ABSTRACT

Our toy example, which we'll see a third (and last) version of in the next chapter, had the model train on a tiny set of data – small enough to pass all observations to the model in one go. What if that wasn't the case? Say we had 10,000 items instead, and every item was an RGB image of size 256 × 256 pixels. Even on very powerful hardware, we could not possibly train a model on the complete data all at once.