Use an already trained Keras model to predict on lots of data #35

mrocklin · 2018-08-31T20:49:37Z

A common approach is to train on a bit of data and then use that trained model to predict on lots of data. We could do this using ParallelPostFit in dask-ml, or we can use X.map_blocks or df.map_partitions. In either case we might want to be a bit careful about avoiding repeated serializations costs. For example, in the following case I suspect that we include the serialized model in every task

# maybe bad?
model = load_model()
predictions = X.map_blocks(model.predict)

It's probably better to encourage the user to keep the model delayed

# maybe bad?
model = dask.delayed(load_model)()
predictions = X.map_blocks(model.predict)

We should also ensure that dask-ml does this correctly, and includes the model as a single task in the graph so that it gets sent around appropriately (cc @TomAugspurger )

I'm also generally curious if a Keras model that lives on the GPU will eventually make its way back onto the GPU when deserializing.

The text was updated successfully, but these errors were encountered:

TomAugspurger · 2018-10-25T21:29:12Z

FYI, I started on this at https://gist.github.com/TomAugspurger/2889a052b5fec4d691f83ba2062d2d92

As you predicted X.map_blocks(model.predict) was slow.

I stopped as soon as I hit an error, and didn't do any profiling yet. I'll pick it up again soon, but don't want else to duplicate effort.

mrocklin · 2018-10-25T21:30:48Z

That's great. When I started looking into this I quickly became lost on how to set up the problem. You appear to have enough practical experience that that's not much of an issue for you. I'll put this on my TODO list. Now that there is a clear thing to optimize/fix it's much easier for me.

…

On Thu, Oct 25, 2018 at 5:29 PM Tom Augspurger ***@***.***> wrote: FYI, I started on this at https://gist.github.com/TomAugspurger/2889a052b5fec4d691f83ba2062d2d92 As you predicted X.map_blocks(model.predict) was slow. I stopped as soon as I hit an error, and didn't do any profiling yet. I'll pick it up again soon, but don't want else to duplicate effort. — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub <#35 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AASszEclxY5admrzWVLQctian5DR8h5qks5uoi0ogaJpZM4WV1oF> .

TomAugspurger · 2018-10-25T21:32:01Z

Oh, and /profile-server is going to be extremely useful here. On a whim, I tried X.map_blocks(delayed(model.predict)) and the scheduler has been at 100% CPU for a minute while the workers are idle.

TomAugspurger · 2018-11-09T17:01:57Z

Right I think I'm stalled on deserializing the TensorFlow graph in a new process https://gist.github.com/33efb49efe611701ef122f577d0e0430

TomAugspurger · 2018-11-09T17:02:34Z

Probably putting this on the backburner for now, if others want to take a look.

AakashKumarNain · 2019-06-14T08:09:40Z

@TomAugspurger @mrocklin Once we have our stacked delayed dask array, can't we just generate batches of data from it on the fly? Something like this

data = [dd.array.from_delayed(x, shape=(224,224, 3),dtype=np.float32) for x in images]

nb_batches = 100
for i in range(100):
    batch_images, batch_labels = next(data) #just an example to show. 
    model.train_on_batch(batch_images, batch_labels)

Is there any way to do this?

mrocklin · 2019-06-14T10:54:02Z

It depends on what you mean by "batch" I guess. You can slice into x in a variety of ways

index = np.random.randint(0, x.shape[0], size=10)
batch = x[index]

Some ways of slicing will be cheap (like above), some won't, depending on chunk structure.

AakashKumarNain · 2019-06-14T11:03:26Z

Thanks @mrocklin I would elaborate a bit on that. Say I have 50,000 images on my disk. I cannot load all the data in memory once. In normal case, we would use a generator that yields batch of data. For example, for a batch size of 32, each batch would contain 32 images. This batch is then fed into the model and trained on it.

Now, with a simple python generator we are using only one core. So, instead of using a python generator, let us say we get delayed dask arrays as

data = [dd.array.from_delayed(x, shape=(224,224, 3),dtype=np.float32) for x in images]

The shape of the final array would be (50000, 224, 224, 3). I am asking that what is the best way to iterate over this delayed array, such that on each iteration, I get a chunk of data containing 32 images

mrocklin · 2019-06-14T11:06:34Z

The same as you would with NumPy

for i in range(0, x.shape[0], 32):
    chunk = x[i:i+32, ...]

chunk is a dask array here. I'm not sure if that's what you want. You might want to call compute or delay the fit call (although Keras has issues sometimes with moving to other threads).

AakashKumarNain · 2019-06-14T11:11:35Z

Cool. Thanks a lot for your time. Yeah, I am aware of those issues, and that is why I just want to use dask for batch generation and no delayed calls to fit.

bw4sz · 2019-09-09T23:21:54Z

@AakashKumarNain I have a similar use case, did you find performance improvements when transitioning from Numpy to dask, reading image slices from file?

skeller88 · 2019-10-10T00:36:06Z

@AakashKumarNain same question on this. What code did you end up using? How was the performance? I want to use a keras.utils.Sequence subclass to leverage keras fit_generator, so I'm thinking something that keeps the images in a dask array and then loads each batch into memory:

class DaskImageSequence(keras.utils.Sequence):
    def __init__(self, x: dask.array, y: dask.array, batch_size: int):
        self.x = x
        self.y = y
        self.batch_size = batch_size

    def __len__(self):
        len_x = self.x.shape[0].compute()
        return int(np.ceil(len_x / self.batch_size))

    def __getitem__(self, batch_num) -> Tuple[np.ndarray, np.ndarray]:
        batch_x = self.x[batch_num * self.batch_size:(batch_num + 1) * self.batch_size].compute()
        batch_y = self.y[batch_num * self.batch_size:(batch_num + 1) * self.batch_size].compute()
        return batch_x, batch_y

AakashKumarNain · 2019-10-10T18:22:28Z

@skeller88 I didn't try it. I was trying to benchmark it with tf.dataset. But this certainly looks good to this point.

bw4sz · 2020-01-15T17:35:54Z

@mrocklin I think I'm stumbling on the exact issue "#maybe bad" mentioned at the top.

psuedo code (working on reproducible)

We have a large number of numpy arrays (geospatial tiles) and an object detection model.

This works in serial.

    model = create_model()
    results = []
    for tile in tilelist:
        boxes = model.predict_tile(tile)
        results.append(boxes)

Following your thought from above,
this

    results = []
    for tile in tilelist:
        model = dask.delayed(create_model)()
        boxes = dask.delayed(model.predict_tile)(tile)
        results.append(boxes)
        
    all_boxes = dask.compute(*results)

has some sort of multiprocessing tensorflow error

builtins.ValueError: Tensor Tensor("filtered_detections/map/TensorArrayStack/TensorArrayGatherV3:0", shape=(?, 300, 4), dtype=float32) is not an element of this graph.

one level up on traceback is
tensorflow/python/framework/ops.py", line 3796, in as_graph_element

    with self._lock:
      return self._as_graph_element_locked(obj, allow_tensor, allow_operation)

testing on LocalCluster on CPU, but will eventually move to SLURM with GPUs.

perhaps related to (dask/distributed#878, dask/dask-ml#281)

For anyone that comes looking, i'm giving up here because I think its a bit of a red herring. I was just using LocalCluster to run tests, and my sense is that is part of the problem. I can see that keras serialization is on going challenge, and my ultimate goal is to get this running on a SLURM cluster, which in this case might be quite a bit simpler. Leaving this note here for others. I will open a reproducible example tomorrow. The problem persists on dask-jobqueue, and I've looked through all the pertinent issues and I feel like the answer is known, but not obvious documented.

bw4sz · 2020-01-16T18:44:26Z

I created a working example here for those who find this link: dask/distributed#2333

mrocklin · 2020-05-23T17:16:45Z

This keeps coming up. I'm adding it to the core maintenance project board.
https://stackoverflow.com/questions/61924824/how-to-do-model-predict-using-distributed-dask-with-a-pre-trained-keras-model

bw4sz · 2020-05-23T19:29:54Z

@mrocklin happy to help if I can with tests, I use this kind of workflow frequently with dask-jobqueue for submitting to GPU clusters

On Sat, May 23, 2020 at 10:16 AM Matthew Rocklin ***@***.***> wrote: This keeps coming up. I'm adding it to the core maintenance project board. https://stackoverflow.com/questions/61924824/how-to-do-model-predict-using-distributed-dask-with-a-pre-trained-keras-model — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#35 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAJHBLA3DWCZVLP33LRE523RTAAITANCNFSM4FSXLICQ> .

-- Ben Weinstein, Ph.D. Postdoctoral Fellow University of Florida http://benweinstein.weebly.com/

TomAugspurger mentioned this issue Dec 16, 2018

Jupyter kernel dying using ParallelPostFit. Workers don't start. dask/dask-ml#438

Closed

TomAugspurger mentioned this issue Jan 17, 2019

Integration with deep learning frameworks dask/dask-ml#268

Open

TomAugspurger mentioned this issue Jul 30, 2019

dask_ml.model_selection.GridSearchCV errors for keras model dask/dask-ml#534

Open

mrocklin mentioned this issue Oct 10, 2019

Use an already trained Torch model to predict on lots of data #111

Closed

jacobtomlinson added the help wanted Extra attention is needed label Oct 14, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use an already trained Keras model to predict on lots of data #35

Use an already trained Keras model to predict on lots of data #35

mrocklin commented Aug 31, 2018

TomAugspurger commented Oct 25, 2018

mrocklin commented Oct 25, 2018 via email

TomAugspurger commented Oct 25, 2018

TomAugspurger commented Nov 9, 2018

TomAugspurger commented Nov 9, 2018

AakashKumarNain commented Jun 14, 2019

mrocklin commented Jun 14, 2019

AakashKumarNain commented Jun 14, 2019 •

edited

Loading

mrocklin commented Jun 14, 2019

AakashKumarNain commented Jun 14, 2019 •

edited

Loading

bw4sz commented Sep 9, 2019

skeller88 commented Oct 10, 2019

AakashKumarNain commented Oct 10, 2019

bw4sz commented Jan 15, 2020 •

edited

Loading

bw4sz commented Jan 16, 2020

mrocklin commented May 23, 2020

bw4sz commented May 23, 2020 via email

Use an already trained Keras model to predict on lots of data #35

Use an already trained Keras model to predict on lots of data #35

Comments

mrocklin commented Aug 31, 2018

TomAugspurger commented Oct 25, 2018

mrocklin commented Oct 25, 2018 via email

TomAugspurger commented Oct 25, 2018

TomAugspurger commented Nov 9, 2018

TomAugspurger commented Nov 9, 2018

AakashKumarNain commented Jun 14, 2019

mrocklin commented Jun 14, 2019

AakashKumarNain commented Jun 14, 2019 • edited Loading

mrocklin commented Jun 14, 2019

AakashKumarNain commented Jun 14, 2019 • edited Loading

bw4sz commented Sep 9, 2019

skeller88 commented Oct 10, 2019

AakashKumarNain commented Oct 10, 2019

bw4sz commented Jan 15, 2020 • edited Loading

bw4sz commented Jan 16, 2020

mrocklin commented May 23, 2020

bw4sz commented May 23, 2020 via email

AakashKumarNain commented Jun 14, 2019 •

edited

Loading

AakashKumarNain commented Jun 14, 2019 •

edited

Loading

bw4sz commented Jan 15, 2020 •

edited

Loading