Transformations #68
Replies: 2 comments
-
3 types of transformation
@hub.apply(...)
def my_apply(sample):
sample["image"][5] = ndvi(sample["image"][0:3])
@hub.transform(dtype=...)
def my_transform(sample):
res = copy(sample)
res["image"][5] = ndvi(sample["image"][0:3])
return res
@hub.generator(dtype=...)
def my_generator(sample):
yield ...
yield ...
yield ...
yield ...
ds2 = my_generator(ds1)
for i in ds2:
pass
ds2.store("s3://...") |
Beta Was this translation helpful? Give feedback.
-
From the feedback, we received today from a customer. They would need to specify where each step (either generator, transform, or apply) would be deployed including either GPU or CPU. Then they also asked how they can run a model on the data and connect the preprocessing pipeline to it. Let's say if you are doing normalization as a preprocessing step for generating the dataset and training the model, then the same should also apply during the inference side. Finally, The insight we have regarding chunk-based storage needs to be used for processing, otherwise it's no different from Ray or Dask processes. |
Beta Was this translation helpful? Give feedback.
-
How to design the transformation of the dataset into another dataset?
We should finalize the API for v1.0 transformation
Beta Was this translation helpful? Give feedback.
All reactions