Replies: 2 comments 1 reply
-
Another comment - is it reasonable to assume that all the labels can be stored in memory? |
Beta Was this translation helpful? Give feedback.
-
Thanks @avihu111 ! Meanwhile, |
Beta Was this translation helpful? Give feedback.
-
Some thoughts about fuse dataset structure.
the DatasetDefault class is not super intuitive in my opinion. few issues:
allow_uncached_sample_morphing
is pretty complex and exposes too much under-the-hood IMO.allow_uncached_sample_morphing: when enabled, allows an Op, to return None, or to return multiple samples (in a list)
In contrast, the class
DatasetWrapSeqToDict
fits the "spirit" of the library (take PyTorch modules and wrap them in a NDict for simpler extendibility) - but most examples do not use it.IMO, the fuse dataset should expect to wrap a PyTorch dataset and expose some add functionality above it (naming the dataset outputs, possibly caching), rather trying to replace it.
Beta Was this translation helpful? Give feedback.
All reactions