v4.4.0
API:
- Add
PartialDecodingsupport, to decode only a subset of the features (for performances) - Catalog now expose links to KnowYourData visualisations
tfds.as_numpysupports datasets withNone- Dataset generated with
disable_shuffling=Trueare now read in generation order. - Loading datasets from files now supports custom
tfds.features.FeatureConnector tfds.testing.mock_datanow supports- non-scalar tensors with dtype
tf.string builder_from_filesand path-based community datasets
- non-scalar tensors with dtype
- File format automatically restored (for datasets generated with
tfds.builder(..., file_format=)). - Many new reinforcement learning datasets
- Various bug fixes and internal improvements like:
- Dynamically set number of worker thread during extraction
- Update progression bar during download even if downloads are cached
Dataset creation:
- Add
tfds.features.LabeledImagefor semantic segmentation (like image but with additionalinfo.features['image_label'].namelabel metadata) - Add float32 support for
tfds.features.Image(e.g. for depth map) - All FeatureConnector can now have a
Nonedimension anywhere (previously restricted to the first position). tfds.features.Tensor()can have arbitrary number of dynamic dimension (Tensor(..., shape=(None, None, 3, None)))tfds.features.Tensorcan now be serialised as bytes, instead of float/int values (to allow better compression):Tensor(..., encoding='zlib')- Add script to add TFDS metadata files to existing TF-record (see doc).
- New guide on common implementation gotchas
Thank you all for your support and contribution!