v3.2.0
·
3921 commits
to master
since this release
Future breaking change:
- The
tfds.features.text
encoding API is deprecated. Please use tensorflow_text instead.
New features
API:
- Add a
tfds.ImageFolder
andtfds.TranslateFolder
to easily create custom datasets with your custom data. - Add a
tfds.ReadConfig(input_context=)
to shard dataset, for better multi-worker compatibility (#1426). - The default
data_dir
can be controlled by theTFDS_DATA_DIR
environment variable. - Better usability when developing datasets outside TFDS
- Downloads are always cached
- Checksum are optional
- Added a
tfds.show_statistics(ds_info)
to display FACETS OVERVIEW. Note: This require the dataset to have been generated with the statistics. - Open source various scripts to help deployment/documentation (Generate catalog documentation, export all metadata files,...)
Documentation:
- Catalog display images (example)
- Catalog shows which dataset have been recently added and are only available in
tfds-nightly
nights_stay
Breaking compatibility change:
- Fix deterministic example order on Windows when path was used as key (this only impact a few datasets). Now example order should be the same on all platforms.
- Remove
tfds.load('image_label_folder')
in favor of the more user-friendlytfds.ImageFolder
Other:
- Various performances improvements for both generation and reading (e.g. use
__slot__
, fix parallelisation bug intf.data.TFRecordReader
,...) - Various fixes (typo, types annotations, better error messages, fixing dead links, better windows compatibility,...)
Thanks to all our contributors who help improving the state of dataset for the entire research community!