Release v1.2.0
What's Changed - Brief Version
New features
- Add Skill Up section to documentation
(#920, #933, #935, #945, #949, #953, #959, #960, #967) - Add LossDynamicsAnalyzer for noisy label detection
(#928) - Add Apache Arrow format support
(#931, #948) - Add sort transform
(#931)
Enhancements
- Add multiprocessing to DatumaroBinaryBase
(#897) - Refactor merge code
(#901, #906) - Refactor download CLI commands
(#909) - Refactor CLI commands w/ and w/o project
(#910, #952) - Refactor Media to be initialized from explicit sources
(#911 #921, #944) - Refactor hl_ops.py
(#912) - Add tfds:uc_merced and tfds:eurosat download
(#914) - Migrate documentation framework to Sphinx
(#917, #922, #947, #954, #958, #961, #962, #963, #964, #965, #969) - Update merge tutorial for real life usecase
(#930) - Abbreviate "detect-format" to "detect" for prettifying
(#951)
Bug fixes
- Add UserWarning if an invalid media_type comes to image statistics computation
(#891) - Fix negated
is_encrypted
(#907) - Save extra images of PointCloud when exporting to datumaro format
(#918) - Fix log issue when importing celeba and align celeba dataset
(#919)
What's Changed - Full Version
- Update PR template by @chuneuny-emily in #866
- Add UserWarning if an invalid media_type comes to image statistics computation by @vinnamkim in #891
- Add pre-commit by @vinnamkim in #892
- Skip nbqa-ruff installation with python3.7 by @JihwanEom in #894
- Add multiprocessing to DatumaroBinaryBase by @vinnamkim in #897
- Add notification to Teams channel for cron jobs on failure by @vinnamkim in #903
- Rename version variable by @cih9088 in #900
- Refactor operations by @vinnamkim in #901
- Move merge policies to datumaro/components/merge directory by @vinnamkim in #906
- Add CODEOWNERS and auto issue assignment by @vinnamkim in #902
- Fix negated property by @cih9088 in #907
- Refactor download CLI command by @vinnamkim in #909
- Refactor PointCloud by @cih9088 in #911
- Fix unit test errors from tensorflow-datasets==4.9.0 by @vinnamkim in #915
- [Doc] sphinx documentation with content reformatting by @wonjuleee in #917
- Refactor CLI commands w/ and w/o project by @vinnamkim in #910
- Add tfds:uc_merced and tfds:eurosat download by @vinnamkim in #914
- Fix log issue when importing celeba and align celeba dataset by @sooahleex in #919
- Refactor hl_ops.py by @vinnamkim in #912
- Fix not saving extra images of PointCloud in datumaro format by @cih9088 in #918
- Infra/fix health check by @vinnamkim in #923
- Revised bandit exclusion comments by @yunchu in #904
- Fix for #903 by @vinnamkim in #905
- Fixed code-scan workflow for the bandit by @yunchu in #925
- [Doc] add tutorials for level 3 and 4 by @wonjuleee in #920
- Added toxenv for snyk scanning by @yunchu in #926
- Add off the shelf progress reporters by @cih9088 in #927
- Refactor Images by @cih9088 in #921
- Fix docs build warning from myst by @yunchu in #922
- Add LossDynamicsAnalyzer for noisy label detection by @vinnamkim in #928
- Add Apache Arrow format support by @cih9088 in #931
- Fix regex for release candidates by @vinnamkim in #938
- Merge back 1.2.0rc1 by @vinnamkim in #937
- Fix arrow format patch error on python 3.7 by @cih9088 in #939
- Refactor merge code by @vinnamkim in #929
- Add unit test for ArrowDataset by @cih9088 in #934
- Update merge tutorial for real life usecase by @vinnamkim in #930
- Fix validator and add notebooks and document for level-up validator by @wonjuleee in #933
- Refactor merge cli command by @vinnamkim in #932
- Updates docs for release by @yunchu in #936
- Update basic skills - 02_dataset_download by @vinnamkim in #935
- Add init to
arrow/mapper
by @cih9088 in #943 - Refactor Image/PointCloud 2 by @cih9088 in #944
- Add basic skills - 01_project_manipulation by @vinnamkim in #945
- Revised docs deployments by @yunchu in #947
- Abbreviate "detect-format" to "detect" for prettifying by @vinnamkim in #951
- Add Arrow format doc by @cih9088 in #948
- Add basic skills - Level 10: Data Generation by @bonhunko in #953
- Add basic skills - Level 11: Project Versioning by @vinnamkim in #949
- Move project free commands to outside again by @vinnamkim in #952
- Revisit 01_project_manipulation by @vinnamkim in #956
- Align docs with the recent CLI changes by @vinnamkim in #954
- Revisit Juypter notebook examples by @vinnamkim in #955
- Update 3rd party txt by @vinnamkim in #957
- Fix doc building warnings by @vinnamkim in #958
- Fix docs in
data_formats
section by @cih9088 in #961 - Add document for data subset aggregation and dataset merge by @wonjuleee in #960
- Fix Misc section's weird nesting structure and broken links by @vinnamkim in #962
- Fix quick start guide by @vinnamkim in #963
- Add Intermediate skills - Level 9: Dataset Explorartion from a Query Image/Text by @sooahleex in #959
- Find and replace for the recent CLI changes by @vinnamkim in #964
- Fix links in Explanation-Formats by @vinnamkim in #965
- Update Level Up docs by @JihwanEom in #967
- Fix docs context-free commands subsection by @cih9088 in #969
- Update Datumaro version to 1.2.0 by @vinnamkim in #968
New Contributors
- @JihwanEom made their first contribution in #894
Full Changelog: v1.1.1...1.2.0