Releases: openvinotoolkit/datumaro
Releases · openvinotoolkit/datumaro
Release v1.3.1
What's Changed
- Update version information to 1.3.0 by @wonjuleee in #1028
- Fix cityscapes detection error by @cih9088 in #1029
Full Changelog: v1.3.0...v1.3.1
v1.3.0
What's Changed
- Merge back 1.2.0rc2 by @vinnamkim in #942
- Merge back 1.2.0rc3 by @vinnamkim in #950
- Merge back 1.2.0 by @vinnamkim in #970
- Fix documentation links by @JihwanEom in #972
- fix typo - level 10 data generation by @bonhunko in #975
- Warn coco format users by @cih9088 in #974
- Add CocoRoboflowImporter by @vinnamkim in #976
- Fix having empty string for
rootpath
in coco format by @cih9088 in #979 - Fix Mapillary Vistas data format by @wonjuleee in #977
- Fix bytes property bug by @cih9088 in #978
- Use autosummary for fully-automatic Python module docs generation by @vinnamkim in #973
- Remedying apache-beam dependency conflicts by @yunchu in #983
- Fix project level CVAT for images format import by @vinnamkim in #980
- Fix broken directory structure of Imagenet format by @cih9088 in #985
- Fix an info message when using the convert CLI command with no args.input_format by @vinnamkim in #982
- Fix media contents error in arrow format by @cih9088 in #986
- Move tfds-dev dependency to tests requirements by @yunchu in #988
- Merge back release 1.2.1 by @vinnamkim in #991
- Fix Synthia data format and Add Synthia AL and SF formats by @wonjuleee in #987
- Enrich stack trace while importing dataset to improve user experience by @cih9088 in #992
- Save and load hashkey for explorer by @sooahleex in #981
- Grant write permission to auto-issue-assigner by @vinnamkim in #995
- Fix typo & broken links in notebooks by @JihwanEom in #994
- Add intermediate skill docs for filter by @sooahleex in #996
- Fix VOC to have task-specific formats by @wonjuleee in #997
- Add COCO Roboflow section by @vinnamkim in #1000
- Add MOT and MOTS data format docs by @vinnamkim in #999
- update code-scan workflow by @yunchu in #998
- Drop a malformed transform from StackedTransform automatically by @vinnamkim in #1001
- Improve RemoveAnnotations to cope with annotation ids by @wonjuleee in #1004
- Add Segment Anything data format by @cih9088 in #1005
- Add Corrector transformation by @wonjuleee in #1006
- Version up to 1.3.0rc1 by @vinnamkim in #1007
- Add segment anything doc by @cih9088 in #1009
- Implement ReindexAnnotations transform by @vinnamkim in #1008
- Fixed typo in example command by @gabrielthomsen in #1010
- Refactor: change Exporter::apply() to Exporter::_apply_impl() by @vinnamkim in #1013
- Refactor supported data format document by @wonjuleee in #1014
- Add Jupyter notebook example of noisy label detection for detection tasks by @vinnamkim in #1011
- Refactor hands-on-example document by @wonjuleee in #1016
- Refactor explanation document by @wonjuleee in #1017
- Update links and outdated contents in contributing.md by @sooahleex in #1018
- Add notebooks for import/export detection data by @wonjuleee in #1020
- Fix missing word & typos in explanation docs by @JihwanEom in #1021
- Support for format-agnostic storage and loading of hashkey by @sooahleex in #1003
- Trivy csv export by @yunchu in #1019
- Updated 3rd party program statement by @yunchu in #1022
- Constraint cityscapes dataset structure by @cih9088 in #1024
- Update 02_merge_detection_data notebook with real datasets by @wonjuleee in #1025
- Add notebook for importing/exporting segmentation data by @wonjuleee in #1023
- Update release notes by @wonjuleee in #1027
New Contributors
- @gabrielthomsen made their first contribution in #1010
Full Changelog: v1.2.0...v1.3.0
Release v1.3.0rc1
What's Changed
- Merge back 1.2.0rc2 by @vinnamkim in #942
- Merge back 1.2.0rc3 by @vinnamkim in #950
- Merge back 1.2.0 by @vinnamkim in #970
- Fix documentation links by @JihwanEom in #972
- fix typo - level 10 data generation by @bonhunko in #975
- Warn coco format users by @cih9088 in #974
- Add CocoRoboflowImporter by @vinnamkim in #976
- Fix having empty string for
rootpath
in coco format by @cih9088 in #979 - Fix Mapillary Vistas data format by @wonjuleee in #977
- Fix bytes property bug by @cih9088 in #978
- Use autosummary for fully-automatic Python module docs generation by @vinnamkim in #973
- Remedying apache-beam dependency conflicts by @yunchu in #983
- Fix broken directory structure of Imagenet format by @cih9088 in #985
- Move tfds-dev dependency to tests requirements by @yunchu in #988
- Merge back release 1.2.1 by @vinnamkim in #991
- Fix Synthia data format and Add Synthia AL and SF formats by @wonjuleee in #987
- Enrich stack trace while importing dataset to improve user experience by @cih9088 in #992
- Save and load hashkey for explorer by @sooahleex in #981
- Grant write permission to auto-issue-assigner by @vinnamkim in #995
- Fix typo & broken links in notebooks by @JihwanEom in #994
- Add intermediate skill docs for filter by @sooahleex in #996
- Fix VOC to have task-specific formats by @wonjuleee in #997
- Add COCO Roboflow section by @vinnamkim in #1000
- Add MOT and MOTS data format docs by @vinnamkim in #999
- update code-scan workflow by @yunchu in #998
- Drop a malformed transform from StackedTransform automatically by @vinnamkim in #1001
- Improve RemoveAnnotations to cope with annotation ids by @wonjuleee in #1004
- Add Segment Anything data format by @cih9088 in #1005
- Add Corrector transformation by @wonjuleee in #1006
- Version up to 1.3.0rc1 by @vinnamkim in #1007
Full Changelog: v1.2.1...v1.3.0rc1
Release v1.2.1
What's Changed
- Fix project level CVAT for images format import by @vinnamkim in #980
- Fix an info message when using the convert CLI command with no args.input_format by @vinnamkim in #982
- Fix media contents error in arrow format by @cih9088 in #986
Full Changelog: v1.2.0...v1.2.1
Release v1.2.0
What's Changed - Brief Version
New features
- Add Skill Up section to documentation
(#920, #933, #935, #945, #949, #953, #959, #960, #967) - Add LossDynamicsAnalyzer for noisy label detection
(#928) - Add Apache Arrow format support
(#931, #948) - Add sort transform
(#931)
Enhancements
- Add multiprocessing to DatumaroBinaryBase
(#897) - Refactor merge code
(#901, #906) - Refactor download CLI commands
(#909) - Refactor CLI commands w/ and w/o project
(#910, #952) - Refactor Media to be initialized from explicit sources
(#911 #921, #944) - Refactor hl_ops.py
(#912) - Add tfds:uc_merced and tfds:eurosat download
(#914) - Migrate documentation framework to Sphinx
(#917, #922, #947, #954, #958, #961, #962, #963, #964, #965, #969) - Update merge tutorial for real life usecase
(#930) - Abbreviate "detect-format" to "detect" for prettifying
(#951)
Bug fixes
- Add UserWarning if an invalid media_type comes to image statistics computation
(#891) - Fix negated
is_encrypted
(#907) - Save extra images of PointCloud when exporting to datumaro format
(#918) - Fix log issue when importing celeba and align celeba dataset
(#919)
What's Changed - Full Version
- Update PR template by @chuneuny-emily in #866
- Add UserWarning if an invalid media_type comes to image statistics computation by @vinnamkim in #891
- Add pre-commit by @vinnamkim in #892
- Skip nbqa-ruff installation with python3.7 by @JihwanEom in #894
- Add multiprocessing to DatumaroBinaryBase by @vinnamkim in #897
- Add notification to Teams channel for cron jobs on failure by @vinnamkim in #903
- Rename version variable by @cih9088 in #900
- Refactor operations by @vinnamkim in #901
- Move merge policies to datumaro/components/merge directory by @vinnamkim in #906
- Add CODEOWNERS and auto issue assignment by @vinnamkim in #902
- Fix negated property by @cih9088 in #907
- Refactor download CLI command by @vinnamkim in #909
- Refactor PointCloud by @cih9088 in #911
- Fix unit test errors from tensorflow-datasets==4.9.0 by @vinnamkim in #915
- [Doc] sphinx documentation with content reformatting by @wonjuleee in #917
- Refactor CLI commands w/ and w/o project by @vinnamkim in #910
- Add tfds:uc_merced and tfds:eurosat download by @vinnamkim in #914
- Fix log issue when importing celeba and align celeba dataset by @sooahleex in #919
- Refactor hl_ops.py by @vinnamkim in #912
- Fix not saving extra images of PointCloud in datumaro format by @cih9088 in #918
- Infra/fix health check by @vinnamkim in #923
- Revised bandit exclusion comments by @yunchu in #904
- Fix for #903 by @vinnamkim in #905
- Fixed code-scan workflow for the bandit by @yunchu in #925
- [Doc] add tutorials for level 3 and 4 by @wonjuleee in #920
- Added toxenv for snyk scanning by @yunchu in #926
- Add off the shelf progress reporters by @cih9088 in #927
- Refactor Images by @cih9088 in #921
- Fix docs build warning from myst by @yunchu in #922
- Add LossDynamicsAnalyzer for noisy label detection by @vinnamkim in #928
- Add Apache Arrow format support by @cih9088 in #931
- Fix regex for release candidates by @vinnamkim in #938
- Merge back 1.2.0rc1 by @vinnamkim in #937
- Fix arrow format patch error on python 3.7 by @cih9088 in #939
- Refactor merge code by @vinnamkim in #929
- Add unit test for ArrowDataset by @cih9088 in #934
- Update merge tutorial for real life usecase by @vinnamkim in #930
- Fix validator and add notebooks and document for level-up validator by @wonjuleee in #933
- Refactor merge cli command by @vinnamkim in #932
- Updates docs for release by @yunchu in #936
- Update basic skills - 02_dataset_download by @vinnamkim in #935
- Add init to
arrow/mapper
by @cih9088 in #943 - Refactor Image/PointCloud 2 by @cih9088 in #944
- Add basic skills - 01_project_manipulation by @vinnamkim in #945
- Revised docs deployments by @yunchu in #947
- Abbreviate "detect-format" to "detect" for prettifying by @vinnamkim in #951
- Add Arrow format doc by @cih9088 in #948
- Add basic skills - Level 10: Data Generation by @bonhunko in #953
- Add basic skills - Level 11: Project Versioning by @vinnamkim in #949
- Move project free commands to outside again by @vinnamkim in #952
- Revisit 01_project_manipulation by @vinnamkim in #956
- Align docs with the recent CLI changes by @vinnamkim in #954
- Revisit Juypter notebook examples by @vinnamkim in #955
- Update 3rd party txt by @vinnamkim in #957
- Fix doc building warnings by @vinnamkim in #958
- Fix docs in
data_formats
section by @cih9088 in #961 - Add document for data subset aggregation and dataset merge by @wonjuleee in #960
- Fix Misc section's weird nesting structure and broken links by @vinnamkim in #962
- Fix quick start guide by @vinnamkim in #963
- Add Intermediate skills - Level 9: Dataset Explorartion from a Query Image/Text by @sooahleex in #959
- Find and replace for the recent CLI changes by @vinnamkim in https://github.com/openvin...
Release v1.2.0rc3
What's Changed
- Fix regex for release candidates by @vinnamkim in #938
- Add init to
arrow/mapper
by @cih9088 in #943 - Refactor Image/PointCloud 2 by @cih9088 in #944
- Add basic skills - 01_project_manipulation by @vinnamkim in #945
- Revised docs deployments by @yunchu in #947
Full Changelog: v1.2.0rc2...v1.2.0rc3
Release v1.2.0rc2
What's Changed
- Fix arrow format patch error on python 3.7 by @cih9088 in #939
- Refactor merge code by @vinnamkim in #929
- Add unit test for ArrowDataset by @cih9088 in #934
- Update merge tutorial for real life usecase by @vinnamkim in #930
- Fix validator and add notebooks and document for level-up validator by @wonjuleee in #933
- Refactor merge cli command by @vinnamkim in #932
- Updates docs for release by @yunchu in #936
- Update basic skills - 02_dataset_download by @vinnamkim in #935
Full Changelog: v1.2.0rc1...v1.2.0rc2
Release v1.2.0rc1
What's Changed
- Update PR template by @chuneuny-emily in #866
- Add UserWarning if an invalid media_type comes to image statistics computation by @vinnamkim in #891
- Add pre-commit by @vinnamkim in #892
- Skip nbqa-ruff installation with python3.7 by @JihwanEom in #894
- Add multiprocessing to DatumaroBinaryBase by @vinnamkim in #897
- Add notification to Teams channel for cron jobs on failure by @vinnamkim in #903
- Rename version variable by @cih9088 in #900
- Refactor operations by @vinnamkim in #901
- Move merge policies to datumaro/components/merge directory by @vinnamkim in #906
- Add CODEOWNERS and auto issue assignment by @vinnamkim in #902
- Fix negated property by @cih9088 in #907
- Refactor download CLI command by @vinnamkim in #909
- Refactor PointCloud by @cih9088 in #911
- Fix unit test errors from tensorflow-datasets==4.9.0 by @vinnamkim in #915
- [Doc] sphinx documentation with content reformatting by @wonjuleee in #917
- Refactor CLI commands w/ and w/o project by @vinnamkim in #910
- Add tfds:uc_merced and tfds:eurosat download by @vinnamkim in #914
- Fix log issue when importing celeba and align celeba dataset by @sooahleex in #919
- Refactor hl_ops.py by @vinnamkim in #912
- Fix not saving extra images of PointCloud in datumaro format by @cih9088 in #918
- Infra/fix health check by @vinnamkim in #923
- Revised bandit exclusion comments by @yunchu in #904
- Fix for #903 by @vinnamkim in #905
- Fixed code-scan workflow for the bandit by @yunchu in #925
- [Doc] add tutorials for level 3 and 4 by @wonjuleee in #920
- Added toxenv for snyk scanning by @yunchu in #926
- Add off the shelf progress reporters by @cih9088 in #927
- Refactor Images by @cih9088 in #921
- Fix docs build warning from myst by @yunchu in #922
- Add LossDynamicsAnalyzer for noisy label detection by @vinnamkim in #928
- Add Apache Arrow format support by @cih9088 in #931
New Contributors
- @JihwanEom made their first contribution in #894
Full Changelog: v1.1.1...v1.2.0rc1
Release v1.1.1
What's Changed
- Fix to not export absolute media path in Datumaro and DatumaroBinary formats by @vinnamkim in #896
- Change pypi_publish.yml to publish_sdist_to_pypi.yml by @vinnamkim in #895
Full Changelog: v1.1.0...v1.1.1
Release v1.1.0
What's Changed - Brief Version
New features
- Add with_subset_dirs decorator (Add ImagenetWithSubsetDirsImporter)
(#816) - Add CommonSemanticSegmentationWithSubsetDirsImporter
(#826) - Add DatumaroBinary format
(#828, #829, #830, #831, #880, #883) - Add Searcher CLI documentation
(#838) - Add version to dataset exported as datumaro format
(#842) - Add Ava action data format support
(#847) - Add Shift Analyzer (both covariate and label shifts)
(#855) - Add YOLO Loose format
(#856) - Add Ultralytics YOLO format
(#859)
Enhancements
- Refactor Datumaro format code and test code
(#824) - Add publish to PyPI Github action
(#867) - Add --no-media-encryption option
(#875)
Bug fixes
- Fix image filenames and anomaly mask appearance in MVTec exporter
(#835) - Fix CIFAR10 and 100 detect function
(#836) - Fix celeba and align_celeba detect function
(#837) - Choose the top priority detect format for all directory depths
(#839) - Fix MVTec format detect function
(#843) - Fix wrong
__len__()
of Subset when the item is removed
(#854) - Fix mask visualization bug
(#860) - Fix detect unit tests to test false negatives as well
(#868)
What's Changed - Full Version
- Add daily/weekly test triggers by @chuneuny-emily in #811
- Raise ImportError on importing malformed COCO directory by @vinnamkim in #812
- Upload data explorer model in public storage by @sooahleex in #813
- Merge back releases/v1.0.0 to develop for taping out v1.0.0.rc by @vinnamkim in #818
- Add with_subset_dirs decorator by @vinnamkim in #816
- Skip some video unit tests on MacOS by @vinnamkim in #825
- Update copyright year in PR template by @vinnamkim in #823
- Refactor Datumaro format code and test code by @vinnamkim in #824
- Add CommonSemanticSegmentationWithSubsetDirsImporter by @vinnamkim in #826
- Develop DatumaroBinaryFormat to export/import the dataset header & DatasetItem by @vinnamkim in #828
- Update weekly_check.yml by @yunchu in #833
- Remove Codacy badge in readme by @chuneuny-emily in #834
- Implement DatumaroBinaryFormat to export/import the image dataset completely by @vinnamkim in #829
- Fix bugs in mvtec exporter by @djdameln in #835
- [Doc] Add documentation for searcher cli by @sooahleex in #838
- Support PointCloud dataset by DatumaroBinary format by @vinnamkim in #830
- [TEST] move test_utils.py to tests package by @yunchu in #841
- Fix CIFAR10 and 100 detect function by @vinnamkim in #836
- Merge back/releases/v1.0.0 by @vinnamkim in #846
- [HOTFIX] Update ipas_default.config by @yunchu in #848
- [HOTFIX] Update ipas_default.config by @yunchu in #849
- [develop] Update .gitattributes by @yunchu in #851
- Choose the top priority detect format for all directory depths by @vinnamkim in #839
- mark datumaro library version when exporting as datumaro by @bonhunko in #842
- Add AVA action data format support by @wonjuleee in #847
- Fix MVTec format detect function by @vinnamkim in #843
- Fix wrong len() of Subset when the item is removed by @vinnamkim in #854
- Fix celeba and align_celeba detect function by @vinnamkim in #837
- Fix mask visualization bug by @vinnamkim in #860
- Add YOLO Loose format by @vinnamkim in #856
- Add ShiftAnalyzer to compute covariate and label shift between two datasets by @wonjuleee in #855
- Add Ultralytics YOLO format by @vinnamkim in #859
- Add full encryption/decryption functionalities for image datasets to DatumaroBinary format by @vinnamkim in #831
- [Doc] AVA action and MVTec data formats by @wonjuleee in #865
- Update PR template by @chuneuny-emily in #869
- Fix detect unit tests to test false negatives as well by @vinnamkim in #868
- Update doc for regex rules in transform by @sooahleex in #874
- Add publish to PyPI Github action by @vinnamkim in #867
- Add Video formats (video_frames and video_keyframes) by @wonjuleee in #872
- Add prefix to secret key by @vinnamkim in #878
- Add missing requirements by @cih9088 in #877
- Add --no-media-encryption option by @vinnamkim in #875
- Update 3rd-party.txt by @chuneuny-emily in #884
- Add datumaro binary format documentation by @vinnamkim in #880
- Add multiprocessing to DatumaroBinaryExporter by @vinnamkim in #883
- Version up to Releases/1.1.0 by @vinnamkim in #886
New Contributors
Full Changelog: v1.0.0...v1.1.0