Fashion12K German Queries dataset

To create Fashion12K German Queries dataset we sampled 12k images from Fashion200K dataset and annotated them with German and English queries using Toloka.

Each row in the dataset consists of three entries:

image url (link to s3 bucket where the original image is hosted),
English query,
German query.

Downloading dataset

Dataset can be downloaded:

directly via tsv file,
or by using docArray from Jina AI (our collaborator on the project) via this python script.

Acknowledgements

Fashion200K dataset dataset is used under the Apache License 2.0.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
LICENSE		LICENSE
README.md		README.md
dataset_JinaAI_docarray_script.py		dataset_JinaAI_docarray_script.py
queries_full_dataset.tsv		queries_full_dataset.tsv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Fashion12K German Queries dataset

Downloading dataset

Acknowledgements

About

Releases

Packages

Contributors 2

Languages

License

Toloka/Fashion12K_german_queries

Folders and files

Latest commit

History

Repository files navigation

Fashion12K German Queries dataset

Downloading dataset

Acknowledgements

About

Resources

License

Security policy

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages