Skip to content

Commit

Permalink
Merge pull request #22 from noath/asr_example
Browse files Browse the repository at this point in the history
ASR dataset collecting pipeline (example)
  • Loading branch information
Losik authored Jun 28, 2021
2 parents 0dfa7d6 + fc28e5f commit c219f9d
Showing 1 changed file with 2 additions and 0 deletions.
2 changes: 2 additions & 0 deletions examples/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -15,7 +15,9 @@ Why it may be usefull:
| [Questing answering on SQuAD](https://github.com/Toloka/toloka-kit/tree/main/examples/SQUAD2.0) <br/><br/> [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/Toloka/toloka-kit/blob/main/examples/SQUAD2.0/SQUAD2.0_processing.ipynb) | Solving the problem of question answering on SQUAD2.0 dataset. Collects and validates answers for questions by human performers. One of the most popular tasks in natural language processing. | ```NLP```, ```Questing Answering```, ```Texts```, ```Benchmark```, ```Verification Project```|
| [Image gathering](https://github.com/Toloka/toloka-kit/tree/main/examples/image_gathering) <br/><br/> [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/Toloka/toloka-kit/blob/main/examples/image_gathering/image_gathering.ipynb) | The goal for this project is to collect a dataset of dogs' and cats' images. Performers will be asked to take a photo of their pet and specify its species. |```CV```, ```Classification```, ```Collecting```, ```Dataset```|
| [Simplest Spatial Crowdsourcing](https://github.com/Toloka/toloka-kit/tree/main/examples/2.spatial_crowdsourcing/0.simplest_example) <br/><br/> [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/Toloka/toloka-kit/blob/main/examples/2.spatial_crowdsourcing/0.simplest_example/spatial_crowdsourcing.ipynb) | In this example, we will collect pictures of the Moscow metro entrances. This example also can be reused for production tasks such as monitoring the state of objects, checking the presence of an organization or other physical object. |```Spatial Crowdsourcing```, ```Outdoor monitoring```, ```Collecting```|
| [ASR/TTS based on Wikipedia articles](https://github.com/noath/asr-datasets-pipeline) <br/><br/> [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/noath/asr-datasets-pipeline/blob/main/ASR_pipeline.ipynb) | This example contains full speech data collecting pipeline from extracting raw texts to labeling and validating speech records. | ```ASR```, ```TTS```, ```Texts```, ```Verification project```, ```Audio samples collection```|
| Blood cells classification | Available soon | ```CV```, ```Classification```, ```Medicine```, ```Benchmark```|

# Need more examples?
If you have an example of data labeling using toloka-kit, do not hesitate to send us a [pool request](https://github.com/Toloka/toloka-kit/pulls). Ideally, a great example should contain the following aspects:
- Problem statement;
Expand Down

0 comments on commit c219f9d

Please sign in to comment.