-
Notifications
You must be signed in to change notification settings - Fork 24
Orca test data
In pursuit of meaningful comparison of the performance of orca-specific models, this page presents open test sets that are specific to killer whale signals. The highest priority of the AI for Orcas project is classification of signals from the endangered Southern Resident Killer Whales (SRKWs), so test sets are listed first for their signals, starting with calls (but with placeholders for whistles and clicks). Orcasound is interested in classifiers for other common signals in the Salish Sea, so test sets for other species and sources, like Bigg's killer whales are listed subsequently.
- Prospects for additional Orcasound test data:
- Pod.Cast candidates for additional rounds of annotation
- Current listener log
* High signal:noise ratio
* 27 Sep 2017 -- 1/2? hour of data from the Orcasound Lab node
* [2017 listener log](https://docs.google.com/spreadsheets/d/1ssBfsHqtCVk_K-0_N5k5mZnpzHAFzNSq-LKfQOJtkyE/edit#gid=2)
* Labeled first in Pod.Cast by Scott, Akash, and Prakruti
* Labels verified by Scott in Audacity
* Labels
* Audio data
* Metadata
* Intermediate signal:noise ratio
* 05 Jul 2019 -- 1/2 hour of data from the Orcasound Lab node labeled in Audacity by Scott
* Labels: [only calls](https://acoustic-sandbox.s3-us-west-2.amazonaws.com/labeled-data/classification/killer-whales/southern-residents/20190705/orcasound-lab/test-only/OS_7_05_2019_08_24_00_labels-SV_200210_only_calls.txt), [other signals](https://acoustic-sandbox.s3-us-west-2.amazonaws.com/labeled-data/classification/killer-whales/southern-residents/20190705/orcasound-lab/test-only/OS_7_05_2019_08_24_00_labels-SV_200210_other_signals.txt) -- with start/end times + label in row N ("call," specific stereotyped call ID, or "?" to indicate probable but not 100% certain call); row N+1 starts with \ and then contains lower and upper frequency bounds. `aws --no-sign-request s3 cp s3://acoustic-sandbox/labeled-data/classification/killer-whales/southern-residents/20190705/orcasound-lab/test-only/OS_7_05_2019_08_24_00_labels-SV_200210_only_calls.txt .`, `aws --no-sign-request s3 cp s3://acoustic-sandbox/labeled-data/classification/killer-whales/southern-residents/20190705/orcasound-lab/test-only/OS_7_05_2019_08_24_00_labels-SV_200210_other_signals.txt .`
* [Audio data](https://s3.console.aws.amazon.com/s3/object/acoustic-sandbox/labeled-data/classification/killer-whales/southern-residents/20190705/orcasound-lab/test-only/OS_7_05_2019_08_24_00_.wav) -- in WAV format
aws --no-sign-request s3 cp s3://acoustic-sandbox/labeled-data/classification/killer-whales/southern-residents/20190705/orcasound-lab/test-only/OS_7_05_2019_08_24_00_.wav .
* Metadata
* Low signal:noise ratio
* 14 Nov 2019 -- 2.5 hours of data from the Port Townsend node labeled in Audacity by Scott
* Labels
* Audio data
* Metadata
- Direct link to open training and test sets (coming soon?)