-
Notifications
You must be signed in to change notification settings - Fork 251
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Showing
101 changed files
with
6,834 additions
and
4,443 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -7,7 +7,7 @@ branches: | |
notifications: | ||
email: false | ||
|
||
dist: xenial | ||
dist: bionic | ||
|
||
addons: | ||
apt: | ||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,69 @@ | ||
.. _classify_speakers: | ||
|
||
********************** | ||
Speaker classification | ||
********************** | ||
|
||
The Montreal Forced Aligner can use trained ivector models (see :ref:`train_ivector` for more information about training | ||
these models) to classify or cluster utterances according to speakers. | ||
|
||
Steps to classify speakers: | ||
|
||
|
||
1. Provided the steps in :ref:`installation` have been completed and you are in the same Conda/virtual environment that | ||
MFA was installed in. | ||
2. Run the following command, substituting the arguments with your own paths: | ||
|
||
.. code-block:: bash | ||
mfa classify_speakers corpus_directory ivector_extractor_path output_directory | ||
If the input uses TextGrids, the output TextGrids will have utterances sorted into tiers by each identified speaker. At | ||
the moment, there is no way to retrain the classifier based on new data. | ||
|
||
If the input corpus directory does not have TextGrids associated with them, then the speaker classifier will output | ||
speaker directories with a text file that contains all the utterances that were classified. | ||
|
||
Options available: | ||
|
||
.. option:: -h | ||
--help | ||
|
||
Display help message for the command | ||
|
||
.. option:: -t DIRECTORY | ||
--temp_directory DIRECTORY | ||
|
||
Temporary directory root to use for aligning, default is ``~/Documents/MFA`` | ||
|
||
.. option:: -j NUMBER | ||
--num_jobs NUMBER | ||
|
||
Number of jobs to use; defaults to 3, set higher if you have more | ||
processors available and would like to process faster | ||
|
||
.. option:: -s NUMBER | ||
--num_speakers NUMBER | ||
|
||
Number of speakers to return. If ``--cluster`` is present, this specifies the number of clusters. Otherwise, | ||
MFA will sort speakers according to the first pass classification and then takes the top X speakers, and reclassify | ||
the utterances to only use those speakers. | ||
|
||
.. option:: --cluster | ||
|
||
MFA will perform clustering of utterance ivectors into the number of speakers specified by ``--num_speakers`` | ||
|
||
.. option:: -v | ||
--verbose | ||
|
||
The aligner will print out more information if present | ||
|
||
.. option:: -d | ||
--debug | ||
|
||
The aligner will run in debug mode | ||
|
||
.. option:: -c | ||
--clean | ||
|
||
Forces removal of temporary files in ``~/Documents/MFA`` |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.