forked from agermanidis/autosub
-
-
Notifications
You must be signed in to change notification settings - Fork 595
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
Showing
29 changed files
with
2,414 additions
and
217 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,8 +1,10 @@ | ||
bin/ | ||
*.spec | ||
*pyc | ||
*.egg-info | ||
*html | ||
build/ | ||
dist/ | ||
.DS_Store | ||
MANIFEST | ||
*#* | ||
*#* |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,7 +1,31 @@ | ||
# pyQtAutosub | ||
# pyTranscriber | ||
|
||
Under development | ||
pyTranscriber is an application that can be used to generate <b>automatic transcription / automatic subtitles </b> for audio/video files through a friendly graphical user interface. The hard work of speech recognition is made by the <a href="https://cloud.google.com/speech/">Google Speech Recognition API</a> using <a href="https://github.com/agermanidis/autosub">Autosub</a>. | ||
<br> | ||
<br> | ||
![pyTranscriber1](doc/screenshot3.png?raw=true "pyTranscriber") | ||
<br> | ||
<br> | ||
pyTranscriber is a improved version of my previous project <a href="https://github.com/raryelcostasouza/JAutosub">JAutosub (Java)</a>, created because of the limitations, issues, and overhead of mixing this 2 different languages on a single project. | ||
<br> | ||
<br> | ||
The app by default outputs the subtitles as .srt and the transcribed audio on the user interface as well as .txt files. SRT Files can be edited using <a href="http://www.aegisub.org/">Aegisub</a>. | ||
Internet connection is REQUIRED because it uses the <a href="https://cloud.google.com/speech/">Google Cloud Speech Server</a> for the job, in the same way as the <a href="https://support.google.com/youtube/answer/6373554?hl=en">Youtube Automatic Subtitles</a>. | ||
<br> | ||
<br> | ||
IMPORTANT: As speech recognition technology is still not fully accurate, the <b>accuracy</b> of the result can vary a lot, depending on many factors, mainly the <b>quality/clarity</b> of the audio. Ideally the audio input should not have background noise, sound effects or music. If there is a single speaker and he speaks in a clear and slow speed seems that the recognition is much more accurate. Sometimes, under ideal/lucky conditions it is possible to get a <a href="https://medium.com/@mlockrey/youtube-s-incredible-95-accuracy-rate-on-auto-generated-captions-b059924765d5">accuracy result close to 95%</a>. | ||
<br> | ||
<br> | ||
![pyTranscriber2](doc/screenshot2.png?raw=true "pyTranscriber") | ||
![pyTranscriber3](doc/screenshot1.png?raw=true "pyTranscriber") | ||
<br> | ||
<br> | ||
<h1>For Users - Download the Linux app</h1> | ||
Coming soon | ||
|
||
<h1>For Developers - Technical Details</h1> | ||
Check at <a href="https://github.com/raryelcostasouza/pyTranscriber/blob/master/pytranscriber/doc/technical_details.txt">technical_details.txt<a> | ||
|
||
### License | ||
|
||
MIT | ||
GPL v3 |
Empty file.
Oops, something went wrong.