Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Why do the numbers in the ASR results not have a start and end timestamp? #911

Open
hpjang opened this issue Oct 30, 2024 · 4 comments
Open

Comments

@hpjang
Copy link

hpjang commented Oct 30, 2024

No description provided.

@hpjang hpjang changed the title Why do the numbers in the results not have a start and end timestamp? Why do the numbers in the ASR results not have a start and end timestamp? Oct 30, 2024
@hpjang
Copy link
Author

hpjang commented Oct 30, 2024

whisperXoutput

you can see 1462 dosen't have start and end's time stamp

@rkulyassa
Copy link

rkulyassa commented Oct 30, 2024

See #314, #717, #789, #792, ...

Transcript words which do not contain characters in the alignment models dictionary e.g. "2014." or "£13.60" cannot be aligned and therefore are not given a timing.

The solution is to pass --suppress_numerals or suppress_numerals=True.

@randyburden
Copy link

To provide a counter solution, if you want to keep the numerals, instead of suppressing them, and not have it convert a numeral such as 7 into seven, then you can run some post-processing logic to look at the timestamp for the word before and after the numeral to fill-in the missing timestamp values for the numeral. This is the strategy we use and it works very well.

@rkulyassa
Copy link

@randyburden That was actually my naive approach as well. Though what you describe may become problematic if the numeral is located at the beginning or end of the sentence, and say, you want to partition the audio there. Then you enter magic number territory with having to determine offsets and etc. I guess it ultimately depends on your use case

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants