Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Translations are not contextual #1

Open
cxdy opened this issue Apr 28, 2019 · 0 comments
Open

Translations are not contextual #1

cxdy opened this issue Apr 28, 2019 · 0 comments
Assignees

Comments

@cxdy
Copy link
Owner

cxdy commented Apr 28, 2019

I forgot how languages work when building this.

Is there TL;DR for too long, didnt type? If not, this will be it's birth.

Anyways, TL;DT
As of the current version, each individual subtitle is translated out of context, so the translations don't necessarily make sense. I only (and barely) speak English, so I have no idea exactly how bad they are, but we need to figure out a way to determine if subtitles are related to each other, and then translate them so that they make some kind of sense.

IDEAS:

  • Build the SRT file through the program and run it through multiple languages at once
    This is probably the most difficult approach, but also probably our best bet (and it's cool as fuck). Only thing is, I don't know how to do that yet. If we can figure it out, maybe there's a way to tell if a different person is speaking?

  • Determine if the times in the previous & next subtitle are close
    00:00:11,636 --> 00:00:13,221
    I'm assuming the format is HR:MIN:SEC,MS? I'm just not 100% what the number after the comma is - seems like a safe assumption though. I know we talked about it before but I forgot.
    Anyway, I guess for this approach, we could just create an empty list and add the subtitles that start within a second or two of the previous subtitle, run it and see if it makes sense? Only thing is, I don't know how we'd determine if it makes any sense with a computer. I can't speak other languages so I'm not confident that I can teach a computer how to speak other languages.

@cxdy cxdy assigned cxdy and kpmgeek Apr 28, 2019
@github-staff github-staff deleted a comment from p4u1d34n Oct 24, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants