Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Discussion: Multi-Track Support #409

Open
pajowu opened this issue Dec 7, 2022 · 1 comment
Open

Discussion: Multi-Track Support #409

pajowu opened this issue Dec 7, 2022 · 1 comment
Labels

Comments

@pajowu
Copy link
Member

pajowu commented Dec 7, 2022

This issues contains the discussion for supporting multiple tracks.

In my eyes, supporting multi-tracks has multiple facets:

Purpose

What are multi-track recordings used for / what use-cases do we want to focus in?

  • Diffrent tracks for different speakers
  • One track for speaking, one track for music & jingles
  • Different languages on seperate tracks

Import

At the moment we diarize the imported audio and to speaker-detection on this. For multi-track files, this might not be needed. It could be replaced with splitting the audio at speaker turns (i.e. one track goes silent, the other stops being silent) or similar. The tracks could also provide speaker identification

Editing

I'm not sure how editing true multi-track projects would work: Should we just "flatten" them to our current format? How do we deal with overlapping segments? Should we display them in a special way to make clear that those are two tracks running in parallel (maybe split the editor left/right)? We might need to do a few designs first

Export

We already kind-of have multi-track export using otio which creates one track per speaker. I'm not sure what additional formats we should support

@pajowu pajowu added enhancement New feature or request fileformat UI / UX labels Dec 7, 2022
@pajowu pajowu changed the title Multi-Track Support Discussion: Multi-Track Support Dec 7, 2022
@jasontucker
Copy link

Agreed, this feature is an important one to me. I record multitrack and would love to have a transcription of the conversation from the combined conversation. Each person is in its own track so you know who is talking for each track. Macwhisper supports this but is having a hard time handling it well.
https://goodsnooze.gumroad.com/l/macwhisper

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants