Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Clarifications on annotation #211

Open
MikeMpapa opened this issue Oct 14, 2024 · 0 comments
Open

Clarifications on annotation #211

MikeMpapa opened this issue Oct 14, 2024 · 0 comments

Comments

@MikeMpapa
Copy link

MikeMpapa commented Oct 14, 2024

Hi there,
I am working on a building a new dataset in Spanish (polysyllabic language). I have gone though MakeDiffSinger but I still have some gaps. I would be grateful if you could sanity check me on my understanding and share any thoughts you might have

Questions for clarifications:

  1. ph_seq: These are sequences of phonemes or syllables?
    Currently I using phonemes and their timestamps as provided by MFA. I am using a pre-trained Spanish model available by MFA. Would you recommend training a new one on my specific data?

  2. note_dur: The midi notes should be estimated over phonemes, syllables, or words?
    Now I estimated one note for each phoneme and assumed ph_dur==note_dure

  3. ph_num: The number of phonemes in each word or in each syllable?
    Now I assumed the number of phonemes in each word

  4. note_seq: Do you think SOME would suffice to get a first shot at this ? I would speculate yes?

  5. is_slur: how would you define slur in this context? I have not found plenty of resources on this topic
    Now I assumed no slurs at all

  6. SPs and APs: Would you recommend doing that manually or using the enhance script might be OK for a first shot?

Thanks!

@MikeMpapa MikeMpapa changed the title Clarification on annotation Clarifications on annotation Oct 14, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant