VGGish mel-band frame length/hop size? #1251
-
Hi! |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment
-
Hi @seunggookim, I really like the idea of returning timestamps. Pleese feel free to fill a feature request for it:) |
Beta Was this translation helpful? Give feedback.
Hi @seunggookim,
Yes! The frame size is 400 samples (25ms) and the hop is 160 (10ms). Those parameters are inherited from the original implementation and and we hardcode them so that it is not possible to feed the model with incorrect mel-spectrograms.
Maybe you couldn't figure the exact numbers because of the zero-padding (
startFromZero=False
) introduced by the frameCutter algorithm?I really like the idea of returning timestamps. Pleese feel free to fill a feature request for it:)