The scaling is off? Can we get a tab\tool\workflow to create matching spectrograms? #112
TheRealBlackNet
started this conversation in
Ideas
Replies: 1 comment
-
sounds reasonable. I saw this popup - https://civitai.com/?query=riffusion this may also help (looking at the flute / harp requirement - you may look into band in a box ) UPDATE - just found this |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
We see left the input image with the Herz and the pixels from top.
Left this is a linear spectrogram by "Sonic Visualiser".
Observation:
Looks like some bands are more detailed (have more pixels) then others.
Background:
I wanted to use the Automatic1111 riffusion tab to create sounds, but not AI generated they should be placed by my own tool.
Idea is I have a buch of stencils of music instruments and I copy them into a image on the timeframe the input will be a Description from outside like:
Flute: |---x---|-x---x-| ...
Harp: |-x--x--|--x-x--| ...
With each x I copy the stencil into the image with n * beat speed the distance between. (Its a Dwarf Fortress music description)
Problem:
My first tests with Paint sounded off so I started to check the frequencies that leads to the image above. Moving the pixels from a specrogram by hand sounds not nice in the reconstruction.
I tired to use the model but getting only a single sound from a flute, piano and so on is not possible a "just a C on a flute" is not trained from the feeling.
Question\Featurerequest\Idea:
Can we please get a "input wav receive a useable spectogramm" tool on the web app or in the Automatic1111 tab?
So I can create a set of correct scaled stencils from the spectogram and create a collage of sounds.
Beta Was this translation helpful? Give feedback.
All reactions