refactor!: change model to expect mel-band oriented tensors instead of time-oriented ones #39

roedoejet · 2024-10-30T01:18:47Z

semanticdiff-com · 2024-10-30T01:18:49Z

Review changes with

Changed Files

File	Status
hfgl/utils.py	53% smaller
hfgl/cli.py	12% smaller

SamuelLarkin

make the code a little bit more readable

SamuelLarkin · 2024-11-06T16:52:21Z

hfgl/cli.py

+    if time_oriented:
+        data = data.transpose(0, 1)
+    data_size = data.size()
+    if (
+        checkpoint["hyper_parameters"]["config"]["preprocessing"]["audio"]["n_mels"]
+        not in data_size
+    ):
+        raise ValueError(
+            f"Your model expects a spectrogram of dimensions [K (Mel bands), T (frames)] where K == {checkpoint['hyper_parameters']['config']['preprocessing']['audio']['n_mels']} but you provided a tensor of size {data_size}"
+        )
+    if (
+        data_size[0]
+        != checkpoint["hyper_parameters"]["config"]["preprocessing"]["audio"]["n_mels"]
+    ):
+        raise ValueError(
+            f"We expected the first dimension of your Mel spectrogram to correspond with the number of Mel bands declared by your model ({checkpoint['hyper_parameters']['config']['preprocessing']['audio']['n_mels']}). Instead, we found you model has the dimensions {data_size}. If your spectrogram is time-oriented, please re-run this command with the '--time-oriented' flag."
+        )


Suggested change

if time_oriented:

data = data.transpose(0, 1)

data_size = data.size()

if (

checkpoint["hyper_parameters"]["config"]["preprocessing"]["audio"]["n_mels"]

not in data_size

):

raise ValueError(

f"Your model expects a spectrogram of dimensions [K (Mel bands), T (frames)] where K == {checkpoint['hyper_parameters']['config']['preprocessing']['audio']['n_mels']} but you provided a tensor of size {data_size}"

)

if (

data_size[0]

!= checkpoint["hyper_parameters"]["config"]["preprocessing"]["audio"]["n_mels"]

):

raise ValueError(

f"We expected the first dimension of your Mel spectrogram to correspond with the number of Mel bands declared by your model ({checkpoint['hyper_parameters']['config']['preprocessing']['audio']['n_mels']}). Instead, we found you model has the dimensions {data_size}. If your spectrogram is time-oriented, please re-run this command with the '--time-oriented' flag."

)

if time_oriented:

data = data.transpose(0, 1)

data_size = data.size()

config_n_mels = checkpoint["hyper_parameters"]["config"]["preprocessing"]["audio"]["n_mels"]

if (config_n_mels not in data_size):

raise ValueError(

f"Your model expects a spectrogram of dimensions [K (Mel bands), T (frames)] where K == {config_n_mels} but you provided a tensor of size {data_size}"

)

if (data_size[0] != config_n_mels):

raise ValueError(

f"We expected the first dimension of your Mel spectrogram to correspond with the number of Mel bands declared by your model ({config_n_mels}). Instead, we found you model has the dimensions {data_size}. If your spectrogram is time-oriented, please re-run this command with the '--time-oriented' flag."

)

Yes, please accept this suggestion, the code is quite hard to read.

…f time-oriented ones

joanise

Please accept Sam's change request, otherwise this looks fine.

joanise · 2024-12-09T22:31:06Z

hfgl/cli.py

+    if time_oriented:
+        data = data.transpose(0, 1)
+    data_size = data.size()
+    if (
+        checkpoint["hyper_parameters"]["config"]["preprocessing"]["audio"]["n_mels"]
+        not in data_size
+    ):
+        raise ValueError(
+            f"Your model expects a spectrogram of dimensions [K (Mel bands), T (frames)] where K == {checkpoint['hyper_parameters']['config']['preprocessing']['audio']['n_mels']} but you provided a tensor of size {data_size}"
+        )
+    if (
+        data_size[0]
+        != checkpoint["hyper_parameters"]["config"]["preprocessing"]["audio"]["n_mels"]
+    ):
+        raise ValueError(
+            f"We expected the first dimension of your Mel spectrogram to correspond with the number of Mel bands declared by your model ({checkpoint['hyper_parameters']['config']['preprocessing']['audio']['n_mels']}). Instead, we found you model has the dimensions {data_size}. If your spectrogram is time-oriented, please re-run this command with the '--time-oriented' flag."
+        )


Yes, please accept this suggestion, the code is quite hard to read.

joanise · 2024-12-09T22:33:17Z

hfgl/cli.py

+        != checkpoint["hyper_parameters"]["config"]["preprocessing"]["audio"]["n_mels"]
+    ):
+        raise ValueError(
+            f"We expected the first dimension of your Mel spectrogram to correspond with the number of Mel bands declared by your model ({checkpoint['hyper_parameters']['config']['preprocessing']['audio']['n_mels']}). Instead, we found you model has the dimensions {data_size}. If your spectrogram is time-oriented, please re-run this command with the '--time-oriented' flag."


This warning message is good, probably giving the user the feedback I was asking for in the top-level PR.

joanise · 2024-12-09T22:49:43Z

Note: you'll need to rebase, and solve the merge conflict: we now have to use shell_complete, not autocompletion.

roedoejet mentioned this pull request Oct 30, 2024

consolidate spectrogram dimensions EveryVoiceTTS/EveryVoice#572

Open

SamuelLarkin requested changes Nov 6, 2024

View reviewed changes

refactor!: change model to expect mel-band oriented tensors instead o…

7baaeff

…f time-oriented ones

joanise requested changes Dec 9, 2024

View reviewed changes

joanise force-pushed the dev.ap/513 branch from b301c36 to 7baaeff Compare December 10, 2024 17:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor!: change model to expect mel-band oriented tensors instead of time-oriented ones #39

refactor!: change model to expect mel-band oriented tensors instead of time-oriented ones #39

roedoejet commented Oct 30, 2024 •

edited

Loading

semanticdiff-com bot commented Oct 30, 2024 •

edited

Loading

SamuelLarkin left a comment

SamuelLarkin Nov 6, 2024

joanise Dec 9, 2024

joanise left a comment

joanise Dec 9, 2024

joanise Dec 9, 2024

joanise commented Dec 9, 2024

refactor!: change model to expect mel-band oriented tensors instead of time-oriented ones #39

Are you sure you want to change the base?

refactor!: change model to expect mel-band oriented tensors instead of time-oriented ones #39

Conversation

roedoejet commented Oct 30, 2024 • edited Loading

semanticdiff-com bot commented Oct 30, 2024 • edited Loading

SamuelLarkin left a comment

Choose a reason for hiding this comment

SamuelLarkin Nov 6, 2024

Choose a reason for hiding this comment

joanise Dec 9, 2024

Choose a reason for hiding this comment

joanise left a comment

Choose a reason for hiding this comment

joanise Dec 9, 2024

Choose a reason for hiding this comment

joanise Dec 9, 2024

Choose a reason for hiding this comment

joanise commented Dec 9, 2024

roedoejet commented Oct 30, 2024 •

edited

Loading

semanticdiff-com bot commented Oct 30, 2024 •

edited

Loading