spec-to-wav synthesis produces some errors from existing mels #513

roedoejet · 2024-07-18T21:23:37Z

For some reason I had to change write(f"{data_path}.wav", sr, wav) to write(f"{data_path}.wav", sr, wav[0]) - I should investigate in hfgl/cli.py

The text was updated successfully, but these errors were encountered:

roedoejet · 2024-07-18T22:58:15Z

related to #507

fixes #513

SamuelLarkin · 2024-11-01T15:38:37Z

I also stumbled on this using

everyvoice synthesize from-spec \
  --input preprocessed/spec/LJ033-0048--speaker_0--eng--spec-22050-mel-librosa.pt \
  --model logs_and_checkpoints/VocoderExperiment/base/checkpoints/last.ckpt

╭───────────────────────────────────────────────────── Traceback (most recent call last) ──────────────────────────────────────────────────────╮
│ /fs/hestia_Hnrc/ict/sam037/git/EveryVoice/everyvoice/model/vocoder/HiFiGAN_iSTFT_lightning/hfgl/cli.py:154 in synthesize                     │
│                                                                                                                                              │
│   151 │   except (TypeError, ValidationError) as e:                                                                                          │
│   152 │   │   logger.error(f"Unable to load {generator_path}: {e}")                                                                          │
│   153 │   │   sys.exit(1)                                                                                                                    │
│ ❱ 154 │   wav, sr = synthesize_data(data, vocoder_model, vocoder_config)                                                                     │
│   155 │   logger.info(f"Writing file {data_path}.wav")                                                                                       │
│   156 │   write(f"{data_path}.wav", sr, wav[0])                                                                                              │
│   157                                                                                                                                        │
│                                                                                                                                              │
│ /fs/hestia_Hnrc/ict/sam037/git/EveryVoice/everyvoice/model/vocoder/HiFiGAN_iSTFT_lightning/hfgl/utils.py:85 in synthesize_data               │
│                                                                                                                                              │
│    82 │   │   wavs = inverse_spectral_transform(mag * torch.exp(phase * 1j)).unsqueeze(-2)                                                   │
│    83 │   else:                                                                                                                              │
│    84 │   │   with torch.no_grad():                                                                                                          │
│ ❱  85 │   │   │   wavs = model.generator(data.transpose(1, 2))                                                                               │
│    86 │   # squeeze to remove the channel dimension                                                                                          │
│    87 │   return (                                                                                                                           │
│    88 │   │   wavs.squeeze(1).cpu().numpy(),                                                                                                 │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯
IndexError: Dimension out of range (expected to be in range of [-2, 1], but got 2)

roedoejet added the bug Something isn't working label Jul 18, 2024

roedoejet self-assigned this Jul 18, 2024

roedoejet mentioned this issue Jul 18, 2024

synthesize from-spec with [T, K] tensor produces error #507

Closed

roedoejet added this to the beta milestone Sep 9, 2024

roedoejet added a commit that referenced this issue Oct 30, 2024

chore: update submodule

7a2df23

fixes #513

roedoejet mentioned this issue Oct 30, 2024

consolidate spectrogram dimensions #572

Open

roedoejet added a commit that referenced this issue Oct 30, 2024

chore: update submodule

9884822

fixes #513

roedoejet added a commit that referenced this issue Oct 31, 2024

chore: update submodule

d16bc5b

fixes #513

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

spec-to-wav synthesis produces some errors from existing mels #513

spec-to-wav synthesis produces some errors from existing mels #513

roedoejet commented Jul 18, 2024

roedoejet commented Jul 18, 2024

SamuelLarkin commented Nov 1, 2024

spec-to-wav synthesis produces some errors from existing mels #513

spec-to-wav synthesis produces some errors from existing mels #513

Comments

roedoejet commented Jul 18, 2024

roedoejet commented Jul 18, 2024

SamuelLarkin commented Nov 1, 2024