> Sorry for having so many questions! But can you explain what differences? The parameters look compatible: https://huggingface.co/nvidia/bigvgan_v2_24khz_100band_256x/blob/main/config.json "num_me...
> Im using it with Pinokio
Not sure if pinokio also compatible with training/finetuning.
If you just want to do inference, use `gradio_app.py` and `inference-cli.py`
Sorry for having so many questions! But can you explain what differences? The parameters look compatible:
https://huggingface.co/nvidia/bigvgan_v2_24khz_100band_256x/blob/main/config.json
"nu...
> Why does this need a retrain? I thought F5 just outputs mel-spectrograms without any dependencies on the downstream vocoder used for inference?
They have some difference in extract mel-spectro...
hi, i know you have scripts/prepare_emilia.py
my question is how if i have my own dataset? what format should i provide? what is the structure needed to finetuned? the docs is not that clear
Does this work? https://huggingface.co/nvidia/bigvgan_v2_24khz_100band_256x
The params seem to match:
sample_rate: 24000
n_fft: 1024
hop_length: 256
n_mels: 100
> @SWivid thank you for the amazing repo !
>
> @kunci115 i create simple script prepare dataset to use with format csv local
>
> ```
> my_data/
> │
> ├── wavs/
> │ ├── audio1.wav
> │ ...