SWivid/F5-TTS Events in 2024 - Ecosyste.ms: Timeline

SWivid created a comment on an issue on SWivid/F5-TTS

October 24, 2024 5:38am

will close this issue, feel free to open if further questions

SWivid closed an issue on SWivid/F5-TTS

October 24, 2024 5:38am

中文如何进行更精细的控制，如多音字、连读等

中文的有些句子多音字、断句和连读模型读的还有一些问题，如何进行控制， eg：各个区县市更以其得天独厚的户外休闲旅游资源吸引着来自海内外的游客 question: 各个区县市未连读，户外休闲旅游资源未连读被拆开读。还有多音字如何控制，试了也不支持输入拼音，是否未进行这部分训练支持。

SWivid opened a pull request on SWivid/F5-TTS

October 24, 2024 5:36am

SWivid created a comment on an issue on SWivid/F5-TTS

October 24, 2024 5:34am

> So it's possible for DynamicBatchSampler to sometimes exceed the frames_threshold. The `self.get_frame_len()` is intended to get duration exactly for a queried index, while `__getitem__` do ca...

View on GitHub

SWivid created a comment on an issue on SWivid/F5-TTS

October 24, 2024 5:24am

Our reproduced E2 model doesn't train with that scheme. Check E2 paper: ![image](https://github.com/user-attachments/assets/cad5427e-9794-4b85-8116-8d7411d07ccd) We just use characters, no random...

View on GitHub

SinishaDjukic starred SWivid/F5-TTS

October 24, 2024 5:20am

SWivid created a comment on an issue on SWivid/F5-TTS

October 24, 2024 5:19am

可以提供一个口音轻的中文声音作为参考音频

View on GitHub

SWivid created a comment on an issue on SWivid/F5-TTS

October 24, 2024 5:16am

Use lower case as we suggested in readme, or you are telling model to read letter by letter Also check if reference audio uploaded correctly, will show waveform if so

View on GitHub

Sunny-an starred SWivid/F5-TTS

October 24, 2024 4:30am

HaileyStorm opened an issue on SWivid/F5-TTS

October 24, 2024 4:25am

ARPAbet phoneme pronunciation not working.

When I try to generate text with ARPAbet phones in parenthesis like you see in the "Specifying the pronunciation without model re-training" of https://www.microsoft.com/en-us/research/project/e2-tt...