Ecosyste.ms: Timeline

Browse the timeline of events for every public repo on GitHub. Data updated hourly from GH Archive.

SWivid/F5-TTS

448486810 starred SWivid/F5-TTS
mhenrichsen created a comment on an issue on SWivid/F5-TTS
Here are the complete logs: ``` Epoch 1/10: 67%|█████████████████████████████████████████████████████████████████████████▎ | 5143/7653 [05:15<1:32:12, 2.20s/st...

View on GitHub

KelvinHuang66 opened an issue on SWivid/F5-TTS
读取阿拉伯数字出错
中文文本里面带阿拉伯数字会自动转英文,怎么解决
awaaate starred SWivid/F5-TTS
zzhdbw starred SWivid/F5-TTS
WGS-note opened an issue on SWivid/F5-TTS
可以支持多少种音色?
请问:可以支持多少种音色?我如何切换不同的音色?谢谢!
SWivid pushed 3 commits to main SWivid/F5-TTS
  • Add Voice Chat feature 92887e3
  • Merge branch 'main' into main 3125a28
  • Merge pull request #236 from jpgallegoar/main Add Voice Chat feature 1f582a6

View on GitHub

SWivid closed a pull request on SWivid/F5-TTS
Add Voice Chat feature
@SWivid This one I will not merge myself because I need your confirmation. This adds an AI Voice Chat feature using LLM Qwen2.5-3B (very fast), 3gb only, but needs pip install transformers_stream_g...
nsyring created a comment on an issue on SWivid/F5-TTS
> > > > > Will you consider to make a Drag n Drop (via Gradio) UI easy to TRAIN other languages locally? I would like to train Hebrew language, but I'm not a programmer. > > > > > If you'll make a...

View on GitHub

jesterxu starred SWivid/F5-TTS
SWivid pushed 1 commit to dev SWivid/F5-TTS

View on GitHub

SWivid closed a pull request on SWivid/F5-TTS
merge inference_gradio
chenluyuZZ starred SWivid/F5-TTS
SWivid created a comment on an issue on SWivid/F5-TTS
will close this issue, feel free to open if further questions

View on GitHub

SWivid closed an issue on SWivid/F5-TTS
中文如何进行更精细的控制,如多音字、连读等
中文的有些句子多音字、断句和连读模型读的还有一些问题,如何进行控制, eg: 各个区县市更以其得天独厚的户外休闲旅游资源吸引着来自海内外的游客 question: 各个区县市未连读,户外休闲旅游资源未连读被拆开读。 还有多音字如何控制,试了也不支持输入拼音,是否未进行这部分训练支持。
ljpadam starred SWivid/F5-TTS
SWivid opened a pull request on SWivid/F5-TTS
merge inference_gradio
SWivid created a comment on an issue on SWivid/F5-TTS
> So it's possible for DynamicBatchSampler to sometimes exceed the frames_threshold. The `self.get_frame_len()` is intended to get duration exactly for a queried index, while `__getitem__` do ca...

View on GitHub

SWivid created a comment on an issue on SWivid/F5-TTS
Our reproduced E2 model doesn't train with that scheme. Check E2 paper: ![image](https://github.com/user-attachments/assets/cad5427e-9794-4b85-8116-8d7411d07ccd) We just use characters, no random...

View on GitHub

SinishaDjukic starred SWivid/F5-TTS
SWivid created a comment on an issue on SWivid/F5-TTS
可以提供一个口音轻的中文声音作为参考音频

View on GitHub

SWivid created a comment on an issue on SWivid/F5-TTS
Use lower case as we suggested in readme, or you are telling model to read letter by letter Also check if reference audio uploaded correctly, will show waveform if so

View on GitHub

land007 forked SWivid/F5-TTS

land007/F5-TTS

Sunny-an starred SWivid/F5-TTS
HaileyStorm opened an issue on SWivid/F5-TTS
ARPAbet phoneme pronunciation not working.
When I try to generate text with ARPAbet phones in parenthesis like you see in the "Specifying the pronunciation without model re-training" of https://www.microsoft.com/en-us/research/project/e2-tt...
robine-w starred SWivid/F5-TTS
devheroo starred SWivid/F5-TTS
wayneg123 starred SWivid/F5-TTS
lishu44275 opened an issue on SWivid/F5-TTS
请问为啥读英语有口音啊,没有地道点的发音吗?
我是克隆了一个中文声音,但是读英文的时候,不地道啊,怎么才能读地道的呢?
Litton-Lei starred SWivid/F5-TTS
Load more