Text-to-Speech

Model Card for f5-tts-hakka-finetune

Model Details

F5-TTS finetune on all formosan data (ithuan, fb ilrdf dict, klokah) without samples only one word, using ipa as input.
Only contains ithuan ami and trv part.
g2p from this repo.

Training Details

  • learning rate: 0.00001
  • batch size per gpu: 6400
  • batch size type: frame
  • max samples: 64
  • grad accumulation steps: 1
  • max grad norm: 1
  • epochs: 210 (1704780 steps, current 1081600), after 1081600 loss rise
  • num warmup updates: 27040

Model Sources

Uses

please refer source repo

Demo

https://huggingface.co/spaces/ithuan/formosan-f5-tts

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for ithuan/f5-tts-formosan-all-finetune

Base model

SWivid/F5-TTS
Finetuned
(71)
this model

Spaces using ithuan/f5-tts-formosan-all-finetune 3