-
Notifications
You must be signed in to change notification settings - Fork 401
Open
Description
It is a pity that:
-
There are no tools to prompt for the correct stress placement when pronouncing a word. In my language, stress plays a primary role. Sometimes it can take an entire hour to get a decent result, even though the video itself is generated quite quickly (on Distilled GGUF Q6_K).
-
It's impossible to influence the voice timbre.
-
There are no markers to indicate people. When two characters are involved, they often intercept each other's lines, or even both simultaneously voice the same text.
-
The program's developers maintain a proud silence and rarely respond to user questions.
Metadata
Metadata
Assignees
Labels
No labels