Skip to content

It is a pity that #93

@Rolf-100R

Description

@Rolf-100R

It is a pity that:

  1. There are no tools to prompt for the correct stress placement when pronouncing a word. In my language, stress plays a primary role. Sometimes it can take an entire hour to get a decent result, even though the video itself is generated quite quickly (on Distilled GGUF Q6_K).

  2. It's impossible to influence the voice timbre.

  3. There are no markers to indicate people. When two characters are involved, they often intercept each other's lines, or even both simultaneously voice the same text.

  4. The program's developers maintain a proud silence and rarely respond to user questions.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions