Skip to content

some different suggestions regarding the engine #56

@king-dahmanus

Description

@king-dahmanus

Hello developers. I'm not a dev, but I am suggesting some improvements and features/ideas for this engine.
First with the shorter one, to improve the quality of the voice, you need to change the encoder. From what I've hird of the samples, this engine is using griffinlim encoder which sounds robotic. You need to change it to use something like hifigam or any other better encoder. Hifi gan sounds promising.
For the second one, I suggest making this engine available for windows assistive technologies by making a sapi5(speech application programming interface) distribution of the engine so screen readers like NVDA or jaws, text rraders like balabolka or textaloud, and many other programs can use it. The voice has to be optimized for responsiveness, meaning faster than realtime output and no lag or delay before or in the middle of the speech. Hope you consider my suggestions. Thanks, and hope we can discuss this.

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions