Skip to content

Consider Adding Microsoft Edge TTS as a Voice Provider #118

@rosscado

Description

@rosscado

A user has suggested adding Microsoft Edge Text-to-Speech (TTS) as a voice provider in Say Pi. This could potentially lower costs, reduce latency, and improve multilingual support.

Key Points/Assumptions

  1. Edge TTS is free to use and offers multilingual natural TTS.
  2. It's accessed through a local browser API, potentially reducing latency compared to remote API calls.
  3. Could help reduce server usage and associated costs.
  4. May allow for faster request processing.

Considerations for Implementation

  • Integration complexity: Evaluate the effort required to incorporate Edge TTS into Say Pi's architecture.
  • Quality comparison: Compare voice quality with current providers (11 Labs, Inflection AI).

Current Voice Providers

  • 11 Labs: Highest quality TTS in the industry
  • Inflection AI: Uses 11 Labs in the background with their own voices

Next Steps

  • Research the Edge TTS API and its integration requirements
  • Conduct a quality comparison test
  • Evaluate potential cost savings and performance improvements

Metadata

Metadata

Assignees

No one assigned

    Labels

    edgeMicrosoft Edge browserttsText to Speech (voice synthesis)user reportedA user raised this issue.

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions