Skip to content

Live Chat and TTS Narrative Maker and Translator App, with Predetermined (Customizable with style prompting) Voices as well as Custom Voice Mode

Notifications You must be signed in to change notification settings

AjarnSpencer/Endless-Text-Narrator

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

22 Commits
 
 
 
 

Repository files navigation

Endless Text Narrator

An advanced text-to-speech narrator and real-time conversational AI application built with React, TypeScript, Electron, and the Google Gemini API.

This application allows for the creation of long-form audio narrations from text with SSML support, voice style customization, and multi-language translation. It also features a real-time voice conversation mode with live transcription.

App Screenshot

Features

  • Narration Mode: Convert large blocks of text into a single, downloadable .wav audio file, perfect for long-form content.
  • SSML Support: Fine-tune audio output with Speech Synthesis Markup Language (<break>, <emphasis>, <prosody>) for professional-grade narration.
  • Multi-Language & Accents: Narrate and translate text into dozens of languages. It also supports regional accents like UK and Indian English.
  • Custom Voice Styles: Use pre-built, high-quality voices or analyze a recording of your own voice to create a unique AI speaking style.
  • Conversation Mode: Engage in real-time, low-latency voice conversations with a Gemini-powered AI.
  • Live Transcription: See a live transcript of the conversation as you speak with the AI.
  • Cross-Platform Desktop App: Packaged with Electron for a native experience on Windows, macOS, and Linux.

Screenshots:

App Screenshot

Prerequisites

To run this application, you will need a Google Gemini API Key. The application's features rely on the Gemini API for all AI-powered functionality.

  • You can obtain a free API key from Google AI Studio.
  • Note that the free tier has usage limits. For more extensive use, you may need to set up billing on a Google Cloud project.

Installation & Setup

  • This App is available for Window or Linux (.exe for Windows or .Zip for Linux):

You can download the latest release for your operating system from the Releases page.

Once installed, the application will prompt you to enter your Google Gemini API Key on the first launch. This key is stored securely in your browser's local storage and is not transmitted anywhere else.

How to Run Locally (for Development)

  1. Clone the repository:

    git clone https://github.com/AjarnSpencer/endless-text-narrator.git
    cd endless-text-narrator
  2. Install dependencies:

    npm install
  3. Run the React development server:

    npm run dev
  4. In a separate terminal, run the Electron app:

    npm run electron:start

    This will open a native desktop window with the application running and connected to the development server, with hot-reloading enabled.

How to Build Executables

To compile the application into installers for Windows, macOS, and Linux, run the following command:

npm run electron:package

The process will first build the React application for production, then package it using electron-builder. The final executables and installers will be located in the /release directory.

Contributing

Contributions are welcome! Please read our Contributing Guidelines to get started. Also, please review our Code of Conduct.

License

This project is licensed under the Apache 2.0 License - see the LICENSE file for details.

Author: Ajarn Spencer Littlewood and Gemini CLI Unleashed in Collaboration with AI Studio

GHBanner

Built with AI Studio

The fastest path from prompt to production with Gemini.

Start building Build & Release Desktop App

License: Apache 2.0

About

Live Chat and TTS Narrative Maker and Translator App, with Predetermined (Customizable with style prompting) Voices as well as Custom Voice Mode

Resources

Stars

Watchers

Forks

Packages

No packages published