diff --git a/examples/server/README.md b/examples/server/README.md index 6f30f15..18c072b 100644 --- a/examples/server/README.md +++ b/examples/server/README.md @@ -83,9 +83,16 @@ curl http://127.0.0.1:8080/v1/audio/speech \ The only required parameter is `input` otherwise generation configuration will be determined by the defaults set on server initialization, and the `response_format` will use `wav`. The `response_format` field currently supports only `wav` and `aiff` audio formats. +#### Voices + +For models that support voices a complete json list of supported voices can be queried vis the voices endpoint, `/v1/audio/voices`: + +```bash +curl http://127.0.0.1:8080/v1/audio/voices +``` + ### Future Work Future work will include: * Support for token authentication and permissioning -* Multiple model support * Streaming audio, for longform audio generation. diff --git a/examples/server/public/index.html b/examples/server/public/index.html index ffaa29c..068eb9c 100644 --- a/examples/server/public/index.html +++ b/examples/server/public/index.html @@ -60,41 +60,6 @@ gap: 10px; } - select { - appearance: base-select; - flex-grow: 1; - box-sizing: border-box; - padding: 10px; - border: 1px solid #d1d5db; - border-radius: 6px; - background: none; - font-family: inherit; - font-size: 0.875rem; - transition: - border-color 0.2s, - box-shadow 0.2s; - } - - select:focus { - outline: none; - border-color: #3b82f6; - box-shadow: 0 0 0 2px rgba(59, 130, 246, 0.2); - } - - ::picker(select) { - appearance: base-select; - flex-grow: 1; - box-sizing: border-box; - padding: 10px; - border: 1px solid #d1d5db; - border-radius: 6px; - font-family: inherit; - font-size: 0.875rem; - transition: - border-color 0.2s, - box-shadow 0.2s; - } - .refresh-btn { padding-right: 9.5px; padding-left: 9.5px; @@ -136,7 +101,8 @@ } textarea, - input[type="text"] { + input[type="text"], + select { box-sizing: border-box; width: 100%; padding: 10px; @@ -148,7 +114,8 @@ } textarea:focus, - input[type="text"]:focus { + input[type="text"]:focus, + select:focus { outline: none; border-color: #3b82f6; box-shadow: 0 0 0 2px rgba(59, 130, 246, 0.2); @@ -159,6 +126,25 @@ resize: vertical; } + select { + appearance: base-select; + background: none; + } + + ::picker(select) { + appearance: base-select; + flex-grow: 1; + box-sizing: border-box; + padding: 10px; + border: 1px solid #d1d5db; + border-radius: 6px; + font-family: inherit; + font-size: 0.875rem; + transition: + border-color 0.2s, + box-shadow 0.2s; + } + .slider-container { margin-top: 8px; } @@ -369,6 +355,14 @@

TTS.cpp Server API

API key for authentication (does nothing for now)

+
+ + +

Voice to use for the speech (not all model have voices)

+
+