From 6210be5927bab2b1be7e03e8a4eb348a17e43010 Mon Sep 17 00:00:00 2001 From: Peggy0422 Date: Mon, 10 Nov 2025 10:50:59 +0800 Subject: [PATCH 1/5] Update media.wsdl MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit 1. Added AddTTSAudioClip request and AddTTSAudioClip response for sending a text and its TTS configuration to the device(1621-1652)(2036-2041)(2418-2422)(2935-2943). 2. Added complex types "TTS Audio" (1465-1485)for TTSConfiguration to support TTS function. It includes parameters Content, Language, VoiceType. 3. updated AudioClipCapabilities with TTSCapabilities(177-181), and added complex types for TTSCapabilities(201-220)to indicate the device supports TTS function and its corresponding configuration. complex types TTSCapabilities includes MaxContentLength, TTSLanguage and TTSVoiceType. 4. Added simpleType TTSLanguage(220-231) and TTSVoiceType(232-238). --- wsdl/ver20/media/wsdl/media.wsdl | 126 ++++++++++++++++++++++++++++++- 1 file changed, 124 insertions(+), 2 deletions(-) diff --git a/wsdl/ver20/media/wsdl/media.wsdl b/wsdl/ver20/media/wsdl/media.wsdl index 3a9729588..a086dd718 100644 --- a/wsdl/ver20/media/wsdl/media.wsdl +++ b/wsdl/ver20/media/wsdl/media.wsdl @@ -174,6 +174,11 @@ IN NO EVENT WILL THE CORPORATION OR ITS MEMBERS OR THEIR AFFILIATES BE LIABLE FO + + + Indicates device has TTS capability. + + @@ -194,6 +199,46 @@ IN NO EVENT WILL THE CORPORATION OR ITS MEMBERS OR THEIR AFFILIATES BE LIABLE FO + + + + + Indicates the maximum length of content of a text for device to convert to an audio clip. + + + + + Indicates what language(s) the device supports for TTS funciton. + + + + + Indicates the voice type for TTS funciton. + + + + + + + + + + + + + + + + + + + + + + + + + @@ -1417,8 +1462,31 @@ IN NO EVENT WILL THE CORPORATION OR ITS MEMBERS OR THEIR AFFILIATES BE LIABLE FO - + + + + + + + Content of the audio clip. + + + + + Language for the audio clip content, it is the same as the language during playback. + + + + + The voice type of audio clip playback. + + + + + + + @@ -1551,7 +1619,40 @@ IN NO EVENT WILL THE CORPORATION OR ITS MEMBERS OR THEIR AFFILIATES BE LIABLE FO - + + + + + + + Optional token associated with the audio clip. + + + + + Audio clip configuration to add. + + + + + The configuration for the TTS audio clip to add. + + + + + + + + + + + Unique token of the TTS audio clip to be uploaded. + + + + + + @@ -1934,6 +2035,12 @@ IN NO EVENT WILL THE CORPORATION OR ITS MEMBERS OR THEIR AFFILIATES BE LIABLE FO + + + + + + @@ -2310,6 +2417,12 @@ image will be updated automatically and independent from calls to GetSnapshotUri + + This operation sends a text and its configuartion to device that supports TTS function, so that device could convert the text into an audio clip and play it according to audio clip Configuration and TTS Configuration. + + + + This operation modifies the existing audio clip configuration on the device. @@ -2821,6 +2934,15 @@ image will be updated automatically and independent from calls to GetSnapshotUri + + + + + + + + + From d2607c705d84333dcc0e74903b74151d4bf9eceb Mon Sep 17 00:00:00 2001 From: Peggy0422 Date: Mon, 10 Nov 2025 11:29:28 +0800 Subject: [PATCH 2/5] Update Media2.xml MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit 1. Added detailed descriptions for AddTTSAudioClip operations, explaining their purpose, parameters, and responses.(2359-2416) 2. updated audio clip Capabilities with TTSCapabilities.(2698-2700) --- doc/Media2.xml | 64 +++++++++++++++++++++++++++++++++++++++++++++++++- 1 file changed, 63 insertions(+), 1 deletion(-) diff --git a/doc/Media2.xml b/doc/Media2.xml index 3ac5f4337..91ac3649d 100644 --- a/doc/Media2.xml +++ b/doc/Media2.xml @@ -2355,7 +2355,65 @@ When the size of the audio clip upload exceeds the MaxAudioClipSize parameter in KB, the device should return an HTTP 413, Request Entity Too Large error to the client. - + +
+ AddTTSAudioClip + This operation adds a text and its configuartion to the device so that device could convert the text to an audio clip and play it based on the configuration. + The response to the command includes a unique token for this converted audio clip. + If the device is unable to support certain language,then returns error message. + + + request + + Token - optional[tt:ReferenceToken] + Optional token associated with the audio clip. + Configuration - + [tr2:AudioClip] + Audio clip configuration to add. + TTSConfiguration - + [tr2:TTSAudio] + TTS configuration to add. + + + + response + + Token - [tt:ReferenceToken] + Unique token of the TTS audio clip to be uploaded. + + + + faults + + env:Receiver - ter:Action - ter:MaxAudioClipLimit + The maximum number of audio clip configurations supported by the device has been reached. + env:Sender - ter:InvalidArgVal - ter:InvalidConfig + The configuration parameters are not possible to set. + env:Sender - ter:InvalidArgVal - ter:InvalidLanguage + The language is not supported. + + + + access class + + WRITE_SYSTEM + + + + + Note: Audio clip uploads to the device can fail in the following scenarios, and a specific HTTP error code should be returned to the client when an upload fails. + + + When the upload URL has expired, the device should return an HTTP 410 error to the client. + + + When the format of the audio clip upload does not match the Type parameter in the audio clip configuration, the device should return an HTTP 415 Unsupported Media Type error to the client. + + + When the size of the audio clip upload exceeds the MaxAudioClipSize parameter in KB, the device should return an HTTP 413, Request Entity Too Large error to the client. + + +
SetAudioClip This operation modifies the existing audio clip configuration on the device. @@ -2637,6 +2695,10 @@ The following audio clip Capabilities are available: + + TTSCapabilities + Indicates device supports TTS function and corresponding configuration. + MaxAudioClipLimit Indicates the maximum number of audio clips that can be uploaded to the device. From 043366eb1947d88ce17266f91234e54314c355b0 Mon Sep 17 00:00:00 2001 From: Peggy0422 Date: Mon, 10 Nov 2025 15:35:48 +0800 Subject: [PATCH 3/5] Update media.wsdl update code line information for TTS function --- wsdl/ver20/media/wsdl/media.wsdl | 78 ++++++++++++++++++-------------- 1 file changed, 44 insertions(+), 34 deletions(-) diff --git a/wsdl/ver20/media/wsdl/media.wsdl b/wsdl/ver20/media/wsdl/media.wsdl index a086dd718..5a12208ae 100644 --- a/wsdl/ver20/media/wsdl/media.wsdl +++ b/wsdl/ver20/media/wsdl/media.wsdl @@ -174,11 +174,13 @@ IN NO EVENT WILL THE CORPORATION OR ITS MEMBERS OR THEIR AFFILIATES BE LIABLE FO + Indicates device has TTS capability. + @@ -199,26 +201,29 @@ IN NO EVENT WILL THE CORPORATION OR ITS MEMBERS OR THEIR AFFILIATES BE LIABLE FO + - - - Indicates the maximum length of content of a text for device to convert to an audio clip. - - - - - Indicates what language(s) the device supports for TTS funciton. - - - - - Indicates the voice type for TTS funciton. - - + + + + Indicates the maximum length of content of a text for device to convert to an audio clip. + + + + + Indicates what language(s) the device supports for TTS funciton. + + + + + Indicates the voice type for TTS funciton. + + + - + @@ -231,14 +236,15 @@ IN NO EVENT WILL THE CORPORATION OR ITS MEMBERS OR THEIR AFFILIATES BE LIABLE FO - + - + + @@ -1462,31 +1468,30 @@ IN NO EVENT WILL THE CORPORATION OR ITS MEMBERS OR THEIR AFFILIATES BE LIABLE FO - - + - + - Content of the audio clip. + Content of the audio clip. - - + + - Language for the audio clip content, it is the same as the language during playback. + Language for the audio clip content, it is the same as the language during playback. - - + + - The voice type of audio clip playback. + The voice type of audio clip playback. - + - - + + @@ -1619,7 +1624,7 @@ IN NO EVENT WILL THE CORPORATION OR ITS MEMBERS OR THEIR AFFILIATES BE LIABLE FO - + @@ -1652,7 +1657,7 @@ IN NO EVENT WILL THE CORPORATION OR ITS MEMBERS OR THEIR AFFILIATES BE LIABLE FO - + @@ -2035,12 +2040,14 @@ IN NO EVENT WILL THE CORPORATION OR ITS MEMBERS OR THEIR AFFILIATES BE LIABLE FO + + @@ -2417,12 +2424,13 @@ image will be updated automatically and independent from calls to GetSnapshotUri + This operation sends a text and its configuartion to device that supports TTS function, so that device could convert the text into an audio clip and play it according to audio clip Configuration and TTS Configuration. - + This operation modifies the existing audio clip configuration on the device. @@ -2934,6 +2942,7 @@ image will be updated automatically and independent from calls to GetSnapshotUri + @@ -2943,6 +2952,7 @@ image will be updated automatically and independent from calls to GetSnapshotUri + From 43f83bf547cd96a15eb317faeb6ed23f65015f77 Mon Sep 17 00:00:00 2001 From: Peggy0422 Date: Tue, 11 Nov 2025 21:13:33 +0800 Subject: [PATCH 4/5] Update media.wsdl correct some editorial errors --- wsdl/ver20/media/wsdl/media.wsdl | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/wsdl/ver20/media/wsdl/media.wsdl b/wsdl/ver20/media/wsdl/media.wsdl index 5a12208ae..5ee00b771 100644 --- a/wsdl/ver20/media/wsdl/media.wsdl +++ b/wsdl/ver20/media/wsdl/media.wsdl @@ -213,7 +213,7 @@ IN NO EVENT WILL THE CORPORATION OR ITS MEMBERS OR THEIR AFFILIATES BE LIABLE FO - Indicates what language(s) the device supports for TTS funciton. + Indicates what language(s) the device supports for TTS function. From ea5b5ddf7bd20a2e4295b899ea7a84cfc1d064d6 Mon Sep 17 00:00:00 2001 From: Peggy0422 Date: Mon, 1 Dec 2025 17:08:37 +0800 Subject: [PATCH 5/5] Update Media2.xml Updated the description of the AddTTSAudioClip operation to clarify the parameters and response. Updated the description of TTScapabilities. --- doc/Media2.xml | 8 ++++---- 1 file changed, 4 insertions(+), 4 deletions(-) diff --git a/doc/Media2.xml b/doc/Media2.xml index b1ba5d110..602f62098 100644 --- a/doc/Media2.xml +++ b/doc/Media2.xml @@ -2358,9 +2358,9 @@
AddTTSAudioClip - This operation adds a text and its configuartion to the device so that device could convert the text to an audio clip and play it based on the configuration. - The response to the command includes a unique token for this converted audio clip. - If the device is unable to support certain language,then returns error message. + This operation adds a text, audio clip configuration and TTS configuration to the device, for device converting the text to an audio clip based on the TTS configuration. + The response to the command includes a unique token for this converted audio clip. + If the device is unable to support language specified in the TTS configuration, the associated configuration will deleted from the device. request @@ -2814,7 +2814,7 @@ TTSCapabilities - Indicates device supports TTS function and corresponding configuration. + Indicates device supports TTS function and TTS configuration. MaxAudioClipLimit