Skip to content

Rework TranscriptionManager#860

Closed
peter-olai wants to merge 4 commits intoNPC-Prefab-Improvementsfrom
Transcribe-Backend-Option
Closed

Rework TranscriptionManager#860
peter-olai wants to merge 4 commits intoNPC-Prefab-Improvementsfrom
Transcribe-Backend-Option

Conversation

@peter-olai
Copy link
Copy Markdown

@peter-olai peter-olai commented May 1, 2025

  • What kind of change does this PR introduce? (Bug fix, feature, docs update, ...)
    This PR introduces transcription on the server side, which is handled by sending sound files to an external server ran in chat-service

  • What is the current behavior? (You can also link to an open issue here)
    Transcription only on client side

  • What is the new behavior? (if this is a feature change)
    Transcription can be sent to a backend to minimize transcription time. Local transcription is also more optimized.

  • Does this PR introduce a breaking change? (What changes might users need to make in their application due to this PR?)
    You need to be sure that you set the server-IP on the TranscriptionManager manually if you change where the server is ran. The option to change this post build is not yet implemented and is scrapped due to time constraints.

  • Other information:

MAKE SURE TO SET TRANSCRIPTIONMANAGER SETTINGS TO YOUR NEEDS.
image
If unticked it will transcribe locally, and via the Transcription Api Url otherwise
image

There is an import which is located in Assets>Samples>6 - MyTry

This needs to be reworked to be a prettier package when exporting this project later on.

peter-olai added 3 commits May 1, 2025 13:36
Updated multiple Unity scenes and scripts to integrate server-side transcription capabilities.
Key changes include the addition of `ServerTranscriptionManager` for managing audio transcription through a server API, modifications to `MicrophoneDemo.cs` for improved audio recording, and enhancements to UI elements for user interaction.
Also updated occlusion culling and render settings in `MyTry.unity`, along with adjustments to prefab properties across various assets.

refs: #1
- Modified `_nPCs` array size in `ReceptionOutdoor.unity`.
- Removed `transcriptionApiUrl` and updated metadata in prefab.
- Enhanced `Electronic Highway Sign SDF.asset` with new glyph metrics.
- Changed API endpoint to local server in `AIRequest.cs`.
- Simplified query handling in `AIConversationController.cs`.
- Disabled server transcription in `TranscriptionManager.prefab`.

refs: #1
@peter-olai peter-olai requested a review from HaluBoi May 1, 2025 19:47
@peter-olai peter-olai self-assigned this May 1, 2025
Copy link
Copy Markdown

@HaluBoi HaluBoi left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

There's a problem with finding the microphone when running the application on the Quest, so let's fix that before merging these changes 👍

@HaluBoi HaluBoi closed this May 2, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants