Add Voice Command Interface using Pipecat#2
Conversation
|
Pipecat available version is 0.0.57. |
|
I'll check the comments on PR #2 as requested. Looking at the PR details now. |
|
I've reviewed the voice command implementation using Pipecat and it looks great overall! Here are a few observations and suggestions for improvement: Potential Issues
Suggested Improvements
These are just suggestions for future enhancements - the current implementation is already a solid foundation for voice commands in CompUse! |
|
I've fixed the ImportError by changing According to the Pipecat documentation, the correct class name is The change is: - from pipecat.frames.frames import AudioFrame, EndFrame, TextFrame
+ from pipecat.frames.frames import AudioRawFrame, EndFrame, TextFrameThis should resolve the error: |
|
I've fixed the import error in the PR. The issue was that the code was trying to import from Changes made:
This should resolve the import error you were seeing. |
Voice Command Interface for CompUse
This PR adds a voice command interface to CompUse using the Pipecat library as requested. The implementation allows users to control their computer using voice commands, which are processed and executed through the CompUse agent.
Features
Implementation Details
voice_tools.py: Core implementation of the voice command interface using PipecatVoiceCommandManagerclass for managing voice recognitionvoice_cli.py: CLI interface for using voice commandsUpdated Dependencies: Added Pipecat and related dependencies to
requirements.txtDocumentation: Added
VOICE_COMMANDS.mdwith detailed usage instructionsUsage
Run the voice command interface:
Optional arguments:
--wake-word TEXT: Set a custom wake word (default: "computer")--auto-start: Automatically start voice recognition on startupWhen voice recognition is active, say the wake word followed by your command:
Configuration
Required environment variables:
OPENAI_API_KEY: For Whisper speech recognitionELEVENLABS_API_KEY(optional): For voice feedbackELEVENLABS_VOICE_ID(optional): For voice feedbackCOMPUSE_WAKE_WORD(optional): Custom wake wordNext Steps