A Python-based application that uses OpenAI Whisper for voice recognition to control a robotic hand via a serial connection.
- Voice Recognition: Uses Whisper
basemodel for offline-capable, high-accuracy transcription. - Serial Communication: Send 5-bit binary commands (e.g.,
01101) to an Arduino or similar microcontroller. - Dynamic Mapping: Assign specific recognized phrases to custom binary commands.
- Continuous Listening: Toggle "Start Listening" for hands-free control.
- Real-time Logs: Monitor recognition and serial output through a built-in log.
- π Python 3.8+
- ποΈ Microphone
- π€ Serial Device (Arduino, ESP32, etc.)
pip install numpy sounddevice scipy openai-whisper pyserial PyQt5Note: openai-whisper depends on ffmpeg. Ensure it is installed on your system.
- Select Serial Port: Choose the COM port/Device path of your robotic hand.
- Connect: Click the π Connect button.
- Assign Commands:
- Click ποΈ Record Voice (3 seconds).
- Once recognized, enter a 5-bit binary command (e.g.,
10001) in the input box. - Click β Assign.
- Control:
- Speak the assigned phrase while the app is "Listening" or use the manual record button.
- The corresponding binary command will be sent over Serial.
main.py: Core application logic and threads.fancy_ui.py: PyQt5 GUI layout and styling.
MIT