GitHub - ben-kahl/FishAI

FishAI

A fully fledged AI powered assistant running on a Big Mouth Billy Bass!
FishAI has evolved into a hybrid edge/cloud IoT device. The "Fish" (Client) runs on a Raspberry Pi with vision and audio processing, while the "Brain" (Server) runs in the cloud (or locally) handling intelligence, state, and complex logic.

Table of Contents

About The Project
- Built With
Getting Started
Usage
Roadmap
License
Contact
Acknowledgments

About The Project

What started as a silly summer project to clone GPTARS.ai has evolved into a possibly over-engineered IoT platform. The project now utilizes a Client-Server architecture:

The Fish (Client): A Raspberry Pi 4b inside the fish that handles:
- Wake Word Detection: Uses Picovoice Porcupine to listen for keywords.
- Vision: Captures images via a connected camera to send to the AI.
- Animatronics: Controls the mouth, head, and tail via H-Bridges.
- Audio: Plays responses using mpg123.
- Health Monitoring: Reports CPU/Temp stats to the dashboard.
The Brain (Cloud/Server): A Flask-based backend (deployable to AWS App Runner via Terraform) that handles:
- Intelligence: Google Gemini Flash 2.5 for multimodal understanding (Text + Image).
- Speech: ElevenLabs for voice synthesis.
- State Management: Redis for command queuing (movements, speech) and health status.
The Dashboard: A Next.js web application for configuring personalities, monitoring device health, and manual control.

(back to top)

Built With

Hardware Control: Python, OpenCV (Computer Vision)
AI Services: Google Gemini, Picovoice, ElevenLabs
Backend: Flask, Redis, Docker
Frontend: Next.js, React, TypeScript
Infrastructure: AWS (App Runner, ECR), Terraform

(back to top)

Getting Started

Because the project is split into three parts (Client, Server, Dashboard), setup is a bit more involved than before.

Hardware Prerequisites

Raspberry Pi (4b recommended)
Big Mouth Billy Bass
Camera: USB Webcam or Pi Camera (for vision capabilities)
Microphone: INMP441 MEMS mic or USB mic
3 H-Bridges (for motor control)
9V battery/power supply
Speaker/Audio output

Installation

Get API Keys You will need keys for: Google Gemini, ElevenLabs, and Picovoice.

Clone the Repo

git clone [https://github.com/ben-kahl/FishAI.git](https://github.com/ben-kahl/FishAI.git)
cd FishAI

Part 1: The Server (Cloud/Local)

You can run the server locally or deploy it to AWS.

Local Run: Ensure you have a local Redis instance running (redis-server).
```
cd cloud
pip install -r cloud_requirements.txt
python server.py
```

AWS Deployment: Infrastructure is managed via Terraform.

cd cloud
# Ensure AWS credentials are set
terraform init
terraform apply

Part 2: The Fish (Raspberry Pi)

Run this on the Raspberry Pi.

Install system dependencies:

sudo apt-get install mpg123 libatlas-base-dev

Install Python dependencies:

cd client
pip install -r client_requirements.txt

Configure Environment: Create a .env file in the client/ directory:

PICOVOICE_API_KEY=<YOUR_KEY>
CLOUD_URL=http://<YOUR_SERVER_IP>:5000
MICROPHONE_INDEX=0
CAMERA_INDEX=0

Start the Client:
```
python main.py
```

Part 3: The Dashboard (Optional)

A web interface to control the fish.

Navigate to dashboard:
```
cd fish-dashboard
```
Install and Run:
```
npm install
npm next dev
```

(back to top)

Usage

Once everything is running:

Voice Control: Say the wake word (default: "Jarvis" or similar, based on your .ppn file). The Fish will wake up, snap a picture of what it sees, listen to your query, and respond using Gemini's multimodal capabilities.
Web Control: Open the Next.js dashboard to:
- Change the Fish's "Personality" (e.g., Normal, Sassy, Excited).
- View live health stats (Temperature, CPU load).
- Manually send text queries.
- Control volume.

(back to top)

Roadmap

Voice to Audio AI response Pipeline
Vision Capabilities: The fish can now "see" you.
Cloud Infrastructure: Terraform + AWS App Runner support.
Web Dashboard: Full Next.js control panel.
Personality Engine: Dynamic personality switching via Redis.

See the open issues for a full list of proposed features (and known issues).

(back to top)

Contributing

Contributions are what make the open source community such an amazing place to learn, inspire, and create. Any contributions you make are greatly appreciated.

Fork the Project
Create your Feature Branch (git checkout -b feature/AmazingFeature)
Commit your Changes (git commit -m 'Add some AmazingFeature')
Push to the Branch (git push origin feature/AmazingFeature)
Open a Pull Request

(back to top)

License

Distributed under the MIT License. See LICENSE for more information.

(back to top)

Contact

Ben Kahl - linkedin-url

Project Link: https://github.com/ben-kahl/FishAI

(back to top)

Name		Name	Last commit message	Last commit date
Latest commit History 121 Commits
.github/workflows		.github/workflows
client		client
cloud		cloud
fish-dashboard		fish-dashboard
.gitignore		.gitignore
LICENSE		LICENSE
Porcupine_LICENSE.txt		Porcupine_LICENSE.txt
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FishAI

About The Project

Built With

Getting Started

Hardware Prerequisites

Installation

Part 1: The Server (Cloud/Local)

Part 2: The Fish (Raspberry Pi)

Part 3: The Dashboard (Optional)

Usage

Roadmap

Contributing

License

Contact

About

Uh oh!

Releases

Packages

Languages

License

ben-kahl/FishAI

Folders and files

Latest commit

History

Repository files navigation

FishAI

About The Project

Built With

Getting Started

Hardware Prerequisites

Installation

Part 1: The Server (Cloud/Local)

Part 2: The Fish (Raspberry Pi)

Part 3: The Dashboard (Optional)

Usage

Roadmap

Contributing

License

Contact

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages