Zikos - AI Music Teacher POC

A proof-of-concept AI music teacher that combines LLM chat interaction with audio analysis and MIDI generation for personalized music instruction.

Status

🚧 Early development - POC implementation - Vibe-coding involved - Not to be used as-is

Quick Overview

Audio Input: User recordings analyzed via signal processing tools
LLM: Qwen2.5/Qwen3 models with excellent function calling support (or Llama 3.3 70B)
Output: Text feedback + MIDI-generated musical examples with notation
Architecture: FastAPI backend + TypeScript frontend
Backends: Supports both llama-cpp-python (GGUF) and HuggingFace Transformers (safetensors)

Hardware Support

Zikos tries to support a wide range of hardware configurations:

CPU-only: Works without GPU (very slow, but functional)
Small GPU (8GB VRAM): RTX 3060Ti, RTX 3070, etc. - Qwen2.5-7B recommended
Medium GPU (16-24GB VRAM): RTX 3090, RTX 4090, etc. - Qwen2.5-14B or Llama 3.3 70B
Large GPU (80GB+ VRAM): H100, A100, etc. - Qwen3-32B with 128K context window

Setup

Prerequisites

Python 3.11+
FFmpeg (for audio preprocessing)
LLM model file (GGUF or HuggingFace Transformers format) - see Downloading Models below
GPU recommended (8GB+ VRAM) but CPU-only is supported

Installation

# Create virtual environment
python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate
pip install .

# Install JavaScript dependencies (for TypeScript frontend)
npm install
npm run build  # Build TypeScript to JavaScript

# Set environment variables
# Copy .env.example to .env and edit with your settings
cp .env.example .env  # On Windows: copy .env.example .env
# Edit .env with your settings (especially LLM_MODEL_PATH)

Environment Variables

Zikos can be configured via environment variables. Copy .env.example to .env and adjust values for your setup.

Downloading Models

You can download models using the provided helper script. See MODEL_RECOMMENDATIONS.md for detailed recommendations.

# List available models
python scripts/download_model.py --list
python scripts/download_model.py qwen2.5-7b-instruct-q4 -o ./models

# With Hugging Face token (for private models)
python scripts/download_model.py qwen3-32b-instruct -t YOUR_TOKEN

The script supports both GGUF (llama-cpp-python) and Transformers (HuggingFace) formats. After downloading, the .env file created by the setup script will be configured automatically, or you can set LLM_MODEL_PATH manually:

# For GGUF models
export LLM_MODEL_PATH=./models/Qwen2.5-7B-Instruct-Q4_K_M.gguf

Note: The script requires huggingface_hub for Transformers models. Install with:

# Recommended: install model download helpers
pip install -e ".[model-download]"

# Or install individually
pip install huggingface_hub

Run

python run.py

API will be available at http://localhost:8000

Docker

Zikos can be run using Docker, which handles all dependencies and setup automatically.

Prerequisites

Docker and Docker Compose installed
LLM model file downloaded to ./models/ directory (see Downloading Models)

Using Docker Compose (Recommended)

The easiest way to run Zikos with Docker:

# Set the model filename (optional, defaults to Llama-3.1-8B-Instruct-Q4_K_M.gguf)
export LLM_MODEL_FILE=Qwen2.5-7B-Instruct-Q4_K_M.gguf

# Build and start the container
docker-compose up --build

# Or run in detached mode
docker-compose up -d --build

The API will be available at http://localhost:8000. The container automatically:

Builds the frontend TypeScript code
Mounts your ./models directory (read-only) for model access
Creates and mounts storage directories for audio, MIDI, and notation files
Sets up environment variables with sensible defaults

Using Docker Directly

# Build the image
docker build -t zikos .

# Run the container
docker run -d \
  --name zikos \
  -p 8000:8000 \
  -v ./models:/app/models:ro \
  -v ./audio_storage:/app/audio_storage \
  -v ./midi_storage:/app/midi_storage \
  -v ./notation_storage:/app/notation_storage \
  -e LLM_MODEL_PATH=/app/models/Qwen2.5-7B-Instruct-Q4_K_M.gguf \
  -e LLM_N_CTX=32768 \
  -e LLM_N_GPU_LAYERS=0 \
  zikos

Docker Configuration

The Docker setup uses volumes to persist data:

./models → /app/models (read-only): Model files
./audio_storage → /app/audio_storage: Uploaded audio files
./midi_storage → /app/midi_storage: Generated MIDI files
./notation_storage → /app/notation_storage: Generated notation files

Environment variables can be customized in docker-compose.yml or passed via -e flags when using docker run. See Environment Variables for available options.

Note: For GPU support, you'll need to configure Docker with GPU access (e.g., --gpus all flag or Docker Compose GPU configuration) and adjust LLM_N_GPU_LAYERS accordingly.

Development

Dependencies

LLM: Qwen2.5-7B/14B (recommended), Qwen3-32B (for H100) or similar models, via dual backend support
- llama-cpp-python: For GGUF models (Qwen2.5, Llama 3.3)
- HuggingFace Transformers: For safetensors models (Qwen3)
Audio Processing: librosa, torchaudio, soundfile
MIDI: Music21 for processing, FluidSynth for synthesis
Backend: FastAPI with WebSocket support
Frontend: TypeScript + Web Audio API

Code Quality

The project uses:

ruff: Fast Python linter
black: Code formatter
mypy: Static type checker
pytest: Testing framework with coverage

Project Structure

zikos/
├── backend/
│   └── zikos/          # Python backend code
│       ├── api/        # FastAPI routes
│       ├── mcp/        # MCP tools and server
│       ├── services/   # Business logic
│       ├── config.py   # Configuration
│       └── main.py     # FastAPI app
├── frontend/           # TypeScript/HTML frontend
│   ├── src/            # TypeScript source files
│   ├── dist/           # Compiled JavaScript (generated)
│   └── index.html      # Main HTML file
├── tests/              # Test code
├── scripts/            # Utility scripts (model download, env setup)
├── CONFIGURATION.md    # Hardware-specific configuration guide (includes H100 optimization)
├── MODEL_RECOMMENDATIONS.md # Model recommendations
├── DESIGN.md           # Architecture design and future roadmap
├── TOOLS.md            # MCP tools specification
└── SYSTEM_PROMPT.md    # LLM system prompt

Python env setup

pip install .[dev]

# Optional: generate a pinned requirements.txt for reproducible builds
pip-compile pyproject.toml -o requirements.txt

Pre-commit Hooks

# Install pre-commit hooks (runs checks before commit)
pre-commit install

# Run hooks manually
pre-commit run --all-files

Testing

This project follows Test-Driven Development (TDD) principles with comprehensive test coverage.

Unit tests: Test individual components in isolation
Integration tests: Test API endpoints and service interactions
Coverage target: Minimum 80% code coverage

Note: Comprehensive tests (LLM inference, heavy audio processing) and integration tests are excluded from default pytest runs and pre-commit hooks to keep commit times reasonable. These tests are marked as comprehensive or integration and require model files or significant resources. LLM integration tests verify real tool calling functionality. These are critical for catching bugs that mocked tests miss.

Run tests:

pytest -m not comprehensive  # Run all but comprehensive tests
pytest -m integration    # Run integration tests
pytest -m ""             # Run all tests including comprehensive and integration

Continuous Integration

The project uses GitHub Actions for CI/CD. The workflow (.github/workflows/ci.yml) runs automatically on pushes and pull requests to main and develop branches.

CI Jobs

Test (Python 3.11, 3.12, 3.13)
- Runs unit tests with coverage (minimum 75% required)
- Runs integration tests (excluding comprehensive tests)
- Uploads coverage to Codecov (Python 3.13 only)
- Installs system dependencies (libsndfile, ffmpeg, fluidsynth, etc.)
Lint
- Runs ruff for linting
- Runs black --check for code formatting
- Runs mypy for type checking
TypeScript Type Check
- Runs TypeScript type checking
- Runs ESLint for frontend code quality
Frontend Tests
- Runs frontend test suite with coverage
- Uploads coverage to Codecov

All jobs must pass for a PR to be mergeable. The CI ensures code quality, type safety, and test coverage across multiple Python versions and the frontend.

Name		Name	Last commit message	Last commit date
Latest commit History 79 Commits
.github/workflows		.github/workflows
backend/zikos		backend/zikos
frontend		frontend
scripts		scripts
self_docuentation		self_docuentation
tests		tests
.coveragerc		.coveragerc
.dockerignore		.dockerignore
.env.example		.env.example
.eslintrc.json		.eslintrc.json
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.prettierignore		.prettierignore
.prettierrc.json		.prettierrc.json
Dockerfile		Dockerfile
README.md		README.md
SYSTEM_PROMPT.md		SYSTEM_PROMPT.md
docker-compose.yml		docker-compose.yml
package-lock.json		package-lock.json
package.json		package.json
pyproject.toml		pyproject.toml
run.py		run.py
todo.txt		todo.txt
tsconfig.json		tsconfig.json
vitest.config.ts		vitest.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Zikos - AI Music Teacher POC

Status

Quick Overview

Hardware Support

Setup

Prerequisites

Installation

Environment Variables

Downloading Models

Run

Docker

Prerequisites

Using Docker Compose (Recommended)

Using Docker Directly

Docker Configuration

Development

Dependencies

Code Quality

Project Structure

Python env setup

Pre-commit Hooks

Testing

Continuous Integration

CI Jobs

About

Uh oh!

Releases

Packages

Languages

cyprienruffino/zikos

Folders and files

Latest commit

History

Repository files navigation

Zikos - AI Music Teacher POC

Status

Quick Overview

Hardware Support

Setup

Prerequisites

Installation

Environment Variables

Downloading Models

Run

Docker

Prerequisites

Using Docker Compose (Recommended)

Using Docker Directly

Docker Configuration

Development

Dependencies

Code Quality

Project Structure

Python env setup

Pre-commit Hooks

Testing

Continuous Integration

CI Jobs

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages