Karaoke Vocal Trainer

A local, browser-based vocal training tool with two modes: Karaoke (play backing tracks and score your pitch in real time) and Warmup (voice exercises without needing audio/MIDI files). Built entirely in vanilla JavaScript with the Web Audio API and Canvas 2D — no framework, no server, no dependencies.

Demo

The main screen shows a scrolling note-roll canvas (notes in blue/green/yellow moving right toward a vertical hit line), a glowing purple pitch ball tracking your live voice, an overview strip at the bottom, and the score/accuracy counters in the top bar.

Features

Karaoke Mode

Drag-and-drop or file-picker loading of WAV/MP3 audio (multiple files mixed automatically) and a MIDI file with the vocal melody
Real-time YIN pitch detection via AudioWorkletProcessor (~40 ms latency), with ScriptProcessorNode fallback for older browsers
Scrolling note-roll canvas with colour-coded verdicts: green = hit, yellow = near, dark = miss
Pitch ball that tracks your live voice with smooth interpolation
Scoring: hit (≤1 semitone off) = 100 pts, near (≤2 semitones off) = 50 pts, miss = 0 pts; final accuracy shown as a percentage and star rating (1–3 stars)
Lyrics support: paste raw text, auto-distributed across notes; shown on note tiles and in a resizable side panel
Per-note lyric editor to fine-tune text assignments
Transpose slider (−6 to +6 semitones) and practice speed mode (75%)
Metronome with configurable BPM using Web Audio API scheduled oscillators
Session recording: saves mixed audio (phonogram + mic) and mic-only track as .webm
Save/load entire session as a .vkt project file (includes MIDI, audio, lyrics, and note bindings)
Overview strip with click-to-seek navigation; keyboard seek with arrow keys
Shareable results PNG screenshot (640×360)
Input/output device selection (Chrome 110+)

Warmup Mode

Single Note Hold — select any note (C3–B4), hold it for the target duration, and see your accuracy and stability score. No audio/MIDI files required.
Piano Note Match — an on-screen 3-octave piano plays a random target; you sing to match it for instant hit/near/miss feedback.
Scale / Pattern Practice — 8 built-in patterns (major scale, minor scale, arpeggio, chromatic, etc.) scroll across a canvas at a configurable BPM and root note. A full accuracy breakdown is shown at the end.
Mic volume meter visible in all warmup sub-modes
All three sub-modes share the existing pitch detector — no duplicate audio logic

Requirements

Browser: Chrome 110+ (recommended for full feature set), Firefox 115+
Microphone: connected and accessible to the browser
Karaoke mode: requires a .mid file (vocal melody) and one or more WAV/MP3 audio files
Warmup mode: requires only a microphone — no audio or MIDI files needed

Quick Start

How to Run

Open index.html directly in Chrome 110+ (double-click).

Note: For best pitch detection accuracy, serve via HTTP instead: python -m http.server 8080 → http://localhost:8080 Direct file:// launch uses a fallback audio engine and works fine for casual use.

On launch the app opens the Mode Select screen. Choose Warmup to practise without any files, or Karaoke to load audio + MIDI and score your singing.

How It Works

Pitch Detection (shared)

Microphone audio is routed to pitch-processor.js, an AudioWorkletProcessor that accumulates samples in a 2048-sample ring buffer and runs the YIN algorithm every ~40 ms. YIN computes the cumulative mean normalised difference function, finds the first minimum below threshold 0.10, and applies parabolic interpolation for sub-sample accuracy. The detected frequency is converted to a MIDI note number. The same detector powers both Karaoke and Warmup modes.

Karaoke Mode

The MIDI file is parsed as raw binary: the app reads all tracks, builds a tempo map from 0xFF 0x51 meta-events, converts delta-ticks to seconds using piecewise tempo interpolation, and merges consecutive same-pitch notes separated by ≤ 0.15 s. Audio files are decoded with AudioContext.decodeAudioData() and played simultaneously through a shared GainNode.

The note-roll canvas redraws every animation frame. Notes are positioned using tileX = hitLineX + (note.startSec - currentTime) × SCROLL_SPEED and tileY = noteToY(midiNote) — a linear mapping from pitch range to canvas height. The pitch ball's Y position exponentially lerps toward the target at factor 0.12 each frame.

Scoring per note:

if total_samples == 0          → miss
if hit_samples / total >= 0.4  → hit   (+100 pts)
if (hit+near) / total >= 0.4   → near  (+50 pts)
else                           → miss  (+0 pts)

Final accuracy: (hits + nears × 0.5) / totalNotes × 100 %

Star thresholds: ★★★ ≥ 85%, ★★ ≥ 60%, ★ ≥ 30%.

Warmup Mode

Single Note Hold: The selected target MIDI note is compared each frame against state.detectedNote. Samples within WARMUP_HOLD_HIT_SEMITONES (0.5 st) are "in-tune"; within WARMUP_HOLD_NEAR_SEMITONES (1.5 st) are "close". A moving stability window (WARMUP_STABILITY_WINDOW = 40 frames) drives the stability fill bar.

Piano Note Match: A random MIDI note is picked from the piano range (MIDI 48–84). The DOM-rendered piano keyboard highlights the target key. The first frame where detectedNote is within tolerance of the target registers the verdict.

Scale Practice: generateScaleNotes(presetId, rootNote, bpm) builds a NoteEvent[] array from the preset's interval array. A dedicated #warmupScaleCanvas renders a scrolling note-roll (same coordinate model as the karaoke canvas) with a configurable lead-in. Per-note verdicts are accumulated over each note's window using the same hit/near/miss semitone logic.

Keyboard Shortcuts

Key	Action
`Space`	Play / Pause (Karaoke mode)
`R`	Restart (Karaoke mode)
`M`	Metronome on / off (Karaoke mode)
`L`	Show / hide lyrics panel (Karaoke mode)
`T`	Transpose +1 semitone (Karaoke mode)
`?` or `/`	Show keyboard shortcut help
`Esc`	Close any open dialog or modal
`←` / `→`	Seek −5 s / +5 s (Karaoke mode)
Click on overview strip	Seek to clicked position (Karaoke mode)

File Format Notes

MIDI (Karaoke mode)

Any MIDI format (0, 1, 2) is accepted; all tracks are parsed and merged.
All channels are included — there is no channel filter. Supply a MIDI file with only the vocal melody.
SMPTE time division is not supported (throws an error).
NoteOn with velocity 0 is treated as NoteOff.
Notes shorter than 0.02 s are discarded. Consecutive same-pitch notes with a gap ≤ 0.15 s are merged.

Audio (Karaoke mode)

WAV files are recommended (most reliable cross-browser decoding).
MP3 files are supported but decode failures are per-file and non-fatal.
Multiple audio files are decoded separately and played simultaneously through the same GainNode — useful for separate instrumental tracks.

Project Structure

index.html           — Complete application (HTML + CSS + JavaScript in a single IIFE)
pitch-processor.js   — AudioWorkletProcessor for YIN pitch detection (loaded at runtime)

Extending the Project

Add warmup presets — add an entry to WARMUP_PRESETS (id, label key, intervals array, direction). generateScaleNotes() will pick it up automatically; no UI code changes needed.
Add song transposition UI presets — store preset semitone offsets in an array and map them to buttons that call ui.transposeSlider.dispatchEvent(new InputEvent('input')) after setting ui.transposeSlider.value.
Add polyphonic/chord display — parseMidi() currently keeps overlapping notes; the renderer renders them at their respective noteToY() positions. To highlight chords, group NoteEvent[] by overlapping time windows before passing to the renderer.
Add pitch deviation history trail — push detected MIDI note + timestamp into a circular buffer in renderFrame() and draw a fading polyline before the pitch ball.
Add a practice loop — store loopStart and loopEnd in state, check them in renderFrame(), and call startAudioPlayback(loopStart / state.playbackRate) when currentSec >= loopEnd.
Add MIDI channel filter — add a MIDI_CHANNEL constant to CONFIG and filter events in parseMidi() before building pendingNotes.

License

MIT — see LICENSE file.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
LICENSE		LICENSE
README.md		README.md
index.html		index.html
pitch-processor.js		pitch-processor.js

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Karaoke Vocal Trainer

Demo

Features

Karaoke Mode

Warmup Mode

Requirements

Quick Start

How to Run

How It Works

Pitch Detection (shared)

Karaoke Mode

Warmup Mode

Keyboard Shortcuts

File Format Notes

MIDI (Karaoke mode)

Audio (Karaoke mode)

Project Structure

Extending the Project

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Karaoke Vocal Trainer

Demo

Features

Karaoke Mode

Warmup Mode

Requirements

Quick Start

How to Run

How It Works

Pitch Detection (shared)

Karaoke Mode

Warmup Mode

Keyboard Shortcuts

File Format Notes

MIDI (Karaoke mode)

Audio (Karaoke mode)

Project Structure

Extending the Project

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages