BakBak - Audio Recording & Transcription App

BakBak is an audio recording and transcription application built with React and TanStack Router. It allows users to record, transcribe, translate, and study language audio recordings.

Features

Audio recording and playback
Automatic transcription of recordings
Translation to different languages
Notes and vocabulary tracking
Recording sharing and collaboration

Tech Stack

Frontend: React 19, TanStack Router, TanStack Query
Backend: TanStack Server, AWS Services
Database: SQLite with better-sqlite3
Storage: AWS S3
Authentication: Better Auth
Transcription/Translation: AWS Transcribe/Translate

Getting Started

Requirements

Node.js 16+ (for crypto.randomUUID support)
Modern browser with crypto.randomUUID support
AWS S3 bucket with proper credentials configured
Docker & Docker Compose (for production deployment)

Development

Clone the repository

git clone https://github.com/yourusername/bakbak.git
cd bakbak

Install dependencies

npm install

Create a .env file

cp .env.example .env
# Edit .env with your AWS S3 bucket and other settings

Set up the database

npm run db:setup

Start development server

make dev

Production Deployment

For production deployment with Docker:

Clone and configure

git clone https://github.com/yourusername/bakbak.git
cd bakbak

Deploy

make prod

On first run, this creates a .env file from the template. Edit it with your configuration and run make prod again.

Database

See DATABASE.md for detailed information about the database schema and models.

Environment Configuration

Required Environment Variables

# REQUIRED: S3 Storage Configuration
AWS_S3_BUCKET=your-recordings-bucket

# REQUIRED: AWS Configuration  
AWS_REGION=us-east-1
AWS_ACCESS_KEY_ID=your-access-key
AWS_SECRET_ACCESS_KEY=your-secret-key

# REQUIRED: Authentication
GOOGLE_CLIENT_ID=your-google-client-id
GOOGLE_CLIENT_SECRET=your-google-client-secret

Storage Organization

All recordings are organized with this exact structure:

your-bucket/
└── recordings/
    ├── user-123/
    │   ├── 2024-01-15/
    │   │   ├── uuid-1.webm
    │   │   ├── uuid-1-transcription.json
    │   │   └── uuid-1-translation-es.json
    │   └── 2024-01-16/
    │       └── uuid-2.webm
    └── user-456/
        └── 2024-01-15/
            └── uuid-3.webm

Benefits:

Complete user isolation: No cross-user access possible
Date-based organization: Easy cleanup and management by date
UUID identification: Globally unique identifiers prevent conflicts
Related file grouping: All recording derivatives stored together

API Routes

The application uses TanStack Router for file-based routing. API routes are available at:

/api/recordings - Recording management
/api/recordings/:id/transcribe - Transcription endpoints
/api/recordings/:id/translate - Translation endpoints

Available Commands

Development

make dev          # Run development server
npm run build     # Build for production
npm run lint      # Lint and format code
npm run db:setup  # Set up database schema

Production

make prod    # Build and run production deployment
make stop    # Stop production containers
make logs    # View application logs
make clean   # Clean up containers and images
make help    # Show all available commands

Usage Examples

Audio Upload Implementation

The audio upload follows TanStack Start best practices with a server function co-located with the route:

// In src/routes/recordings/new.tsx
import { createServerFn } from "@tanstack/react-start";

const uploadAudioRecording = createServerFn({ method: "POST" })
  .validator((data) => {
    // Validate FormData with audioFile, userId, fileExtension
    if (!(data instanceof FormData)) {
      throw new Error("Invalid form data");
    }
    // ... validation logic
  })
  .handler(async ({ data: { audioFile, userId, fileExtension } }) => {
    // Generate structured S3 path with UUID
    // Upload directly to S3 using presigned URLs
    // Return URL and metadata
  });

// Usage in component:
const formData = new FormData();
formData.append('audioFile', audioFile);
formData.append('userId', session.user.id);
formData.append('fileExtension', 'webm');

const result = await uploadAudioRecording({ data: formData });

Working with Storage Paths

import { RecordingStoragePaths } from "~/services/storage/RecordingStoragePaths";

// Generate structured paths (no fallbacks)
const audioPath = RecordingStoragePaths.getAudioPath("user123");
const transcriptionPath = RecordingStoragePaths.getTranscriptionPath("user123", "recording-uuid");
const translationPath = RecordingStoragePaths.getTranslationPath("user123", "recording-uuid", "es");

// List all recordings for a user on a specific date
const userDatePath = RecordingStoragePaths.getUserDatePath("user123", "2024-01-15");

// Extract information from paths (returns null if invalid format)
const userId = RecordingStoragePaths.extractUserId(audioPath);
const recordingUUID = RecordingStoragePaths.extractRecordingUUID(audioPath);
const date = RecordingStoragePaths.extractDate(audioPath);

Database Access Examples

import { RecordingModel } from './database/models';

// Find all recordings for a user
const recordings = RecordingModel.findByUserId('user-123');

// Get a specific recording
const recording = RecordingModel.findById('recording-456');

// Create a new recording
const newRecording = RecordingModel.create({
  userId: 'user-123',
  title: 'Practice Session',
  description: 'Japanese conversation practice',
  filePath: 's3://recordings/user-123/recording.webm',
  language: 'Japanese',
  duration: 120,
  sourceType: RecordingSourceType.SELF_RECORDED
});

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 67 Commits
.claude		.claude
.vscode		.vscode
better-auth_migrations		better-auth_migrations
docs		docs
ideas		ideas
public		public
scripts		scripts
src		src
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
.prettierignore		.prettierignore
CLAUDE.md		CLAUDE.md
DATABASE.md		DATABASE.md
Dockerfile		Dockerfile
Makefile		Makefile
README.md		README.md
TRANSCRIBE.md		TRANSCRIBE.md
docker-compose.yml		docker-compose.yml
package-lock.json		package-lock.json
package.json		package.json
postcss.config.mjs		postcss.config.mjs
tailwind.config.mjs		tailwind.config.mjs
tsconfig.json		tsconfig.json
vite.config.ts		vite.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BakBak - Audio Recording & Transcription App

Features

Tech Stack

Getting Started

Requirements

Development

Production Deployment

Database

Environment Configuration

Required Environment Variables

Storage Organization

API Routes

Available Commands

Development

Production

Usage Examples

Audio Upload Implementation

Working with Storage Paths

Database Access Examples

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

samratjha96/bakbak

Folders and files

Latest commit

History

Repository files navigation

BakBak - Audio Recording & Transcription App

Features

Tech Stack

Getting Started

Requirements

Development

Production Deployment

Database

Environment Configuration

Required Environment Variables

Storage Organization

API Routes

Available Commands

Development

Production

Usage Examples

Audio Upload Implementation

Working with Storage Paths

Database Access Examples

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages