Machine Learning Project Template

A batteries-included template for PyTorch machine learning projects using Lightning, wandb, and modern Python tooling.

Features

PyTorch Lightning: Structured training framework with minimal boilerplate
Pydantic Configuration: Type-safe configuration management
Weights & Biases: Integrated experiment tracking
Modern Tooling: Built with uv for fast dependency management
Code Quality: Pre-configured with ruff, mypy, pytest, and pre-commit hooks
Git-based Versioning: Automatic experiment naming using git commit hashes

Project Structure

.
├── src/template/
│   ├── config.py              # Pydantic configuration classes
│   ├── lightning_module.py    # Base Lightning module
│   ├── datasets/              # Dataset implementations
│   ├── modeling/              # Model architectures
│   └── scripts/
│       └── train.py          # Training script
├── tests/                     # Test files
├── pyproject.toml            # Project metadata and dependencies
├── .pre-commit-config.yaml   # Pre-commit hooks configuration
└── .env.example              # Example environment variables

Quick Start

1. Install uv

curl -LsSf https://astral.sh/uv/install.sh | sh

2. Use this template for a new project

When creating a new project from this template:

Clone or fork this repository
Rename the src/template directory to your project name:
```
mv src/template src/your_project_name
```
Update pyproject.toml:
- Change name = "template" to your project name
- Update module-name = ["template"] to your project name
- Update the train script path in [project.scripts]
Update import statements in Python files to use your new project name

3. Install dependencies

uv sync

4. Set up environment variables

Copy the example environment file and add your API keys:

cp .env.example .env
# Edit .env and add your wandb API key and other credentials

5. Install pre-commit hooks

uv run pre-commit install

Usage

Training

Run the training script:

uv run train <data_root> --project my-project --num_devices 1

Available arguments:

data_root: Path to your dataset (required)
--project: Wandb project name (default: "jigsaw-2025")
--num_devices: Number of GPUs to use (default: 1)
--num_workers: Number of data loading workers (default: 12)
--log_root: Directory for logs and checkpoints (default: "logs")
--checkpoint_path: Resume from checkpoint
--weights_path: Load model weights
--debug: Enable debug mode
--fast_dev_run: Run a quick test with minimal data

Configuration

Edit src/template/config.py to customize hyperparameters:

from pydantic import BaseModel

class Config(BaseModel):
    # Reproducibility
    seed: int = 42

    # Data
    test_split: float = 0.1
    batch_size: int = 16

    # Training
    max_epochs: int = 200
    early_stopping_patience: int = 30
    learning_rate: float = 1e-4
    min_learning_rate: float = 1e-6
    weight_decay: float = 1e-2

Implementing Your Model

Create your Lightning module by inheriting from BaseLightningModule:

from template.lightning_module import BaseLightningModule

class MyModel(BaseLightningModule):
    def training_step(self, batch, batch_idx):
        # Your training logic here
        pass

    def validation_step(self, batch, batch_idx):
        # Your validation logic here
        pass

Add your dataset in src/template/datasets/:

from torch.utils.data import Dataset

class MyDataset(Dataset):
    def __init__(self, data_root, config):
        # Initialize your dataset
        pass

Update the training script to use your model and dataset

Development

Running Tests

uv run pytest

Type Checking

uv run mypy src/

Linting and Formatting

uv run ruff check src/
uv run ruff format src/

Pre-commit Hooks

Pre-commit hooks will automatically run on every commit to ensure code quality. To run manually:

uv run pre-commit run --all-files

Dependencies

Core dependencies:

PyTorch: Deep learning framework (with GPU support)
Lightning: High-level PyTorch wrapper
Pydantic: Data validation and configuration
Wandb: Experiment tracking
python-dotenv: Environment variable management

Development tools:

ruff: Fast Python linter and formatter
mypy: Static type checker
pytest: Testing framework
pre-commit: Git hooks for code quality

Build System

This project uses uv_build as the build backend, which is significantly faster than traditional build systems like setuptools or hatchling.

To build the project:

uv build

License

See LICENSE file for details.

Contributing

Create a new branch for your feature
Make your changes
Ensure all tests pass and pre-commit hooks succeed
Submit a pull request

Tips

Experiment names automatically include the git commit hash for reproducibility
Use .env for sensitive information (API keys, credentials)
The config system uses Pydantic for type safety and validation
Lightning automatically handles distributed training, gradient accumulation, and mixed precision

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
.github/workflows		.github/workflows
src/template		src/template
tests		tests
.env.example		.env.example
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Machine Learning Project Template

Features

Project Structure

Quick Start

1. Install uv

2. Use this template for a new project

3. Install dependencies

4. Set up environment variables

5. Install pre-commit hooks

Usage

Training

Configuration

Implementing Your Model

Development

Running Tests

Type Checking

Linting and Formatting

Pre-commit Hooks

Dependencies

Build System

License

Contributing

Tips

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

License

will-rice/ml-template

Folders and files

Latest commit

History

Repository files navigation

Machine Learning Project Template

Features

Project Structure

Quick Start

1. Install uv

2. Use this template for a new project

3. Install dependencies

4. Set up environment variables

5. Install pre-commit hooks

Usage

Training

Configuration

Implementing Your Model

Development

Running Tests

Type Checking

Linting and Formatting

Pre-commit Hooks

Dependencies

Build System

License

Contributing

Tips

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages