KaliCalibNet

KaliCalibNet is a neural network model for basketball court calibration and keypoint (grid + baskets) detection.
It is an implementation of the architecture presented in KaliCalib: A Framework for Basketball Court Registration by Adrien Maglo, Astrid Orcesi, and Quoc Cuong Pham.

My implementation is part of a larger project to provide more advanced analytics to smaller college basketball programs. I am particuarly focused on the Division II Mountain East Conference.

Example output

This repository includes:

Model Architecture (src/models/network.py)
Custom Non-Local Block (src/models/layers.py)
Dataset Processing & Augmentations (src/data/)
Loss Functions (src/training/losses.py)
Court Utilities (homography, geometry, etc.) in (src/utils/court.py)
Training Script (scripts/train.py)
Inference Script (scripts/inference.py)
Config (configs/default.yaml)
Heatmap Generation (scripts/generate_single_heatmap.py)

Project Overview

KaliCalibNet uses a ResNet-18 backbone, modified with dilated convolutions and optional Non-Local Blocks for capturing longer-range dependencies in the spatial dimension. It is designed to:

Ingest an RGB image of a basketball court.
Predict a (K+1)-channel heatmap, where each channel corresponds to:
- 91 grid point positions (perspective-aware or uniformly spaced),
- 1 channel for the upper basket (UB),
- 1 channel for the lower basket (LB),
- and 1 for the background.
Enable further homography-based transformations to register the court with real-world coordinates.

Data Structure & Preparation

The code expects your dataset directory to be structured as follows:

data/
├── train/
│   ├── images/
│   │   ├── example_1.jpg
│   │   ├── example_2.jpg
│   │   └── ...
│   └── labels/
│       ├── example_1.npz
│       ├── example_2.npz
│       └── ...
└── val/
    ├── images/
    │   ├── val_1.jpg
    │   ├── val_2.jpg
    │   └── ...
    └── labels/
        ├── val_1.npz
        ├── val_2.npz
        └── ...

Each .npz file should contain:

91 grid channels: grid_0, grid_1, ..., grid_90
ub (upper basket) channel
lb (lower basket) channel
background channel

All heatmaps are typically at 1/4 resolution of the input image (based on output_stride = 4 in the config).

Example:
If your image is 1920x1080, each channel in .npz might be 480x270.

To generate the .npz binary mask heatmaps you can use:

scripts/generate_single_heatmap.py
- This script reads a JSON label with a few manually labeled court points, computes the homography, and then projects the standard court grid + baskets into the image to create the binary masks.

Data Labeling (with Video Demonstration)

I have a separate custom labeling system consisting of a Python backend for serving images and a React frontend for an intuitive labeling experience. The labeling process is designed to be efficient and user-friendly:

Initial Calibration
- The first 4 keypoints must be manually labeled
- These points are mapped to corresponding locations on a standard basketball court
- This initial mapping establishes the homography transformation
Assisted Labeling
- After the initial 4 points are set, the system displays pink dots showing estimated locations of remaining keypoints
- These estimates are computed using the established homography
- Users can click near any pink dot to label that keypoint
- The system automatically selects the nearest keypoint to the click location, streamlining the labeling process
Export Process
- Labels are exported as JSON files containing the keypoint correspondences
- These JSON files are then processed using generate_single_heatmap.py to create the .npz binary masks needed for training

Training

1. Configuration

Review configs/default.yaml:

model:
  n_keypoints: 93
  input_size: [1920, 1080]  # width, height
  output_stride: 4

training:
  n_epochs: 200
  batch_size: 2
  learning_rate: 0.0001
  lr_decay_epoch: 66
  keypoint_weight: 10
  background_weight: 1

data:
  court_width: 500
  court_length: 940
  border: 0
  grid_size: [7, 13]
  disk_radius: 5

augmentation:
  color_jitter:
    brightness: 0.7
    contrast: 0.5
    saturation: 0.5
    hue: 0.5
  random_flip_prob: 0.5

evaluation:
  visualize: true
  save_predictions: true
  metrics:
    - mse
    - detection_rate

You can tune batch_size, learning_rate, etc. either via YAML or CLI arguments.

2. Launch Training

Run the train.py script:

python scripts/train.py \
  --config configs/default.yaml \
  --data-dir /path/to/data \
  --output-dir outputs/train_runs \
  --num-workers 4

Key Args:

--config: Path to the YAML config.
--data-dir: Directory with train/ & val/ subdirectories (as described above).
--output-dir: Where to save logs & checkpoints.
--num-workers: DataLoader workers (for speed if CPU cores are available).

Additional CLI Overrides:

--batch-size
--learning-rate
--n-epochs
--lr-decay-epoch
--keypoint-weight
--background-weight

At the end, a best_model.pth will be saved.

Inference / Testing

Use scripts/inference.py to do inference on a single image:

python scripts/inference.py \
  --model outputs/train_runs/<TIMESTAMP>/best_model.pth \
  --image path/to/test_image.jpg \
  --output-dir outputs/inference \
  --config configs/default.yaml

This script will:

Load your model & config.
Resize the input image to [1920, 1080] (or whatever is set in the config).
Generate predictions (93 channels).
Visualize the predicted keypoints.
Estimate a homography from predicted points to standard court coordinates (if you choose perspective-aware grid).
Project standard court lines or other points onto the image (optional steps shown in the script).
Save the resulting visualizations in outputs/inference/.

Successful Homogrpahy

Degenerate Homography

Note on Detecting Degenerate Cases to Improve Training: When the model produces a degenerate homography like the one shown above, we can detect this automatically by leveraging our knowledge of basketball court geometry. By reprojecting the midpoint of each half-court into court space and measuring the distance between them, we can identify likely failures. If this distance falls below a certain threshold, it indicates that points are incorrectly clustering together, which is a signature of degenerate homography estimation. These can then be added to a queue of images that need to be labeled.

Scripts Overview

scripts/train.py
Main training script. Handles:
- Argument parsing
- YAML config loading
- Data loader creation
- Model instantiation (KaliCalibNet)
- Training loop & validation
scripts/inference.py
Performs single image inference with:
- Model load
- Keypoint heatmap extraction
- Visualization & homography-based transformations
scripts/generate_single_heatmap.py
- Generates binary heatmaps for a single image/label pair.
- Uses a JSON label with a few correspondences, calculates homography, and projects a grid + baskets.
- Can be used for creating .npz label data.

Configuration Options

All major hyperparameters and data parameters are stored in configs/default.yaml.

Key sections:

model: n_keypoints, input_size, output_stride
training: epochs, LR, batch size, weighting for keypoints vs. background channels
data: court dimensions, grid size, etc.
augmentation: data augmentation probabilities & intensities
evaluation: optional metrics, saving predictions, etc.

You can create custom configs and specify --config your_custom.yaml.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
configs		configs
images		images
scripts		scripts
src		src
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

KaliCalibNet

Example output

Table of Contents

Project Overview

Data Structure & Preparation

Data Labeling (with Video Demonstration)

Training

1. Configuration

2. Launch Training

Inference / Testing

Successful Homogrpahy

Degenerate Homography

Scripts Overview

Configuration Options

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

CalebNSmith/kalicalib

Folders and files

Latest commit

History

Repository files navigation

KaliCalibNet

Example output

Table of Contents

Project Overview

Data Structure & Preparation

Data Labeling (with Video Demonstration)

Training

1. Configuration

2. Launch Training

Inference / Testing

Successful Homogrpahy

Degenerate Homography

Scripts Overview

Configuration Options

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages