MVANet Image Segmentation

A modern web application for image segmentation using the MVANet deep learning model. Features a beautiful web dashboard and FastAPI-based backend for batch processing of images with real-time progress monitoring.

Features

Web Dashboard

Modern UI: Clean, responsive interface with real-time updates
Job Submission: Easy folder path input for batch image processing
Test-Time Augmentation: Toggle TTA for improved segmentation accuracy
Real-time Console: Live processing logs with timestamps and color-coded messages
Latest Image Viewer: Preview the most recently processed image with overlays
Task Persistence: Processing continues across page refreshes - monitor from anywhere

Processing Capabilities

Recursive Folder Processing: Automatically finds and processes all 'images' folders
Dual Output: Generates both segmentation masks and overlays
GPU Acceleration: Supports CUDA for fast processing
Background Processing: Non-blocking job execution with task tracking
Error Handling: Comprehensive logging and error reporting

API Endpoints

Task submission and status tracking
Real-time log streaming
System status monitoring
Image serving and retrieval

Installation

Clone the repository:

git clone https://github.com/OpsiClear/MVANet.git
cd MVANet

Install dependencies using uv (recommended) or pip:

# Using uv (faster)
uv sync

# Or using pip
pip install -r requirements.txt

Ensure the model files are in the models/ directory:
- models/MVANet.pth - Main segmentation model
- models/swin_base_patch4_window12_384_22kto1k.pth - Backbone model

Usage

Starting the Application

Run the FastAPI server directly:

python api_app.py

The application will start on http://localhost:8001 by default.

Web Interface

Open your browser and navigate to http://localhost:8001
Enter the full path to your input folder (must contain folders named 'images')
Toggle Test-Time Augmentation if desired (improves accuracy but slower)
Click "Submit Job" to start processing
Monitor progress in real-time via the console output
View the latest processed image using the "Latest Image" button

Input Folder Structure:

your-input-folder/
├── images/           # Must be named 'images'
│   ├── image1.jpg
│   ├── image2.png
│   └── ...

Output Structure:

your-input-folder_overlay/  # Segmentation overlays
your-input-folder_masks/    # Binary masks

API Endpoints

Submit Processing Job

curl -X POST "http://localhost:8001/api/process" \
    -H "Content-Type: application/json" \
    -d '{
        "input_folder": "/path/to/folder",
        "use_tta": true
    }'

Check Task Status

curl "http://localhost:8001/api/status/{request_id}"

Get System Status

curl "http://localhost:8001/api/system/status"

Get Task Logs

curl "http://localhost:8001/api/logs/{task_id}"

Get Latest Processed Image

curl "http://localhost:8001/api/latest-image"

Technology Stack

Backend: FastAPI with async/await support
Frontend: Bootstrap 5 with vanilla JavaScript
Deep Learning: PyTorch with MVANet architecture
Model Backbone: Swin Transformer
Image Processing: PIL/Pillow, NumPy

License

MIT License

Name		Name	Last commit message	Last commit date
Latest commit History 50 Commits
models		models
src		src
static		static
.gitattributes		.gitattributes
.gitignore		.gitignore
.python-version		.python-version
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README.md		README.md
api_app.py		api_app.py
cli.py		cli.py
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MVANet Image Segmentation

Features

Web Dashboard

Processing Capabilities

API Endpoints

Installation

Usage

Starting the Application

Web Interface

API Endpoints

Submit Processing Job

Check Task Status

Get System Status

Get Task Logs

Get Latest Processed Image

Technology Stack

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

MVANet Image Segmentation

Features

Web Dashboard

Processing Capabilities

API Endpoints

Installation

Usage

Starting the Application

Web Interface

API Endpoints

Submit Processing Job

Check Task Status

Get System Status

Get Task Logs

Get Latest Processed Image

Technology Stack

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages