X-Ray Image Search Interface

A semantic search engine for X-ray images supporting text queries and reverse image search, powered by OpenAI CLIP.

Features

Text Search: Enter queries like "chest x-ray" or "fractured bone"
Image Search: Upload an X-ray to find visually similar images
Category Filters: Narrow results to specific X-ray types
Ranking Scores: See similarity percentages for each result
REST API: Programmatic access via FastAPI endpoints

Screenshots

Quick Start

1. Install Dependencies

# Python dependencies
pip install torch torchvision transformers pandas Pillow fastapi uvicorn python-multipart icrawler

# Frontend dependencies
cd frontend && npm install

2. Collect Dataset (Optional - already included)

python src/collect_data_icrawler.py

3. Generate Embeddings (Optional - already included)

python src/indexer.py

4. Run the Application

Start Backend (Terminal 1):

python -m uvicorn src.api:app --reload --port 8000

Start Frontend (Terminal 2):

cd frontend && npm run dev

Access:

Frontend: http://localhost:3000
API Docs: http://localhost:8000/docs

Project Structure

xray_project/
├── src/
│   ├── api.py                    # FastAPI backend
│   ├── model.py                  # CLIP model wrapper
│   ├── indexer.py                # Embedding generator
│   └── collect_data_icrawler.py  # Data collection script
├── frontend/
│   ├── src/App.jsx               # React UI
│   └── package.json
├── dataset/
│   ├── images/                   # 550 X-ray images (9 categories)
│   ├── metadata.csv              # Image metadata
│   └── embeddings.pt             # Pre-computed CLIP embeddings
├── archive/                      # Assignment details and reference
├── Screenshots/                  # Screenshots
├── README.md
├── report.md
└── walkthrough.md

Dataset

Total Images: 550
Categories: chest, spine, dental, fracture, skull, lung, knee, hand, pelvis
Source: Bing Image Search via icrawler
Metadata: dataset/metadata.csv with image_name, source_url, category

API Endpoints

Endpoint	Method	Description
`/search/text?q=query`	GET	Text-based semantic search
`/search/image`	POST	Image-based similarity search
`/categories`	GET	List all categories
`/images/{path}`	GET	Serve image files

Tech Stack

Backend: FastAPI, PyTorch, CLIP (openai/clip-vit-base-patch32)
Frontend: React, Vite, TailwindCSS
Search: Cosine similarity on CLIP embeddings

Scalability Note

"I designed the system to be compatible with Pinecone for production scaling (to handle millions of images), but kept the local .pt implementation for this assignment to ensure ease of setup for the reviewer without needing external API keys."

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

X-Ray Image Search Interface

Features

Screenshots

Quick Start

1. Install Dependencies

2. Collect Dataset (Optional - already included)

3. Generate Embeddings (Optional - already included)

4. Run the Application

Project Structure

Dataset

API Endpoints

Tech Stack

Scalability Note

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
Screenshots		Screenshots
archive		archive
dataset		dataset
frontend		frontend
src		src
.gitignore		.gitignore
README.md		README.md
report.md		report.md

Folders and files

Latest commit

History

Repository files navigation

X-Ray Image Search Interface

Features

Screenshots

Quick Start

1. Install Dependencies

2. Collect Dataset (Optional - already included)

3. Generate Embeddings (Optional - already included)

4. Run the Application

Project Structure

Dataset

API Endpoints

Tech Stack

Scalability Note

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages