Image Recognition to Detect Different Vehicle Types in Riga

This project focuses on training a model for accurate vehicle identification and classification within Riga, Latvia. The model is trained to recognize and categorize the following vehicle types:

V (Passenger Vehicles): This category includes standard cars used for personal transportation. Passenger vehicles represent a significant portion of traffic and provide a valuable baseline for comparison with other vehicle types.
C (Cargo Vehicles): This encompasses all commercial vehicles, from smaller vans (C1) to large trucks with trailers (C4). Tracking cargo vehicles is crucial due to their larger size and lower speeds, which can contribute to increased traffic congestion.
S (Buses): This category specifically targets public transport buses. Monitoring buses is essential for understanding their impact on traffic flow and public transportation schedules.

Highlights

Objective: Develop a real-time vehicle classification system using live webcam feeds from Riga.
Tools Used: YOLOv8, Google Cloud Platform, FFmpeg, Albumentations, and CVAT for annotations.
Key Results:
- Base model: Precision 75.27%, Recall 78.98%, mAP@0.5: 82.12%.
- Augmented model: Precision 79.12%, Recall 79.01%, mAP@0.5: 83.79%.
- Augmentation led to ~2.2% improvement in confidence metrics and ~5.5% reduction in prediction variability.
- Demonstrated real-world applicability with evaluations on unlabeled data.

Group Proposal

Kanban Board of project

Very general progress notes

Final Presentation

Team Roles

Name	Title	Roles
Dmitrijs	Data Engineer	Business Understanding, Data Preparation, Modelling, Test & Validate
Eden	Data Analyst	Business Understanding, Data Understanding, Modelling, Communication of Insights
Gonzalo	Project Manager	Management, Support

Project Timeline

Week	Task/Deliverable	Responsible
1	Project requirements	Dmitrijs, Eden, Gonzalo
2	Data collection, data exploration	Dmitrijs, Eden
3	Data annotation, data formatting	Dmitrijs, Eden, Gonzalo
4	Model training and evaluation	Dmitrijs, Eden
5	Model improvement and tuning	Dmitrijs, Eden
6	Model testing with new data	Dmitrijs, Eden
7	Final Model adjustments and validation	Dmitrijs, Eden
8	Final presentation	Dmitrijs, Eden, Gonzalo

Detailed timeline and assigned responsibilities

Key Project Features

Data Collection: Webcam feeds processed via FFmpeg; images stored in Google Cloud.
Annotation: Used CVAT for precise labeling.
Augmentation: Applied transformations (rotations, brightness adjustments, etc.) via Albumentations.
Evaluation: Precision, recall, and mAP metrics across test sets and unlabeled data.

Challenges and Lessons Learned

Overcame restricted access to video streams using customized HTTP headers and FFmpeg.
Manual annotation was time-intensive but improved with team coordination.
Augmented datasets required higher computational and storage resources.

Project Organization

├── LICENSE            <- Open-source license if one is chosen
├── Makefile           <- Makefile with convenience commands like `make data` or `make train`
├── README.md          <- The top-level README for developers using this project.
├── data
│   ├── external       <- Data from third party sources.
│   ├── interim        <- Intermediate data that has been transformed.
│   ├── processed      <- The final, canonical data sets for modeling.
│   └── raw            <- The original, immutable data dump.
│
├── docs               <- A default mkdocs project; see www.mkdocs.org for details
│
├── models             <- Trained and serialized models, model predictions, or model summaries
│
├── notebooks          <- Jupyter notebooks. Naming convention is a number (for ordering),
│                         the creator's initials, and a short `-` delimited description, e.g.
│                         `1.0-jqp-initial-data-exploration`.
│
├── pyproject.toml     <- Project configuration file with package metadata for 
│                         ai_group_project and configuration for tools like black
│
├── references         <- Data dictionaries, manuals, and all other explanatory materials.
│
├── reports            <- Generated analysis as HTML, PDF, LaTeX, etc.
│   └── figures        <- Generated graphics and figures to be used in reporting
│
├── requirements.txt   <- The requirements file for reproducing the analysis environment, e.g.
│                         generated with `pip freeze > requirements.txt`
│
├── setup.cfg          <- Configuration file for flake8
│
└── project_packages   <- Source code for use in this project.
    │
    └── __init__.py             <- Makes ai_group_project a Python module

Final Thoughts

This project demonstrates the power of modern object detection frameworks like YOLOv8 combined with robust data pipelines. The integration of cloud storage and augmentation significantly enhanced model performance and scalability.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Image Recognition to Detect Different Vehicle Types in Riga

Highlights

Team Roles

Project Timeline

Key Project Features

Challenges and Lessons Learned

Project Organization

Final Thoughts

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 110 Commits
data		data
docs		docs
models/yolov8_model		models/yolov8_model
notebooks		notebooks
project_packages		project_packages
references		references
reports		reports
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
environment.yml		environment.yml
pyproject.toml		pyproject.toml
setup.cfg		setup.cfg

Folders and files

Latest commit

History

Repository files navigation

Image Recognition to Detect Different Vehicle Types in Riga

Highlights

Team Roles

Project Timeline

Key Project Features

Challenges and Lessons Learned

Project Organization

Final Thoughts

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages