Aquadrone YOLO Training

YOLO setup for Linux

The Darknet YOLO model must be trained on a Linux platform.

Install OpenCV from here, following the detailed process steps.
Install Darknet from here.
To compile with OpenCV, edit the Makefile by:
- changing OPENCV=0 to OPENCV=1
- changing line 45 from LDFLAGS+= `pkg-config --libs opencv` -lstdc++ to LDFLAGS+= `pkg-config --libs opencv4` -lstdc++
- changing line 46 from COMMON+= `pkg-config --cflags opencv` to COMMON+= `pkg-config --cflags opencv4`
Finally, cd src and modify image_opencv.cpp by:
- adding in missing header files
```
#include "opencv2/core/core_c.h"
#include "opencv2/videoio/legacy/constants_c.h"
#include "opencv2/highgui/highgui_c.h"
```
- changing the line IplImage ipl = m; to IplImage ipl = cvIplImage(m);

Data collection

Videos for training can be found on the Aquadrone google drive.
The Timestamps folder contains csv files that contain start and end times as well as labels for objects that appear in the videos.
The timestamps are used by the Data Collection/split_video.py (from the original aquadrone-vision repo) to produce images of objects from the videos that can be used to train the neural network.

python split_video.py --directory <directory path to store frames> --video <path to video file> --file <path to timestamp file> [--compress] [-n <number of frames per second>]

Data annotation

Before training, images must be annotated with the following information in a corresponding .txt file.

<object class> <x_center> <y_center> <width> <height>

object_class will be a number between 0 and total number of classes - 1, which identifies which object the metadata belongs to.
x_center is the x coordinate of the center of the object divided by the image width.
y_center is the y coordinate of the center of the object divided by the image height.
width is the width of the object.
height is the height of the object.

Yolo_mark and LabelImg are two good options.

Repo setup for Windows

The data augmentation app must be run on a Windows platform. To install required packages, use pip install -r requirements.txt in the root of the repo.

Data augmentation

A GUI has been made to facilitate the process of augmenting images. The Albumentations library was used; demoes of data augmentation techniques can be found here.

The GUI supports the following data augmentation techniques:

Horizontal Flip
Motion Blur
Iso Noise
Rotate
CutOut
Crop
Rgb Shift

To run the app, navigate to the Data Augmentation subdirectory and run python aug_app.py.

Data Division

Data collected must be split into three subsets: training set (to train the model), validation set (to evaluate and tweak model and training process), and test set (to finally double check the model on the test set). The divide_data.py can help with automating the division of data.

To run the script, use

python divide_data.py --directory <directory path to store image sets> --images <path to where all images and bounding boxes are stored> --test <percent of images to use in test set> --train <percent of images to use in training set> --valid <percent of images to use in the validation set>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Aquadrone YOLO Training

YOLO setup for Linux

Data collection

Data annotation

Repo setup for Windows

Data augmentation

Data Division

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
Data Augmentation		Data Augmentation
Data Collection		Data Collection
Data Division		Data Division
Timestamps		Timestamps
README.md		README.md
requirements.txt		requirements.txt

msmel01/YOLOv3-aquadrone

Folders and files

Latest commit

History

Repository files navigation

Aquadrone YOLO Training

YOLO setup for Linux

Data collection

Data annotation

Repo setup for Windows

Data augmentation

Data Division

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages