Captcha Solver

This projects is divided into 2 parts:

First, hard captcha images are preprocessed and then a CNN is trained to solve them.
Then, given a set of captcha images with lower level of difficulty, the previously trained model in harder to solve images is used as the feature extractor for the new simpler task. Transfer learning is use dand only the output layer is trained.

Datasets

simple_captcha: really simple captcha

hard_captcha: similar to the previous one, but with lower case letters

Image processing

Given the simplicity of the captcha images, the processing is done by extracting each character from the image using some data treatment and OpenCV. Then, each character is used to train a CNN.

How to run it

A detailed explanation of each step can be found on the file demo.ipynb In this file several steps are performed:

Data preprocessing: Doing this will create two new folders with the extacted digits from both simple and harder captcha;
Two models are trained. One using the harder captcha data and the other the easier one.
For each task, the results are visualized and evaluated.
Transfer learning: The model that trained on the harder data is used as the backbone of a new model that is also going to be used to solve simple captcha.

Results

Contributions

Contributions are welcomed! A lot can be improved in this repository. Some suggestions include:

Defining a custom data generator to feed training and validation sets when fit method is called.
Instead of cropping and saving each character from the images, it would be better to do it all in a preprocessing stage within a custom data generator.
Improving the hard captcha preprocessing: adding new methods to detect each character.

Name		Name	Last commit message	Last commit date
Latest commit History 44 Commits
data		data
results		results
src		src
LICENSE.md		LICENSE.md
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Captcha Solver

Datasets

simple_captcha: really simple captcha

hard_captcha: similar to the previous one, but with lower case letters

Image processing

How to run it

Results

Contributions

About

Uh oh!

Releases

Packages

Languages

License

inesmcm26/captcha-solver

Folders and files

Latest commit

History

Repository files navigation

Captcha Solver

Datasets simple_captcha: really simple captcha hard_captcha: similar to the previous one, but with lower case letters

Image processing

How to run it

Results

Contributions

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Datasets

simple_captcha: really simple captcha

hard_captcha: similar to the previous one, but with lower case letters

Packages