Real MTCNN PyTorch

The primary reason for reimplementing the MTCNN model is that many repos that call themselves "mtcnn_pytorch" do not use PyTorch throught the process. They repeatedly convert model output tensors into numpy and PIL images, which can cause unnecessary performance issues as data is moved between CPU and GPU constantly. Moreover, many modules like NMS are handcrafted and do not make full use of existing libraires like torchvision that make better use of GPUs. These issues haunted me when I tried to find a fast and easy2use version of MTCNN in my latest paper.

Therefore, I rewrote MTCNN in PyTorch completely, with all operations done on GPU and make better use of PyTorch and torchvision. Hope it helps. Plz star the repo if it ever helped you. Thx a lot! (As a CS UG, stars can mean a lot in my resume... BTW, if you are interested in privacy protection against unauthorized face recognition systems, check my latest paper and its code)

Performance

This repo offers faster speed compared with another often-used implementation of mtcnn_pytorch using the same environment and machine (Intel Xeon w5-3415 CPU, 1 NVIDIA RTX 5880 Ada GPU (48 GB memory), and 128 GB RAM).

The detection is rather satisfying. (The weight and relevant hyperparameters are copied from https://github.com/TropComplique/mtcnn-pytorch.)

How to use?

This repo is more intended to be used as a module, which is why it has minimal requirements and should work with most of the torch, torchvision, numpy, and tqdm versions. But it can run on its own anyway.

Clone the repo with the following command:

$ git clone https://github.com/Michael-wzl/mtcnn_pytorch

Install the dependencies: Change requirements.txt depending on whether you have CUDA devices.

$ pip install -r requirements.txt

Run the demo:

$ python demo.py

Integrate the module into your own code:

import torch

from mtcnn import MTCNN

device = torch.device("cuda:0")
detector = MTCNN(device=device)
img = torch.randn(1, 3, 640, 640).to(device)
boxes, probs = detector(img) # boxes, probs = detector.detect(img)

Refer to demo.py for more complete example usages.

Future Work

Currently, we are still feeding the images into the model one by one, without making full use of batch calculation. The primary obstacle is dealing with zero-detection images within a batch. This can be a point of improvement. Also, many operations are still not optimal, like the _square and _calibrate function. Moreover, we still have misdetections, so the hyperparameters like nms_thresh may have to be further tuned. Lastly, if I find that many people are interested in the repo, I will take the trouble to turn it into a library that can be installed through pip and conda and make sure it has undergone more comprehensive testings.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
assets		assets
weights		weights
.gitignore		.gitignore
README.md		README.md
demo.py		demo.py
mtcnn.py		mtcnn.py
nets.py		nets.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Real MTCNN PyTorch

Performance

How to use?

Future Work

About

Uh oh!

Releases

Packages

Languages

Michael-wzl/mtcnn_pytorch

Folders and files

Latest commit

History

Repository files navigation

Real MTCNN PyTorch

Performance

How to use?

Future Work

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages