Human Action Recognition

A service that predicts human action in a video using pre-trained Resnet 3D PyTorch models. Deployed on AWS Cloud using ECS, S3, EventBridge, and Lambda.

basketball.mp4

🌟 How It Works

Get S3 presigned URL from API Gateway, and upload an input video to S3 bucket using the presigned URL.

Put operation on S3 is logged with CloudTrail. Eventbridge rule recognizes the put object log and starts a new ECS task.

The triggered and initialized ECS task does the human action recognition prediction using pre-trained Pytorch model, and saves the output to S3 bucket.

🏃🏻‍♀️ Usage

I implemented a simple client side code using FastAPI. Please refer to this page for how to use this service.

💭 Thoughts and Optimization

Both input and output videos are saved in the same S3 bucket. For the EventBridge rule on S3 PUT OBJECT, input and output videos should be in separate buckets to avoid infinite cycles for the best practice.

Scalability
- Currently, ECS task is created every time an input video is uploaded to S3 bucket and exits after the Machine Learning prediction and processing is done. For higher availability of processing in ECS, deploy an ECS task so that it continuously listens to incoming requests
- Lambda function that provides presigned url and S3 bucket for video storage is highly scalable.
Latency
- With webhook url, the client side does not have to wait for the processing completion.
  - The frontend provides a webhook url as well as an input video to S3, then the rest of the processing is done asynchronously. For more detail, look at client_app.py
  - Without the webhook url, it takes at least a few minutes to complete the processing.
- Multi-part upload can improve the performance of video upload.
- Reduce the cold start time of AWS lambda with provisioned concurrency.
QPS
- max. number of concurrent lambda function invocations is 1000 per AWS Region.

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
app		app
client		client
images		images
lambda_container		lambda_container
s3operators		s3operators
s3presignedurl		s3presignedurl
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Human Action Recognition

🌟 How It Works

🏃🏻‍♀️ Usage

💭 Thoughts and Optimization

💡 Things I learned

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Human Action Recognition

🌟 How It Works

🏃🏻‍♀️ Usage

💭 Thoughts and Optimization

💡 Things I learned

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages