Skip to content

Add demo for concurrent users#356

Merged
bebechien merged 4 commits intomainfrom
concurrent
Apr 17, 2026
Merged

Add demo for concurrent users#356
bebechien merged 4 commits intomainfrom
concurrent

Conversation

@MaartenGr
Copy link
Copy Markdown
Contributor

10+ Concurrent Requests of Gemma 4 on MacBook Pro M4 Max

Adds an example of the efficiency of Gemma 4 26B A4B running concurrent requests on a MacBook. I got it working nicely with 10 concurrent requests to the same model with about 18 t/s per request!

demo_concurrent_github.mp4

Placed in apps/ since this is a demo to showcase the efficiency of the models (any model can be used).

NOTE: I added a .png since adding a .gif would be too big. Perhaps I can do the trick of adding a video by uploading it here and referencing it in the README.md, let me try.

Copy link
Copy Markdown
Collaborator

@bebechien bebechien left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm!

@bebechien bebechien merged commit 786a782 into main Apr 17, 2026
2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants