GPU/CPU Management & Inference #19

umama-rahman1 · 2024-12-16T02:14:33Z

Related Task

Changes

Added GPU machine creation
Script to run on creating machine and expose port for Ollama inference
Ollama inference url getter
Pull model onto GPU machine
Delete model from GPU machine

Documentation

New /GPU machine endpoints:

Local Testing

Create GPU Machine:

Script run on creating machine and to expose ollama port:

Pull Model to machine:

Delete Model from machine:

Get Inference URL for machine access:

Inference:

All endpoints refactored:

K-rolls

Just some small changes requested, great work buddy

app/models/machine.py

app/api/v1/endpoints/machine_endpoints.py

Daayim

Good stuff. I just had a few questions if you don't mind taking a look.

app/services/ec2_service.py

app/api/v1/endpoints/machine_endpoints.py

app/services/ec2_service.py

K-rolls

Daayim

LGTM

umama-rahman1 added 3 commits December 14, 2024 21:27

ollama-setup-script

e118e6e

wip-runs-on-specific-machine

d1c2ec9

model-pull-and-delete-on-ollama

c39a1c9

umama-rahman1 added the dev Code Development work label Dec 16, 2024

umama-rahman1 self-assigned this Dec 16, 2024

umama-rahman1 added 3 commits December 15, 2024 21:26

add-GPU-dev-ami-secret

5763e05

update-ollama-setup-script

9822513

test-gpu-and-public-ip

d222bd0

umama-rahman1 changed the title ~~GPU Management~~ GPU Management & Ollama inference Dec 16, 2024

umama-rahman1 changed the title ~~GPU Management & Ollama inference~~ GPU/CPU Management & Ollama inference Dec 16, 2024

umama-rahman1 requested review from Daayim, K-rolls and cmatthews20 and removed request for Daayim and K-rolls December 16, 2024 02:45

K-rolls requested review from K-rolls and cmatthews20 and removed request for K-rolls and cmatthews20 December 17, 2024 18:38

K-rolls approved these changes Dec 17, 2024

View reviewed changes

app/models/machine.py Show resolved Hide resolved

app/api/v1/endpoints/machine_endpoints.py Show resolved Hide resolved

Daayim requested changes Jan 10, 2025

View reviewed changes

app/services/ec2_service.py Outdated Show resolved Hide resolved

app/services/ec2_service.py Show resolved Hide resolved

app/api/v1/endpoints/machine_endpoints.py Outdated Show resolved Hide resolved

app/services/ec2_service.py Show resolved Hide resolved

umama-rahman1 added 3 commits January 10, 2025 22:46

remove-redundant-ssm-client

665106b

fix-model-selection-class-issue

c73e752

add-ownership-check

5fe08fd

cmatthews20 changed the title ~~GPU/CPU Management & Ollama inference~~ GPU/CPU Management & Inference Jan 11, 2025

cpu-dev-ami

62c7815

umama-rahman1 added 3 commits January 11, 2025 17:53

cpu-endpoint-update

b6d780d

cpu-create-function

cb06fed

update-CI

0687243

umama-rahman1 requested review from Daayim, K-rolls and cmatthews20 January 11, 2025 22:00

cmatthews20 approved these changes Jan 11, 2025

View reviewed changes

K-rolls approved these changes Jan 11, 2025

View reviewed changes

Daayim approved these changes Jan 12, 2025

View reviewed changes

umama-rahman1 merged commit c65b54c into main Jan 12, 2025
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

GPU/CPU Management & Inference #19

GPU/CPU Management & Inference #19

Uh oh!

umama-rahman1 commented Dec 16, 2024 •

edited

Loading

Uh oh!

K-rolls left a comment

Uh oh!

Uh oh!

Uh oh!

Daayim left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

K-rolls left a comment

Uh oh!

Daayim left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

GPU/CPU Management & Inference #19

GPU/CPU Management & Inference #19

Uh oh!

Conversation

umama-rahman1 commented Dec 16, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Related Task

Changes

Documentation

Local Testing

Create GPU Machine:

Pull Model to machine:

Delete Model from machine:

Get Inference URL for machine access:

Inference:

All endpoints refactored:

Uh oh!

K-rolls left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Daayim left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

K-rolls left a comment

Choose a reason for hiding this comment

Uh oh!

Daayim left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

umama-rahman1 commented Dec 16, 2024 •

edited

Loading