Skip to content

Conversation

X1aoZEOuO
Copy link
Contributor

What this PR does / why we need it

Detailed Explanation of Commit

This commit introduces a guide for configuring serverless environments on Kubernetes, focusing on integrating Prometheus for monitoring and KEDA for autoscaling. The guide aims to optimize resource efficiency through event-driven scaling while maintaining observability for AI/ML workloads.

  • Prometheus Integration: Configured with namespaceSelector for cross-namespace monitoring
  • KEDA Autoscaling: Custom metric scaling with Prometheus triggers
  • Scale-to-Zero: Activator pattern with request buffering and CloudEvents

Which issue(s) this PR fixes

Fixes #

Special notes for your reviewer

Does this PR introduce a user-facing change?


cc @pacoxu @kerthcet

Signed-off-by: X1aoZEOuO <nizefeng2002@outlook.com>
Signed-off-by: X1aoZEOuO <nizefeng2002@outlook.com>
Signed-off-by: X1aoZEOuO <nizefeng2002@outlook.com>
@InftyAI-Agent InftyAI-Agent added needs-triage Indicates an issue or PR lacks a label and requires one. needs-priority Indicates a PR lacks a label and requires one. do-not-merge/needs-kind Indicates a PR lacks a label and requires one. labels Sep 28, 2025
@X1aoZEOuO X1aoZEOuO force-pushed the feat/1-n-keda-support branch from fea7121 to b73e6f0 Compare September 28, 2025 12:18
@X1aoZEOuO
Copy link
Contributor Author

/kind feature

@InftyAI-Agent InftyAI-Agent added feature Categorizes issue or PR as related to a new feature. and removed do-not-merge/needs-kind Indicates a PR lacks a label and requires one. labels Sep 28, 2025
@X1aoZEOuO
Copy link
Contributor Author

X1aoZEOuO commented Sep 29, 2025

@pacoxu @googs1025 @carlory @kerthcet Hello all! Could you spare a few minutes to review my PRs when you have a chance?

Other ref PRs:

@pacoxu
Copy link
Contributor

pacoxu commented Oct 9, 2025

/assign
I will take a look this week or early next week.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

feature Categorizes issue or PR as related to a new feature. needs-priority Indicates a PR lacks a label and requires one. needs-triage Indicates an issue or PR lacks a label and requires one.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants