Introducing Kite AI Agent: Conversational Operations for Kubernetes #409
Replies: 1 comment
-
|
This is a great direction — turning multi-step debugging workflows into conversations is exactly where K8s ops should be heading. A few thoughts from watching similar tool-calling agent patterns:
Feature idea: Consider a "dry-run" mode where the agent explains what it would do without actually executing. Great for learning and for users who want to understand the underlying API calls before giving full trust. Excited to see where this goes. The RBAC-scoped execution is the right foundation. |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Managing Kubernetes clusters often involves a frustrating amount of context switching between a dashboard to visualize state and a terminal to actually get things done. Kite already simplifies this with a highly visual React and Go-based experience, but we wanted to take operational workflows a step further.
Today we are introducing the Kite AI Agent—a built-in, context-aware assistant powered by OpenAI and Anthropic models. This isn't just a chatbot that spits out generic Kubernetes documentation; it actively interacts with your cluster.
Why We Built It
Diagnosing a failing service usually looks like this:
The Kite AI Agent turns this multi-step process into a conversation. You can simply ask, "Why is the
auth-servicedeployment crashing?" The agent will look at the deployment state, fetch the associated pod logs, identify the problem, and suggest a fix. If you tell it, "Add the missingAPI_URLto the ConfigMap and restart the deployment," it will generate the necessary patches and apply them.What It Can Do
The agent is built leveraging LLM tool-calling (function calling). We've equipped it with a robust set of tools that allow it to safely read and mutate cluster state using standard Kubernetes APIs:
1. Contextual Diagnostics
Instead of chaining together multiple
kubectl getanddescribecommands, you can query your infrastructure using natural language:productionnamespace that are currently inCrashLoopBackOff."2. Active Remediation
The AI agent isn't strictly read-only. It can modify infrastructure directly, making it an excellent tool for rapid fixes and prototyping:
Under the Hood
The Kite AI Agent runs entirely within our Go backend (
pkg/ai). We built native integrations using the official Anthropic and OpenAI Go SDKs, giving you the flexibility to choose the model that best fits your workflow.When you prompt the agent, it translates your intent into precise
client-goAPI calls using dynamic clients (viaunstructuredtypes and discovery mapping). Tool calls likepatch_resourceorget_pod_logsare mapped directly to core Kubernetes APIs.Because giving an LLM access to your infrastructure requires strict guardrails, the agent heavily relies on Kite's existing Role-Based Access Control (RBAC) implementation. The agent operates strictly within the boundaries of the logged-in user's permissions—it cannot perform actions or access namespaces that the user is not authorized to see.
Getting Started
https://github.com/kite-org/kite/releases/tag/v0.8.0
To try out the agent, pull the latest version of Kite, navigate to the AI configuration panel, and add your API key for OpenAI or Anthropic.
We are actively expanding the agent's toolbelt to handle more advanced operational workflows, including multi-cluster diagnostics and prometheus query tools.
Beta Was this translation helpful? Give feedback.
All reactions