[MLOB-3525] add setup instructions for llm obs litellm integration (DataDog#20911)

ncybul · web-flow · commit 62adfc8c1237 · 2025-08-07T13:47:42.000Z
* add setup instructions for llm obs litellm integration

* add apm section

* remove detailed llm obs and apm sections in favor of linking to llm obs public documentation
diff --git a/litellm/README.md b/litellm/README.md
@@ -1,28 +1,37 @@
-# Agent Check: LiteLLM
+# LiteLLM
 
 ## Overview
 
-[LiteLLM][1] is a lightweight, open-source proxy and analytics layer for large language model (LLM) APIs. It enables unified access, observability, and cost control across multiple LLM providers.
+Monitor, troubleshoot, and evaluate your LLM-powered applications built using [LiteLLM][1]: a lightweight, open-source proxy and analytics layer for large language model (LLM) APIs. It enables unified access, observability, and cost control across multiple LLM providers.
 
-This integration provides real-time monitoring, alerting, and analytics for all LLM API usage through LiteLLM, helping customers optimize performance, manage costs, and ensure reliability across their AI-powered applications.
+Use LLM Observability to investigate the root cause of issues, monitor operational performance, and evaluate the quality, privacy, and safety of your LLM applications.
+
+See the [LLM Observability tracing view video](https://imgix.datadoghq.com/video/products/llm-observability/expedite-troubleshooting.mp4?fm=webm&fit=max) for an example of how you can investigate a trace.
+
+Get cost estimation, prompt and completion sampling, error tracking, performance metrics, and more out of [LiteLLM][1] Python library requests using Datadog metrics and APM.
 
 Key metrics such as request/response counts, latency, error rates, token usage, and spend per provider or deployment are monitored. This data enables customers to track usage patterns, detect anomalies, control costs, and troubleshoot issues quickly, ensuring efficient and reliable LLM operations through LiteLLM's health check and Prometheus endpoints.
 
 ## Setup
 
+### LLM Observability: Get end-to-end visibility into your LLM application using LiteLLM
+See the [LiteLLM integration docs][12] for details on how to get started with LLM Observability for LiteLLM.
+
+
+### Agent Check: LiteLLM
 Follow the instructions below to install and configure this check for an Agent running on a host. For containerized environments, see the [Autodiscovery Integration Templates][3] for guidance on applying these instructions.
 
-### Installation
+#### Installation
 
 Starting from Agent 7.68.0, the LiteLLM check is included in the [Datadog Agent][2] package. No additional installation is needed on your server.
 
-### Configuration
+#### Configuration
 
 This integration collects metrics through the Prometheus endpoint exposed by the LiteLLM Proxy. This feature is only available for enterprise users of LiteLLM. By default, the metrics are exposed on the `/metrics` endpoint. If connecting locally, the default port is 4000. For more information, see the [LiteLLM Prometheus documentation][10].
 
 Note: The listed metrics can only be collected if they are available. Some metrics are generated only when certain actions are performed. For example, the `litellm.auth.failed_requests.count` metric might only be exposed after an authentication failed request has occurred.
 
-#### Host-based
+##### Host-based
 
 1. Edit the `litellm.d/conf.yaml` file in the `conf.d/` folder at the root of your Agent's configuration directory to start collecting your LiteLLM performance data. See the [sample litellm.d/conf.yaml][4] for all available configuration options. Example config:
 
@@ -38,7 +47,7 @@ instances:
 
 2. [Restart the Agent][5].
 
-#### Kubernetes-based
+##### Kubernetes-based
 
 For LiteLLM Proxy running on Kubernetes, configuration can be easily done via pod annotations. See the example below:
 
@@ -69,11 +78,11 @@ spec:
 
 For more information and alternative ways to configure the check in Kubernetes-based environments, see the [Kubernetes Integration Setup documentation][3].
 
-#### Logs
+##### Logs
 
 LiteLLM can send logs to Datadog through its callback system. You can configure various logging settings in LiteLLM to customize log formatting and delivery to Datadog for ingestion. For detailed configuration options and setup instructions, refer to the [LiteLLM Logging Documentation][11].
 
-### Validation
+#### Validation
 
 Run the Agent's status subcommand ([see documentation][6]) and look for `litellm` under the Checks section.
 
@@ -109,3 +118,4 @@ Need help? Contact [Datadog support][9].
 [9]: https://docs.datadoghq.com/help/
 [10]: https://docs.litellm.ai/docs/proxy/prometheus
 [11]: https://docs.litellm.ai/docs/proxy/logging
+[12]: https://docs.datadoghq.com/llm_observability/instrumentation/auto_instrumentation?tab=python#litellm