add guide on adapters (#8671)

chenmoneygithub · web-flow · commit 1b90b4b7410c · 2025-08-16T08:31:50.000-04:00
diff --git a/docs/docs/learn/programming/adapters.md b/docs/docs/learn/programming/adapters.md
@@ -0,0 +1,307 @@
+# Understanding DSPy Adapters
+
+## What are Adapters?
+
+Adapters are the bridge between `dspy.Predict` and the actual Language Model (LM). When you call a DSPy module, the
+adapter takes your signature, user inputs, and other attributes like `demos` (few-shot examples) and converts them
+into multi-turn messages that get sent to the LM.
+
+The adapter system is responsible for:
+
+- Translating DSPy signatures into system messages that define the task and request/response structure.
+- Formatting input data according to the request structure outlined in DSPy signatures.
+- Parsing LM responses back into structured DSPy outputs, such as `dspy.Prediction` instances.
+- Managing conversation history and function calls.
+- Converting pre-built DSPy types into LM prompt messages, e.g., `dspy.Tool`, `dspy.Image`, etc.
+
+## Configure Adapters
+
+You can use `dspy.configure(adapter=...)` to choose the adapter for the entire Python process, or
+`with dspy.context(adapter=...):` to only affect a certain namespace.
+
+If no adapter is specified in the DSPy workflow, each `dspy.Predict.__call__` defaults to using the `dspy.ChatAdapter`. Thus, the two code snippets below are equivalent:
+
+```python
+import dspy
+
+dspy.configure(lm=dspy.LM("openai/gpt-4o-mini"))
+
+predict = dspy.Predict("question -> answer")
+result = predict(question="What is the capital of France?")
+```
+
+```python
+import dspy
+
+dspy.configure(
+    lm=dspy.LM("openai/gpt-4o-mini"),
+    adapter=dspy.ChatAdapter(),  # This is the default value
+)
+
+predict = dspy.Predict("question -> answer")
+result = predict(question="What is the capital of France?")
+```
+
+## Where Adapters Fit in the System
+
+The flow works as follows:
+
+1. The user calls their DSPy agent, typically a `dspy.Module` with inputs.
+2. The inner `dspy.Predict` is invoked to obtain the LM response.
+3. `dspy.Predict` calls **Adapter.format()**, which converts its signature, inputs, and demos into multi-turn messages sent to the `dspy.LM`. `dspy.LM` is a thin wrapper around `litellm`, which communicates with the LM endpoint.
+4. The LM receives the messages and generates a response.
+5. **Adapter.parse()** converts the LM response into structured DSPy outputs, as specified in the signature.
+6. The caller of `dspy.Predict` receives the parsed outputs.
+
+You can explicitly call `Adapter.format()` to view the messages sent to the LM.
+
+```python
+# Simplified flow example
+signature = dspy.Signature("question -> answer")
+inputs = {"question": "What is 2+2?"}
+demos = [{"question": "What is 1+1?", "answer": "2"}]
+
+adapter = dspy.ChatAdapter()
+print(adapter.format(signature, demos, inputs))
+```
+
+The output should resemble:
+
+```
+{'role': 'system', 'content': 'Your input fields are:\n1. `question` (str):\nYour output fields are:\n1. `answer` (str):\nAll interactions will be structured in the following way, with the appropriate values filled in.\n\n[[ ## question ## ]]\n{question}\n\n[[ ## answer ## ]]\n{answer}\n\n[[ ## completed ## ]]\nIn adhering to this structure, your objective is: \n        Given the fields `question`, produce the fields `answer`.'}
+{'role': 'user', 'content': '[[ ## question ## ]]\nWhat is 1+1?'}
+{'role': 'assistant', 'content': '[[ ## answer ## ]]\n2\n\n[[ ## completed ## ]]\n'}
+{'role': 'user', 'content': '[[ ## question ## ]]\nWhat is 2+2?\n\nRespond with the corresponding output fields, starting with the field `[[ ## answer ## ]]`, and then ending with the marker for `[[ ## completed ## ]]`.'}
+```
+
+## Types of Adapters
+
+DSPy offers several adapter types, each tailored for specific use cases:
+
+### ChatAdapter
+
+**ChatAdapter** is the default adapter and works with all language models. It uses a field-based format with special markers.
+
+#### Format Structure
+
+ChatAdapter uses `[[ ## field_name ## ]]` markers to delineate fields. For fields of non-primitive Python types, it includes the JSON schema of the type. Below, we use `dspy.inspect_history()` to display the formatted messages by `dspy.ChatAdapter` clearly.
+
+```python
+import dspy
+import pydantic
+
+dspy.configure(lm=dspy.LM("openai/gpt-4o-mini"), adapter=dspy.ChatAdapter())
+
+
+class ScienceNews(pydantic.BaseModel):
+    text: str
+    scientists_involved: list[str]
+
+
+class NewsQA(dspy.Signature):
+    """Get news about the given science field"""
+
+    science_field: str = dspy.InputField()
+    year: int = dspy.InputField()
+    num_of_outputs: int = dspy.InputField()
+    news: list[ScienceNews] = dspy.OutputField(desc="science news")
+
+
+predict = dspy.Predict(NewsQA)
+predict(science_field="Computer Theory", year=2022, num_of_outputs=1)
+dspy.inspect_history()
+```
+
+The output looks like:
+
+```
+[2025-08-15T22:24:29.378666]
+
+System message:
+
+Your input fields are:
+1. `science_field` (str):
+2. `year` (int):
+3. `num_of_outputs` (int):
+Your output fields are:
+1. `news` (list[ScienceNews]): science news
+All interactions will be structured in the following way, with the appropriate values filled in.
+
+[[ ## science_field ## ]]
+{science_field}
+
+[[ ## year ## ]]
+{year}
+
+[[ ## num_of_outputs ## ]]
+{num_of_outputs}
+
+[[ ## news ## ]]
+{news}        # note: the value you produce must adhere to the JSON schema: {"type": "array", "$defs": {"ScienceNews": {"type": "object", "properties": {"scientists_involved": {"type": "array", "items": {"type": "string"}, "title": "Scientists Involved"}, "text": {"type": "string", "title": "Text"}}, "required": ["text", "scientists_involved"], "title": "ScienceNews"}}, "items": {"$ref": "#/$defs/ScienceNews"}}
+
+[[ ## completed ## ]]
+In adhering to this structure, your objective is:
+        Get news about the given science field
+
+
+User message:
+
+[[ ## science_field ## ]]
+Computer Theory
+
+[[ ## year ## ]]
+2022
+
+[[ ## num_of_outputs ## ]]
+1
+
+Respond with the corresponding output fields, starting with the field `[[ ## news ## ]]` (must be formatted as a valid Python list[ScienceNews]), and then ending with the marker for `[[ ## completed ## ]]`.
+
+
+Response:
+
+[[ ## news ## ]]
+[
+    {
+        "scientists_involved": ["John Doe", "Jane Smith"],
+        "text": "In 2022, researchers made significant advancements in quantum computing algorithms, demonstrating their potential to solve complex problems faster than classical computers. This breakthrough could revolutionize fields such as cryptography and optimization."
+    }
+]
+
+[[ ## completed ## ]]
+```
+
+!!! info "Practice: locate Signature information in the printed LM history"
+
+    Try adjusting the signature, and observe how the changes are reflected in the printed LM message.
+
+
+Each field is preceded by a marker `[[ ## field_name ## ]]`. If an output field has non-primitive types, the instruction includes the type's JSON schema, and the output is formatted accordingly. Because the output field is structured as defined by ChatAdapter, it can be automatically parsed into structured data.
+
+#### When to Use ChatAdapter
+
+`ChatAdapter` offers the following advantages:
+
+- **Universal compatibility**: Works with all language models, though smaller models may generate responses that do not match the required format.
+- **Fallback protection**: If `ChatAdapter` fails, it automatically retries with `JSONAdapter`.
+
+In general, `ChatAdapter` is a reliable choice if you don't have specific requirements.
+
+#### When Not to Use ChatAdapter
+
+Avoid using `ChatAdapter` if you are:
+
+- **Latency sensitive**: `ChatAdapter` includes more boilerplate output tokens compared to other adapters, so if you're building a system sensitive to latency, consider using a different adapter.
+
+### JSONAdapter
+
+**JSONAdapter** prompts the LM to return JSON data containing all output fields as specified in the signature. It is effective for models that support structured output via the `response_format` parameter, leveraging native JSON generation capabilities for more reliable parsing.
+
+#### Format Structure
+
+The input part of the prompt formatted by `JSONAdapter` is similar to `ChatAdapter`, but the output part differs, as shown below:
+
+```python
+import dspy
+import pydantic
+
+dspy.configure(lm=dspy.LM("openai/gpt-4o-mini"), adapter=dspy.JSONAdapter())
+
+
+class ScienceNews(pydantic.BaseModel):
+    text: str
+    scientists_involved: list[str]
+
+
+class NewsQA(dspy.Signature):
+    """Get news about the given science field"""
+
+    science_field: str = dspy.InputField()
+    year: int = dspy.InputField()
+    num_of_outputs: int = dspy.InputField()
+    news: list[ScienceNews] = dspy.OutputField(desc="science news")
+
+
+predict = dspy.Predict(NewsQA)
+predict(science_field="Computer Theory", year=2022, num_of_outputs=1)
+dspy.inspect_history()
+```
+
+```
+System message:
+
+Your input fields are:
+1. `science_field` (str):
+2. `year` (int):
+3. `num_of_outputs` (int):
+Your output fields are:
+1. `news` (list[ScienceNews]): science news
+All interactions will be structured in the following way, with the appropriate values filled in.
+
+Inputs will have the following structure:
+
+[[ ## science_field ## ]]
+{science_field}
+
+[[ ## year ## ]]
+{year}
+
+[[ ## num_of_outputs ## ]]
+{num_of_outputs}
+
+Outputs will be a JSON object with the following fields.
+
+{
+  "news": "{news}        # note: the value you produce must adhere to the JSON schema: {\"type\": \"array\", \"$defs\": {\"ScienceNews\": {\"type\": \"object\", \"properties\": {\"scientists_involved\": {\"type\": \"array\", \"items\": {\"type\": \"string\"}, \"title\": \"Scientists Involved\"}, \"text\": {\"type\": \"string\", \"title\": \"Text\"}}, \"required\": [\"text\", \"scientists_involved\"], \"title\": \"ScienceNews\"}}, \"items\": {\"$ref\": \"#/$defs/ScienceNews\"}}"
+}
+In adhering to this structure, your objective is:
+        Get news about the given science field
+
+
+User message:
+
+[[ ## science_field ## ]]
+Computer Theory
+
+[[ ## year ## ]]
+2022
+
+[[ ## num_of_outputs ## ]]
+1
+
+Respond with a JSON object in the following order of fields: `news` (must be formatted as a valid Python list[ScienceNews]).
+
+
+Response:
+
+{
+  "news": [
+    {
+      "text": "In 2022, researchers made significant advancements in quantum computing algorithms, demonstrating that quantum systems can outperform classical computers in specific tasks. This breakthrough could revolutionize fields such as cryptography and complex system simulations.",
+      "scientists_involved": [
+        "Dr. Alice Smith",
+        "Dr. Bob Johnson",
+        "Dr. Carol Lee"
+      ]
+    }
+  ]
+}
+```
+
+#### When to Use JSONAdapter
+
+`JSONAdapter` is good at:
+
+- **Structured output support**: When the model supports the `response_format` parameter.
+- **Low latency**: Minimal boilerplate in the LM response results in faster responses.
+
+#### When Not to Use JSONAdapter
+
+Avoid using `JSONAdapter` if you are:
+
+- Using a model that does not natively support structured output, such as a small open-source model hosted on Ollama.
+
+## Summary
+
+Adapters are a crucial component of DSPy that bridge the gap between structured DSPy signatures and language model APIs.
+Understanding when and how to use different adapters will help you build more reliable and efficient DSPy programs.
diff --git a/docs/mkdocs.yml b/docs/mkdocs.yml
@@ -17,6 +17,7 @@ nav:
             - Language Models: learn/programming/language_models.md
             - Signatures: learn/programming/signatures.md
             - Modules: learn/programming/modules.md
+            - Adapters: learn/programming/adapters.md
         - DSPy Evaluation:
             - Evaluation Overview: learn/evaluation/overview.md
             - Data Handling: learn/evaluation/data.md