oracle
diff --git a/‎libs/oci/OCI_API_GAP_ANALYSIS.md‎
Lines changed: 281 additions & 0 deletions b/‎libs/oci/OCI_API_GAP_ANALYSIS.md‎
Lines changed: 281 additions & 0 deletions
diff --git a/‎libs/oci/test_parallel_tool_calling_integration.py‎ renamed to ‎libs/oci/tests/integration_tests/chat_models/test_parallel_tool_calling_integration.py‎ b/‎libs/oci/test_parallel_tool_calling_integration.py‎ renamed to ‎libs/oci/tests/integration_tests/chat_models/test_parallel_tool_calling_integration.py‎
@@ -0,0 +1,281 @@
+# OCI Generative AI API vs langchain-oci SDK Gap Analysis
+
+## Executive Summary
+
+This document provides a comprehensive comparison between the OCI Generative AI API specifications and the current langchain-oci implementation to identify missing functionality and parameters.
+
+**Last Updated:** 2025-10-30
+**OCI SDK Version Analyzed:** 2.162.0
+**langchain-oci Version:** Current main branch
+
+---
+
+## Current Architecture
+
+The langchain-oci implementation passes parameters through this flow:
+1. **Class-level:** `model_kwargs` dict in `ChatOCIGenAI.__init__()`
+2. **Method-level:** `kwargs` passed to `invoke()`, `stream()`, or `bind_tools()`
+3. **Merged:** Combined into `chat_params` at line 1186
+4. **Passed:** Directly to `oci_chat_request(**chat_params)` at line 1198
+
+This means **most API parameters can technically be used via `model_kwargs`**, but they lack:
+- Type validation
+- Documentation
+- IDE autocomplete
+- First-class API support
+- User awareness
+
+---
+
+## Gap Categories
+
+### ✅ **Fully Implemented**
+Parameters with first-class support and documentation
+
+### ⚠️ **Partially Implemented**
+Parameters that work via `model_kwargs` but lack documentation/validation
+
+### ❌ **Not Implemented**
+Parameters that don't work or have bugs
+
+### 🔧 **Needs Fix**
+Features with known issues reported by users
+
+---
+
+## GenericChatRequest Parameters (Meta, Llama, Grok, OpenAI, Mistral)
+
+### Core Parameters
+
+| Parameter | OCI API | langchain-oci | Status | Notes |
+|-----------|---------|---------------|--------|-------|
+| `messages` | ✅ Required | ✅ Implemented | ✅ Fully Implemented | Via LangChain message types |
+| `api_format` | ✅ Required | ✅ Implemented | ✅ Fully Implemented | Auto-set to "GENERIC" |
+| `is_stream` | ✅ Optional | ✅ Implemented | ✅ Fully Implemented | Via `stream()` method |
+| `tools` | ✅ Optional | ✅ Implemented | ✅ Fully Implemented | Via `bind_tools()` |
+| `tool_choice` | ✅ Optional | ✅ Implemented | ✅ Fully Implemented | Via `bind_tools(tool_choice=...)` |
+
+### Generation Control Parameters
+
+| Parameter | OCI API | langchain-oci | Status | Notes |
+|-----------|---------|---------------|--------|-------|
+| `temperature` | ✅ Optional (float) | ⚠️ Via model_kwargs | ⚠️ Partially Implemented | Works but not documented as first-class param |
+| `max_tokens` | ✅ Optional (int) | ⚠️ Via model_kwargs | ⚠️ Partially Implemented | Works but not validated |
+| `max_completion_tokens` | ✅ Optional (int) | ⚠️ Via model_kwargs | ⚠️ Partially Implemented | Recommended for OpenAI models |
+| `top_k` | ✅ Optional (int) | ⚠️ Via model_kwargs | ⚠️ Partially Implemented | No validation |
+| `top_p` | ✅ Optional (float) | ⚠️ Via model_kwargs | ⚠️ Partially Implemented | No validation |
+| `stop` | ✅ Optional (list[str]) | ✅ Implemented | ✅ Fully Implemented | Via `invoke(stop=...)` |
+| `num_generations` | ✅ Optional (int) | ⚠️ Via model_kwargs | ⚠️ Partially Implemented | Not documented |
+
+### Quality & Control Parameters
+
+| Parameter | OCI API | langchain-oci | Status | Priority | Notes |
+|-----------|---------|---------------|--------|----------|-------|
+| `seed` | ✅ Optional (int) | ⚠️ Via model_kwargs | ⚠️ Partially Implemented | **HIGH** | Critical for reproducibility |
+| `frequency_penalty` | ✅ Optional (float) | ⚠️ Via model_kwargs | ⚠️ Partially Implemented | **HIGH** | Common quality control |
+| `presence_penalty` | ✅ Optional (float) | ⚠️ Via model_kwargs | ⚠️ Partially Implemented | **HIGH** | Common quality control |
+| `reasoning_effort` | ✅ Optional (enum) | ⚠️ Via model_kwargs | ⚠️ Partially Implemented | **MEDIUM** | For reasoning models |
+| `verbosity` | ✅ Optional (enum) | ⚠️ Via model_kwargs | ⚠️ Partially Implemented | **LOW** | Response length control |
+
+### Advanced Parameters
+
+| Parameter | OCI API | langchain-oci | Status | Priority | Notes |
+|-----------|---------|---------------|--------|----------|-------|
+| `response_format` | ✅ Optional (object) | 🔧 **Broken** | 🔧 Needs Fix | **HIGH** | Issue #33 - Users need structured output |
+| `logit_bias` | ✅ Optional (dict) | ⚠️ Via model_kwargs | ⚠️ Partially Implemented | **LOW** | Advanced use case |
+| `log_probs` | ✅ Optional (int) | ⚠️ Via model_kwargs | ⚠️ Partially Implemented | **LOW** | Debugging/analysis |
+| `is_echo` | ✅ Optional (bool) | ⚠️ Via model_kwargs | ⚠️ Partially Implemented | **LOW** | Include prompt in response |
+| `metadata` | ✅ Optional (object) | ⚠️ Via model_kwargs | ⚠️ Partially Implemented | **LOW** | Custom metadata |
+| `prediction` | ✅ Optional (object) | ⚠️ Via model_kwargs | ⚠️ Partially Implemented | **LOW** | Prediction configuration |
+
+### Streaming Parameters
+
+| Parameter | OCI API | langchain-oci | Status | Priority | Notes |
+|-----------|---------|---------------|--------|----------|-------|
+| `stream_options` | ✅ Optional (object) | ⚠️ Via model_kwargs | ⚠️ Partially Implemented | **MEDIUM** | Streaming configuration |
+
+### Specialized Features
+
+| Parameter | OCI API | langchain-oci | Status | Priority | Notes |
+|-----------|---------|---------------|--------|----------|-------|
+| `web_search_options` | ✅ Optional (object) | ⚠️ Via model_kwargs | ⚠️ Partially Implemented | **MEDIUM** | Web search integration |
+| `is_parallel_tool_calls` | ✅ Optional (bool) | ✅ **Just Added!** | ✅ Fully Implemented | N/A | PR #59 - Complete |
+
+---
+
+## CohereChatRequest Parameters (Cohere Models)
+
+### Core Parameters
+
+| Parameter | OCI API | langchain-oci | Status | Notes |
+|-----------|---------|---------------|--------|-------|
+| `message` | ✅ Required (str) | ✅ Implemented | ✅ Fully Implemented | Converted from LangChain messages |
+| `chat_history` | ✅ Optional | ✅ Implemented | ✅ Fully Implemented | Via message history |
+| `api_format` | ✅ Required | ✅ Implemented | ✅ Fully Implemented | Auto-set to "COHERE" |
+| `is_stream` | ✅ Optional | ✅ Implemented | ✅ Fully Implemented | Via `stream()` method |
+| `tools` | ✅ Optional | ✅ Implemented | ✅ Fully Implemented | Via `bind_tools()` |
+
+### Generation Control
+
+| Parameter | OCI API | langchain-oci | Status | Notes |
+|-----------|---------|---------------|--------|-------|
+| `temperature` | ✅ Optional | ⚠️ Via model_kwargs | ⚠️ Partially Implemented | Works but undocumented |
+| `max_tokens` | ✅ Optional | ⚠️ Via model_kwargs | ⚠️ Partially Implemented | Works but undocumented |
+| `max_input_tokens` | ✅ Optional | ⚠️ Via model_kwargs | ⚠️ Partially Implemented | Cohere-specific |
+| `top_k` | ✅ Optional | ⚠️ Via model_kwargs | ⚠️ Partially Implemented | Works but undocumented |
+| `top_p` | ✅ Optional | ⚠️ Via model_kwargs | ⚠️ Partially Implemented | Works but undocumented |
+| `stop_sequences` | ✅ Optional | ✅ Implemented | ✅ Fully Implemented | Via `invoke(stop=...)` |
+
+### Quality & Control
+
+| Parameter | OCI API | langchain-oci | Status | Priority | Notes |
+|-----------|---------|---------------|--------|----------|-------|
+| `seed` | ✅ Optional | ⚠️ Via model_kwargs | ⚠️ Partially Implemented | **HIGH** | Reproducibility |
+| `frequency_penalty` | ✅ Optional | ⚠️ Via model_kwargs | ⚠️ Partially Implemented | **HIGH** | Quality control |
+| `presence_penalty` | ✅ Optional | ⚠️ Via model_kwargs | ⚠️ Partially Implemented | **HIGH** | Quality control |
+
+### Cohere-Specific Features
+
+| Parameter | OCI API | langchain-oci | Status | Priority | Notes |
+|-----------|---------|---------------|--------|----------|-------|
+| `documents` | ✅ Optional | ⚠️ Via model_kwargs | ⚠️ Partially Implemented | **MEDIUM** | RAG support |
+| `response_format` | ✅ Optional | ⚠️ Via model_kwargs | ⚠️ Partially Implemented | **HIGH** | Cohere structured output |
+| `is_search_queries_only` | ✅ Optional | ⚠️ Via model_kwargs | ⚠️ Partially Implemented | **LOW** | Search query generation |
+| `preamble_override` | ✅ Optional | ⚠️ Via model_kwargs | ⚠️ Partially Implemented | **MEDIUM** | System prompt override |
+| `prompt_truncation` | ✅ Optional | ⚠️ Via model_kwargs | ⚠️ Partially Implemented | **LOW** | Prompt handling |
+| `tool_results` | ✅ Optional | ✅ Implemented | ✅ Fully Implemented | N/A | Via ToolMessage |
+| `is_force_single_step` | ✅ Optional | ⚠️ Via model_kwargs | ⚠️ Partially Implemented | **MEDIUM** | Multi-step control |
+| `is_raw_prompting` | ✅ Optional | ⚠️ Via model_kwargs | ⚠️ Partially Implemented | **LOW** | Raw prompt mode |
+| `is_echo` | ✅ Optional | ⚠️ Via model_kwargs | ⚠️ Partially Implemented | **LOW** | Echo prompt |
+| `citation_quality` | ✅ Optional | ⚠️ Via model_kwargs | ⚠️ Partially Implemented | **LOW** | Citation mode |
+| `safety_mode` | ✅ Optional | ⚠️ Via model_kwargs | ⚠️ Partially Implemented | **MEDIUM** | Safety controls |
+| `stream_options` | ✅ Optional | ⚠️ Via model_kwargs | ⚠️ Partially Implemented | **MEDIUM** | Streaming config |
+
+---
+
+## Known Issues (GitHub)
+
+### 🔧 Critical Bugs
+
+1. **Issue #52: Tool Parameter Parsing**
+   - **Problem:** Llama models return escaped JSON strings `"{\\"key\\": \\"value\\"}"` instead of JSON objects
+   - **Impact:** Multi-turn tool conversations fail
+   - **Status:** Open, needs robust JSON unescaping
+   - **Priority:** **HIGH**
+
+2. **Issue #33: response_format Not Supported**
+   - **Problem:** `TypeError: Unrecognized keyword argument: response_format`
+   - **Impact:** Can't use `json_mode` with `with_structured_output()`
+   - **Status:** Open, parameter exists in API but not exposed
+   - **Priority:** **HIGH**
+
+3. **Issue #28: Llama Tool Response Integration**
+   - **Problem:** Llama calls tools but doesn't integrate results into final answer
+   - **Impact:** Tool calling workflows broken for Llama
+   - **Status:** Open, works with Cohere, fails with Llama
+   - **Priority:** **HIGH**
+
+### ⚠️ Medium Priority Issues
+
+4. **Issue #40: Gemini Structured Output**
+   - **Problem:** `with_structured_output()` doesn't work with Gemini 2.5
+   - **Status:** Open, provider-specific handling needed
+   - **Priority:** **MEDIUM**
+
+5. **Issue #37: OpenAI Tool Schema**
+   - **Problem:** Incomplete tool schema for OpenAI models, array arguments fail
+   - **Status:** Open
+   - **Priority:** **MEDIUM**
+
+6. **Issue #45: Invalid Function Schema**
+   - **Problem:** Tool schema validation failures
+   - **Status:** Open
+   - **Priority:** **MEDIUM**
+
+### 📦 Feature Requests
+
+7. **Issue #55: LangChain 1.0 Support**
+   - **Status:** Open
+   - **Priority:** **MEDIUM**
+
+8. **Issue #5: Tool Description Required**
+   - **Status:** Open
+   - **Priority:** **LOW**
+
+9. **Issue #4: InMemorySaver Checkpointer**
+   - **Status:** Open
+   - **Priority:** **LOW**
+
+---
+
+## Recommendations
+
+### Immediate Priorities (High Impact, Low Effort)
+
+1. **✅ DONE: `is_parallel_tool_calls`** - PR #59 merged
+2. **Fix Issue #33: `response_format` Support**
+   - Expose as first-class parameter
+   - Add validation for GenericChatRequest and CohereChatRequest
+   - Document usage examples
+3. **Fix Issue #52: Robust JSON Parsing**
+   - Add JSON unescape logic for tool arguments
+   - Handle both `'{"key": "value"}'` and `'"{\\"key\\": \\"value\\"}"'`
+4. **Add First-Class Parameters:**
+   - `seed` (reproducibility)
+   - `frequency_penalty` (quality)
+   - `presence_penalty` (quality)
+
+### Medium-Term Improvements
+
+5. **Fix Issue #28: Llama Tool Integration**
+   - Debug message history handling
+   - Ensure tool results properly incorporated
+6. **Documentation Enhancement**
+   - Document all `model_kwargs` parameters
+   - Add examples for each provider
+   - Create migration guide from undocumented to first-class params
+
+### Long-Term Enhancements
+
+7. **Provider-Specific Features**
+   - Cohere: `documents`, `preamble_override`, `safety_mode`
+   - Generic: `reasoning_effort`, `web_search_options`
+8. **Advanced Features**
+   - `logit_bias` support
+   - `stream_options` configuration
+   - `prediction` support
+
+---
+
+## Implementation Status Summary
+
+| Category | Total Params | Fully Implemented | Partially Implemented | Not Implemented | Broken |
+|----------|--------------|-------------------|----------------------|----------------|--------|
+| **GenericChatRequest** | 26 | 6 (23%) | 19 (73%) | 0 (0%) | 1 (4%) |
+| **CohereChatRequest** | 25 | 7 (28%) | 18 (72%) | 0 (0%) | 0 (0%) |
+| **Total** | 51 | 13 (25%) | 37 (73%) | 0 (0%) | 1 (2%) |
+
+### Key Insight
+
+**73% of parameters work via `model_kwargs` but lack documentation!**
+
+Most parameters are technically usable but users don't know about them because:
+- Not in function signatures
+- Not in documentation
+- No type validation
+- No examples
+
+---
+
+## Next Steps
+
+1. ✅ **Completed:** Add `is_parallel_tool_calls` support (PR #59)
+2. **Fix:** `response_format` parameter (Issue #33)
+3. **Fix:** Tool argument parsing bug (Issue #52)
+4. **Add:** First-class support for `seed`, `frequency_penalty`, `presence_penalty`
+5. **Document:** All working `model_kwargs` parameters
+6. **Fix:** Llama tool response integration (Issue #28)
+
+---
+
+*This gap analysis will be updated as features are implemented and new issues are discovered.*