feat: Implement keyword search tool and pattern learning system #11

monkscode · 2025-11-16T13:40:35Z

Added KeywordSearchTool for semantic search over Robot Framework keywords.
Introduced QueryPatternMatcher for learning and predicting keyword usage patterns.
Developed SmartKeywordProvider to orchestrate keyword retrieval with a hybrid architecture.
Configured centralized logging for optimization components in logging_config.py.
Enhanced RobotTasks to provide minimal keyword guidelines for planning phase.
Updated requirements.txt to include dependencies for ChromaDB and sentence-transformers.
Modified workflow service to learn from successful test executions and store patterns.
Updated frontend to store and pass the original user query for pattern learning during execution.

Summary by CodeRabbit

Release Notes

New Features
- Added performance optimization system with configurable token reduction and keyword search capabilities
- Introduced pattern learning to improve keyword predictions from successful test executions
- Added context pruning to streamline library guidance based on user queries
Configuration
- New optional settings for enabling optimization, configuring storage paths, and tuning keyword search and confidence thresholds
Documentation
- Added comprehensive guides for optimization system setup, configuration, and developer extension points
Metrics & Monitoring
- Enhanced metrics tracking to include token usage, keyword search statistics, pattern learning accuracy, and context reduction measurements

- Added `KeywordSearchTool` for semantic search over Robot Framework keywords. - Introduced `QueryPatternMatcher` for learning and predicting keyword usage patterns. - Developed `SmartKeywordProvider` to orchestrate keyword retrieval with a hybrid architecture. - Configured centralized logging for optimization components in `logging_config.py`. - Enhanced `RobotTasks` to provide minimal keyword guidelines for planning phase. - Updated `requirements.txt` to include dependencies for ChromaDB and sentence-transformers. - Modified workflow service to learn from successful test executions and store patterns. - Updated frontend to store and pass the original user query for pattern learning during execution.

coderabbitai · 2025-11-16T13:40:43Z

Walkthrough

Introduces a comprehensive optimization system for CrewAI that reduces token usage via a hybrid knowledge architecture. Implements core rules, semantic keyword search (ChromaDB), pattern learning from executed code, context pruning, and workflow metrics tracking across backend configuration, agent initialization, optimization components, and frontend integration.

Changes

Cohort / File(s)	Summary
Configuration & Environment `.gitignore`, `src/backend/.env.example`, `src/backend/core/config.py`	Updated ignore rules for optimization-related artifacts; introduced OPTIMIZATION_* environment variables and Settings fields (enabled, paths, thresholds, pruning controls) with validation for confidence thresholds.
Documentation `docs/OPTIMIZATION.md`, `docs/OPTIMIZATION_DEVELOPER_GUIDE.md`	Added two comprehensive guides: user-facing documentation covering architecture, four-tier flow, configuration, monitoring, and troubleshooting; and developer guide with module structure, class interfaces, workflows, testing approaches, and extensions.
Metrics & Tracking `src/backend/core/workflow_metrics.py`	Added optimization metrics fields (token_usage, keyword_search_stats, pattern_learning_stats, context_reduction), tracking methods, serialization support, and count_tokens helper function.
Library Context Base & Implementations `src/backend/crew_ai/library_context/base.py`, `src/backend/crew_ai/library_context/browser_context.py`, `src/backend/crew_ai/library_context/selenium_context.py`, `src/backend/crew_ai/library_context/dynamic_context.py`	Added abstract core_rules property to base class; implemented core_rules, lazy-loaded planning/code-assembly contexts, and get_minimal_planning_context utility across concrete implementations.
Agent Architecture `src/backend/crew_ai/agents.py`, `src/backend/crew_ai/tasks.py`	Extended RobotAgents.init to accept optimized_context, keyword_search_tool, and role-specific contexts; simplified agent goals and backstories; updated task keyword guidelines to use minimal planning_context.
Crew & Workflow Integration `src/backend/crew_ai/crew.py`, `src/backend/api/endpoints.py`, `src/backend/services/workflow_service.py`	Modified run_crew to initialize optimization components and return 3-tuple with metrics; added user_query parameter to execute endpoints and stream functions; integrated pattern-learning post-execution triggers.
Core Optimization Package `src/backend/crew_ai/optimization/__init__.py`, `src/backend/crew_ai/optimization/chroma_store.py`, `src/backend/crew_ai/optimization/keyword_search_tool.py`, `src/backend/crew_ai/optimization/pattern_learning.py`, `src/backend/crew_ai/optimization/smart_keyword_provider.py`, `src/backend/crew_ai/optimization/context_pruner.py`, `src/backend/crew_ai/optimization/logging_config.py`	Introduced complete optimization subsystem: ChromaDB-backed keyword vector store with semantic search; keyword search tool for agents; pattern learning system with SQLite; context pruner for semantic classification; three-tier keyword provider with fallback logic; and centralized optimization logging.
Frontend & Dependencies `src/frontend/script.js`, `src/backend/requirements.txt`	Captured and propagated user_query through execution pathway for pattern learning; added chromadb, sentence-transformers, and numpy dependencies.

Sequence Diagram(s)

sequenceDiagram
    actor User
    participant Frontend as Frontend<br/>script.js
    participant API as API<br/>endpoints.py
    participant Crew as Crew<br/>crew.py
    participant Optimization as Optimization<br/>Components
    participant Agents as Agent Planner<br/>Identifier, etc.
    participant Workflow as Workflow<br/>Service
    
    User->>Frontend: Submit query
    Frontend->>Frontend: Store currentUserQuery
    Frontend->>API: POST /generate + /execute-test
    
    Note over Crew,Optimization: Optimization Phase
    API->>Crew: run_crew (with settings)
    Crew->>Optimization: Initialize if OPTIMIZATION_ENABLED
    Optimization->>Optimization: Build core_rules for all agents
    Optimization->>Optimization: Load pattern predictions (Tier 2)
    Optimization->>Optimization: Fallback to zero-context+tool (Tier 2)
    Optimization->>Optimization: Fallback to full context (Tier 3)
    Crew->>Agents: Pass optimized_context + keyword_search_tool
    
    Note over Agents: Agent Execution
    Agents->>Agents: Generate code
    Workflow->>Workflow: Execute test
    
    Note over Workflow: Pattern Learning Phase
    Workflow->>Workflow: Check if test passed + user_query exists
    Workflow->>Optimization: Initialize optimization components
    Optimization->>Optimization: Extract keywords from code
    Optimization->>Optimization: Learn pattern: query→keywords
    Optimization->>Optimization: Update stats in SQLite
    
    Frontend->>User: Display result

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~60 minutes

Areas requiring extra attention:

src/backend/crew_ai/optimization/smart_keyword_provider.py — Orchestrates three-tier fallback logic with multiple conditional branches; verify all fallback paths are correctly triggered and metrics are accurately tracked.
src/backend/crew_ai/optimization/chroma_store.py — ChromaDB integration with version tracking and automatic rebuilds; review error handling for edge cases (corrupt DB, missing embeddings, collection mismatches).
src/backend/crew_ai/crew.py — Return signature changed from 2-tuple to 3-tuple with optimization_metrics; verify all call sites properly unpack the return value.
src/backend/crew_ai/optimization/pattern_learning.py — SQLite database initialization and keyword extraction regex; validate extraction logic against Robot Framework syntax edge cases.
src/backend/crew_ai/agents.py — Significant agent refactoring with context prioritization; confirm all agent role-specific context selections (planner_context, identifier_context, etc.) are correctly applied.
src/backend/services/workflow_service.py — Post-execution pattern learning sidecar logic; ensure non-blocking error handling and verify it doesn't interfere with normal workflow completion.
Context pruning thresholds and similarity scoring across multiple files — Validate that configuration ranges (0.0–1.0) are enforced and that threshold logic is consistent.

Poem

🐰 Hop and hum, the optimization drum,
Core rules gleam, patterns strum,
Context pruned, token bills shrink,
Three tiers tall, no more chink!
Learn and search, learn and sway,
CrewAI's smarter every day! ✨

Pre-merge checks and finishing touches

✅ Passed checks (3 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title clearly and specifically summarizes the main additions: a keyword search tool and pattern learning system for the optimization feature.
Docstring Coverage	✅ Passed	Docstring coverage is 100.00% which is sufficient. The required threshold is 80.00%.

✨ Finishing touches

📝 Generate docstrings

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch feat/keyword-learning-optimisation

Tip

📝 Customizable high-level summaries are now available!

You can now customize how CodeRabbit generates the high-level summary in your pull requests — including its content, structure, tone, and formatting.

Provide custom instructions to shape the summary (bullet lists, tables, contributor stats, etc.).
Use high_level_summary_in_walkthrough to move the summary from the description to the walkthrough section.

Example:

"Create a concise high-level summary as a bullet-point list. Then include a Markdown table showing lines added and removed by each contributing author."

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

sonarqubecloud · 2025-11-16T13:41:07Z

Quality Gate passed

Issues
16 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

coderabbitai

Actionable comments posted: 6

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (2)

src/backend/crew_ai/agents.py (2)

80-112: Planner context is never used

SmartKeywordProvider now supplies a trimmed planner_context, but step_planner_agent still builds guidance solely from library_context. As a result the optimization work never reaches the agent and we keep sending the heavy fallback context. Please honor the optimized planner_context first and only fall back to the legacy guidance when it is missing.

-        library_name = self.library_context.library_name if self.library_context else 'Robot Framework'
-        
-        # Step Planner needs MINIMAL context - just library name and core principles
-        # It doesn't need keyword details - that's for the Code Assembler
-        library_guidance = ""
-        if self.library_context:
-            library_guidance = f"""
+        library_name = self.library_context.library_name if self.library_context else 'Robot Framework'
+        
+        # Prefer optimized planner context when provided, otherwise fall back to minimal library guidance
+        planner_guidance = ""
+        if self.planner_context:
+            planner_guidance = f"\n\n{self.planner_context.strip()}"
+        elif self.library_context:
+            planner_guidance = f"""
@@
-                "8. Create HIGH-LEVEL steps - the Code Assembler will handle keyword details."
-                f"{library_guidance}"
+                "8. Create HIGH-LEVEL steps - the Code Assembler will handle keyword details."
+                f"{planner_guidance}"

118-134: Identifier context is ignored

The optimized identifier_context passed into RobotAgents is never used, so the element identifier agent still operates on the legacy backstory. That defeats the optimization pipeline’s attempt to shrink context for this role. Please insert the identifier_context when provided before falling back to the static text.

-    def element_identifier_agent(self) -> Agent:
-        return Agent(
+    def element_identifier_agent(self) -> Agent:
+        identifier_guidance = ""
+        if self.identifier_context:
+            identifier_guidance = f"\n\n{self.identifier_context.strip()}"
+
+        return Agent(
@@
-                "Benefits: Browser opens once (3-5x faster), full context awareness, intelligent popup handling, validated locators."
-            ),
+                "Benefits: Browser opens once (3-5x faster), full context awareness, intelligent popup handling, validated locators."
+                f"{identifier_guidance}"
+            ),

🧹 Nitpick comments (12)

.gitignore (1)
40-40: Use trailing slash for directory pattern consistency.

Line 40 ignores logs/temp_metrics without a trailing slash. For consistency with standard gitignore directory patterns (e.g., chroma_db/ on line 38), use a trailing slash to explicitly denote this as a directory:
-logs/temp_metrics
+logs/temp_metrics/
This prevents accidental matching of files named temp_metrics at different levels.
src/backend/requirements.txt (1)

26-34: Validate new ML dependency versions against your Python/runtime targets

The new chromadb==0.4.22, sentence-transformers==2.2.2, and numpy==1.24.3 pins look reasonable, but they are heavy and somewhat opinionated:

sentence-transformers will pull in substantial ML dependencies (e.g., torch), which will noticeably increase image size and cold-start time.

numpy==1.24.3 may not be compatible with newer Python runtimes (e.g., Python 3.12 prefers a newer NumPy).

I’d suggest double-checking that:

These versions are supported on the Python version you ship in Docker/production.

The footprint/performance impact is acceptable (or consider putting these behind an extra or separate image if not always needed).
src/backend/.env.example (1)
69-107: Consider aligning example OPTIMIZATION_ENABLED value with the documented default

The comment says default is false (disabled until fully tested), but .env.example ships with:
OPTIMIZATION_ENABLED=true
Given many users will cp .env.example .env, this effectively enables the new optimization system by default, which may surprise them if they haven’t read the docs yet.

I’d consider either:

Changing the example to OPTIMIZATION_ENABLED=false and letting the docs show how to turn it on, or

Adjusting the comment to clarify that the example enables optimization even though the code default is false.
src/backend/core/config.py (1)
39-47: Optimization config looks solid; consider adding TOP_K validation and tightening error handling

The new optimization settings and confidence-threshold validator are well-structured and match the env/example usage.

Two minor improvement ideas:
Enforce the documented range for OPTIMIZATION_KEYWORD_SEARCH_TOP_K
Docs and .env.example describe a valid range of 1–10, but the config doesn’t enforce it. Adding a small validator would prevent misconfiguration, e.g.:
@validator("OPTIMIZATION_KEYWORD_SEARCH_TOP_K")
def validate_keyword_search_top_k(cls, v: int) -> int:
    if not 1 <= v <= 10:
        raise ValueError(f"OPTIMIZATION_KEYWORD_SEARCH_TOP_K must be between 1 and 10, got {v}")
    return v
Error message length (Ruff TRY003)
Ruff flags the relatively long error message in validate_confidence_threshold. This is purely stylistic; if you want to appease it, you could shorten the message or factor it into a constant, but functionally it’s fine as-is.
Also note: OPTIMIZATION_CONTEXT_PRUNING_THRESHOLD here defaults to 0.6, while docs/OPTIMIZATION.md currently states a default of 0.8; worth reconciling so operators don’t get conflicting information.

Also applies to: 77-82
src/backend/crew_ai/optimization/logging_config.py (1)
23-47: Simplify logger naming to guarantee hierarchy and narrow the exception catch around file handler

Nice centralized logging surface; a couple of small robustness points:
get_optimization_logger hierarchy can be brittle
Current behavior depends on whether name starts with or contains "optimization". If callers follow the doc and pass __name__, modules with names like src.backend.crew_ai.optimization.keyword_search_tool will get loggers outside the crew_ai.optimization tree, so they won’t automatically inherit the handlers configured on OPTIMIZATION_LOGGER_NAME.

A simpler, more predictable pattern is:
def get_optimization_logger(name: str) -> logging.Logger:
    if name.startswith(OPTIMIZATION_LOGGER_NAME):
        logger_name = name
    else:
        logger_name = f"{OPTIMIZATION_LOGGER_NAME}.{name}"
    return logging.getLogger(logger_name)
and then pass a short component name (e.g. "keyword_search") or __name__ if you really want the fully-qualified under that prefix.
Catching bare Exception when configuring file logging (Ruff BLE001)
Functionally it’s acceptable to treat any failure as “log a warning and continue”, but to keep linters quiet and make intent clearer, you may want to narrow this to OSError / IOError / PermissionError, which covers the usual file-handler failures.
Also applies to: 90-99
src/backend/core/workflow_metrics.py (1)
52-91: Optimization metrics wiring looks good; minor serialization and docstring nits

Overall, the new optimization metrics surface is well thought out: defaults via __post_init__, dedicated track_* helpers, and backward-compatible parsing in from_dict all look solid.

A couple of small things you might want to tweak:
Duplicate exposure of optimization fields in to_dict
to_dict() uses asdict(self) (which already includes token_usage, keyword_search_stats, etc. at the top level) and then adds an optimization sub-dict that nests the same values. The JSON shape shown in docs/OPTIMIZATION.md only uses the nested optimization section.

If you want the external JSON to match the docs and avoid redundancy, you could pop the top-level keys before adding optimization, for example:
def to_dict(self) -> Dict[str, Any]:
    data = asdict(self)
    data["timestamp"] = self.timestamp.isoformat()

    token_usage = data.pop("token_usage", None)
    keyword_search = data.pop("keyword_search_stats", None)
    pattern_learning = data.pop("pattern_learning_stats", None)
    context_reduction = data.pop("context_reduction", None)

    data["optimization"] = {
        "token_usage": token_usage,
        "keyword_search": keyword_search,
        "pattern_learning": pattern_learning,
        "context_reduction": context_reduction,
    }
    return data
from_dict is already set up to consume an optimization section, so this would align the serialized shape with what you document.
count_tokens docstring example doesn’t match implementation
Given words = text.split() and estimated_tokens = int(len(words) * 1.33), the example:
>>> count_tokens("Hello world, this is a test")
8
actually returns 7 with the current heuristic (6 words * 1.33 → 7 after int). Either adjust the example value or tweak the multiplier if you want the example to be exact.
Also applies to: 92-147, 150-161, 221-235, 386-414
src/frontend/script.js (1)

26-28: User query tracking across generate/execute looks consistent

Storing currentUserQuery on generation, resetting it on clearAll, and forwarding it in /execute-test payload is coherent and matches the backend contract for optional user_query. This keeps the original query bound to the generated code and avoids leakage across sessions.

One behavioral nuance to be aware of: if a user generates once, then manually replaces the code without pressing “New Test”, executions will still send the original currentUserQuery. If you’d rather only learn from queries that directly produced the executed code, consider clearing currentUserQuery when users paste or heavily edit code after generation.

Also applies to: 387-389, 536-538, 648-650

src/backend/crew_ai/library_context/dynamic_context.py (1)

179-213: Minimal planning context implementation is sound; consider tightening exception scope

get_minimal_planning_context correctly reuses get_library_documentation, includes version in the banner when available, and degrades gracefully when documentation loading fails.

The broad except Exception is acceptable here for resiliency, but if you want to satisfy BLE001 and avoid hiding programmer errors, you could narrow it to the expected failure modes (e.g., ImportError, OSError, json.JSONDecodeError) while letting unexpected exceptions surface.

docs/OPTIMIZATION_DEVELOPER_GUIDE.md (1)

1-2233: Address markdownlint issues for better tooling compatibility

The guide is thorough and well‑structured. markdownlint is flagging a few mechanical issues:

Some fenced code blocks lack a language spec (e.g., around lines 26, 54, 861, 1933, 1946, etc.). Consider adding bash, python, json, etc. to those fences.

Several lines use emphasis (**...**) in places where a heading level (e.g., ### ...) would be more appropriate (MD036).

These don’t affect rendering much but fixing them will reduce noise from docs linters and improve IDE support.
src/backend/services/workflow_service.py (3)
81-83: Unused validation_output and optimization_metrics from run_crew

run_crew now returns three values, but validation_output and optimization_metrics are never used in run_agentic_workflow. This is harmless but flagged by Ruff and slightly misleading.

If you don’t plan to use them here, consider marking them as intentionally unused:
-        validation_output, crew_with_results, optimization_metrics = run_crew(
+        _result, crew_with_results, _optimization_metrics = run_crew(
             natural_language_query, model_provider, model_name, library_type=None, workflow_id=workflow_id)
If you do intend to surface optimization metrics later, wiring them into the unified WorkflowMetrics would be a good follow‑up.

449-457: Type hint for user_query is slightly non‑idiomatic

stream_execute_only uses user_query: str = None, which is valid at runtime but violates PEP‑484 style and triggers RUF013.

Consider switching to an explicit optional type for clarity:
-async def stream_execute_only(robot_code: str, user_query: str = None) -> Generator[str, None, None]:
+async def stream_execute_only(robot_code: str, user_query: str | None = None) -> Generator[str, None, None]:
(Similarly, you can use Optional[str] if you prefer older syntax.)

489-520: Broad except Exception around learning is acceptable but could be narrowed

Both learning blocks wrap all errors in a generic except Exception and log a warning. This is reasonable for a non‑critical sidecar that must not break execution, but it also swallows programming errors (e.g., misconfigurations) the same way as transient environment failures.

If you want stricter behavior, consider:

Narrowing to expected runtime failures (e.g., ImportError, OSError, chromadb.errors.*), or

Re‑raising on clearly programmer‑error types while continuing to log and swallow transient ones.

Not urgent, but worth considering once the main wiring is stable.

Also applies to: 593-621

📜 Review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between a4d78c6 and f65f26e.

📒 Files selected for processing (24)

.gitignore (1 hunks)
docs/OPTIMIZATION.md (1 hunks)
docs/OPTIMIZATION_DEVELOPER_GUIDE.md (1 hunks)
src/backend/.env.example (1 hunks)
src/backend/api/endpoints.py (2 hunks)
src/backend/core/config.py (2 hunks)
src/backend/core/workflow_metrics.py (3 hunks)
src/backend/crew_ai/agents.py (6 hunks)
src/backend/crew_ai/crew.py (5 hunks)
src/backend/crew_ai/library_context/base.py (1 hunks)
src/backend/crew_ai/library_context/browser_context.py (3 hunks)
src/backend/crew_ai/library_context/dynamic_context.py (1 hunks)
src/backend/crew_ai/library_context/selenium_context.py (4 hunks)
src/backend/crew_ai/optimization/__init__.py (1 hunks)
src/backend/crew_ai/optimization/chroma_store.py (1 hunks)
src/backend/crew_ai/optimization/context_pruner.py (1 hunks)
src/backend/crew_ai/optimization/keyword_search_tool.py (1 hunks)
src/backend/crew_ai/optimization/logging_config.py (1 hunks)
src/backend/crew_ai/optimization/pattern_learning.py (1 hunks)
src/backend/crew_ai/optimization/smart_keyword_provider.py (1 hunks)
src/backend/crew_ai/tasks.py (1 hunks)
src/backend/requirements.txt (1 hunks)
src/backend/services/workflow_service.py (4 hunks)
src/frontend/script.js (4 hunks)

🧰 Additional context used

🧬 Code graph analysis (15)

src/backend/crew_ai/optimization/logging_config.py (1)

src/backend/crew_ai/llm_output_cleaner.py (2)

LLMFormattingMonitor (366-413)

LLMOutputCleaner (31-363)

src/backend/services/workflow_service.py (5)

src/backend/crew_ai/crew.py (1)

run_crew (53-277)

src/backend/crew_ai/optimization/smart_keyword_provider.py (2)

SmartKeywordProvider (20-335)

learn_from_execution (323-335)

src/backend/crew_ai/optimization/pattern_learning.py (2)

QueryPatternMatcher (22-308)

learn_from_execution (145-195)

src/backend/crew_ai/optimization/chroma_store.py (1)

KeywordVectorStore (18-378)

src/backend/crew_ai/library_context/__init__.py (1)

get_library_context (21-43)

src/backend/crew_ai/library_context/base.py (2)

src/backend/crew_ai/library_context/browser_context.py (1)

core_rules (52-96)

src/backend/crew_ai/library_context/selenium_context.py (1)

core_rules (189-234)

src/backend/crew_ai/optimization/pattern_learning.py (2)

src/backend/crew_ai/optimization/chroma_store.py (1)

get_or_create_pattern_collection (84-104)

src/backend/crew_ai/optimization/smart_keyword_provider.py (1)

learn_from_execution (323-335)

src/backend/crew_ai/library_context/browser_context.py (3)

src/backend/crew_ai/library_context/base.py (4)

core_rules (123-140)

planning_context (33-42)

code_assembly_context (46-56)

validation_context (60-69)

src/backend/crew_ai/library_context/selenium_context.py (4)

core_rules (189-234)

planning_context (35-43)

code_assembly_context (46-154)

validation_context (237-246)

src/backend/crew_ai/library_context/dynamic_context.py (1)

get_minimal_planning_context (179-212)

src/backend/crew_ai/optimization/chroma_store.py (4)

src/backend/core/config.py (1)

Settings (11-87)

src/backend/crew_ai/library_context/browser_context.py (1)

library_name (27-28)

src/backend/crew_ai/library_context/selenium_context.py (1)

library_name (27-28)

src/backend/crew_ai/library_context/dynamic_context.py (2)

DynamicLibraryDocumentation (23-233)

get_library_documentation (41-89)

src/backend/crew_ai/optimization/__init__.py (6)

src/backend/crew_ai/optimization/chroma_store.py (1)

KeywordVectorStore (18-378)

src/backend/crew_ai/optimization/keyword_search_tool.py (1)

KeywordSearchTool (18-169)

src/backend/crew_ai/optimization/pattern_learning.py (1)

QueryPatternMatcher (22-308)

src/backend/crew_ai/optimization/smart_keyword_provider.py (1)

SmartKeywordProvider (20-335)

src/backend/crew_ai/optimization/context_pruner.py (1)

ContextPruner (17-204)

src/backend/crew_ai/optimization/logging_config.py (6)

get_optimization_logger (23-47)

configure_optimization_logging (50-104)

LogMessages (108-146)

log_fallback (150-166)

log_critical_failure (169-184)

log_performance_metric (187-207)

src/backend/api/endpoints.py (1)

src/backend/services/workflow_service.py (1)

stream_execute_only (449-524)

src/backend/crew_ai/agents.py (3)

src/backend/crew_ai/library_context/base.py (3)

library_name (21-23)

code_assembly_context (46-56)

validation_context (60-69)

src/backend/crew_ai/library_context/browser_context.py (3)

library_name (27-28)

code_assembly_context (110-205)

validation_context (208-216)

src/backend/crew_ai/library_context/selenium_context.py (3)

library_name (27-28)

code_assembly_context (46-154)

validation_context (237-246)

src/backend/crew_ai/tasks.py (3)

src/backend/crew_ai/library_context/base.py (1)

planning_context (33-42)

src/backend/crew_ai/library_context/browser_context.py (1)

planning_context (99-107)

src/backend/crew_ai/library_context/selenium_context.py (1)

planning_context (35-43)

src/backend/crew_ai/library_context/selenium_context.py (3)

src/backend/crew_ai/library_context/dynamic_context.py (1)

get_minimal_planning_context (179-212)

src/backend/crew_ai/library_context/base.py (3)

code_assembly_context (46-56)

core_rules (123-140)

validation_context (60-69)

src/backend/crew_ai/library_context/browser_context.py (3)

code_assembly_context (110-205)

core_rules (52-96)

validation_context (208-216)

src/backend/crew_ai/optimization/keyword_search_tool.py (2)

src/backend/crew_ai/optimization/chroma_store.py (1)

search (197-243)

src/backend/core/workflow_metrics.py (1)

track_keyword_search (106-121)

src/backend/crew_ai/library_context/dynamic_context.py (3)

src/backend/crew_ai/library_context/base.py (1)

library_name (21-23)

src/backend/crew_ai/library_context/browser_context.py (1)

library_name (27-28)

src/backend/crew_ai/library_context/selenium_context.py (1)

library_name (27-28)

src/backend/crew_ai/optimization/smart_keyword_provider.py (8)

src/backend/crew_ai/optimization/pattern_learning.py (2)

get_relevant_keywords (197-257)

learn_from_execution (145-195)

src/backend/crew_ai/optimization/chroma_store.py (2)

KeywordVectorStore (18-378)

search (197-243)

src/backend/crew_ai/optimization/keyword_search_tool.py (1)

KeywordSearchTool (18-169)

src/backend/crew_ai/optimization/context_pruner.py (4)

ContextPruner (17-204)

classify_query (94-142)

prune_keywords (144-175)

get_pruning_stats (177-204)

src/backend/crew_ai/library_context/base.py (5)

library_name (21-23)

core_rules (123-140)

planning_context (33-42)

code_assembly_context (46-56)

validation_context (60-69)

src/backend/crew_ai/library_context/browser_context.py (5)

library_name (27-28)

core_rules (52-96)

planning_context (99-107)

code_assembly_context (110-205)

validation_context (208-216)

src/backend/crew_ai/library_context/selenium_context.py (5)

library_name (27-28)

core_rules (189-234)

planning_context (35-43)

code_assembly_context (46-154)

validation_context (237-246)

src/backend/core/workflow_metrics.py (1)

track_pattern_learning (123-134)

src/backend/crew_ai/crew.py (5)

src/backend/core/workflow_metrics.py (3)

WorkflowMetrics (18-236)

count_tokens (386-414)

track_context_reduction (136-148)

src/backend/crew_ai/optimization/chroma_store.py (2)

KeywordVectorStore (18-378)

ensure_collection_ready (361-378)

src/backend/crew_ai/optimization/pattern_learning.py (1)

QueryPatternMatcher (22-308)

src/backend/crew_ai/optimization/smart_keyword_provider.py (2)

get_agent_context (210-280)

get_keyword_search_tool (310-321)

src/backend/crew_ai/agents.py (1)

RobotAgents (52-237)

🪛 dotenv-linter (4.0.0)

src/backend/.env.example

[warning] 68-68: [ExtraBlankLine] Extra blank line detected

(ExtraBlankLine)

🪛 markdownlint-cli2 (0.18.1)

docs/OPTIMIZATION_DEVELOPER_GUIDE.md

26-26: Fenced code blocks should have a language specified

(MD040, fenced-code-language)

54-54: Fenced code blocks should have a language specified

(MD040, fenced-code-language)

534-534: Emphasis used instead of a heading