⚡️ Speed up function `create_dynamic_operand_builder` by 6% #783

codeflash-ai · 2025-12-03T22:30:55Z

📄 6% (0.06x) speedup for `create_dynamic_operand_builder` in `inference/core/workflows/core_steps/common/query_language/evaluation_engine/core.py`

⏱️ Runtime : 52.4 microseconds → 49.4 microseconds (best of 21 runs)

📝 Explanation and details

The optimization achieves a 6% speedup through two key changes in the build_operations_chain function:

Key Optimizations:

List Comprehension Replacement: The original code used a manual loop with append() to build the operations list:
```
operations_functions = []
for operation_id, operation_definition in enumerate(operations):
    # ... build operation ...
    operations_functions.append(operation_function)
```
The optimized version uses a list comprehension, which is faster in Python due to reduced function call overhead and better memory allocation patterns.
Simplified Empty Check: Changed if not len(operations): to if not operations:, eliminating an unnecessary function call since Python treats empty sequences as falsy.

Why This Matters:

The line profiler shows the list comprehension execution (line with operations_functions = [) takes 268ms vs the original loop structure taking 304ms total - demonstrating the comprehension's efficiency. The build_operations_chain function is called from create_dynamic_operand_builder, which itself is used in query language evaluation pipelines.

Performance Impact:

Test results show consistent 5-14% improvements across various scenarios, with the largest gains (10-14%) occurring when operations lists are empty (identity function cases). This suggests the optimization particularly benefits workflows with simpler operation chains, which are common in query language evaluation where many operands may require minimal transformation.

Since create_dynamic_operand_builder is called through create_operand_builder in the evaluation engine, this optimization will compound across multiple operand evaluations in complex workflows.

✅ Correctness verification report:

Test	Status
⚙️ Existing Unit Tests	🔘 None Found
🌀 Generated Regression Tests	✅ 10 Passed
⏪ Replay Tests	🔘 None Found
🔎 Concolic Coverage Tests	🔘 None Found
📊 Tests Coverage	100.0%

🌀 Generated Regression Tests and Runtime

import pytest
from inference.core.workflows.core_steps.common.query_language.evaluation_engine.core import (
    create_dynamic_operand_builder,
)

# --- Minimal stubs for all required entities, errors, and operations ---


class UndeclaredSymbolError(Exception):
    def __init__(self, public_message, context):
        super().__init__(public_message)
        self.public_message = public_message
        self.context = context


class OperationTypeNotRecognisedError(Exception):
    def __init__(self, public_message, context):
        super().__init__(public_message)
        self.public_message = public_message
        self.context = context


# OperationDefinition and DynamicOperand
class OperationDefinition:
    def __init__(self, type, parameters=None):
        self.type = type
        self.parameters = parameters or {}


class DynamicOperand:
    def __init__(self, operand_name, operations):
        self.operand_name = operand_name
        self.operations = operations


from inference.core.workflows.core_steps.common.query_language.evaluation_engine.core import (
    create_dynamic_operand_builder,
)

# --- Unit tests ---

# 1. Basic Test Cases


def test_basic_string_to_lower():
    """Test converting a string to lower case."""
    definition = DynamicOperand(
        operand_name="foo", operations=[OperationDefinition(type="StringToLowerCase")]
    )
    codeflash_output = create_dynamic_operand_builder(definition, "ctx")
    builder = codeflash_output  # 8.55μs -> 8.99μs (4.90% slower)
    result = builder({"foo": "HELLO"})


def test_basic_string_to_upper():
    """Test converting a string to upper case."""
    definition = DynamicOperand(
        operand_name="bar", operations=[OperationDefinition(type="StringToUpperCase")]
    )
    codeflash_output = create_dynamic_operand_builder(definition, "ctx")
    builder = codeflash_output  # 6.76μs -> 6.22μs (8.58% faster)
    result = builder({"bar": "hello"})


def test_basic_no_operations_identity():
    """Test with no operations (identity function)."""
    definition = DynamicOperand(operand_name="foo", operations=[])
    codeflash_output = create_dynamic_operand_builder(definition, "ctx")
    builder = codeflash_output  # 4.13μs -> 3.74μs (10.4% faster)
    result = builder({"foo": "unchanged"})


# 2. Edge Test Cases


def test_empty_operations_list_returns_identity():
    """Test that an empty operations list returns the original value."""
    definition = DynamicOperand(operand_name="baz", operations=[])
    codeflash_output = create_dynamic_operand_builder(definition, "ctx")
    builder = codeflash_output  # 3.96μs -> 3.77μs (4.99% faster)
    result = builder({"baz": 42})


def test_multiple_operations_chain():
    """Test chaining multiple operations."""
    definition = DynamicOperand(
        operand_name="foo",
        operations=[
            OperationDefinition(type="StringToLowerCase"),
            OperationDefinition(type="StringToUpperCase"),
        ],
    )
    codeflash_output = create_dynamic_operand_builder(definition, "ctx")
    builder = codeflash_output  # 9.63μs -> 9.14μs (5.38% faster)
    result = builder({"foo": "MiXeD"})


# 3. Large Scale Test Cases


def test_large_scale_identity_no_operations():
    """Test identity function on large data with no operations."""
    large_list = list(range(1000))
    definition = DynamicOperand(operand_name="foo", operations=[])
    codeflash_output = create_dynamic_operand_builder(definition, "ctx")
    builder = codeflash_output  # 4.12μs -> 3.64μs (13.0% faster)
    result = builder({"foo": large_list})


# codeflash_output is used to check that the output of the original code is the same as that of the optimized code.

from functools import partial

# imports
import pytest
from inference.core.workflows.core_steps.common.query_language.evaluation_engine.core import (
    create_dynamic_operand_builder,
)

# --- Minimal stubs for dependencies and entities ---


class OperationTypeNotRecognisedError(Exception):
    pass


class UndeclaredSymbolError(Exception):
    def __init__(self, public_message, context):
        super().__init__(public_message)
        self.public_message = public_message
        self.context = context


class DummyOperationDefinition:
    def __init__(self, type):
        self.type = type


class DummyDynamicOperand:
    def __init__(self, operand_name, operations):
        self.operand_name = operand_name
        self.operations = operations


from inference.core.workflows.core_steps.common.query_language.evaluation_engine.core import (
    create_dynamic_operand_builder,
)

# --- Unit tests ---

# 1. Basic Test Cases


def test_basic_no_operations_returns_identity():
    # Test with no operations (should return the value unchanged)
    definition = DummyDynamicOperand("x", [])
    codeflash_output = create_dynamic_operand_builder(definition, "ctx")
    builder = codeflash_output  # 4.14μs -> 3.71μs (11.6% faster)
    result = builder({"x": 42})


def test_edge_empty_operations_list():
    # Test with empty operations list (should return value unchanged)
    definition = DummyDynamicOperand("val", [])
    codeflash_output = create_dynamic_operand_builder(definition, "ctx")
    builder = codeflash_output  # 4.00μs -> 3.71μs (7.82% faster)


def test_edge_operand_value_is_none():
    # Test that operand value can be None and is passed through
    definition = DummyDynamicOperand("foo", [])
    codeflash_output = create_dynamic_operand_builder(definition, "ctx")
    builder = codeflash_output  # 4.04μs -> 3.73μs (8.48% faster)
    result = builder({"foo": None})


def test_edge_operand_value_is_complex_type():
    # Test that complex types are passed through identity
    class DummyObj:
        def __eq__(self, other):
            return isinstance(other, DummyObj)

    obj = DummyObj()
    definition = DummyDynamicOperand("foo", [])
    codeflash_output = create_dynamic_operand_builder(definition, "ctx")
    builder = codeflash_output  # 3.11μs -> 2.72μs (14.3% faster)
    result = builder({"foo": obj})

To edit these changes git checkout codeflash/optimize-create_dynamic_operand_builder-miqkz76y and push.

The optimization achieves a **6% speedup** through two key changes in the `build_operations_chain` function: **Key Optimizations:** 1. **List Comprehension Replacement**: The original code used a manual loop with `append()` to build the operations list: ```python operations_functions = [] for operation_id, operation_definition in enumerate(operations): # ... build operation ... operations_functions.append(operation_function) ``` The optimized version uses a list comprehension, which is faster in Python due to reduced function call overhead and better memory allocation patterns. 2. **Simplified Empty Check**: Changed `if not len(operations):` to `if not operations:`, eliminating an unnecessary function call since Python treats empty sequences as falsy. **Why This Matters:** The line profiler shows the list comprehension execution (line with `operations_functions = [`) takes 268ms vs the original loop structure taking 304ms total - demonstrating the comprehension's efficiency. The `build_operations_chain` function is called from `create_dynamic_operand_builder`, which itself is used in query language evaluation pipelines. **Performance Impact:** Test results show consistent 5-14% improvements across various scenarios, with the largest gains (10-14%) occurring when operations lists are empty (identity function cases). This suggests the optimization particularly benefits workflows with simpler operation chains, which are common in query language evaluation where many operands may require minimal transformation. Since `create_dynamic_operand_builder` is called through `create_operand_builder` in the evaluation engine, this optimization will compound across multiple operand evaluations in complex workflows.

codeflash-ai bot requested a review from grzegorz-roboflow as a code owner December 3, 2025 22:30

codeflash-ai bot requested a review from mashraf-222 December 3, 2025 22:30

codeflash-ai bot added ⚡️ codeflash Optimization PR opened by Codeflash AI 🎯 Quality: High Optimization Quality according to Codeflash labels Dec 3, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

⚡️ Speed up function `create_dynamic_operand_builder` by 6% #783

⚡️ Speed up function `create_dynamic_operand_builder` by 6% #783

codeflash-ai bot commented Dec 3, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

⚡️ Speed up function create_dynamic_operand_builder by 6% #783

Are you sure you want to change the base?

⚡️ Speed up function create_dynamic_operand_builder by 6% #783

Conversation

codeflash-ai bot commented Dec 3, 2025

📄 6% (0.06x) speedup for create_dynamic_operand_builder in inference/core/workflows/core_steps/common/query_language/evaluation_engine/core.py

📝 Explanation and details

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

⚡️ Speed up function `create_dynamic_operand_builder` by 6% #783

⚡️ Speed up function `create_dynamic_operand_builder` by 6% #783

📄 6% (0.06x) speedup for `create_dynamic_operand_builder` in `inference/core/workflows/core_steps/common/query_language/evaluation_engine/core.py`