OpenAI Agents SDK + GEPA Optimization Demo

Build self-optimizing AI agents with OpenAI SDK and automated prompt Optimization with GEPA

This is a complete working example showing how to build AI agents using the official OpenAI Agents SDK and optimize them automatically with GEPA (Genetic Parito) and SuperOptiX Lite version.

✨ What's Included

Complete Code Reviewer Agent - Production-ready agent that detects security issues, memory leaks, and performance problems
GEPA Optimization Demo - Automated prompt engineering example (custom implementation)
Agent Spec Scenarios (Agent Evals) - Measurable evaluation with scripted agent evaluations
100% Local OR Cloud - Works with Ollama (free!) or cloud models (OpenAI, Anthropic, Google)
Auto-Loading - Optimized prompts load automatically after training
Standalone - Includes minimal SuperOptiX components (no external dependencies)
Cost-Conscious - Clear warnings and estimates for cloud API costs

Note: This demo includes superoptix_lite - minimal components needed for GEPA-style optimization. For production use with full SuperOptiX features, install: pip install superoptix[frameworks-openai]

🚀 Quick Start

1. Prerequisites

# Install Ollama from https://ollama.com

# Pull required models
ollama pull gpt-oss:20b      # For agent execution
ollama pull llama3.1:8b       # For GEPA optimization

System Requirements:

Python 3.11+
32GB+ RAM recommended if running 20b models
Ollama running locally

2. Install

# Clone repository
git clone https://github.com/SuperagenticAI/superoptix-lite-openai.git
cd superoptix-lite-openai

# Create virtual environment
python3 -m venv venv
source venv/bin/activate  # Windows: venv\Scripts\activate

# Install dependencies
pip install -r requirements.txt

3. Run the Demo

Option A: Local Models (FREE)

# Try the live code review demo with FREE local Ollama models
python demo_local.py

Option B: Cloud Models (OpenAI, Anthropic, Google)

Method 1: Using .env file (recommended)

# Copy the example file
cp .env.example .env

# Edit .env and add your API key
# OPENAI_API_KEY=sk-...

# Load environment variables and run
source .env
python demo_cloud.py

Method 2: Direct export

# Set your API key (choose one)
export OPENAI_API_KEY=sk-...
# OR
export ANTHROPIC_API_KEY=sk-ant-...
# OR
export GOOGLE_API_KEY=...

# Run cloud demo
python demo_cloud.py

Both demos will demonstrate the agent reviewing 4 code samples and detecting:

✅ SQL injection vulnerabilities
✅ Memory leaks
✅ Missing error handling
✅ Performance issues (O(n²) algorithms)

📖 Usage Examples

Basic Code Review

import asyncio
from openai_gepa.agents.code_reviewer.pipelines.code_reviewer_openai_pipeline import CodeReviewerPipeline

# Initialize agent
pipeline = CodeReviewerPipeline(
    'openai_gepa/openai_gepa/agents/code_reviewer/playbook/code_reviewer_playbook.yaml'
)

# Review some code
code = """
def get_user(username):
    query = "SELECT * FROM users WHERE name = '" + username + "'"
    return db.execute(query)
"""

result = asyncio.run(pipeline.run(code=code, language="python"))
print(result["review"])

Output:

⚠️  Critical Security Issue: SQL Injection Vulnerability

This code is vulnerable to SQL injection. The username parameter is directly
concatenated into the SQL query without sanitization...

Recommendations:
1. Use parameterized queries: cursor.execute("SELECT * FROM users WHERE name = ?", (username,))
2. Consider using an ORM (SQLAlchemy, Django ORM)
3. Validate and sanitize all user input
...

Run GEPA Optimization

Option A: Local Models (FREE)

# Optimize with FREE local Ollama models
python optimize_local.py

Option B: Cloud Models (with API costs)

Method 1: Using .env file (recommended)

# Copy and edit .env file
cp .env.example .env
# Edit .env and add your API key

# Load environment variables and run
source .env
python optimize_cloud.py

Method 2: Direct export

# Set your API key (choose one)
export OPENAI_API_KEY=sk-...        # Uses gpt-5
# OR
export ANTHROPIC_API_KEY=sk-ant-... # Uses claude-sonnet-4.5
# OR
export GOOGLE_API_KEY=...           # Uses gemini-pro-2.5

# Run cloud optimization
python optimize_cloud.py

⚠️ Cost Warning: Cloud optimization uses cloud APIs and will incur costs. The process runs multiple evaluations to improve the agent. Local Ollama models are completely free!

GEPA will:

✅ Evaluate baseline performance (runs all Agent Spec scenarios)
✅ Analyze failures and identify missing keywords
✅ Generate improved instructions
✅ Test the improved version
✅ Save optimized weights (if improved)

After optimization, the agent automatically loads improved instructions on next run.

ℹ️ The optimized instructions are saved to openai_gepa/openai_gepa/agents/code_reviewer/optimized/code_reviewer_openai_optimized.json. The pipeline will load from this location on subsequent runs.

🏗️ Project Structure

superoptix-lite-openai/
├── README.md                                  # This file
├── LICENSE                                    # MIT License
├── requirements.txt                           # Python dependencies
│
├── demo_local.py                              # Live demo with LOCAL models (FREE)
├── demo_cloud.py                              # Live demo with CLOUD models
├── optimize_local.py                          # GEPA optimization with LOCAL models (FREE)
├── optimize_cloud.py                          # GEPA optimization with CLOUD models
│
└── openai_gepa/                               # Main project
    ├── pyproject.toml                         # Package configuration
    └── openai_gepa/                           # Python package
        └── agents/
            ├── code_reviewer/                 # ⭐ Code reviewer agent
            │   ├── playbook/
            │   │   └── code_reviewer_playbook.yaml       # Config + Agent Spec scenarios
            │   ├── pipelines/
            │   │   └── code_reviewer_openai_pipeline.py  # Agent code
            │   └── optimized/
            │       └── code_reviewer_openai_optimized.json  # GEPA results
            │
            └── assistant_openai/              # Simple Q&A assistant
                ├── playbook/
                ├── pipelines/
                └── optimized/

🔍 How It Works

1. Native OpenAI SDK Agent

Written using official OpenAI Agents SDK patterns:

from agents import Agent, Runner, OpenAIChatCompletionsModel
from openai import AsyncOpenAI

class CodeReviewerAgent:
    def __init__(self, instructions, model, api_base):
        # Initialize Ollama-compatible model
        self.model = OpenAIChatCompletionsModel(
            model=model,
            openai_client=AsyncOpenAI(
                base_url=f"{api_base}/v1",
                api_key="ollama"  # Dummy key for Ollama
            )
        )

        # Create agent
        self.agent = Agent(
            name="Code Reviewer",
            instructions=instructions,  # ← GEPA optimizes this!
            model=self.model
        )

    async def review_code(self, code, language):
        result = await Runner.run(self.agent, input=code)
        return result.final_message.content

2. SuperOptiX Integration

Wrapped in BaseComponent for GEPA compatibility:

# Using minimal BaseComponent from included superoptix_lite package
from openai_gepa.superoptix_lite import BaseComponent

class CodeReviewerComponent(BaseComponent):
    def __init__(self, instructions=None, model_config=None):
        super().__init__(
            name="code_reviewer",
            variable=instructions,      # ← GEPA optimizes this variable
            variable_type="instructions",
            framework="openai"
        )

Note: This demo includes superoptix_lite - a minimal implementation of SuperOptiX's BaseComponent. For production use with full GEPA capabilities, install the complete SuperOptiX framework.

3. Agent Spec Scenarios (Agent Evals)

Performance measured with Agent Spec scenarios in the playbook:

feature_specifications:
  scenarios:
    - name: SQL Injection Detection
      input:
        code: |
          def get_user(username):
              query = "SELECT * FROM users WHERE name = '" + username + "'"
              return db.execute(query)
        language: python
      expected_output:
        expected_keywords:
          - SQL injection
          - vulnerability
          - parameterized

4. GEPA Optimization Process

Baseline: Evaluates agent with original instructions
Analysis: Identifies which test scenarios fail and why
Improvement: Generates enhanced instructions with explicit requirements
Testing: Validates improved version against all scenarios
Saving: Stores optimized instructions for automatic loading

5. Automatic Loading

On next initialization, optimized instructions load transparently:

# Check for optimization file (happens automatically in __init__)
optimized_file = playbook_dir / "optimized" / "code_reviewer_openai_optimized.json"

if optimized_file.exists():
    opt_data = json.load(open(optimized_file))
    optimized_instructions = opt_data['best_variable']
    print(f"✅ Loaded optimized instructions (score: {score:.2%})")
    self.is_trained = True

☁️ Using Cloud Models (OpenAI, Anthropic, Google)

If you can't run local models with Ollama, you can use cloud-based models. We've made this super simple with dedicated scripts for each mode.

Quick Start with Cloud Models

Step 1: Get an API key from one of these providers:

OpenAI: https://platform.openai.com/api-keys
Anthropic: https://console.anthropic.com/
Google AI: https://makersuite.google.com/app/apikey

Step 2: Set your API key using either method:

Option A: .env file (recommended)

cp .env.example .env
# Edit .env and uncomment/add your API key
source .env

Option B: Direct export

export OPENAI_API_KEY=sk-...
# OR
export ANTHROPIC_API_KEY=sk-ant-...
# OR
export GOOGLE_API_KEY=...

Step 3: Run the cloud scripts:

# Demo with cloud models (auto-detects provider from API key)
python demo_cloud.py

# Optimize with cloud models (⚠️ will incur API costs)
python optimize_cloud.py

That's it! The scripts automatically:

✅ Detect which cloud provider to use based on your API key
✅ Use the latest models (gpt-5, claude-sonnet-4.5, gemini-pro-2.5)
✅ Configure everything for you

⚠️ IMPORTANT: Cost Warning

Cloud optimization uses APIs and will incur costs! The optimization process:

Evaluates test scenarios multiple times
Makes many API calls to improve the agent
Costs vary by provider and model

Tips to reduce costs:

✅ Use local models (Ollama) - completely free! (Recommended)
✅ Test with demo_cloud.py first (cheaper) before running optimize_cloud.py
✅ Only run optimization when needed
✅ Consider using local optimization: python optimize_local.py (FREE!)

Supported Cloud Providers

Provider	Latest Model	Script Auto-Detects
OpenAI	gpt-5	✅ From `OPENAI_API_KEY`
Anthropic	claude-sonnet-4.5	✅ From `ANTHROPIC_API_KEY`
Google	gemini-pro-2.5	✅ From `GOOGLE_API_KEY`

Local vs Cloud Comparison

Aspect	Local (Ollama)	Cloud (APIs)
Setup	Download models (one-time)	Get API key (instant)
Demo	FREE	Uses API credits
Optimization	FREE	Uses API credits
Hardware	16GB+ RAM required	No requirements
Privacy	100% local	Data sent to API
Commands	`python demo_local.py` `python optimize_local.py`	`python demo_cloud.py` `python optimize_cloud.py`

Recommendation: Use local Ollama models for development and testing. Only use cloud models if you can't run local models or need specific cloud capabilities.

🎯 Creating Your Own Agent

Step 1: Define Playbook with Agent Spec Scenarios

# my_agent_playbook.yaml
apiVersion: agent/v1
kind: AgentSpec
metadata:
  name: my_agent
spec:
  target_framework: openai
  language_model:
    provider: ollama
    model: ollama:gpt-oss:20b
    api_base: http://localhost:11434

  feature_specifications:
    scenarios:
      - name: Test Case 1
        input:
          text: "Sample input"
        expected_output:
          expected_keywords:
            - keyword1
            - keyword2

Step 2: Implement OpenAI SDK Agent

from agents import Agent, Runner, OpenAIChatCompletionsModel

class MyAgent:
    def __init__(self, instructions, model, api_base):
        self.model = OpenAIChatCompletionsModel(...)
        self.agent = Agent(
            name="My Agent",
            instructions=instructions,
            model=self.model
        )

    async def execute(self, text):
        result = await Runner.run(self.agent, input=text)
        return result.final_message.content

Step 3: Add SuperOptiX Wrapper

# Using minimal BaseComponent from superoptix_lite (included in this demo)
from openai_gepa.superoptix_lite import BaseComponent

class MyComponent(BaseComponent):
    def __init__(self, instructions=None):
        super().__init__(
            variable=instructions or "Default instructions",
            variable_type="instructions",
            framework="openai"
        )

Step 4: Create Pipeline with Evaluation

class MyPipeline:
    def __init__(self, playbook_path):
        self.component = MyComponent(
            instructions=self._load_optimization()  # Auto-load if exists
        )
        self.test_scenarios = self._load_bdd_scenarios()

    def evaluate(self):
        # Run all Agent Spec scenarios and measure pass rate
        pass

🐛 Troubleshooting

Ollama Connection Issues

# Verify Ollama is running
ollama list

# Test a model
ollama run gpt-oss:20b "Hello"

# Check API endpoint
curl http://localhost:11434/api/tags

Module Import Errors

# Ensure you're in the right directory and venv is activated
cd superoptix-lite-openai
source venv/bin/activate

# Reinstall dependencies
pip install -r requirements.txt

# Verify openai-agents is installed
pip show openai-agents

OPENAI_API_KEY Warnings

These warnings are safe to ignore when using Ollama:

OPENAI_API_KEY is not set, skipping trace export
[non-fatal] Tracing client error 401

The agent works perfectly with local Ollama models without any API keys.

📊 Performance Results

Code Reviewer Agent

Results varies depneding on the models used.

Metric	Value
Test Pass Rate	4/4 scenarios (100%)
Scenarios Tested	SQL injection, memory leaks, error handling, performance
Detection Accuracy	Identifies all critical issues
Model	Ollama gpt-oss:20b

GEPA Optimization:

Original instructions: ~900 characters
Optimized instructions: ~1,900 characters (+106%)
Adds explicit detection requirements for common issues
Ensures consistent use of technical terminology

📚 About SuperOptiX Lite

This demo includes superoptix_lite - a minimal, standalone implementation of the key SuperOptiX components needed for agent optimization:

What's Included

BaseComponent - Base class for creating optimizable agents
Variable management - Interface for GEPA-style optimization
Framework integration - Compatible with OpenAI Agents SDK

What's NOT Included (Full SuperOptiX Only)

❌ Full GEPA optimizer (UniversalGEPA)
❌ Multi-framework compilation (DSPy, CrewAI, Google ADK, Microsoft, DeepAgents)
❌ Advanced RAG optimization
❌ MCP protocol optimization
❌ Memory system optimization
❌ Orchestra (multi-agent coordination)
❌ CLI tools (super command)
❌ Observability integrations (MLFlow, LangFuse, W&B)

Production Use

For production deployments with full optimization capabilities:

pip install superoptix[frameworks-openai]

Then use the full framework:

from superoptix.core.base_component import BaseComponent
from superoptix.optimizers.universal_gepa import UniversalGEPA

See the SuperOptiX documentation for complete features.

📚 Learn More

Documentation

OpenAI Agents SDK - Official SDK documentation
GEPA - Official Github Repo
SuperOptiX - Official Webpage
SuperOptiX Docs - Docs Full Stack Agent Optimization framework
Ollama - Local LLM inference

Tutorials

For a complete step-by-step tutorial on building agents with OpenAI SDK and optimizing with GEPA, see the SuperOptiX documentation.

🤝 Contributing

Contributions welcome! Feel free to:

Report bugs via GitHub Issues
Suggest new agent examples
Submit pull requests for improvements
Share your own agents built with this template

📄 License

MIT License - see LICENSE file for details.

🙏 Acknowledgments

OpenAI Agents SDK - Excellent agent framework
GEPA - Best Optimizer in the AI industry right now
SuperOptiX - GEPA optimization and Agent Framework orchestration capabilities
Ollama - Making local LLM inference accessible

Made with ❤️ using OpenAI Agents SDK, GEPA and SuperOptiX

Give it a ⭐ if you find this helpful!

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.github		.github
openai_gepa		openai_gepa
.env.example		.env.example
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
demo_cloud.py		demo_cloud.py
demo_local.py		demo_local.py
demo_optimized_agent.py		demo_optimized_agent.py
optimize_cloud.py		optimize_cloud.py
optimize_code_reviewer.py		optimize_code_reviewer.py
optimize_local.py		optimize_local.py
requirements-dev.txt		requirements-dev.txt
requirements.txt		requirements.txt

License

SuperagenticAI/superoptix-lite-openai

Folders and files

Latest commit

History

Repository files navigation

OpenAI Agents SDK + GEPA Optimization Demo

✨ What's Included

🚀 Quick Start

1. Prerequisites

2. Install

3. Run the Demo

Option A: Local Models (FREE)

Option B: Cloud Models (OpenAI, Anthropic, Google)

📖 Usage Examples

Basic Code Review

Run GEPA Optimization

Option A: Local Models (FREE)

Option B: Cloud Models (with API costs)

🏗️ Project Structure

🔍 How It Works

1. Native OpenAI SDK Agent

2. SuperOptiX Integration

3. Agent Spec Scenarios (Agent Evals)

4. GEPA Optimization Process

5. Automatic Loading

☁️ Using Cloud Models (OpenAI, Anthropic, Google)

Quick Start with Cloud Models

⚠️ IMPORTANT: Cost Warning

Supported Cloud Providers

Local vs Cloud Comparison

🎯 Creating Your Own Agent

Step 1: Define Playbook with Agent Spec Scenarios

Step 2: Implement OpenAI SDK Agent

Step 3: Add SuperOptiX Wrapper

Step 4: Create Pipeline with Evaluation

🐛 Troubleshooting

Ollama Connection Issues

Module Import Errors

OPENAI_API_KEY Warnings

The agent works perfectly with local Ollama models without any API keys.

📊 Performance Results

Code Reviewer Agent

📚 About SuperOptiX Lite

What's Included

What's NOT Included (Full SuperOptiX Only)

Production Use

📚 Learn More

Documentation

Tutorials

🤝 Contributing

📄 License

🙏 Acknowledgments

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages