Implement comprehensive real-time agent reference design for Azure OpenAI and AI Foundry #1

Copilot · 2025-09-10T09:20:10Z

This PR implements a complete reference design for building real-time AI agents using Azure OpenAI or AI Foundry services, addressing the requirements outlined in the repository's problem statement.

🎯 Overview

The implementation provides a production-ready Python framework for developing real-time conversational AI agents with streaming response capabilities, comprehensive event handling, and support for multiple AI providers.

🏗️ Architecture & Design

Core Components

RealTimeAgent: Main orchestrator handling conversation flow and lifecycle management
Provider Abstraction: Pluggable architecture supporting Azure OpenAI and AI Foundry
Event-Driven System: Comprehensive event handling for messages, connections, and errors
Configuration Management: Environment-based configuration with validation

Key Features

Streaming Responses: Real-time token-by-token response generation for immediate user feedback
Multi-Provider Support: Seamless switching between Azure OpenAI and AI Foundry
Type Safety: Full type annotations throughout the codebase
Production Ready: Robust error handling, logging, retry logic, and connection management
Conversation Management: History tracking, export capabilities, and context maintenance

📝 Implementation Details

Project Structure

src/real_time_agent/
├── core/                    # Core agent implementation
│   ├── agent.py            # Main RealTimeAgent class
│   ├── config.py           # Configuration management
│   └── interfaces.py       # Abstract base classes and protocols
├── integrations/           # Provider implementations
│   ├── azure_openai.py     # Azure OpenAI integration
│   └── ai_foundry.py       # AI Foundry integration
├── utils/                  # Utility functions
│   └── helpers.py          # Export, validation, and helper utilities
└── examples/               # Usage examples

Provider Integration

The system uses a clean provider abstraction pattern:

# Azure OpenAI usage
config = AgentConfig(
    provider="azure_openai",
    azure_openai_endpoint="https://your-resource.openai.azure.com/",
    azure_openai_deployment="gpt-4"
)
provider = AzureOpenAIProvider(config)
agent = RealTimeAgent(config, provider)

# AI Foundry usage - same interface
config = AgentConfig(
    provider="ai_foundry",
    ai_foundry_endpoint="https://your-foundry-endpoint.com/"
)
provider = AIFoundryProvider(config)
agent = RealTimeAgent(config, provider)

Streaming Response Handling

A critical fix was implemented for conversation history tracking in streaming mode. The original implementation had an issue where conversation history wasn't being updated when async generators were consumed with early breaks:

# Fixed: History update happens BEFORE yielding the final chunk
if chunk.is_final:
    final_message = Message(...)
    self.conversation_history.append(final_message)  # ← Moved before yield
    
yield chunk  # Consumer may break here

if chunk.is_final:
    break

This ensures conversation context is preserved even when consumers don't fully exhaust the generator.

📚 Documentation & Examples

Comprehensive Documentation

Architecture Guide: Detailed explanation of real-time agent concepts and design patterns
Setup Instructions: Complete installation and configuration guide with troubleshooting
API Documentation: Full documentation of classes, methods, and usage patterns

Working Examples

Basic Usage: Simple examples for both Azure OpenAI and AI Foundry
Advanced Streaming: Demonstrates real-time response handling and event processing
Complete Workflow: Full-featured demo with mock provider for testing without API keys

Configuration Management

Environment-based configuration with validation:

# .env configuration
AZURE_OPENAI_ENDPOINT=https://your-resource.openai.azure.com/
AZURE_OPENAI_API_KEY=your-api-key
AZURE_OPENAI_DEPLOYMENT=gpt-4
REAL_TIME_AGENT_STREAMING=true
REAL_TIME_AGENT_MAX_TOKENS=1000

🧪 Testing & Quality

Type Safety: Full mypy compatibility with comprehensive type annotations
Error Handling: Robust exception handling with retry logic and graceful degradation
Validation: Input validation and configuration verification
Testing: Basic test suite validating core functionality

🚀 Production Readiness

The implementation includes production-grade features:

Authentication: Support for API keys and Azure Managed Identity
Monitoring: Comprehensive logging with configurable levels
Scalability: Stateless design enabling horizontal scaling
Security: Secure credential handling and input validation
Performance: Async/await throughout for optimal performance

📈 Usage Example

import asyncio
from real_time_agent import RealTimeAgent, AgentConfig, AzureOpenAIProvider

async def main():
    # Configure agent
    config = AgentConfig(
        provider="azure_openai",
        streaming=True,
        max_tokens=500
    )
    
    # Initialize provider and agent
    provider = AzureOpenAIProvider(config)
    agent = RealTimeAgent(config, provider)
    await agent.initialize()
    
    # Send message with streaming response
    async for response in await agent.send_message("Hello!"):
        print(response.message.content, end='', flush=True)
        if response.is_final:
            break
    
    await agent.close()

asyncio.run(main())

This implementation transforms the repository from a basic placeholder into a comprehensive, production-ready reference implementation that developers can use as a foundation for building real-time AI applications.

💡 You can make Copilot smarter by setting up custom instructions, customizing its development environment and configuring Model Context Protocol (MCP) servers. Learn more Copilot coding agent tips in the docs.

…nality Co-authored-by: marvinbuss <34542414+marvinbuss@users.noreply.github.com>

… implementation Co-authored-by: marvinbuss <34542414+marvinbuss@users.noreply.github.com>

Initial plan

28e9eaa

Copilot AI assigned Copilot and marvinbuss Sep 10, 2025

Copilot started work on behalf of marvinbuss September 10, 2025 09:20 View session

Copilot AI and others added 2 commits September 10, 2025 09:40

Implement complete real-time agent reference design with core functio…

194afd8

…nality Co-authored-by: marvinbuss <34542414+marvinbuss@users.noreply.github.com>

Fix conversation history tracking in streaming responses and complete…

2a37e51

… implementation Co-authored-by: marvinbuss <34542414+marvinbuss@users.noreply.github.com>

Copilot AI requested a review from marvinbuss September 10, 2025 09:48

Copilot finished work on behalf of marvinbuss September 10, 2025 09:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Implement comprehensive real-time agent reference design for Azure OpenAI and AI Foundry #1

Implement comprehensive real-time agent reference design for Azure OpenAI and AI Foundry #1

Uh oh!

Copilot AI commented Sep 10, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Implement comprehensive real-time agent reference design for Azure OpenAI and AI Foundry #1

Are you sure you want to change the base?

Implement comprehensive real-time agent reference design for Azure OpenAI and AI Foundry #1

Uh oh!

Conversation

Copilot AI commented Sep 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🎯 Overview

🏗️ Architecture & Design

Core Components

Key Features

📝 Implementation Details

Project Structure

Provider Integration

Streaming Response Handling

📚 Documentation & Examples

Comprehensive Documentation

Working Examples

Configuration Management

🧪 Testing & Quality

🚀 Production Readiness

📈 Usage Example

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Copilot AI commented Sep 10, 2025 •

edited

Loading