An OpenEnv benchmark testing the ability of AI agents to act as Site Reliability Engineers (SREs) by diagnosing and filtering raw production failure logs.
-
Updated
Apr 8, 2026 - Python
An OpenEnv benchmark testing the ability of AI agents to act as Site Reliability Engineers (SREs) by diagnosing and filtering raw production failure logs.
An RL environment where an LLM agent learns to curate talking-head video clips for AV LoRA training. No labels exposed, rewards only.
A production-grade OpenEnv environment for benchmarking RL agents on real-world data cleaning and schema engineering tasks.
OpenEnv-compliant RL environment for SQL query debugging. Built for META x PyTorch x SST OpenEnv Hackathon.
An OpenEnv based RL environment that allows agents to learn to clean datasets across 3 levels of difficulties.
Data Cleaning Agent for Cleaning Unorganised Dataset
An OpenEnv-compliant reinforcement learning environment for personalized AI tutoring — simulating real-world EdTech dynamics with psychometric student modeling and multi-objective pedagogical optimization.
Deterministic reinforcement learning environment for simulating open-source issue triage workflows
Fault-injecting OpenEnv training environment for vibe-coded SaaS incidents. 30 scenarios grounded in 2025-26 production failures. Drop-in OpenClaw-RL pool server. Claude Code skill included.
Government Scheme Eligibility Matching - OpenEnv Environment
A production-oriented OpenEnv-style environment for evaluating tool-using agents on customer support ticket triage.
a reinforcement learning agent built with OpenEnv and Stable-Baselines3 that learns to intelligently manage email workflows. The agent handles tasks ranging from spam filtering to drafting meeting invitations and resolving ambiguous client requests.
An OpenEnv environment where AI agents triage satellite intelligence reports, classify threats, and make real-time defense decisions.
Reinforcement Learning system for smart irrigation of Punjab rice farms. Built for the OpenEnv Hackathon.
A real-world RL environment where AI agents learn to maintain and update test suites when code changes. Includes tasks for unit testing, bug detection, and regression auditing with structured reward signals.
OpenEnv code review environment for AI agents.
ShopOps Env is a realistic OpenEnv environment that simulates daily operations of an e-commerce support and operations team. In this environment, an AI agent acts as an operations associate responsible for handling a stream of customer cases such as refund requests, delivery issues, wrong item complaints, and fraud signals. Each episode represent
Add a description, image, and links to the openenv-environment topic page so that developers can more easily learn about it.
To associate your repository with the openenv-environment topic, visit your repo's landing page and select "manage topics."