Render videos eval mode #274

mpragnay · 2026-02-01T01:54:49Z

Implemented parallel video rendering eval mode using Thread Pool.
Renders videos for first num_maps(eval) videos in map_dir(eval) and saves in resources/drive/videos.

eugenevinitsky · 2026-02-01T02:29:19Z

docs/src/visualizer.md

 Adjust the screen size and color depth as needed. The `xvfb-run` wrapper allows Raylib to render without an attached display, which is convenient for servers and CI jobs.
+
+## Evaluation Rendering
+You can batch render videos for multiple maps using the evaluation mode. This will render the first `num_maps`(limited by number of maps in the directory) from `map_dir` in parallel using the `visualize` binary.


can we randomize over this? It seems bad to always use the exact same maps

Oh so the PR intends to be able to render a bunch of videos(that the user knows scenarios for) for a given policy, so I figured it be best to be predictable instead of a random set of videos. Maybe you are getting confused with this PR and async render mode PR(#269).

eugenevinitsky · 2026-02-01T02:30:03Z

pufferlib/config/ocean/drive.ini

+; "both", "topdown", "agent"; Other args are passed from train confs
+render_view_mode = "both"
+; Policy bin file used for rendering videos
+render_policy_path = "resources/drive/puffer_drive_weights.bin"


this seems dangerous? Presumably we want to render the policies we are actually training

eugenevinitsky · 2026-02-01T02:30:19Z

pufferlib/ocean/drive/drive.h

 void init_goal_positions(Drive *env);
 float clipSpeed(float speed);
 void sample_new_goal(Drive *env, int agent_idx);
+int check_lane_aligned(Entity *car, Entity *lane, int geometry_idx);


this seems like it should not be here?

Oh i added it cuz I was getting compile error on my mac. forward declaration. I can make it a separate PR but it's just a one liner.

greptile-apps · 2026-02-01T15:05:26Z

Greptile Overview

Greptile Summary

This PR implements a parallel video rendering evaluation mode using ThreadPool to batch render videos for multiple maps in the drive environment.

Key Changes:

Added render_videos_eval mode that uses ThreadPool to parallelize video rendering across multiple maps
New configuration options in drive.ini: render_videos_eval, render_view_mode, and render_policy_path
The mode renders the first num_maps (limited by available maps) from map_dir using the visualize binary
Videos are saved to resources/drive/render_videos with support for topdown, agent, or both views
Added documentation explaining how to use the new evaluation rendering mode
Added forward declaration for check_lane_aligned function in drive.h (appears unrelated to the main feature)

Issues Found:

Missing exception handling for subprocess.TimeoutExpired - timeout errors will crash the thread pool
Path concatenation uses string addition instead of os.path.join() which could cause issues on Windows

Confidence Score: 4/5

This PR is mostly safe to merge with one critical fix needed for exception handling
The implementation is sound overall with proper use of ThreadPool for parallelization. However, the missing TimeoutExpired exception handling is a critical issue that could cause the entire rendering process to crash if a single video takes too long. The path concatenation issue is a minor cross-platform concern.
Focus on pufferlib/pufferl.py line 1180 - the timeout exception must be handled

Important Files Changed

Filename	Overview
docs/src/visualizer.md	Added documentation for new parallel video rendering evaluation mode
pufferlib/config/ocean/drive.ini	Added configuration options for render_videos_eval mode (render_videos_eval, render_view_mode, render_policy_path)
pufferlib/ocean/drive/drive.h	Added forward declaration for check_lane_aligned function
pufferlib/pufferl.py	Implemented parallel video rendering with ThreadPool, added render_videos_eval mode with subprocess calls to visualize binary

Sequence Diagram

sequenceDiagram
    participant User
    participant eval() as eval()
    participant Config as Config (drive.ini)
    participant ThreadPool
    participant Worker as Worker Thread
    participant Binary as visualize binary
    participant FS as File System

    User->>eval(): puffer eval puffer_drive
    eval()->>Config: Load render_videos_eval settings
    Config-->>eval(): num_maps, map_dir, view_mode, num_workers
    eval()->>FS: os.listdir(map_dir)
    FS-->>eval(): List of .bin files
    eval()->>eval(): Filter and sort .bin files
    eval()->>FS: os.makedirs(output_dir)
    eval()->>eval(): ensure_drive_binary()
    eval()->>ThreadPool: Create ThreadPool(num_workers)
    
    loop For each map in render_maps
        ThreadPool->>Worker: Assign render_task(map_path)
        Worker->>Worker: Build command with args
        Worker->>Binary: subprocess.run(["xvfb-run", "./visualize", ...])
        Binary->>FS: Write topdown_{map_name}.mp4
        Binary->>FS: Write agent_{map_name}.mp4
        Binary-->>Worker: Return code
        alt returncode != 0
            Worker->>User: Print error message
        end
    end
    
    ThreadPool-->>eval(): All tasks complete
    eval()->>User: Print completion message

greptile-apps

_{4 files reviewed, 2 comments}

_{Edit Code Review Agent Settings | Greptile}

pufferlib/pufferl.py

Copilot

Pull request overview

This PR adds an eval-mode path to batch-render PufferDrive videos in parallel using the visualize binary, plus a small C header update and configuration/docs wiring. It enables puffer eval puffer_drive to render multiple maps concurrently based on config values.

Changes:

Introduced render_videos_eval mode in pufferlib.pufferl.eval, using a ThreadPool to invoke the visualize binary across multiple .bin maps and write MP4s to resources/drive/render_videos.
Updated drive.ini eval configuration to include switches and options for video rendering (toggle, view mode, and policy path).
Added a forward declaration for check_lane_aligned in drive.h and extended docs/src/visualizer.md to describe the new evaluation rendering workflow.

Reviewed changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated 2 comments.

File	Description
`pufferlib/pufferl.py`	Adds a new `elif render_videos_enabled` branch to `eval` that collects map binaries, ensures the `visualize` binary is built, and runs parallel rendering jobs via `ThreadPool`.
`pufferlib/ocean/drive/drive.h`	Adds a forward declaration for `check_lane_aligned` to match its implementation and usage elsewhere in the file.
`pufferlib/config/ocean/drive.ini`	Wires new eval config flags (`render_videos_eval`, `render_view_mode`, `render_policy_path`) used by the eval rendering mode.
`docs/src/visualizer.md`	Documents the new eval-based batch rendering flow, including how it uses `eval.num_maps`, `render_videos_eval`, and `vec.num_workers`.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-02-01T15:05:45Z

pufferlib/pufferl.py

+    elif render_videos_enabled:
+        # Renders first num_maps from map_dir using visualize binary
+        map_dir = args["eval"]["map_dir"]
+        num_maps = args["eval"]["num_maps"]
+        view_mode = args["eval"].get("render_view_mode", "both")
+        render_policy_path = args["eval"].get("render_policy_path", None)
+        num_workers = args["vec"]["num_workers"]
+
+        if num_maps > len(os.listdir(map_dir)):
+            num_maps = len(os.listdir(map_dir))
+
+        render_maps = [os.path.join(map_dir, f) for f in sorted(os.listdir(map_dir)) if f.endswith(".bin")][:num_maps]
+        output_dir = "resources/drive/render_videos"
+        os.makedirs(output_dir, exist_ok=True)
+
+        # Rebuild visualize binary
+        ensure_drive_binary()
+
+        def render_task(map_path):
+            base_cmd = (
+                ["./visualize"]
+                if sys.platform == "darwin"
+                else ["xvfb-run", "-a", "-s", "-screen 0 1280x720x24", "./visualize"]
+            )
+            cmd = base_cmd.copy()
+            cmd.extend(["--map-name", map_path])
+            train_configs = args["train"]
+            if train_configs.get("show_grid", False):
+                cmd.append("--show-grid")
+            if train_configs.get("obs_only", False):
+                cmd.append("--obs-only")
+            if train_configs.get("show_lasers", False):
+                cmd.append("--lasers")
+            if train_configs.get("show_human_logs", False):
+                cmd.append("--show-human-logs")
+            if train_configs.get("zoom_in", False):
+                cmd.append("--zoom-in")
+            if train_configs.get("frame_skip", 1) > 1:
+                cmd.extend(["--frame-skip", str(train_configs["frame_skip"])])
+            cmd.extend(["--view", view_mode])
+            if render_policy_path is not None:
+                cmd.extend(["--policy-name", render_policy_path])
+
+            map_name = os.path.basename(map_path).replace(".bin", "")
+
+            if view_mode == "topdown" or view_mode == "both":
+                cmd.extend(["--output-topdown", output_dir + f"/topdown_{map_name}.mp4"])
+            if view_mode == "agent" or view_mode == "both":
+                cmd.extend(["--output-agent", output_dir + f"/agent_{map_name}.mp4"])
+
+            env_vars = os.environ.copy()
+            env_vars["ASAN_OPTIONS"] = "exitcode=0"
+            result = subprocess.run(cmd, cwd=os.getcwd(), capture_output=True, text=True, timeout=600, env=env_vars)
+            if result.returncode != 0:
+                print(f"Error rendering {map_name}: {result.stderr}")
+
+        if render_maps:
+            print(f"Rendering {len(render_maps)} from {map_dir} with {num_workers} workers...")
+            with ThreadPool(num_workers) as pool:
+                pool.map(render_task, render_maps)
+            print(f"Finished rendering videos to {output_dir}")


This new render_videos_eval branch introduces non-trivial behavior (building the visualize binary, constructing per-map subprocess commands, and dispatching them via a ThreadPool), but there are no tests covering this path; if it regresses, failures may only surface at runtime. Consider adding an automated test (potentially modeled after tests/test_drive_render.py) that exercises this eval mode end-to-end for a small number of maps to validate command construction and successful completion.

docs/src/visualizer.md

…stions by greptile

mpragnay added 4 commits January 31, 2026 20:26

Added render_eval mode, minor bug fixes for building in clang

e72e16a

revert to default configs

4067498

minor changes

26b7940

Added Documentation for mode

63d1cf1

eugenevinitsky reviewed Feb 1, 2026

View reviewed changes

mpragnay marked this pull request as ready for review February 1, 2026 15:02

Copilot AI review requested due to automatic review settings February 1, 2026 15:02

Copilot started reviewing on behalf of mpragnay February 1, 2026 15:02 View session

greptile-apps bot reviewed Feb 1, 2026

View reviewed changes

pufferlib/pufferl.py Outdated Show resolved Hide resolved

pufferlib/pufferl.py Outdated Show resolved Hide resolved

Copilot AI reviewed Feb 1, 2026

View reviewed changes

mpragnay added 2 commits February 1, 2026 14:55

String Concatenation fixes, exception handling for timeout, doc sugge…

9dbc4c1

…stions by greptile

Merge branch '3.0' into Render-Videos-Eval-Mode

712218d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Render videos eval mode #274

Render videos eval mode #274

Uh oh!

mpragnay commented Feb 1, 2026 •

edited

Loading

Uh oh!

eugenevinitsky Feb 1, 2026

Uh oh!

mpragnay Feb 1, 2026

Uh oh!

eugenevinitsky Feb 1, 2026

Uh oh!

eugenevinitsky Feb 1, 2026

Uh oh!

mpragnay Feb 1, 2026

Uh oh!

greptile-apps bot commented Feb 1, 2026

Uh oh!

greptile-apps bot left a comment

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Feb 1, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Render videos eval mode #274

Are you sure you want to change the base?

Render videos eval mode #274

Uh oh!

Conversation

mpragnay commented Feb 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

eugenevinitsky Feb 1, 2026

Choose a reason for hiding this comment

Uh oh!

mpragnay Feb 1, 2026

Choose a reason for hiding this comment

Uh oh!

eugenevinitsky Feb 1, 2026

Choose a reason for hiding this comment

Uh oh!

eugenevinitsky Feb 1, 2026

Choose a reason for hiding this comment

Uh oh!

mpragnay Feb 1, 2026

Choose a reason for hiding this comment

Uh oh!

greptile-apps bot commented Feb 1, 2026

Greptile Overview

Greptile Summary

Confidence Score: 4/5

Important Files Changed

Sequence Diagram

Uh oh!

greptile-apps bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Feb 1, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

mpragnay commented Feb 1, 2026 •

edited

Loading