[codex] Add reproducible minimal PPO WSL workflow by HC-Seaple · Pull Request #484 · Emerge-Lab/PufferDrive

HC-Seaple · 2026-06-14T02:55:40Z

What changed

Add a self-contained continuous-action PPO trainer with vectorized rollouts, GAE, clipped updates, checkpointing, and deterministic evaluation.
Add Windows/WSL setup and launch scripts for the Linux-native Raylib build.
Add generic WOMD JSON-to-map preparation without committing datasets or generated binaries.
Add native third-person checkpoint visualization and JSON metrics.
Ensure complete renderer frames are written to ffmpeg.
Document clone, setup, map preparation, training, visualization, and handoff.

Validation

Python scripts pass python -m py_compile.
WSL launchers pass bash -n.
The staged change set passes git diff --check.
The end-to-end workflow was previously exercised in WSL with a 10,112-step checkpoint and 92-frame native render.

Current limitation

This is a smoke-test training architecture. Reward shaping still needs route-progress reward, reverse-motion penalties, and stronger collision/off-road costs before scaling.

eugenevinitsky · 2026-06-15T15:57:57Z

Hi! Thanks for the PR. We can review and discuss this once it's off draft status but a quick thing to mention is that this probably needs a slightly different folder structure since otherwise it pollutes the script folder with a lot of unstructured code

nyx7ck added 2 commits June 13, 2026 22:27

Add reproducible minimal PPO WSL workflow

96b306a

Document workflow and add scenario visualization

111adb4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[codex] Add reproducible minimal PPO WSL workflow#484

[codex] Add reproducible minimal PPO WSL workflow#484
HC-Seaple wants to merge 2 commits into
Emerge-Lab:2.0from
HC-Seaple:codex/minimal-ppo-wsl

HC-Seaple commented Jun 14, 2026

Uh oh!

eugenevinitsky commented Jun 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

HC-Seaple commented Jun 14, 2026

What changed

Validation

Current limitation

Uh oh!

eugenevinitsky commented Jun 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants