GitHub - simonshimengyang/baconhead: Beat Roblox

roblox bot that watches you play, learns game states from your gameplay, and takes over when you go idle.

uses vision model we trained to understand what's on screen and plan actions. trains a local ViT model (GameSense) to detect death screens, danger zones, and menus — no hardcoded game knowledge, works on any roblox game.

demo

how it works

you play roblox normally
after N seconds idle, the bot takes over
claude looks at your screen and decides what to do
GameSense (trained from your gameplay) detects deaths, danger, menus
you press any key and you're back in control

setup

git clone https://github.com/your-username/baconhead.git
cd baconhead
python -m venv .venv && source .venv/bin/activate
pip install -r requirements.txt
cp .env.example .env
# add your ANTHROPIC_API_KEY to .env

macos only rn. needs accessibility permission for keyboard/mouse control: System Settings → Privacy & Security → Accessibility → add your terminal app.

train GameSense (classification)

collect data by playing roblox (auto-labels frames from your gameplay):

python -m vision.collect --seconds 120 --out game_data

train the model:

python -m vision.train --data game_data --out game_sense.pt --epochs 10

prints per-class precision/recall when done. more data = better model.

DPO fine-tuning (bake reward into weights)

after collecting gameplay data, run DPO to fold the reward signal directly into the model weights — no separate reward model needed at inference time.

the preference signal comes from observed gameplay outcomes: playing > danger > menu > dead (chosen = frames from better states, rejected = frames from worse states).

# fine-tune existing game_sense.pt in-place
python run_dpo.py --data episode_data --model game_sense.pt --epochs 3

# save to a separate file instead
python run_dpo.py --data episode_data --model game_sense.pt \
    --out game_sense_dpo.pt --epochs 5 --beta 0.1

# smoke-test without saving
python run_dpo.py --data episode_data --model game_sense.pt --dry-run

the resulting checkpoint embeds a frozen reference model alongside the policy, so future DPO rounds stay anchored to the original distribution (--beta controls how far the policy can drift).

use the DPO-tuned model with the bot exactly as before:

python run_takeover.py --model game_sense.pt

run

# basic — takes over after 3s idle
python run_takeover.py

# custom idle time + trained model
python run_takeover.py --idle 10 --model game_sense.pt

# monitor decisions to a log file
python run_takeover.py --idle 7 --model game_sense.pt --monitor bot_log.tsv

# no claude (random actions, for testing)
python run_takeover.py --no-scout

# capture only (no bot, just screen grab)
python run_capture.py --report --seconds 30

For the obby-beating implementation, use the ObbyBeater branch:

git checkout ObbyBeater

contribute

fork this repo
create a branch (git checkout -b my-feature)
make your changes
open a PR

todo

collect more training data across different roblox games
self-improving loop (bot labels its own gameplay data while running)
add temporal context to GameSense (sequence of frames, not just one)
obstacle avoidance from learned danger predictions
multi-platform support (windows, linux — currently macos only)
web dashboard for monitoring bot decisions live
support custom action spaces beyond WASD
fine-tune claude prompts based on per-game performance metrics
replay buffer for offline RL training

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

demo

how it works

setup

train GameSense (classification)

DPO fine-tuning (bake reward into weights)

run

contribute

todo

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
assets		assets
capture		capture
dpo		dpo
episode_data		episode_data
llm_agent		llm_agent
look_calib_shots		look_calib_shots
policy_data		policy_data
reward		reward
vision		vision
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
config.yaml		config.yaml
requirements.txt		requirements.txt
run_capture.py		run_capture.py
run_dpo.py		run_dpo.py
run_takeover.py		run_takeover.py

Folders and files

Latest commit

History

Repository files navigation

demo

how it works

setup

train GameSense (classification)

DPO fine-tuning (bake reward into weights)

run

contribute

todo

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages