OCLA — Open Cyber LLM Arena

Local-first benchmarking for evaluating LLMs on cybersecurity tasks, with optional anonymous sharing to a public leaderboard.

Quickstart

Install deps: npm install
Run dev server: npm run dev
Open: http://localhost:3000

Database (optional uploads + leaderboard)

Uploads and the public leaderboard require Postgres.

Copy .env.example → .env.local and set DATABASE_URL (keep .env.local private)
Create tables: npm run db:push

Neon tip: use the pooled URL for DATABASE_URL, and use the unpooled URL for DATABASE_URL_UNPOOLED if db:push fails due to pooling/pgbouncer limitations.

Offline runner (integrity-hash included)

The repo includes a downloadable Node.js runner at public/ocla-runner.mjs.

Prompt pack: public/prompt-packs/ocla-safe-v1.json
Runner hash pin (generated): public/ocla-runner.sha256

Example:

node public/ocla-runner.mjs --base-url http://localhost:11434/v1 --model llama3.1 --prompt-pack public/prompt-packs/ocla-safe-v1.json

Notes

The in-browser benchmark runner is designed to keep API keys in the browser (never sent to OCLA server routes).
The repo ships with a safe example prompt pack. You can import your own prompt packs for internal testing.

Name		Name	Last commit message	Last commit date
Latest commit History 47 Commits
app		app
components		components
docs		docs
lib		lib
prisma		prisma
public		public
scripts		scripts
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
debug_scoring.ts		debug_scoring.ts
eslint.config.mjs		eslint.config.mjs
jest.config.js		jest.config.js
next-env.d.ts		next-env.d.ts
next.config.js		next.config.js
package-lock.json		package-lock.json
package.json		package.json
postcss.config.js		postcss.config.js
tailwind.config.js		tailwind.config.js
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OCLA — Open Cyber LLM Arena

Quickstart

Database (optional uploads + leaderboard)

Offline runner (integrity-hash included)

Notes

About

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

OCLA — Open Cyber LLM Arena

Quickstart

Database (optional uploads + leaderboard)

Offline runner (integrity-hash included)

Notes

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Contributors

Uh oh!

Languages