Oxabl

High performance oxidized tooling for Progress ABL, written in Rust.

Unofficial

No affiliation with Progress.

Status

The first library will be oxabl_parser.

Requirements:

oxabl_lexer: MVP has been completed in crates/oxabl_lexer.
- Produces tokens against all known ABL keywords, primitive datatypes, operators, and identifiers.
- Comprehensive test coverage.
- Run against a realistc 390kb syntactically correct ABL file and correctly tokenized it.
- Benchmarks and token dumps in crates/oxabl_lexer/benches and crates/oxabl_lexer/examples using a test file in resources/bench_keywords.abl.
source_map: MVP has been completed in crates/oxabl_common.
- It's able to produce line and column numbers from byte offsets stored in tokens.
- Test coverage
- Used in our token dumps and benchmarks, appears to be accurate.
oxabl_ast: Implemented in crates/oxabl_ast
- Defines literals, statements, expressions, variable definitions, control flow, and data types.
oxabl_parser: Actively developed in crates/oxabl_parser with 147 tests
- Parses expressions with proper operator precedence
- Parses statements: DEFINE VARIABLE, VAR, assignments, DO blocks (with counting), IF/THEN/ELSE, REPEAT, LEAVE, NEXT, RETURN, CASE, FIND, FOR EACH, PROCEDURE
- Parses postfix operations: method calls, member access, array access, field access

Current Work: RUN statement (in progress), DISPLAY and MESSAGE statements next.

Roadmap

Goal: A high performance suite of command line tools and libraries to make ABL development blazingly fast and more effective.

oxabl_parser - foundation for understanding ABL code.
- oxabl_lexer and the oxabl_ast make up the foundation of the foundation.
oxabl_fmt - CLI tool for formatting ABL code.
oxabl_lint - CLI tool for linting ABL code.
oxabl_minify - CLI tool for removing dead code, shortening syntax, and code obfuscation.
oxabl_build - CLI tool for assisting in the compilation of ABL.
oxabl_run - CLI tool for assisting in the running of ABL.
oxabl_test - CLI tool for assisting in the testing of ABL.

Disclaimer: There is no long-term plan to take all of these stand-alone libraries and executables and create a cohesive experience. For now, it will be duct-taping things together. Perhaps an oxabl CLI?

Assisting?: Some of these are stand alone executables or libraries that assist the developer working with ABL, they don't do everything. ABL is closed source, and you cannot compile ABL to byte code without the ABL compiler. That being said, you can make the process faster and more enjoyable. Because you need the AVM and compiler at the end of the day, what Oxabl can accomplish is limited.

Benchmarks

As a high performance oriented library, Oxabl is focused on hitting low numbers and keeping them low across versions.

Benchmarks are run with cargo bench -p <crate>. Each crate has its own benchmark so we can track the performance of individual components in the toolset.

These are not sanitized benchmarks — they were run on real hardware with normal background processes, similar to how a developer would actually use the tools.

Intel i7-8550U Laptop

Hardware: Intel Core i7-8550U (8) @ 4.00 GHz, 15.37 GiB RAM, Linux 6.19.10-arch1-1

Source Map (`oxabl_common`)

Benchmark	Time (min)	Time (avg)	Time (max)	Throughput (avg)
source_map/construction	22.341 µs	22.636 µs	23.002 µs	728.99 MiB/s
source_map/lookup	98.447 ns	98.853 ns	99.361 ns	50.580 Melem/s

Lexer (`oxabl_lexer`)

Benchmark	Time (min)	Time (avg)	Time (max)	Throughput (avg)
lexer/keywords	248.71 µs	249.82 µs	250.95 µs	66.053 MiB/s
lexer/strings	24.602 µs	24.727 µs	24.862 µs	81.301 MiB/s
lexer/comments	13.988 µs	14.026 µs	14.069 µs	175.70 MiB/s
lexer/numeric	23.294 µs	23.396 µs	23.523 µs	71.701 MiB/s
lexer/preprocessor	31.775 µs	31.925 µs	32.079 µs	80.476 MiB/s

Parser (`oxabl_parser`)

Benchmark	Time (min)	Time (avg)	Time (max)	Throughput (avg)
parser/full_program	305.47 µs	306.08 µs	306.75 µs	53.913 MiB/s
parser/expressions	93.108 µs	93.943 µs	94.915 µs	24.506 MiB/s
parser/declarations	54.445 µs	54.707 µs	55.016 µs	46.928 MiB/s
parser/control_flow	82.733 µs	83.156 µs	83.626 µs	28.075 MiB/s
parser/oo_abl	76.691 µs	76.973 µs	77.287 µs	51.281 MiB/s
parser/temp_tables	50.913 µs	51.169 µs	51.424 µs	50.303 MiB/s
parser/procs_funcs	87.470 µs	88.238 µs	88.983 µs	39.352 MiB/s
parser/datasets	46.195 µs	46.292 µs	46.405 µs	54.531 MiB/s

Token Dumps

Full token dump: cargo run -p oxabl_lexer --example dump_tokens

Just errors cargo run -p oxabl_lexer --example dump_tokens -- --errors

Just summary cargo run -p oxabl_lexer --example dump_tokens -- --summary

Optimizations

I don't know anything about these techniques! But I'm excited to learn.

I consider the Lexer "production-grade" with the current benchmarks, it's more than within the realm of being usable for developer tooling, if you ran it on-save in your editor, it would only be ms to tokenize the entire file, which is more-or-less instant in an editor. Still, I love optimizing things, so we're certainly going to aim for better. Why not tokenize the entire codebase on save?? (Jokes)

Here's what's on the roadmap for the lexer:

Perfect Hash Table
- create a "perfect hash table", which could drop our 1600+ keyword comparison (which eats up 93% of our lexing time) to a 1-2 hash lookups + bounds check.
- Priority: High, impacts the process we spend the most amount of time in.
Skip case conversion
- ABL treats upper and lowercase as valid for keywords, so we are converting everything to lowercase, which requires an allocation.
- We could inline a case-insensitive comparison.
- Priority: High, impacts the process we spend the 2nd most amount of time in.
Arena Allocation
- Allocate many small objects into a single buffer and free everything at once instead of individual deallocations.
- Tokens are short lived, so this eliminates an allocation and deallocation for every single token, and improves caching.
- Instead of pushing a new token, allocate all tokens into an Arena, then drop the whole thing after parsing.
- Priority: Medium, not as complex as some optimizations while still offering decent returns.
SIMD scanning
- Process more bytes at once using CPU vector instructions instead of looping byte-by-byte.
- Priority: Low, could be another significant speed-up, but after implementation, lexer will be harder to maintain, so leave it for now.
Branchless state machines
- Replace if/match with lookup tables
- Build a table of transitions[state][byte] -> next_state and index directly, `current_state = table[current_state][byte]
- Priority: None, might not be worth our effort.

Contributing

CI

Every push and PR to master runs the following checks (all must pass):

cargo check — compilation
cargo test — full test suite
cargo fmt --check — formatting
cargo clippy -D warnings — lints

Commit Messages

Use Conventional Commits format. This drives the automated release process:

feat: add X — new feature (bumps minor version)
fix: correct Y — bug fix (bumps patch version)
feat!: breaking change — breaking change (bumps major, or minor while pre-1.0)
chore:, docs:, refactor:, test: — won't trigger a release, but appear in the changelog

Releases

Releases are fully automated via Release Please:

Merge PRs with conventional commit messages into master
Release Please accumulates commits and maintains an open release PR with a generated changelog and version bumps across all Cargo.toml files
When the release PR is merged, a GitHub Release and git tag are created automatically

No manual version bumping is needed.

CodeGen

We generate code for all the keywords and operators to use within the project. Use these commands to generate the code:

cargo run -p oxabl_codegen -- <command>

Valid commands are:

kind
- generates the kind.rs file for the lexer.
atoms
- generates the build.rs file for the lexer.
summary
- outputs status and usage
No command
- generates all files

Commands write generated files directly to their target locations. Generated files include a "DO NOT EDIT" header. No manual file redirection is needed.

Name		Name	Last commit message	Last commit date
Latest commit History 169 Commits
.claude		.claude
.github/workflows		.github/workflows
crates		crates
docs		docs
resources		resources
.gitattributes		.gitattributes
.gitignore		.gitignore
.release-please-manifest.json		.release-please-manifest.json
CLAUDE.md		CLAUDE.md
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
README.md		README.md
atoms_snippet.rs		atoms_snippet.rs
release-please-config.json		release-please-config.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Oxabl

Unofficial

Status

Roadmap

Benchmarks

Intel i7-8550U Laptop

Source Map (`oxabl_common`)

Lexer (`oxabl_lexer`)

Parser (`oxabl_parser`)

Token Dumps

Optimizations

Contributing

CI

Commit Messages

Releases

CodeGen

About

Uh oh!

Releases 18

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Oxabl

Unofficial

Status

Roadmap

Benchmarks

Intel i7-8550U Laptop

Source Map (oxabl_common)

Lexer (oxabl_lexer)

Parser (oxabl_parser)

Token Dumps

Optimizations

Contributing

CI

Commit Messages

Releases

CodeGen

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases 18

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Source Map (`oxabl_common`)

Lexer (`oxabl_lexer`)

Parser (`oxabl_parser`)

Packages