Add Rust tool-calling bindings for llama.cpp by MegalithOfficial · Pull Request #977 · utilityai/llama-cpp-rs

MegalithOfficial · 2026-03-30T13:12:37Z

This PR addresses #864 by filling in the missing Rust-side tool-calling pieces that llama.cpp already supports.

It adds:

typed tool definitions
chat template application with tools
typed OpenAI-compatible options and tool choice handling
typed parsed responses and streaming deltas
JSON schema to grammar conversion helper
tests and example updates

The goal here was to make tool calling usable from Rust without forcing everything through raw JSON strings, while still keeping the raw JSON APIs available for users who prefer them.

For verification, I ran cargo test -p llama-cpp-2 --lib for the library changes and used cargo check on the updated examples to make sure the typed APIs and example flows still compiled cleanly. I also exercised the reasoning and tool-calling path directly with:

cargo run -release --example tools_reasoning -- --continous hf-model unsloth/Qwen3.5-4B-GGUF Qwen3.5-4B-Q4_K_M.gguf

I also added --continous flag to the reasoning example so it can take the model’s initial tool call, inject a mock tool result back into the conversation, and then generate the follow-up assistant response in the same run.

MarcusDunn · 2026-03-30T16:49:31Z

llama-cpp-2/Cargo.toml

+serde = { version = "1.0", features = ["derive"] }
+serde_json = "1.0"


these are pretty big deps (compile time wise) to drag in by default, put this behind a feature flag.

MarcusDunn · 2026-03-30T16:53:51Z

looks good. throw this behind a feature flag (I don't want serde pulled in by default) or better yet, just accept string refs. We don't do anything with the json to justify the dep beyond better types.

MarcusDunn · 2026-03-30T17:00:07Z

llama-cpp-2/src/openai.rs

I do not think this belongs in this crate. Others can implement provider-specific logic.

Okay. When I first created this, I thought it would be better to provide typed functions on the Rust side instead of making users build and parse JSON strings by hand. This is especially important since the whole reason for this PR was #864 and the underlying llama.cpp support is already there. I agree that this started mixing two layers: the actual missing bindings needed for tool calling, and a more opinionated Rust/OpenAI convenience layer on top.

I've updated the PR to focus on the binding surface needed for #864. The typed serde-based helpers are no longer in the library API, but the raw JSON/string-based path for passing tools into chat templates, getting prompts and grammar back, and parsing OpenAI-compatible responses is still there. I also removed the standard serde/serde_json dependency from the main collection of code and updated the examples to use that raw flow instead.

MegalithOfficial · 2026-03-31T15:04:37Z

I have updated the READMEs.

MarcusDunn · 2026-04-01T18:33:16Z

lgtm. am I correct in thinking this adds very little to the core lib? the main contributions are better docs/examples and the test?

MegalithOfficial marked this pull request as draft March 30, 2026 13:17

MegalithOfficial marked this pull request as ready for review March 30, 2026 13:20

MarcusDunn reviewed Mar 30, 2026

View reviewed changes

MegalithOfficial added 7 commits March 30, 2026 20:31

feat(llama-cpp-2): add typed tool calling wrappers

b691e8d

test(llama-cpp-2): cover typed tool calling helpers

3a2244e

docs(llama-cpp-2): document typed tool calling flow

238c9b5

feat(llama-cpp-2): add typed OpenAI tool APIs

4872041

docs(examples): use typed parsed output in tools example

4f01a5f

feat(examples): add reasoning tool calling flow

02277a6

refactor(llama-cpp-2): reduce tool calling API to raw bindings

da76549

MegalithOfficial force-pushed the main branch from 9850995 to da76549 Compare March 30, 2026 17:31

docs: fix tool-calling README example

114aec4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Rust tool-calling bindings for llama.cpp#977

Add Rust tool-calling bindings for llama.cpp#977
MegalithOfficial wants to merge 8 commits intoutilityai:mainfrom
MegalithOfficial:main

MegalithOfficial commented Mar 30, 2026 •

edited

Loading

Uh oh!

MarcusDunn Mar 30, 2026

Uh oh!

MarcusDunn commented Mar 30, 2026 •

edited

Loading

Uh oh!

MarcusDunn Mar 30, 2026 •

edited

Loading

Uh oh!

MegalithOfficial Mar 30, 2026

Uh oh!

MegalithOfficial commented Mar 31, 2026

Uh oh!

MarcusDunn commented Apr 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		serde = { version = "1.0", features = ["derive"] }
		serde_json = "1.0"

Conversation

MegalithOfficial commented Mar 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

MarcusDunn Mar 30, 2026

Choose a reason for hiding this comment

Uh oh!

MarcusDunn commented Mar 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

MarcusDunn Mar 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

MegalithOfficial Mar 30, 2026

Choose a reason for hiding this comment

Uh oh!

MegalithOfficial commented Mar 31, 2026

Uh oh!

MarcusDunn commented Apr 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

MegalithOfficial commented Mar 30, 2026 •

edited

Loading

MarcusDunn commented Mar 30, 2026 •

edited

Loading

MarcusDunn Mar 30, 2026 •

edited

Loading