Add llama.cpp LLM backend for LLM player with JSON action enforcing by GiuliaForasassi · Pull Request #24 · rl-language/rlc

GiuliaForasassi · 2026-05-25T21:18:02Z

This PR adds:

llama.cpp backend for llmplayer
type-based regex generation in rlc
JSON generation strategy for LLM actions
force LLM generation given regex/grammar

Example command to start llama.cpp server:

./build/bin/llama-server --host 0.0.0.0 --port 8000 --ctx-size 4096 --n-gpu-layers 999 --alias model --hf-repo unsloth/gemma-4-E2B-it-GGUF  --hf-file gemma-4-E2B-it-Q4_K_M.gguf --reasoning-budget 768 --reasoning-budget-message "...\n<<<MAX BUDGET REACHED, REASONING INTERRUPTED, ANSWER IMMEDIATELY>>>" --ctx-size 131072

(requires previous installation/compilation of llama.cpp)

GiuliaForasassi added 4 commits May 25, 2026 23:07

Add llama.cpp LLM backend for LLM player with JSON action enforcing

beab404

Clean up code and add some comments

7da859f

add xgrammar dependency

2f4dd6f

Add variables to control reasoning and regex

42576a4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add llama.cpp LLM backend for LLM player with JSON action enforcing#24

Add llama.cpp LLM backend for LLM player with JSON action enforcing#24
GiuliaForasassi wants to merge 4 commits into
rl-language:masterfrom
GiuliaForasassi:llm-player

GiuliaForasassi commented May 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

GiuliaForasassi commented May 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant