Skip to content

Add llama.cpp LLM backend for LLM player with JSON action enforcing#24

Open
GiuliaForasassi wants to merge 4 commits into
rl-language:masterfrom
GiuliaForasassi:llm-player
Open

Add llama.cpp LLM backend for LLM player with JSON action enforcing#24
GiuliaForasassi wants to merge 4 commits into
rl-language:masterfrom
GiuliaForasassi:llm-player

Conversation

@GiuliaForasassi
Copy link
Copy Markdown

This PR adds:

  • llama.cpp backend for llmplayer
  • type-based regex generation in rlc
  • JSON generation strategy for LLM actions
  • force LLM generation given regex/grammar

Example command to start llama.cpp server:

./build/bin/llama-server --host 0.0.0.0 --port 8000 --ctx-size 4096 --n-gpu-layers 999 --alias model --hf-repo unsloth/gemma-4-E2B-it-GGUF  --hf-file gemma-4-E2B-it-Q4_K_M.gguf --reasoning-budget 768 --reasoning-budget-message "...\n<<<MAX BUDGET REACHED, REASONING INTERRUPTED, ANSWER IMMEDIATELY>>>" --ctx-size 131072

(requires previous installation/compilation of llama.cpp)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant