Open
Description
Prerequisites
- I am running the latest code. Mention the version if possible as well.
- I carefully followed the README.md.
- I searched using keywords relevant to my issue to make sure that I am creating a new issue that is not already open (or closed).
- I reviewed the Discussions, and have a new and useful enhancement to share.
Feature Description
Add a built-in way to reset the context without reloading the model, such as:
/reset command in llama-cli interactive mode.
HTTP API endpoint (e.g., POST /reset) for llama-server.
Use Case:
Chat applications needing fresh sessions.
Testing prompts without residual context.
Long-running servers where model reloads are costly.
Motivation
Currently, to start a new conversation with a clean context (no prior chat history), users must restart llama-cli (reloading the model, which is slow).
Possible Implementation
No response