Question about the paper and the code

I understood that the RL‑tuned model determines an action and then executes that action (e.g., delete, update, etc.).
However, from the code, it seems like the model only learns to choose the action, not to perform it. Is that correct?

Also, in memory_server.py, the _get_analysis_prompt function looks like part of the memory extraction process.
Is this step handled by the RL‑tuned model, or is it done by a larger base model?


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about the paper and the code #3

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Question about the paper and the code #3

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions