-
Notifications
You must be signed in to change notification settings - Fork 18
Open
Description
I understood that the RL‑tuned model determines an action and then executes that action (e.g., delete, update, etc.).
However, from the code, it seems like the model only learns to choose the action, not to perform it. Is that correct?
Also, in memory_server.py, the _get_analysis_prompt function looks like part of the memory extraction process.
Is this step handled by the RL‑tuned model, or is it done by a larger base model?
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels