是否有Qwen3-4B LoRA版本在Personal Agent Track上的训练结果？ #112

Open

opened

on May 11, 2026

想尝试LoRA版本，但似乎复现出来的三种强化学习方案RL,OPD,Combined和论文报告的全参模型相差很大

Metadata

Assignees

No one assigned

Labels

No labels

No labels

Type

No type

Fields

No fields configured for issues without a type.

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests