-
Notifications
You must be signed in to change notification settings - Fork 40
Description
Hello,
First of all, congratulations on the excellent work with UniPic2 and Skywork-EditReward — it’s truly inspiring research.
I was particularly impressed by the reinforcement learning stage for image editing, where you mention employing GPT-4.1 with carefully designed evaluation templates to provide multi-dimensional scoring (instruction-following accuracy, image quality, etc.) before training Skywork-EditReward.
I would like to kindly ask:
-
During the GRPO (reinforcement learning) stage for image editing, did you use GPT-4.1 directly as the rule-based evaluator to provide reward signals?
-
If possible, could you share (or point me to) the evaluation template / rule prompt you used with GPT-4.1 for scoring the generated images?
Having access to the prompt design would greatly help me and others in the community better understand and potentially reproduce the evaluation pipeline you proposed.
Thank you very much for your time and for making this valuable work available.
Best regards,