Inquiry about GPT-4.1 Rule Prompt for Image Editing GRPO Stage

Hello,
First of all, congratulations on the excellent work with UniPic2 and Skywork-EditReward — it’s truly inspiring research.
I was particularly impressed by the reinforcement learning stage for image editing, where you mention employing GPT-4.1 with carefully designed evaluation templates to provide multi-dimensional scoring (instruction-following accuracy, image quality, etc.) before training Skywork-EditReward.

I would like to kindly ask:

1. During the GRPO (reinforcement learning) stage for image editing, did you use GPT-4.1 directly as the rule-based evaluator to provide reward signals?

2. If possible, could you share (or point me to) the evaluation template / rule prompt you used with GPT-4.1 for scoring the generated images?

Having access to the prompt design would greatly help me and others in the community better understand and potentially reproduce the evaluation pipeline you proposed.
Thank you very much for your time and for making this valuable work available.

Best regards,

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Inquiry about GPT-4.1 Rule Prompt for Image Editing GRPO Stage #28

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Inquiry about GPT-4.1 Rule Prompt for Image Editing GRPO Stage #28

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions