Trust Your Critic: Robust Reward Modeling and Reinforcement Learning for Faithful Image Editing and Generation
Xiangyu Zhao, Peiyuan Zhang, Junming Lin, Tianhao Liang et al.
52.40/100
🫥 Mediocre
Incremental, thin
Content 52.4 · Citation bonus +0.0 · no citation data
💡 This paper proposes FIRM, a framework that builds tailored scoring datasets and a benchmark for image editing/generation reward models, uses a Base-and-Bonus reward strategy to improve RL-based image
#奖励模型去幻觉#RL图像编辑优化#指令对齐新基准#开源数据集贡献#Reward Model Dehallucina#RL Image Editing Boost#Instruction Alignment Be#Open Dataset Dump
Score breakdown
Novelty5.0 / 10
Rigor6.0 / 10
Significance7.0 / 10
Clarity8.0 / 10
Reproducibility8.0 / 10
This tone hasn't been generated yet — roast it again to create it.