🔥 GitHub Roast
← Back to the board
Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the Key
Zhihe Yang, Xufang Luo, Dongqi Han, Yunjian Xu et al.
67.08/100
📘 Readable
Decent, has merit
Content 61.2 · Citation bonus +5.9 · 67 citations

💡 This paper identifies that the performance variation of existing DPO-based LVLM hallucination mitigation methods stems from on-policy data alignment, proposes OPA-DPO which outperforms prior 16k-sampl

#LVLM幻觉治理#DPO玄学破局#on-policy数据#专家反馈#高效对齐#LVLM Hallucination Fix#DPO Black-box Break#On-Policy Data#Expert Feedback#Efficient Alignment

Score breakdown

Novelty7.0 / 10
Rigor7.0 / 10
Significance8.0 / 10
Clarity9.0 / 10
Reproducibility8.0 / 10

This tone hasn't been generated yet — roast it again to create it.