GameCraft-Bench: Can Agents Build Playable Games End-to-End in a Real Game Engine?
Tongxu Luo, Rongsheng Wang, Jiaxi Bi, Chenming Xu et al.
60.40/100
🫥 Mediocre
Incremental, thin
Content 60.4 · Citation bonus +0.0 · no citation data
💡 This paper proposes GameCraft-Bench, the first end-to-end game generation benchmark based on the open-source Godot engine, with 140 tasks across 15 game families, revealing that the top frontier codin
#游戏生成基准#编码智能体评估#Godot引擎#端到端交互任务#game generation benchmar#coding agent evaluation#Godot engine#end-to-end interactive t
Score breakdown
Novelty7.0 / 10
Rigor6.0 / 10
Significance8.0 / 10
Clarity9.0 / 10
Reproducibility9.0 / 10
This tone hasn't been generated yet — roast it again to create it.