🔥 GitHub Roast

🏆Paper board · best / worst→

Beta · arXiv only

arXiv Paper Roast

Drop an arXiv link or ID — strict score + roast/praise commentary.

🏆 Hall of Fame

Open full board →

1
Attention Is All You Need
#Transformer seminal work
🥇 80.00
2
Deep Residual Learning for Image Recognition
#Residual Skip
🥇 80.00
3
MME: A Comprehensive Evaluation Benchmark for Multimodal Large Language Models
#MLLM Exam
📘 78.81
4
Denoising Diffusion Probabilistic Models
#diffusion model origin
📘 75.60
5
Absolute Zero: Reinforced Self-play Reasoning with Zero Data
#zero-data RL
📘 73.95
6
Mean Flows for One-step Generative Modeling
#one-shot gen
📘 71.43
7
Who Wrote This? The Key to Zero-Shot LLM-Generated Text Detection Is GECScore
#LLM-generated text detec
📘 71.20
8
OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
#GUI Agent Benchmark
📘 70.00
9
DetectRL: Benchmarking LLM-Generated Text Detection in Real-World Scenarios
#LLM Detection Benchmark
📘 69.87
10
WildClawBench: A Benchmark for Real-World, Long-Horizon Agent Evaluation
#Sandbox Debunker
📘 68.83