arXiv Paper Roast
Drop an arXiv link or ID โ strict score + roast/praise commentary.
๐ Hall of Fame
Open full board โ- 1๐ฅ 80.00Attention Is All You Need#Transformer seminal work
- 2๐ฅ 80.00Deep Residual Learning for Image Recognition#Residual Skip
- 3๐ 78.81MME: A Comprehensive Evaluation Benchmark for Multimodal Large Language Models#MLLM Exam
- 4๐ 75.60Denoising Diffusion Probabilistic Models#diffusion model origin
- 5๐ 73.95Absolute Zero: Reinforced Self-play Reasoning with Zero Data#zero-data RL
- 6๐ 71.43Mean Flows for One-step Generative Modeling#one-shot gen
- 7๐ 71.20Who Wrote This? The Key to Zero-Shot LLM-Generated Text Detection Is GECScore#LLM-generated text detec
- 8๐ 70.00OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments#GUI Agent Benchmark
- 9๐ 69.87DetectRL: Benchmarking LLM-Generated Text Detection in Real-World Scenarios#LLM Detection Benchmark
- 10๐ 68.83WildClawBench: A Benchmark for Real-World, Long-Horizon Agent Evaluation#Sandbox Debunker