🔥 毒舌 GitHub

🏆 神作榜

  1. 1
    Attention Is All You Need
    Ashish Vaswani, Noam Shazeer, Niki Parmar · #Transformer开
    🥇 80.00
  2. 2
    Deep Residual Learning for Image Recognition
    Kaiming He, Xiangyu Zhang, Shaoqing Ren · #残差跳线
    🥇 80.00
  3. 3
    MME: A Comprehensive Evaluation Benchmark for Multimodal Large Language Models
    Chaoyou Fu, Peixian Chen, Yunhang Shen · #MLLM考卷
    📘 78.81
  4. 4
    Denoising Diffusion Probabilistic Models
    Jonathan Ho, Ajay Jain, Pieter Abbeel · #扩散模型开山之作
    📘 75.60
  5. 5
    Absolute Zero: Reinforced Self-play Reasoning with Zero Data
    Andrew Zhao, Yiran Wu, Yang Yue · #零数据RL
    📘 73.95
  6. 6
    Mean Flows for One-step Generative Modeling
    Zhengyang Geng, Mingyang Deng, Xingjian Bai · #一步出图
    📘 71.43
  7. 7
    Who Wrote This? The Key to Zero-Shot LLM-Generated Text Detection Is GECScore
    Junchao Wu, Runzhe Zhan, Derek F. Wong · #LLM生成文本检测
    📘 71.20
  8. 8
    OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
    Tianbao Xie, Danyang Zhang, Jixuan Chen · #GUI智能体基准
    📘 70.00
  9. 9
    DetectRL: Benchmarking LLM-Generated Text Detection in Real-World Scenarios
    Junchao Wu, Runzhe Zhan, Derek F. Wong · #LLM检测基准
    📘 69.87
  10. 10
    WildClawBench: A Benchmark for Real-World, Long-Horizon Agent Evaluation
    Shuangrui Ding, Xuanlang Dai, Long Xing · #沙盒打假人
    📘 68.83
  11. 11
    StreamingBench: Assessing the Gap for MLLMs to Achieve Streaming Video Understanding
    Junming Lin, Zheng Fang, Chi Chen · #流式视频理解
    📘 68.64
  12. 12
    d$^2$Cache: Accelerating Diffusion-Based LLMs via Dual Adaptive Caching
    Yuchu Jiang, Yue Cai, Xiangzhong Luo · #扩散LLM推理加速
    📘 65.97
  13. 13
    Tranception: protein fitness prediction with autoregressive transformers and inference-time retrieval
    Pascal Notin, Mafalda Dias, Jonathan Frazer · #蛋白适应度预测
    📘 65.60
  14. 14
    SkillOpt: Executive Strategy for Self-Evolving Agent Skills
    Yifan Yang, Ziyang Gong, Weiquan Huang · #智能体技能炼丹
    🫥 64.80
  15. 15
    NL2Repo-Bench: Towards Long-Horizon Repository Generation Evaluation of Coding Agents
    Jingzhe Ding, Shengda Long, Changxin Pu · #coding agent
    🫥 63.60
  16. 16
    VideoRoPE: What Makes for Good Video Rotary Position Embedding?
    Xilin Wei, Xiaoran Liu, Yuhang Zang · #视频位置编码
    🫥 63.20
  17. 17
    Balanced Aggregation: Understanding and Fixing Aggregation Bias in GRPO
    Zhiyuan Zeng, Jiameng Huang, Zhangyue Yin · #GRPO聚合玄学破解
    🫥 63.20
  18. 18
    Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual Editing
    Xiangyu Zhao, Peiyuan Zhang, Kexian Tang · #视觉编辑新基准
    🫥 62.40
  19. 19
    MMLongBench: Benchmarking Long-Context Vision-Language Models Effectively and Thoroughly
    Zhaowei Wang, Wenhao Yu, Xiyu Ren · #长上下文多模态评测
    🫥 62.40
  20. 20
    Beyond the Current Observation: Evaluating Multimodal Large Language Models in Controllable Non-Markov Games
    Shengyuan Ding, Xilin Wei, Xinyu Fang · #多模态大模型评估
    🫥 62.00
  21. 21
    Training Long-Context Vision-Language Models Effectively with Generalization Beyond 128K Context
    Zhaowei Wang, Lishu Luo, Haodong Duan · #长上下文视觉语言模型
    🫥 62.00
  22. 22
    SetCon: Towards Open-Ended Referring Segmentation via Set-Level Concept Prediction
    Zhixiong Zhang, Yizhuo Li, Shuangrui Ding · #LVLM终于知道多目标是
    🫥 60.80
  23. 23
    GameCraft-Bench: Can Agents Build Playable Games End-to-End in a Real Game Engine?
    Tongxu Luo, Rongsheng Wang, Jiaxi Bi · #游戏生成基准
    🫥 60.40
  24. 24
    DetectRL-X: Towards Reliable Multilingual and Real-World LLM-Generated Text Detection
    Junchao Wu, Yefeng Liu, Chenyu Zhu · #多语言文本检测
    🫥 59.60
  25. 25
    MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMs
    Jiarui Zhang, Mahyar Khayatkhoei, Prateek Chhikara · #MLLM眼脑分离
    🫥 58.00
  26. 26
    Stacking Your Transformers: A Closer Look at Model Growth for Efficient LLM Pre-Training
    Wenyu Du, Tongxu Luo, Zihan Qiu · #模型生长实用指南
    🫥 57.20
  27. 27
    Point2RBox-v2: Rethinking Point-supervised Oriented Object Detection with Spatial Layout Among Instances
    Yi Yu, Botao Ren, Peiyuan Zhang · #点监督有向检测
    🫥 56.40
  28. 28
    SeHDR: Single-Exposure HDR Novel View Synthesis via 3D Gaussian Bracketing
    Yiyu Li, Haoyuan Wang, Ke Xu · #单曝光HDR合成
    🫥 55.60
  29. 29
    Agentifying Patient Dynamics within LLMs through Interacting with Clinical World Model
    Minghao Wu, Yuting Yan, Zhenyang Cai · #脓毒症智能决策
    🫥 55.60
  30. 30
    ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning
    Shengyuan Ding, Xinyu Fang, Ziyu Liu · #多模态奖励模型
    🫥 54.40
  31. 31
    Knowledge Index of Noah's Ark
    Sheng Jin, Minghao Liu, Yunze Xiao · #LLM知识评估
    🫥 54.40
  32. 32
    GenExam: A Multidisciplinary Text-to-Image Exam
    Zhaokai Wang, Penghao Yin, Xiangyu Zhao · #文生图评估
    🫥 54.40
  33. 33
    OneRec Technical Report
    Guorui Zhou, Jiaxin Deng, Jinghao Zhang · #工业级推荐系统
    🫥 53.60
  34. 34
    The Molecular Structure of Thought: Mapping the Topology of Long Chain-of-Thought Reasoning
    Qiguang Chen, Yantao Du, Ziniu Li · #长链思维分析
    🫥 53.20
  35. 35
    Why Multi-Step Tool-Use Reinforcement Learning Collapses and How Supervisory Signals Fix It
    Yupu Hao, Zhuoran Jin, Huanxuan Liao · #工具调用RL
    🫥 52.40
  36. 36
    SS-MAE: Spatial-Spectral Masked Auto-Encoder for Multi-Source Remote Sensing Image Classification
    Junyan Lin, Feng Gao, Xiaocheng Shi · #遥感分类
    🫥 52.40
  37. 37
    Fast Large Language Model Collaborative Decoding via Speculation
    Jiale Fu, Yuchu Jiang, Junkai Chen · #LLM加速
    🫥 52.40
  38. 38
    Trust Your Critic: Robust Reward Modeling and Reinforcement Learning for Faithful Image Editing and Generation
    Xiangyu Zhao, Peiyuan Zhang, Junming Lin · #奖励模型去幻觉
    🫥 52.40
  39. 39
    DebCSE: Rethinking Unsupervised Contrastive Sentence Embedding Learning in the Debiasing Perspective
    Pu Miao, Zeyao Du, Junlin Zhang · #句子嵌入去偏
    🫥 52.40
  40. 40
    RepreGuard: Detecting LLM-Generated Text by Revealing Hidden Representation Patterns
    Xin Chen, Junchao Wu, Shu Yang · #AI生成文本检测
    🫥 51.20
  41. 41
    Agent World Model: Infinity Synthetic Environments for Agentic Reinforcement Learning
    Zhaoyang Wang, Canwen Xu, Boyi Liu · #agent环境救星
    🫥 50.40
  42. 42
    Learning from Peers in Reasoning Models
    Tongxu Luo, Wenyu Du, Jiaxi Bi · #前缀陷阱观察
    🫥 50.40
  43. 43
    Multi-Layer Visual Feature Fusion in Multimodal LLMs: Methods, Analysis, and Best Practices
    Junyan Lin, Haoran Chen, Yue Fan · #多模态大模型
    🫥 50.40
  44. 44
    Qwen-AgentWorld: Language World Models for General Agents
    Yuxin Zuo, Zikai Xiao, Li Sheng · #世界模型炼丹
    🫥 50.00
  45. 45
    Kwai Keye-VL-2.0 Technical Report
    Kwai Keye Team, Bin Wen, Changyi Liu · #长视频理解
    🫥 50.00
  46. 46
    Generative Modeling via Drifting
    Mingyang Deng, He Li, Tianhong Li · #一步生成
    🫥 50.00
  47. 47
    DynamicFace: High-Quality and Consistent Face Swapping for Image and Video using Composable 3D Facial Priors
    Runqi Wang, Yang Chen, Sijie Xu · #面部替换
    🫥 49.60
  48. 48
    MM-IFEngine: Towards Multimodal Instruction Following
    Shengyuan Ding, Shenxi Wu, Xiangyu Zhao · #多模态大模型
    🫥 49.60
  49. 49
    MM-HELIX: Boosting Multimodal Long-Chain Reflective Reasoning with Holistic Platform and Adaptive Hybrid Policy Optimization
    Xiangyu Zhao, Junming Lin, Tianhao Liang · #多模态大模型
    🫥 49.44
  50. 50
    Hard to Read, Easy to Jailbreak: How Visual Degradation Bypasses MLLM Safety Alignment
    Zhixue Song, Boyan Han, Yiwei Wang · #多模态安全
    🫥 48.00