💧 Filler board
- 1💧 24.80Gate-level boolean evolutionary geometric attention neural networksXianshuai Shi, Jianfeng Zhu, Leibo Liu · #concept patchwork
- 2💧 32.80Agent Learning via Early ExperienceKai Zhang, Xiangchao Chen, Bo Liu · #agent learning paradigm
- 3💧 33.20Partial Weakly-Supervised Oriented Object DetectionMingxin Liu, Peiyuan Zhang, Yuan Liu · #partially weakly supervi
- 4💧 34.00LLM Evaluation Based on Aerospace Manufacturing Expertise: Automated Generation and Multi-Model Question AnsweringBeiming Liu, Zhizhuo Cui, Siteng Hu · #aerospace LLM aptitude t
- 5💧 36.80SELU: Self-Learning Embodied MLLMs in Unknown EnvironmentsBoyu Li, Haobin Jiang, Ziluo Ding · #Embodied AI Self-Learnin
- 6💧 37.20Qwen-Image-Agent: Bridging the Context Gap in Real-World Image GenerationZekai Zhang, Jiahao Li, Jie Zhang · #T2I context completion
- 7💧 37.60Visual Grounding for Object-Level Generalization in Reinforcement LearningHaobin Jiang, Zongqing Lu · #VLM-wrapped RL
- 8💧 38.40Scaling Latent Reasoning via Looped Language ModelsRui-Jie Zhu, Zixuan Wang, Kai Hua · #Implicit Reasoning
- 9💧 39.60Understanding by Reconstruction: Reversing the Software Development Process for LLM PretrainingZhiyuan Zeng, Yichi Zhang, Yong Shan · #code pre-training pie-in
- 10💧 40.00Deep Differentiable Logic Gate NetworksFelix Petersen, Christian Borgelt, Hilde Kuehne · #Differentiable Logic Gat
- 11💧 40.40MARS: Unleashing the Power of Variance Reduction for Training Large ModelsHuizhuo Yuan, Yifeng Liu, Shuang Wu · #variance reduction
- 12💧 40.80MetaClaw: Just Talk -- An Agent That Meta-Learns and Evolves in the WildPeng Xia, Jianwen Chen, Xinyu Yang · #LLM Agent Continual Lear
- 13💧 40.80Advancing Complex Video Object Segmentation via Progressive Concept ConstructionZhixiong Zhang, Shuangrui Ding, Xiaoyi Dong · #Video Object Segmentatio
- 14💧 42.00Kimi K2.5: Visual Agentic IntelligenceKimi Team, Tongtong Bai, Yifan Bai · #multimodal agent
- 15💧 42.08Can Recommender Systems Teach Themselves? A Recursive Self-Improving Framework with Fidelity ControlLuankang Zhang, Hao Wang, Zhongzhou Liu · #self-bootstrapping recom
- 16💧 43.20PointOBB-v3: Expanding Performance Boundaries of Single Point-Supervised Oriented Object DetectionPeiyuan Zhang, Junwei Luo, Xue Yang · #single point-supervised
- 17💧 43.30YOLO-Master: MOE-Accelerated with Specialized Transformers for Enhanced Real-time DetectionXu Lin, Jinlong Peng, Zhenye Gan · #YOLO-stacking
- 18💧 43.60OmniAlign-V: Towards Enhanced Alignment of MLLMs with Human PreferenceXiangyu Zhao, Shengyuan Ding, Zicheng Zhang · #multi-modal alignment in
- 19💧 43.60Rethinking LLM Ensembling from the Perspective of Mixture ModelsJiale Fu, Yuchu Jiang, Peijun Wu · #LLM Ensembling
- 20💧 44.40ConceptMoE: Adaptive Token-to-Concept Compression for Implicit Compute AllocationZihao Huang, Jundong Zhou, Xingwei Qu · #MoE architecture improve
- 21💧 44.40OneReason Technical ReportOneRec Team, Biao Yang, Boyang Ding · #Generative Recommendatio
- 22💧 44.72BubbleSpec: Turning Long-Tail Bubbles into Speculative Rollout Drafts for Synchronous Reinforcement LearningYuhang Xu, Kaibin Tian, Yang Tian · #RL Training Acceleration
- 23🫥 45.20A Survey on LLM-Generated Text Detection: Necessity, Methods, and Future DirectionsJunchao Wu, Shu Yang, Runzhe Zhan · #LLM-generated text detec
- 24🫥 45.60ARM: An AutoRegressive Large Multimodal Model with Unified Discrete RepresentationsJunke Wang, Xiao Wang, Jiacheng Pan · #Autoregressive Multimoda
- 25🫥 46.40Agentopia: Long-Term Life Simulation and Learning in Agent SocietiesXintao Wang, Sirui Zheng, Hongqiu Wu · #LLM Society Simulation
- 26🫥 46.80Reverse-Engineered Reasoning for Open-Ended GenerationHaozhe Wang, Haoran Que, Qixin Xu · #open-ended reasoning
- 27🫥 46.80Modality Gap-Driven Subspace Alignment Training Paradigm For Multimodal Large Language ModelsXiaomin Yu, Yi Xin, Yuhui Zhang · #modality gap alignment
- 28🫥 47.60SENTINEL: A Fully End-to-End Language-Action Model for Humanoid Whole Body ControlYuxuan Wang, Haobin Jiang, Shiqing Yao · #humanoid control
- 29🫥 47.60GraphIC: A Graph-Based In-Context Example Retrieval Model for Multi-Step ReasoningJiale Fu, Yaqing Wang, Simeng Han · #In-Context Learning Retr
- 30🫥 47.60CoTJudger: A Graph-Driven Framework for Automatic Evaluation of Chain-of-Thought Efficiency and Redundancy in LRMsSiyi Li, Jiajun Shi, Shiwen Ni · #LLM Evaluation
- 31🫥 47.60Workflow-GYM: Towards Long-Horizon Evaluation of Computer-use Agentic tasks in Real-World Professional FieldsLiya Zhu, Jingzhe Ding, Jian Zhang · #GUI Agent Benchmark
- 32🫥 47.60HiVLA: A Visual-Grounded-Centric Hierarchical Embodied Manipulation SystemTianshuo Yang, Guanyu Chen, Yutian Chen · #VLA fine-tuning PTSD
- 33🫥 47.60Color Shift Estimation-and-Correction for Image EnhancementYiyu Li, Ke Xu, Gerhard Petrus Hancke · #over/under-exposure fix
- 34🫥 48.00Hard to Read, Easy to Jailbreak: How Visual Degradation Bypasses MLLM Safety AlignmentZhixue Song, Boyan Han, Yiwei Wang · #multimodal safety
- 35🫥 49.44MM-HELIX: Boosting Multimodal Long-Chain Reflective Reasoning with Holistic Platform and Adaptive Hybrid Policy OptimizationXiangyu Zhao, Junming Lin, Tianhao Liang · #Multimodal LLMs
- 36🫥 49.60DynamicFace: High-Quality and Consistent Face Swapping for Image and Video using Composable 3D Facial PriorsRunqi Wang, Yang Chen, Sijie Xu · #face swapping
- 37🫥 49.60MM-IFEngine: Towards Multimodal Instruction FollowingShengyuan Ding, Shenxi Wu, Xiangyu Zhao · #Multimodal LLM
- 38🫥 50.00Qwen-AgentWorld: Language World Models for General AgentsYuxin Zuo, Zikai Xiao, Li Sheng · #world model training
- 39🫥 50.00Kwai Keye-VL-2.0 Technical ReportKwai Keye Team, Bin Wen, Changyi Liu · #Long-video Understanding
- 40🫥 50.00Generative Modeling via DriftingMingyang Deng, He Li, Tianhong Li · #one-step generation
- 41🫥 50.40Agent World Model: Infinity Synthetic Environments for Agentic Reinforcement LearningZhaoyang Wang, Canwen Xu, Boyi Liu · #agent-env-savior
- 42🫥 50.40Learning from Peers in Reasoning ModelsTongxu Luo, Wenyu Du, Jiaxi Bi · #prefix trap observation
- 43🫥 50.40Multi-Layer Visual Feature Fusion in Multimodal LLMs: Methods, Analysis, and Best PracticesJunyan Lin, Haoran Chen, Yue Fan · #Multimodal LLM
- 44🫥 51.20RepreGuard: Detecting LLM-Generated Text by Revealing Hidden Representation PatternsXin Chen, Junchao Wu, Shu Yang · #AI-generated text detect
- 45🫥 52.40Why Multi-Step Tool-Use Reinforcement Learning Collapses and How Supervisory Signals Fix ItYupu Hao, Zhuoran Jin, Huanxuan Liao · #Tool-use RL
- 46🫥 52.40SS-MAE: Spatial-Spectral Masked Auto-Encoder for Multi-Source Remote Sensing Image ClassificationJunyan Lin, Feng Gao, Xiaocheng Shi · #Remote Sensing Classific
- 47🫥 52.40Fast Large Language Model Collaborative Decoding via SpeculationJiale Fu, Yuchu Jiang, Junkai Chen · #LLM Acceleration
- 48🫥 52.40Trust Your Critic: Robust Reward Modeling and Reinforcement Learning for Faithful Image Editing and GenerationXiangyu Zhao, Peiyuan Zhang, Junming Lin · #Reward Model Dehallucina
- 49🫥 52.40DebCSE: Rethinking Unsupervised Contrastive Sentence Embedding Learning in the Debiasing PerspectivePu Miao, Zeyao Du, Junlin Zhang · #Sentence Embedding Debia
- 50🫥 53.20The Molecular Structure of Thought: Mapping the Topology of Long Chain-of-Thought ReasoningQiguang Chen, Yantao Du, Ziniu Li · #Long CoT analysis