Last updated: 2025-05-19 23:30:27. Maintained by Weisen Jiang.

citation publish date title (pdf) review authors
380 2024-10-23 WorldSimBench: Towards Video Generation Models as World Simulators link Yiran Qin, Zhelun Shi,..., Ruimao Zhang
375 2021-05-19 Self-supervised Heterogeneous Graph Neural Network with Optimal Transport link Yanbei Liu, Chongxu Wang,..., Zhitao Xiao
95 2024-12-30 Do NOT Think That Much for 2+3=? On the
Overthinking of o1-Like LLMs
link Xingyu Chen, Jiahao Xu,..., Dong Yu
92 2024-07-05 Learning to (Learn at Test Time): RNNs with Expressive
Hidden States
link Yu Sun, Xinhao Li,..., Carlos Guestrin
78 2025-01-08 rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved
Deep Thinking
link Xinyu Guan, Li Lyna Zhang,..., Mao Yang
66 2024-06-28 MimicMotion: High-Quality Human Motion Video Generation with Confidence-aware Pose
Guidance
link Yuang Zhang, Jiaxi Gu,..., FangYuan Zou
63 2025-02-05 Demystifying Long Chain-of-Thought Reasoning link Edward Yeo, Yuxuan Tong,..., Xiang Yue
58 2025-01-28 SFT Memorizes, RL Generalizes: A Comparative Study of Foundation
Model Post-training
link Tianzhe Chu, Yuexiang Zhai,..., Yi Ma
57 2024-04-29 DPO Meets PPO: Reinforced Token Optimization for RLHF link Han Zhong, Zikang Shan,..., Liwei Wang
56 2024-04-21 AdvPrompter: Fast Adaptive Adversarial Prompting for LLMs link Anselm Paulus, Arman Zharmagambetov,..., Yuandong Tian
49 2024-10-22 LongVU: Spatiotemporal Adaptive Compression for Long Video-Language Understanding link Xiaoqian Shen, Yunyang Xiong,..., Vikas Chandra
42 2024-01-03 Theoretical guarantees on the best-of-n alignment policy link Ahmad Beirami, Alekh Agarwal,..., Ananda Suresh
41 2024-12-02 Free Process Rewards without Process Labels link Lifan Yuan, Wendi Li,..., Hao Peng
40 2024-11-04 How Far Is Video Generation from World Model: A
Physical Law Perspective
link Bingyi Kang, Yang Yue,..., Jiashi Feng
37 2024-10-06 SparseVLM: Visual Token Sparsification for Efficient Vision Language Models
Inference
link Yuan Zhang, Chun-Kai Fan,..., Shanghang Zhang
35 2024-10-14 Agent-as-a-Judge: Evaluate Agents with Agents link Mingchen Zhuge, Changsheng Zhao,..., Jürgen Schmidhuber
35 2024-06-12 What If We Recaption Billions of Web Images with
LLaMA-3?
link Xianhang Li, Haoqin Tu,..., Cihang Xie
33 2024-05-31 OR-Bench: An Over-Refusal Benchmark for Large Language Models link Jiaxing Cui, Wei-Lin Chiang,..., Cho-Jui Hsieh
31 2023-11-30 LMRL Gym: Benchmarks for Multi-Turn Reinforcement Learning with Language
Models
link Marwa Abdulhai, Isadora White,..., Sergey Levine
30 2024-02-13 Mitigating Object Hallucination in Large Vision-Language Models via Image-Grounded
Guidance
link Linxi Zhao, Yihe Deng,..., Quanquan Gu
26 2024-11-01 Freeze-Omni: A Smart and Low Latency Speech-to-speech Dialogue Model
with Frozen LLM
link Xiong Wang, Yangze Li,..., Long Ma
26 2024-09-12 Windows Agent Arena: Evaluating Multi-Modal OS Agents at Scale link Rogerio Bonatti, Dan Zhao,..., Zheng Hui
24 2024-11-22 RE-Bench: Evaluating Frontier AI R&D Capabilities of Language Model
Agents against Human Experts
link Hjalmar Wijk, Tao Lin,..., Elizabeth Barnes
24 2024-10-07 Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video
Generation
link Fanqing Meng, Jiaqi Liao,..., Ping Luo
23 2023-01-11 An Analysis of Quantile Temporal-Difference Learning link Mark Rowland, Remi Munos,..., Will Dabney
23 2022-10-22 Discrepancy Minimization in Input-Sparsity Time link Yichuan Deng, Xiaoyu Li,..., OMRI WEINSTEIN
23 2024-12-30 Training Software Engineering Agents and Verifiers with SWE-Gym link Jiayi Pan, Xingyao Wang,..., Yizhe Zhang
23 2024-06-13 GuardAgent: Safeguard LLM Agents via Knowledge-Enabled Reasoning link Zhen Xiang, Linzhi Zheng,..., Bo Li
22 2024-12-12 Forest-of-Thought: Scaling Test-Time Compute for Enhancing LLM Reasoning link Zhenni Bi, Kai Han,..., Yunhe Wang
22 2024-11-07 Taming Rectified Flow for Inversion and Editing link Jiangshan Wang, Junfu Pu,..., Ying Shan
21 2024-10-02 RLEF: Grounding Code LLMs in Execution Feedback with Reinforcement
Learning
link Jonas Gehring, Kunhao Zheng,..., Gabriel Synnaeve
21 2023-04-03 Empirical Design in Reinforcement Learning link Andrew Patterson, Samuel F Neumann,..., Adam White
21 2025-02-13 MME-CoT: Benchmarking Chain-of-Thought in LMMs for Reasoning Quality, Robustness,
and Efficiency
link Hongsheng Li, Yu Qi,..., Shen Yan
21 2022-08-15 AI for Global Climate Cooperation: Modeling Global Climate Negotiations,
Agreements, and Long-Term Cooperation in RICE-N
link Tianyu Zhang, Andrew Williams,..., Stephan Zheng
21 2024-09-11 Agent Workflow Memory link Zhiruo Wang, Jiayuan Mao,..., Graham Neubig
21 2023-06-18 Position: AI Evaluation Should Learn from How We Test
Humans
link Yan Zhuang, Qi Liu,..., Enhong Chen
20 2024-03-05 Cradle: Empowering Foundation Agents towards General Computer Control link Weihao Tan, Wentao Zhang,..., Zongqing Lu
20 2025-01-13 Imagine While Reasoning in Space: Multimodal Visualization-of-Thought link Chengzu Li, Wenshan Wu,..., Furu Wei
20 2025-01-20 Advancing Language Model Reasoning through Reinforcement Learning and Inference
Scaling
link Zhenyu Hou, Xin Lv,..., Yuxiao Dong
19 2024-10-27 Mind Your Step (by Step): Chain-of-Thought can Reduce Performance
on Tasks where Thinking Makes Humans Worse
link Ryan Liu, Jiayi Geng,..., Thomas Griffiths
18 2025-02-17 Scaling Test-Time Compute Without Verification or RL is Suboptimal link Amrith Setlur, Nived Rajaraman,..., Aviral Kumar
18 2025-02-17 SWE-Lancer: Can Frontier LLMs Earn $1 Million from Real-World
Freelance Software Engineering?
link Samuel Miserendino, Michele Wang,..., Johannes Heidecke
18 2024-08-30 VisionTS: Visual Masked Autoencoders Are Free-Lunch Zero-Shot Time Series
Forecasters
link Mouxiang Chen, Lefei Shen,..., Chenghao Liu
16 2024-08-18 Antidote: Post-fine-tuning Safety Alignment for Large Language Models against
Harmful Fine-tuning
link Tiansheng Huang, Gautam Bhattacharya,..., Ling Liu
16 2023-10-26 High-Dimensional Prediction for Sequential Decision Making link Georgy Noarov, Ramya Ramalingam,..., Stephan Xie
16 2024-06-06 Why Has Predicting Downstream Capabilities of Frontier AI Models
with Scale Remained Elusive?
link Rylan Schaeffer, Hailey Schoelkopf,..., Sanmi Koyejo
16 2024-10-28 One-Step Diffusion Policy: Fast Visuomotor Policies via Diffusion Distillation link Zhendong Wang, Max Li,..., Yu Zeng
16 2024-10-28 ShadowKV: KV Cache in Shadows for High-Throughput Long-Context LLM
Inference
link Hanshi Sun, Li-Wen Chang,..., Beidi Chen
16 2025-01-09 Can MLLMs Reason in Multimodality? EMMA: An Enhanced MultiModal
ReAsoning Benchmark
link Yunzhuo Hao, Jiawei Gu,..., Yu Cheng
15 2024-04-05 Nonparametric Modern Hopfield Models link Jerry Yao-Chieh Hu, Bo-Yu Chen,..., Han Liu
15 2024-10-14 Thinking LLMs: General Instruction Following with Thought Generation link Tianhao Wu, Janice Lan,..., Sainbayar Sukhbaatar
15 2024-05-23 SliM-LLM: Salience-Driven Mixed-Precision Quantization for Large Language Models link Wei Huang, Haotong Qin,..., XIAOJUAN QI
15 2024-09-25 Programming Every Example: Lifting Pre-training Data Quality Like Experts
at Scale
link Fan Zhou, Zengzhi Wang,..., Pengfei Liu
15 2024-11-17 SageAttention2: Efficient Attention with Thorough Outlier Smoothing and Per-thread
INT4 Quantization
link Jintao Zhang, Haofeng Huang,..., Jianfei Chen
15 2024-12-05 Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction link Yiheng Xu, Zekun Wang,..., Caiming Xiong
15 2025-01-12 A General Framework for Inference-time Scaling and Steering of
Diffusion Models
link raghav singhal, Zachary Horvitz,..., Rajesh Ranganath
14 2024-03-09 AutoEval Done Right: Using Synthetic Data for Model Evaluation link Pierre Boyeau, Anastasios Angelopoulos,..., Michael Jordan
14 2025-01-28 AxBench: Steering LLMs? Even Simple Baselines Outperform Sparse Autoencoders link Zhengxuan Wu, Aryaman Arora,..., Christopher Potts
14 2024-12-19 FlowAR: Scale-wise Autoregressive Image Generation Meets Flow Matching link Sucheng Ren, Qihang Yu,..., Liang-Chieh Chen
14 2025-01-30 SANA 1.5: Efficient Scaling of Training-Time and Inference-Time Compute
in Linear Diffusion Transformer
link Enze Xie, Junsong Chen,..., Song Han
14 2024-11-07 DINO-WM: World Models on Pre-trained Visual Features enable Zero-shot
Planning
link Gaoyue Zhou, Hengkai Pan,..., Lerrel Pinto
13 2024-06-12 WMAdapter: Adding WaterMark Control to Latent Diffusion Models link Hai Ci, Yiren Song,..., Mike Zheng Shou
13 2024-10-15 MoH: Multi-Head Attention as Mixture-of-Head Attention link Peng Jin, Bo Zhu,..., Shuicheng YAN
13 2025-03-10 Optimizing Test-Time Compute via Meta Reinforcement Finetuning link Yuxiao Qu, Matthew Yang,..., Aviral Kumar
13 2021-12-14 On the Impact of Hard Adversarial Instances on Overfitting
in Adversarial Training
link Chen Liu, Zhichao Huang,..., Sabine Süsstrunk
12 2024-10-02 FlipAttack: Jailbreak LLMs via Flipping link Yue Liu, Xiaoxin He,..., Bryan Hooi
12 2024-12-28 An analytic theory of creativity in convolutional diffusion models link Mason Kamb, Surya Ganguli
11 2024-10-22 Collapse or Thrive: Perils and Promises of Synthetic Data
in a Self-Generating World
link Joshua Kazdan, Rylan Schaeffer,..., Sanmi Koyejo
11 2024-10-17 Automatically Interpreting Millions of Features in Large Language Models link Gonçalo Paulo, Alex Mallen,..., Nora Belrose
11 2024-10-03 AutoML-Agent: A Multi-Agent LLM Framework for Full-Pipeline AutoML link Patara Trirat, Wonyong Jeong, Sung Ju Hwang
11 2024-12-23 Diving into Self-Evolve Training for Multimodal Reasoning link Wei Liu, Junlong Li,..., Junxian He
11 2024-11-29 Critical Tokens Matter: Token-Level Contrastive Estimation Enhances LLM’s Reasoning
Capability
link Zicheng Lin, Tian Liang,..., Zhaopeng Tu
11 2023-10-12 MCU: An Evaluation Framework for Open-Ended Game Agents link Xinyue Zheng, Haowei Lin,..., Yitao Liang
11 2025-02-14 MM-RLHF: The Next Step Forward in Multimodal LLM Alignment link Yi-Fan Zhang, Tao Yu,..., Rong Jin
10 2024-10-24 Context is Key: A Benchmark for Forecasting with Essential
Textual Information
link Andrew Williams, Arjun Ashok,..., Alexandre Drouin
10 2025-02-13 Long-Term TalkingFace Generation via Motion-Prior Conditional Diffusion Model link SHEN FEI, Cong Wang,..., Tat-Seng Chua
10 2025-01-14 Diffusion Adversarial Post-Training for One-Step Video Generation link Shanchuan Lin, Xin Xia,..., Lu Jiang
10 2025-01-31 Reward-Guided Speculative Decoding for Efficient LLM Reasoning link Baohao Liao, Yuhui Xu,..., Caiming Xiong
10 2025-02-13 EmbodiedBench: Comprehensive Benchmarking Multi-modal Large Language Models for Vision-Driven
Embodied Agents
link Rui Yang, Hanyang(Jeremy) Chen,..., Tong Zhang
10 2025-02-04 VideoJAM: Joint Appearance-Motion Representations for Enhanced Motion Generation in
Video Models
link Hila Chefer, Uriel Singer,..., Shelly Sheynin
10 2025-02-05 Token Assorted: Mixing Latent and Text Tokens for Improved
Language Model Reasoning
link DiJia Su, Hanlin Zhu,..., Qinqing Zheng
10 2025-01-30 MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding link Yuxin Zuo, Shang Qu,..., Bowen Zhou
9 2025-02-24 Emergent Misalignment: Narrow finetuning can produce broadly misaligned LLMs link Jan Betley, Daniel Tan,..., Owain Evans
9 2024-11-26 Star Attention: Efficient LLM Inference over Long Sequences link Shantanu Acharya, Fei Jia, Boris Ginsburg
9 2024-03-06 Diffusion on language model encodings for protein sequence generation link Viacheslav Meshchaninov, Pavel Strashnov,..., Dmitry Vetrov
9 2025-02-11 CodeIO: Condensing Reasoning Patterns via Code Input-Output Prediction link Junlong Li, Daya Guo,..., Junxian He
9 2024-05-22 How to set AdamW's weight decay as you scale
model and dataset size
link Xi Wang, Laurence Aitchison
9 2025-02-03 Sparse Video-Gen: Accelerating Video Diffusion Transformers with Spatial-Temporal Sparsity link Haocheng Xi, Shuo Yang,..., Song Han
9 2024-05-27 Unisolver: PDE-Conditional Transformers Are Universal Neural PDE Solvers link Hang Zhou, Yuezhou Ma,..., Mingsheng Long
9 2024-05-29 FourierMamba: Fourier Learning Integration with State Space Models for
Image Deraining
link Dong Li, Yidi Liu,..., Zheng-Jun Zha
9 2024-10-03 NETS: A Non-equilibrium Transport Sampler link Michael Albergo, Eric Vanden-Eijnden
8 2024-12-04 From Language Models over Tokens to Language Models over
Characters
link Tim Vieira, Benjamin LeBrun,..., Ryan Cotterell
8 2024-05-14 Addressing Misspecification in Simulation-based Inference through Data-driven Calibration link Antoine Wehenkel, Juan L. Gamella,..., Marco Cuturi
8 2025-02-03 Sample, Scrutinize and Scale: Effective Inference-Time Search by Scaling
Verification
link Eric Zhao, Pranjal Awasthi, Sreenivas Gollapudi
8 2024-12-10 FireFlow: Fast Inversion of Rectified Flow for Image Semantic
Editing
link Yingying Deng, Xiangyu He,..., Fan Tang
8 2024-10-02 Automated Red Teaming with GOAT: the Generative Offensive Agent
Tester
link Maya Pavlova, Erik Brinkman,..., Aaron Grattafiori
8 2024-11-18 Understanding Chain-of-Thought in LLMs through Information Theory link Jean-Francois Ton, Muhammad Faaiz Taufiq, Yang Liu
8 2024-10-14 Cavia: Camera-controllable Multi-view Video Diffusion with View-Integrated Attention link Dejia Xu, Yifan Jiang,..., Hao Tang
8 2024-12-16 SepLLM: Accelerate Large Language Models by Compressing One Segment
into One Separator
link Guoxuan Chen, Han Shi,..., Chao Huang
8 2025-02-03 ZebraLogic: On the Scaling Limits of LLMs for Logical
Reasoning
link Yuchen Lin, Ronan Le Bras,..., Yejin Choi
8 2025-03-06 Audio Flamingo 2: An Audio-Language Model with Long-Audio Understanding
and Expert Reasoning Abilities
link Sreyan Ghosh, Zhifeng Kong,..., Bryan Catanzaro
8 2023-11-21 Multi-Session Budget Optimization for Forward Auction-based Federated Learning link Xiaoli Tang, Han Yu,..., Xiaoxiao Li
8 2024-12-19 Video Prediction Policy: A Generalist Robot Policy with Predictive
Visual Representations
link Yucheng Hu, Yanjiang Guo,..., Jianyu Chen
8 2025-02-25 SpargeAttn: Accurate Sparse Attention Accelerating Any Model Inference link Jintao Zhang, Chendong Xiang,..., Jianfei Chen
8 2025-02-08 AnyEdit: Edit Any Knowledge Encoded in Language Models link Houcheng Jiang, Junfeng Fang,..., Tat-Seng Chua
8 2023-11-16 Flow-field inference from neural data using deep recurrent networks link Tim Kim, Thomas Luo,..., Carlos Brody
7 2024-10-04 Look Twice Before You Answer: Memory-Space Visual Retracing for
Hallucination Mitigation in Multimodal Large Language Models
link Xin Zou, Yizhou WANG,..., Xuming Hu
7 2024-06-20 Adversaries Can Misuse Combinations of Safe Models link Erik Jones, Anca Dragan, Jacob Steinhardt
7 2024-07-29 Emergence in non-neural models: grokking modular arithmetic via average
gradient outer product
link Neil Mallinar, Daniel Beaglehole,..., Misha Belkin
7 2024-10-11 PoisonBench: Assessing Large Language Model Vulnerability to Poisoned Preference
Data
link Tingchen Fu, Mrinank Sharma,..., Fazl Barez
7 2025-02-19 Which Attention Heads Matter for In-Context Learning? link Kayo Yin, Jacob Steinhardt
7 2024-07-03 Universal Length Generalization with Turing Programs link Kaiying Hou, Eran Malach,..., Sham Kakade
7 2024-12-16 The dark side of the forces: assessing non-conservative force
models for atomistic machine learning
link Filippo Bigi, Marcel Langer, Michele Ceriotti
7 2025-02-26 Hi Robot: Open-Ended Instruction Following with Hierarchical Vision-Language-Action Models link Lucy Xiaoyang Shi, brian ichter,..., Chelsea Finn
7 2024-09-24 Interactive Tools Substantially Assist LM Agents in Finding Security
Vulnerabilities
link Talor Abramovich, Meet Udeshi,..., Ofir Press
7 2025-02-06 Universal Sparse Autoencoders: Interpretable Cross-Model Concept Alignment link Harrish Thasarathan, Julian Forsyth,..., Konstantinos Derpanis
7 2023-12-27 Adaptive Message Passing: A General Framework to Mitigate Oversmoothing,
Oversquashing, and Underreaching
link Federico Errica, Henrik Christiansen,..., Francesco Alesiani
7 2024-04-07 Shortcut-connected Expert Parallelism for Accelerating Mixture of Experts link Weilin Cai, Juyong Jiang,..., Jiayi Huang
7 2024-10-02 Stochastic Deep Restoration Priors for Imaging Inverse Problems link Yuyang Hu, Albert Peng,..., Ulugbek Kamilov
7 2025-02-10 History-Guided Video Diffusion link Kiwhan Song, Boyuan Chen,..., Vincent Sitzmann
7 2024-10-15 G-Designer: Architecting Multi-agent Communication Topologies via Graph Neural Networks link Guibin Zhang, Yanwei Yue,..., Dawei Cheng
7 2024-10-03 Contrastive Localized Language-Image Pre-Training link Hong-You Chen, Zhengfeng Lai,..., Zhe Gan
7 2024-10-18 Diverging Preferences: When do Annotators Disagree and do Models
Know?
link Michael Zhang, Zhilin Wang,..., Valentina Pyatkin
7 2025-02-02 Dissecting Submission Limit in Desk-Rejections: A Mathematical Analysis of
Fairness in AI Conference Policies
link Yuefan Cao, Xiaoyu Li,..., Jiahao Zhang
7 2024-12-17 Proposer-Agent-Evaluator (PAE): Autonomous Skill Discovery For Foundation Model Internet
Agents
link Yifei Zhou, Qianlan Yang,..., Li Li
7 2024-11-15 MARS: Unleashing the Power of Variance Reduction for Training
Large Models
link Huizhuo Yuan, Yifeng Liu,..., Quanquan Gu
7 2025-02-10 MATH-Perturb: Benchmarking LLMs' Math Reasoning Abilities against Hard Perturbations link Kaixuan Huang, Jiacheng Guo,..., Mengdi Wang
7 2024-02-22 Subobject-level Image Tokenization link Delong Chen, Samuel Cahyawijaya,..., Pascale FUNG
6 2024-05-10 No-Regret is not enough! Bandits with General Constraints through
Adaptive Regret Minimization
link Martino Bernasconi, Matteo Castiglioni, Andrea Celli
6 None Understanding Mode Connectivity via Parameter Space Symmetry link Bo Zhao, Nima Dehmamy,..., Rose Yu
6 2024-10-21 A Simple Model of Inference Scaling Laws link Noam Levi
6 2025-02-19 FlexTok: Resampling Images into 1D Token Sequences of Flexible
Length
link Roman Bachmann, Jesse Allardice,..., Afshin Dehghan
6 2024-11-07 Scaling Laws for Pre-training Agents and World Models link Tim Pearce, Tabish Rashid,..., Katja Hofmann
6 2024-04-30 Synthetic Face Datasets Generation via Latent Space Exploration from
Brownian Identity Diffusion
link David Geissbühler, Hatef Otroshi Shahreza, Sébastien Marcel
6 2025-01-29 SAeUron: Interpretable Concept Unlearning in Diffusion Models with Sparse
Autoencoders
link Bartosz Cywiński, Kamil Deja
6 2025-01-24 Leveraging Online Olympiad-Level Math Problems for LLMs Training and
Contamination-Resistant Evaluation
link Seyed Mohammad Sadegh Mahdavi, Muchen Li,..., Renjie Liao
6 2025-02-16 A Physics-Informed Machine Learning Framework for Safe and Optimal
Control of Autonomous Systems
link Manan Tayal, Aditya Singh,..., Somil Bansal
6 2023-11-21 Limitations of measure-first protocols in quantum machine learning link Casper Gyurik, Riccardo Molteni, Vedran Dunjko
6 2025-02-05 Robust Autonomy Emerges from Self-Play link Marco Cusumano-Towner, David Hafner,..., Vladlen Koltun
6 2024-10-06 SITCOM: Step-wise Triple-Consistent Diffusion Sampling For Inverse Problems link Ismail Alkhouri, Shijun Liang,..., Rongrong Wang
6 2025-04-02 PaperBench: Evaluating AI’s Ability to Replicate AI Research link Giulio Starace, Oliver Jaffe,..., Tejal Patwardhan
6 2024-10-15 A Hitchhiker's Guide to Scaling Law Estimation link Leshem Choshen, Yang Zhang, Jacob Andreas
6 2025-02-04 Layer by Layer: Uncovering Hidden Representations in Language Models link Oscar Skean, Md Rifat Arefin,..., Ravid Shwartz-Ziv
6 2025-02-04 TabPFN Unleashed: A Scalable and Effective Solution to Tabular
Classification Problems
link Si-Yang Liu, Han-Jia Ye
6 2025-02-06 Fast Video Generation with Sliding Tile Attention link Peiyuan Zhang, Yongqi Chen,..., Hao Zhang
6 2024-11-06 Self-Consistency Preference Optimization link Archiki Prasad, Weizhe Yuan,..., Jane Dwivedi-Yu
6 2023-10-10 Self-Discriminative Modeling for Anomalous Graph Detection link Jinyu Cai, Yunhe Zhang, Jicong Fan
6 2024-12-29 EraseAnything: Enabling Concept Erasure in Rectified Flow Transformers link Daiheng Gao, Shilin Lu,..., Weiming Zhang
6 2024-03-15 DSP: Dynamic Sequence Parallelism for Multi-Dimensional Transformers link Xuanlei Zhao, Shenggan Cheng,..., Yang You
6 2025-01-28 Over-Tokenized Transformer: Vocabulary is Generally Worth Scaling link Hongzhi Huang, Defa Zhu,..., zhou Xun
6 2025-02-05 Teaching Language Models to Critique via Reinforcement Learning link Zhihui Xie, Jie chen,..., Lingpeng Kong
6 2025-02-05 Masked Autoencoders Are Effective Tokenizers for Diffusion Models link Hao Chen, Yujin Han,..., Bhiksha Raj
6 2024-10-31 Scalable Reinforcement Post-Training Beyond Static Human Prompts link Ziyu Ye, Rishabh Agarwal,..., Yuan Liu
6 2024-10-04 Compute or Load KV Cache? Why Not Both? link Shuowei Jin, Xueshen Liu,..., Zhuoqing Morley Mao
6 2024-12-12 EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via
Multimodal LLM
link Zhuofan Zong, Dongzhi Jiang,..., Hongsheng Li
6 2025-02-03 Massive Values in Self-Attention Modules are the Key to
Contextual Knowledge Understanding
link Mingyu Jin, Kai Mei,..., Yongfeng Zhang
6 2025-02-04 Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via
Autoregressive Search
link Maohao Shen, Guangtao Zeng,..., Chuang Gan
6 2024-06-07 A Manifold Perspective on the Statistical Generalization of Graph
Neural Networks
link Zhiyang Wang, Juan Cervino, Alejandro Ribeiro
5 2024-10-16 Mechanistic Unlearning: Robust Knowledge Unlearning and Editing via Mechanistic
Localization
link Phillip Guo, Aaquib Syed,..., Gintare Karolina Dziugaite
5 2023-11-15 One-Shot Heterogeneous Federated Learning with Local Model-Guided Diffusion Models link Mingzhao Yang, Shangchao Su,..., Xiangyang Xue
5 2024-08-27 How transformers learn structured data: insights from hierarchical filtering link Jerome Garnier-Brun, Marc Mezard,..., Luca Saglietti
5 2025-02-08 TabICL: A Tabular Foundation Model for In-Context Learning on
Large Data
link Jingang QU, David Holzmüller,..., Marine Le Morvan
5 2025-02-07 NoLiMa: Long-Context Evaluation Beyond Literal Matching link Ali Modarressi, Hanieh Deilamsalehy,..., Hinrich Schuetze
5 2024-10-29 Auditing $f$-differential privacy in one run link Saeed Mahloujifar, Luca Melis, Kamalika Chaudhuri
5 2023-02-20 Depth Degeneracy in Neural Networks: Vanishing Angles in Fully
Connected ReLU Networks on Initialization
link Cameron Jakub, Mihai Nica
5 2025-01-23 GeoPixel: Pixel Grounding Large Multimodal Model in Remote Sensing link Akashah Shabbir, Ilmuz Zaman Mohammed Zumri,..., Salman Khan
5 2025-02-04 MedRAX: Medical Reasoning Agent for Chest X-ray link Adibvafa Fallahpour, Jun Ma,..., BO WANG
5 2025-03-04 Wyckoff Transformer: Generation of Symmetric Crystals link Nikita Kazeev, Wei Nong,..., Kedar Hippalgaonkar
5 2024-12-09 AlphaVerus: Bootstrapping Formally Verified Code Generation through Self-Improving Translation
and Treefinement
link Pranjal Aggarwal, Bryan Parno, Sean Welleck
5 2025-02-23 Are Sparse Autoencoders Useful? A Case Study in Sparse
Probing
link Subhash Kantamneni, Josh Engels,..., Neel Nanda
5 2025-02-13 EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image
Modeling
link Theodoros Kouzelis, Ioannis Kakogeorgiou,..., Nikos Komodakis
5 2024-12-09 Normalizing Flows are Capable Generative Models link Shuangfei Zhai, Ruixiang ZHANG,..., Joshua M Susskind
5 2025-03-05 OTTER: A Vision-Language-Action Model with Text-Aware Visual Feature Extraction link Huang Huang, Fangchen Liu,..., Pieter Abbeel
5 2024-10-05 Equivariant Polynomial Functional Networks link Thieu Vo, Viet Hoang Tran,..., Tan Nguyen
5 2025-02-01 Position: Evaluating Generative AI Systems is a Social Science
Measurement Challenge
link Hanna Wallach, Meera Desai,..., Abigail Z. Jacobs
5 2024-10-29 A Large Recurrent Action Model: xLSTM enables Fast Inference
for Robotics Tasks
link Thomas Schmied, Thomas Adler,..., Sepp Hochreiter
5 2024-12-02 LMAct: A Benchmark for In-Context Imitation Learning with Long
Multimodal Demonstrations
link Anian Ruoss, Fabio Pardo,..., Tim Genewein
5 2025-01-21 Parameters vs FLOPs: Scaling Laws for Optimal Sparsity for
Mixture-of-Experts Language Models
link Samira Abnar, Harshay Shah,..., Vimal Thilak
5 2022-06-06 Goal-Space Planning with Subgoal Models link Chunlok Lo, Kevin Roice,..., Martha White
5 2021-10-12 Causal Discovery from Conditionally Stationary Time Series link Carles Balsells-Rodas, Xavier Sumba,..., Yingzhen Li
5 2025-01-13 Exploring and Mitigating Adversarial Manipulation of Voting-Based Leaderboards link Yangsibo Huang, Milad Nasr,..., Chiyuan Zhang
5 2025-01-05 Generalizing from SIMPLE to HARD Visual Reasoning: Can We
Mitigate Modality Imbalance in VLMs?
link Simon Park, Abhishek Panigrahi,..., Sanjeev Arora
5 2025-02-04 DAMO: Data- and Model-aware Alignment of Multi-modal LLMs link Jinda Lu, Junkang Wu,..., Xiangnan He
5 2025-03-05 MA-LoT: Multi-Agent Lean-based Long Chain-of-Thought Reasoning enhances Formal Theorem
Proving
link Ruida WANG, Rui Pan,..., Tong Zhang
5 2024-08-02 On the Resilience of LLM-Based Multi-Agent Collaboration with Faulty
Agents
link Jen-Tse Huang, Jiaxu Zhou,..., Maarten Sap
5 2024-06-07 CTBench: A Library and Benchmark for Certified Training link Yuhao Mao, Stefan Balauca, Martin Vechev
5 2025-01-28 P(all-atom) Is Unlocking New Path For Protein Design link Wei Qu, Jiawei Guan,..., haobo Wang
5 2024-10-29 AAAR-1.0: Assessing AI’s Potential to Assist Research link Renze Lou, Hanzi Xu,..., Wenpeng Yin
5 2025-02-03 Scalable Language Models with Posterior Inference of Latent Thought
Vectors
link Deqian Kong, Minglu Zhao,..., Ying Nian Wu
5 2025-02-19 AdaptiveStep: Automatically Dividing Reasoning Step through Model Confidence link Yuliang Liu, Junjie Lu,..., Zhouhan Lin
5 2023-12-31 GraphGPT: Generative Pre-trained Graph Eulerian Transformer link Qifang Zhao, Weidong Ren,..., Xiaoxiao Xu
5 2025-02-05 DreamDPO: Aligning Text-to-3D Generation with Human Preferences via Direct
Preference Optimization
link Zhenglin Zhou, Xiaobo Xia,..., Tat-Seng Chua
5 2025-02-18 Is Noise Conditioning Necessary for Denoising Generative Models? link Zhicheng Jiang, Qiao Sun,..., Kaiming He
5 2024-11-28 Orthus: Autoregressive Interleaved Image-Text Generation with Modality-Specific Heads link Siqi Kou, Jiachun Jin,..., Zhijie Deng
5 2024-12-05 ZipAR: Accelerating Autoregressive Image Generation through Spatial Locality link Yefei He, Feng Chen,..., Bohan Zhuang
5 2025-02-09 Reinforced Lifelong Editing for Language Models link Zherui Li, Houcheng Jiang,..., Xiang Wang
5 2024-06-20 Raising the Bar: Investigating the Values of Large Language
Models via Generative Evolving Testing
link Han Jiang, Xiaoyuan Yi,..., Xing Xie
5 2025-01-28 Optimizing Large Language Model Training Using FP4 Quantization link Ruizhe Wang, Yeyun Gong,..., Peng CHENG
5 2024-06-17 Compress then Serve: Serving Thousands of LoRA Adapters with
Little Overhead
link Rickard Gabrielsson, Jiacheng Zhu,..., Justin Solomon
5 2024-12-19 How to Synthesize Text Data without Model Collapse? link Xuekai Zhu, Daixuan Cheng,..., Bowen Zhou
4 2025-01-29 Think Smarter not Harder: Adaptive Reasoning with Inference Aware
Optimization
link Zishun Yu, Tengyu Xu,..., Han Fang
4 2024-12-19 HashAttention: Semantic Sparsity for Faster Inference link Aditya Desai, Shuo Yang,..., Ion Stoica
4 2025-02-10 Implicit Language Models are RNNs: Balancing Parallelization and Expressivity link Mark Schoene, Babak Rahmani,..., Jannes Gladrow
4 2024-06-09 Flow of Reasoning: Training LLMs for Divergent Reasoning with
Minimal Examples
link Fangxu Yu, Lai Jiang,..., Lianhui Qin
4 2024-11-04 ViTally Consistent: Scaling Biological Representation Learning for Cell Microscopy link Kian Kenyon-Dean, Zitong Jerry Wang,..., Oren Kraus
4 2025-02-20 From RAG to Memory: Non-Parametric Continual Learning for Large
Language Models
link Bernal Jimenez Gutierrez, Yiheng Shu,..., Yu Su
4 2024-06-16 ExPLoRA: Parameter-Efficient Extended Pre-Training to Adapt Vision Transformers under
Domain Shifts
link Samar Khanna, Medhanie Irgau,..., Stefano Ermon
4 2022-04-23 Spherical Rotation Dimension Reduction with Geometric Loss Functions link Hengrui Luo, Jeremy E. Purvis, Didong Li
4 2024-05-29 Does learning the right latent variables necessarily improve in-context
learning?
link Sarthak Mittal, Eric Elmoznino,..., Dhanya Sridhar
4 2024-12-12 Lexico: Extreme KV Cache Compression via Sparse Coding over
Universal Dictionaries
link Junhyuck Kim, Jong Ho Park,..., Dimitris Papailiopoulos
4 2024-11-20 GraphCL: Graph-based Clustering for Semi-Supervised Medical Image Segmentation link Mengzhu Wang, houcheng su,..., Jingcai Guo
4 2024-06-06 The Brain's Bitter Lesson: Scaling Speech Decoding With Self-Supervised
Learning
link Dulhan Jayalath, Gilad Landau,..., ʻŌiwi Parker Jones
4 2024-08-08 Risk and cross validation in ridge regression with correlated
samples
link Alexander Atanasov, Jacob A Zavatone-Veth, Cengiz Pehlevan
4 2025-02-05 Detecting Strategic Deception with Linear Probes link Nicholas Goldowsky-Dill, Bilal Chughtai,..., Marius Hobbhahn
4 2025-01-31 Fixing the Double Penalty in Data-Driven Weather Forecasting Through
a Modified Spherical Harmonic Loss Function
link Christopher Subich, Syed Husain,..., Jing Yang
4 2024-10-01 Flex3D: Feed-Forward 3D Generation With Flexible Reconstruction Model And
Input View Curation
link Junlin Han, Jianyuan Wang,..., Filippos Kokkinos
4 2025-02-06 Discovering Symbolic Cognitive Models from Human and Animal Behavior link Pablo Samuel Castro, Nenad Tomasev,..., Kimberly Stachenfeld
4 2025-02-12 Distillation Scaling Laws link Dan Busbridge, Amitis Shidani,..., Russell Webb
4 2024-07-26 Right Now, Wrong Then: Non-Stationary Direct Preference Optimization under
Preference Drift
link Seongho Son, William Bankes,..., Ilija Bogunovic
4 2024-02-09 Where is the Truth? The Risk of Getting Confounded
in a Continual World
link Florian Peter Busch, ROSHNI KAMATH,..., Martin Mundt
4 2024-03-10 A Reductions Approach to Risk-Sensitive Reinforcement Learning with Optimized
Certainty Equivalents
link Kaiwen Wang, Dawen Liang,..., Wen Sun
4 2024-11-01 KAN-AD: Time Series Anomaly Detection with Kolmogorov–Arnold Networks link Quan Zhou, Changhua Pei,..., HanJing
4 2024-10-20 PICI: Efficient Position-Independent Context Caching for Serving Large Language
Models
link JUNHAO HU, Wenrui Huang,..., Tao Xie
4 2024-11-03 Autoformulation of Mathematical Optimization Models Using LLMs link Nicolás Astorga, Tennison Liu,..., M van der Schaar
4 2025-05-01 PertEval-scFM: Benchmarking Single-Cell Foundation Models for Perturbation Effect Prediction link Aaron Wenteler, Martina Occhetta,..., Amaya Gallagher-Syed
4 2025-02-13 CoSER: Coordinating LLM-Based Persona Simulation of Established Roles link Xintao Wang, Heng Wang,..., Shuchang Zhou
4 2025-01-30 SAM2Act: Integrating Visual Foundation Model with A Memory Architecture
for Robotic Manipulation
link Haoquan Fang, Markus Grotz,..., Jiafei Duan
4 2023-10-25 Sum-of-Parts: Self-Attributing Neural Networks with End-to-End Learning of Feature
Groups
link Weiqiu You, Helen Qu,..., Eric Wong
4 2025-02-08 From Mechanistic Interpretability to Mechanistic Biology: Training, Evaluating, and
Interpreting Sparse Autoencoders on Protein Language Models
link Etowah Adams, Liam Bai,..., Mohammed AlQuraishi
4 2024-10-11 Parameter-Efficient Fine-Tuning of State Space Models link Kevin Galim, Wonjun Kang,..., Kangwook Lee
4 2025-02-14 HealthGPT: A Medical Large Vision-Language Model for Unifying Comprehension
and Generation via Heterogeneous Knowledge Adaptation
link Tianwei Lin, Wenqiao Zhang,..., Beng Chin Ooi
4 2025-02-03 Self-Improving Transformers Overcome Easy-to-Hard and Length Generalization Challenges link Nayoung Lee, Jack Cai,..., Dimitris Papailiopoulos
4 2024-11-17 Learn from Downstream and Be Yourself in Multimodal Large
Language Models Fine-Tuning
link Wenke Huang, Jian Liang,..., Mang Ye
4 2024-10-08 Locate-then-edit for Multi-hop Factual Recall under Knowledge Editing link Zhuoran Zhang, Yongxiang Li,..., Di Wang
4 2024-10-12 FlatQuant: Flatness Matters for LLM Quantization link Yuxuan Sun, Ruikang Liu,..., Jun Yao
4 2025-02-05 Position: Editing Large Language Models Poses Serious Safety Risks link Paul Youssef, Zhixue Zhao,..., Christin Seifert
4 2024-06-04 S2-Track: A Simple yet Strong Approach for End-to-End 3D
Multi-Object Tracking
link Tao Tang, Lijun Zhou,..., Xiaodan Liang
4 2025-02-06 Scaling Laws in Patchification: An Image Is Worth 50,176
Tokens And More
link Feng Wang, Yaodong Yu,..., Cihang Xie
4 2025-02-17 video-SALMONN-o1: Reasoning-enhanced Audio-visual Large Language Model link Guangzhi Sun, Yudong Yang,..., Chao Zhang
4 2025-02-01 OrcaLoca: An LLM Agent Framework for Software Issue Localization link Zhongming Yu, Hejia Zhang,..., Jishen Zhao
4 2024-02-10 Clients Collaborate: Flexible Differentially Private Federated Learning with Guaranteed
Improvement of Utility-Privacy Trade-off
link Yuecheng Li, Lele Fu,..., Chuan Chen
4 2024-06-09 Expressive Power of Graph Neural Networks for (Mixed-Integer) Quadratic
Programs
link Ziang Chen, Xiaohan Chen,..., Wotao Yin
4 2024-12-24 Orient Anything: Learning Robust Object Orientation Estimation from Rendering
3D Models
link Zehan Wang, Ziang Zhang,..., Zhou Zhao
4 2024-10-16 Graph-constrained Reasoning: Faithful Reasoning on Knowledge Graphs with Large
Language Models
link Linhao Luo, Zicheng Zhao,..., Shirui Pan
4 2025-01-02 Modeling Multi-Task Model Merging as Adaptive Projective Gradient Descent link Yongxian Wei, Anke Tang,..., Xiaochun Cao
4 2025-02-17 Idiosyncrasies in Large Language Models link Mingjie Sun, Yida Yin,..., Zhuang Liu
4 2025-01-26 Rethinking External Slow-Thinking: From Snowball Errors to Probability of
Correct Reasoning
link Zeyu Gan, Yun Liao, Yong Liu
3 2024-08-09 EasyInv: Toward Fast and Better DDIM Inversion link Ziyue Zhang, Mingbao Lin,..., Rongrong Ji