380 |
2024-10-23 |
WorldSimBench: Towards Video Generation Models as World Simulators |
link |
Yiran Qin, Zhelun Shi,..., Ruimao Zhang |
375 |
2021-05-19 |
Self-supervised Heterogeneous Graph Neural Network with Optimal Transport |
link |
Yanbei Liu, Chongxu Wang,..., Zhitao Xiao |
95 |
2024-12-30 |
Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs |
link |
Xingyu Chen, Jiahao Xu,..., Dong Yu |
92 |
2024-07-05 |
Learning to (Learn at Test Time): RNNs with Expressive Hidden States |
link |
Yu Sun, Xinhao Li,..., Carlos Guestrin |
78 |
2025-01-08 |
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking |
link |
Xinyu Guan, Li Lyna Zhang,..., Mao Yang |
66 |
2024-06-28 |
MimicMotion: High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance |
link |
Yuang Zhang, Jiaxi Gu,..., FangYuan Zou |
63 |
2025-02-05 |
Demystifying Long Chain-of-Thought Reasoning |
link |
Edward Yeo, Yuxuan Tong,..., Xiang Yue |
58 |
2025-01-28 |
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training |
link |
Tianzhe Chu, Yuexiang Zhai,..., Yi Ma |
57 |
2024-04-29 |
DPO Meets PPO: Reinforced Token Optimization for RLHF |
link |
Han Zhong, Zikang Shan,..., Liwei Wang |
56 |
2024-04-21 |
AdvPrompter: Fast Adaptive Adversarial Prompting for LLMs |
link |
Anselm Paulus, Arman Zharmagambetov,..., Yuandong Tian |
49 |
2024-10-22 |
LongVU: Spatiotemporal Adaptive Compression for Long Video-Language Understanding |
link |
Xiaoqian Shen, Yunyang Xiong,..., Vikas Chandra |
42 |
2024-01-03 |
Theoretical guarantees on the best-of-n alignment policy |
link |
Ahmad Beirami, Alekh Agarwal,..., Ananda Suresh |
41 |
2024-12-02 |
Free Process Rewards without Process Labels |
link |
Lifan Yuan, Wendi Li,..., Hao Peng |
40 |
2024-11-04 |
How Far Is Video Generation from World Model: A Physical Law Perspective |
link |
Bingyi Kang, Yang Yue,..., Jiashi Feng |
37 |
2024-10-06 |
SparseVLM: Visual Token Sparsification for Efficient Vision Language Models Inference |
link |
Yuan Zhang, Chun-Kai Fan,..., Shanghang Zhang |
35 |
2024-10-14 |
Agent-as-a-Judge: Evaluate Agents with Agents |
link |
Mingchen Zhuge, Changsheng Zhao,..., Jürgen Schmidhuber |
35 |
2024-06-12 |
What If We Recaption Billions of Web Images with LLaMA-3? |
link |
Xianhang Li, Haoqin Tu,..., Cihang Xie |
33 |
2024-05-31 |
OR-Bench: An Over-Refusal Benchmark for Large Language Models |
link |
Jiaxing Cui, Wei-Lin Chiang,..., Cho-Jui Hsieh |
31 |
2023-11-30 |
LMRL Gym: Benchmarks for Multi-Turn Reinforcement Learning with Language Models |
link |
Marwa Abdulhai, Isadora White,..., Sergey Levine |
30 |
2024-02-13 |
Mitigating Object Hallucination in Large Vision-Language Models via Image-Grounded Guidance |
link |
Linxi Zhao, Yihe Deng,..., Quanquan Gu |
26 |
2024-11-01 |
Freeze-Omni: A Smart and Low Latency Speech-to-speech Dialogue Model with Frozen LLM |
link |
Xiong Wang, Yangze Li,..., Long Ma |
26 |
2024-09-12 |
Windows Agent Arena: Evaluating Multi-Modal OS Agents at Scale |
link |
Rogerio Bonatti, Dan Zhao,..., Zheng Hui |
24 |
2024-11-22 |
RE-Bench: Evaluating Frontier AI R&D Capabilities of Language Model Agents against Human Experts |
link |
Hjalmar Wijk, Tao Lin,..., Elizabeth Barnes |
24 |
2024-10-07 |
Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation |
link |
Fanqing Meng, Jiaqi Liao,..., Ping Luo |
23 |
2023-01-11 |
An Analysis of Quantile Temporal-Difference Learning |
link |
Mark Rowland, Remi Munos,..., Will Dabney |
23 |
2022-10-22 |
Discrepancy Minimization in Input-Sparsity Time |
link |
Yichuan Deng, Xiaoyu Li,..., OMRI WEINSTEIN |
23 |
2024-12-30 |
Training Software Engineering Agents and Verifiers with SWE-Gym |
link |
Jiayi Pan, Xingyao Wang,..., Yizhe Zhang |
23 |
2024-06-13 |
GuardAgent: Safeguard LLM Agents via Knowledge-Enabled Reasoning |
link |
Zhen Xiang, Linzhi Zheng,..., Bo Li |
22 |
2024-12-12 |
Forest-of-Thought: Scaling Test-Time Compute for Enhancing LLM Reasoning |
link |
Zhenni Bi, Kai Han,..., Yunhe Wang |
22 |
2024-11-07 |
Taming Rectified Flow for Inversion and Editing |
link |
Jiangshan Wang, Junfu Pu,..., Ying Shan |
21 |
2024-10-02 |
RLEF: Grounding Code LLMs in Execution Feedback with Reinforcement Learning |
link |
Jonas Gehring, Kunhao Zheng,..., Gabriel Synnaeve |
21 |
2023-04-03 |
Empirical Design in Reinforcement Learning |
link |
Andrew Patterson, Samuel F Neumann,..., Adam White |
21 |
2025-02-13 |
MME-CoT: Benchmarking Chain-of-Thought in LMMs for Reasoning Quality, Robustness, and Efficiency |
link |
Hongsheng Li, Yu Qi,..., Shen Yan |
21 |
2022-08-15 |
AI for Global Climate Cooperation: Modeling Global Climate Negotiations, Agreements, and Long-Term Cooperation in RICE-N |
link |
Tianyu Zhang, Andrew Williams,..., Stephan Zheng |
21 |
2024-09-11 |
Agent Workflow Memory |
link |
Zhiruo Wang, Jiayuan Mao,..., Graham Neubig |
21 |
2023-06-18 |
Position: AI Evaluation Should Learn from How We Test Humans |
link |
Yan Zhuang, Qi Liu,..., Enhong Chen |
20 |
2024-03-05 |
Cradle: Empowering Foundation Agents towards General Computer Control |
link |
Weihao Tan, Wentao Zhang,..., Zongqing Lu |
20 |
2025-01-13 |
Imagine While Reasoning in Space: Multimodal Visualization-of-Thought |
link |
Chengzu Li, Wenshan Wu,..., Furu Wei |
20 |
2025-01-20 |
Advancing Language Model Reasoning through Reinforcement Learning and Inference Scaling |
link |
Zhenyu Hou, Xin Lv,..., Yuxiao Dong |
19 |
2024-10-27 |
Mind Your Step (by Step): Chain-of-Thought can Reduce Performance on Tasks where Thinking Makes Humans Worse |
link |
Ryan Liu, Jiayi Geng,..., Thomas Griffiths |
18 |
2025-02-17 |
Scaling Test-Time Compute Without Verification or RL is Suboptimal |
link |
Amrith Setlur, Nived Rajaraman,..., Aviral Kumar |
18 |
2025-02-17 |
SWE-Lancer: Can Frontier LLMs Earn $1 Million from Real-World Freelance Software Engineering? |
link |
Samuel Miserendino, Michele Wang,..., Johannes Heidecke |
18 |
2024-08-30 |
VisionTS: Visual Masked Autoencoders Are Free-Lunch Zero-Shot Time Series Forecasters |
link |
Mouxiang Chen, Lefei Shen,..., Chenghao Liu |
16 |
2024-08-18 |
Antidote: Post-fine-tuning Safety Alignment for Large Language Models against Harmful Fine-tuning |
link |
Tiansheng Huang, Gautam Bhattacharya,..., Ling Liu |
16 |
2023-10-26 |
High-Dimensional Prediction for Sequential Decision Making |
link |
Georgy Noarov, Ramya Ramalingam,..., Stephan Xie |
16 |
2024-06-06 |
Why Has Predicting Downstream Capabilities of Frontier AI Models with Scale Remained Elusive? |
link |
Rylan Schaeffer, Hailey Schoelkopf,..., Sanmi Koyejo |
16 |
2024-10-28 |
One-Step Diffusion Policy: Fast Visuomotor Policies via Diffusion Distillation |
link |
Zhendong Wang, Max Li,..., Yu Zeng |
16 |
2024-10-28 |
ShadowKV: KV Cache in Shadows for High-Throughput Long-Context LLM Inference |
link |
Hanshi Sun, Li-Wen Chang,..., Beidi Chen |
16 |
2025-01-09 |
Can MLLMs Reason in Multimodality? EMMA: An Enhanced MultiModal ReAsoning Benchmark |
link |
Yunzhuo Hao, Jiawei Gu,..., Yu Cheng |
15 |
2024-04-05 |
Nonparametric Modern Hopfield Models |
link |
Jerry Yao-Chieh Hu, Bo-Yu Chen,..., Han Liu |
15 |
2024-10-14 |
Thinking LLMs: General Instruction Following with Thought Generation |
link |
Tianhao Wu, Janice Lan,..., Sainbayar Sukhbaatar |
15 |
2024-05-23 |
SliM-LLM: Salience-Driven Mixed-Precision Quantization for Large Language Models |
link |
Wei Huang, Haotong Qin,..., XIAOJUAN QI |
15 |
2024-09-25 |
Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale |
link |
Fan Zhou, Zengzhi Wang,..., Pengfei Liu |
15 |
2024-11-17 |
SageAttention2: Efficient Attention with Thorough Outlier Smoothing and Per-thread INT4 Quantization |
link |
Jintao Zhang, Haofeng Huang,..., Jianfei Chen |
15 |
2024-12-05 |
Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction |
link |
Yiheng Xu, Zekun Wang,..., Caiming Xiong |
15 |
2025-01-12 |
A General Framework for Inference-time Scaling and Steering of Diffusion Models |
link |
raghav singhal, Zachary Horvitz,..., Rajesh Ranganath |
14 |
2024-03-09 |
AutoEval Done Right: Using Synthetic Data for Model Evaluation |
link |
Pierre Boyeau, Anastasios Angelopoulos,..., Michael Jordan |
14 |
2025-01-28 |
AxBench: Steering LLMs? Even Simple Baselines Outperform Sparse Autoencoders |
link |
Zhengxuan Wu, Aryaman Arora,..., Christopher Potts |
14 |
2024-12-19 |
FlowAR: Scale-wise Autoregressive Image Generation Meets Flow Matching |
link |
Sucheng Ren, Qihang Yu,..., Liang-Chieh Chen |
14 |
2025-01-30 |
SANA 1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer |
link |
Enze Xie, Junsong Chen,..., Song Han |
14 |
2024-11-07 |
DINO-WM: World Models on Pre-trained Visual Features enable Zero-shot Planning |
link |
Gaoyue Zhou, Hengkai Pan,..., Lerrel Pinto |
13 |
2024-06-12 |
WMAdapter: Adding WaterMark Control to Latent Diffusion Models |
link |
Hai Ci, Yiren Song,..., Mike Zheng Shou |
13 |
2024-10-15 |
MoH: Multi-Head Attention as Mixture-of-Head Attention |
link |
Peng Jin, Bo Zhu,..., Shuicheng YAN |
13 |
2025-03-10 |
Optimizing Test-Time Compute via Meta Reinforcement Finetuning |
link |
Yuxiao Qu, Matthew Yang,..., Aviral Kumar |
13 |
2021-12-14 |
On the Impact of Hard Adversarial Instances on Overfitting in Adversarial Training |
link |
Chen Liu, Zhichao Huang,..., Sabine Süsstrunk |
12 |
2024-10-02 |
FlipAttack: Jailbreak LLMs via Flipping |
link |
Yue Liu, Xiaoxin He,..., Bryan Hooi |
12 |
2024-12-28 |
An analytic theory of creativity in convolutional diffusion models |
link |
Mason Kamb, Surya Ganguli |
11 |
2024-10-22 |
Collapse or Thrive: Perils and Promises of Synthetic Data in a Self-Generating World |
link |
Joshua Kazdan, Rylan Schaeffer,..., Sanmi Koyejo |
11 |
2024-10-17 |
Automatically Interpreting Millions of Features in Large Language Models |
link |
Gonçalo Paulo, Alex Mallen,..., Nora Belrose |
11 |
2024-10-03 |
AutoML-Agent: A Multi-Agent LLM Framework for Full-Pipeline AutoML |
link |
Patara Trirat, Wonyong Jeong, Sung Ju Hwang |
11 |
2024-12-23 |
Diving into Self-Evolve Training for Multimodal Reasoning |
link |
Wei Liu, Junlong Li,..., Junxian He |
11 |
2024-11-29 |
Critical Tokens Matter: Token-Level Contrastive Estimation Enhances LLM’s Reasoning Capability |
link |
Zicheng Lin, Tian Liang,..., Zhaopeng Tu |
11 |
2023-10-12 |
MCU: An Evaluation Framework for Open-Ended Game Agents |
link |
Xinyue Zheng, Haowei Lin,..., Yitao Liang |
11 |
2025-02-14 |
MM-RLHF: The Next Step Forward in Multimodal LLM Alignment |
link |
Yi-Fan Zhang, Tao Yu,..., Rong Jin |
10 |
2024-10-24 |
Context is Key: A Benchmark for Forecasting with Essential Textual Information |
link |
Andrew Williams, Arjun Ashok,..., Alexandre Drouin |
10 |
2025-02-13 |
Long-Term TalkingFace Generation via Motion-Prior Conditional Diffusion Model |
link |
SHEN FEI, Cong Wang,..., Tat-Seng Chua |
10 |
2025-01-14 |
Diffusion Adversarial Post-Training for One-Step Video Generation |
link |
Shanchuan Lin, Xin Xia,..., Lu Jiang |
10 |
2025-01-31 |
Reward-Guided Speculative Decoding for Efficient LLM Reasoning |
link |
Baohao Liao, Yuhui Xu,..., Caiming Xiong |
10 |
2025-02-13 |
EmbodiedBench: Comprehensive Benchmarking Multi-modal Large Language Models for Vision-Driven Embodied Agents |
link |
Rui Yang, Hanyang(Jeremy) Chen,..., Tong Zhang |
10 |
2025-02-04 |
VideoJAM: Joint Appearance-Motion Representations for Enhanced Motion Generation in Video Models |
link |
Hila Chefer, Uriel Singer,..., Shelly Sheynin |
10 |
2025-02-05 |
Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning |
link |
DiJia Su, Hanlin Zhu,..., Qinqing Zheng |
10 |
2025-01-30 |
MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding |
link |
Yuxin Zuo, Shang Qu,..., Bowen Zhou |
9 |
2025-02-24 |
Emergent Misalignment: Narrow finetuning can produce broadly misaligned LLMs |
link |
Jan Betley, Daniel Tan,..., Owain Evans |
9 |
2024-11-26 |
Star Attention: Efficient LLM Inference over Long Sequences |
link |
Shantanu Acharya, Fei Jia, Boris Ginsburg |
9 |
2024-03-06 |
Diffusion on language model encodings for protein sequence generation |
link |
Viacheslav Meshchaninov, Pavel Strashnov,..., Dmitry Vetrov |
9 |
2025-02-11 |
CodeIO: Condensing Reasoning Patterns via Code Input-Output Prediction |
link |
Junlong Li, Daya Guo,..., Junxian He |
9 |
2024-05-22 |
How to set AdamW's weight decay as you scale model and dataset size |
link |
Xi Wang, Laurence Aitchison |
9 |
2025-02-03 |
Sparse Video-Gen: Accelerating Video Diffusion Transformers with Spatial-Temporal Sparsity |
link |
Haocheng Xi, Shuo Yang,..., Song Han |
9 |
2024-05-27 |
Unisolver: PDE-Conditional Transformers Are Universal Neural PDE Solvers |
link |
Hang Zhou, Yuezhou Ma,..., Mingsheng Long |
9 |
2024-05-29 |
FourierMamba: Fourier Learning Integration with State Space Models for Image Deraining |
link |
Dong Li, Yidi Liu,..., Zheng-Jun Zha |
9 |
2024-10-03 |
NETS: A Non-equilibrium Transport Sampler |
link |
Michael Albergo, Eric Vanden-Eijnden |
8 |
2024-12-04 |
From Language Models over Tokens to Language Models over Characters |
link |
Tim Vieira, Benjamin LeBrun,..., Ryan Cotterell |
8 |
2024-05-14 |
Addressing Misspecification in Simulation-based Inference through Data-driven Calibration |
link |
Antoine Wehenkel, Juan L. Gamella,..., Marco Cuturi |
8 |
2025-02-03 |
Sample, Scrutinize and Scale: Effective Inference-Time Search by Scaling Verification |
link |
Eric Zhao, Pranjal Awasthi, Sreenivas Gollapudi |
8 |
2024-12-10 |
FireFlow: Fast Inversion of Rectified Flow for Image Semantic Editing |
link |
Yingying Deng, Xiangyu He,..., Fan Tang |
8 |
2024-10-02 |
Automated Red Teaming with GOAT: the Generative Offensive Agent Tester |
link |
Maya Pavlova, Erik Brinkman,..., Aaron Grattafiori |
8 |
2024-11-18 |
Understanding Chain-of-Thought in LLMs through Information Theory |
link |
Jean-Francois Ton, Muhammad Faaiz Taufiq, Yang Liu |
8 |
2024-10-14 |
Cavia: Camera-controllable Multi-view Video Diffusion with View-Integrated Attention |
link |
Dejia Xu, Yifan Jiang,..., Hao Tang |
8 |
2024-12-16 |
SepLLM: Accelerate Large Language Models by Compressing One Segment into One Separator |
link |
Guoxuan Chen, Han Shi,..., Chao Huang |
8 |
2025-02-03 |
ZebraLogic: On the Scaling Limits of LLMs for Logical Reasoning |
link |
Yuchen Lin, Ronan Le Bras,..., Yejin Choi |
8 |
2025-03-06 |
Audio Flamingo 2: An Audio-Language Model with Long-Audio Understanding and Expert Reasoning Abilities |
link |
Sreyan Ghosh, Zhifeng Kong,..., Bryan Catanzaro |
8 |
2023-11-21 |
Multi-Session Budget Optimization for Forward Auction-based Federated Learning |
link |
Xiaoli Tang, Han Yu,..., Xiaoxiao Li |
8 |
2024-12-19 |
Video Prediction Policy: A Generalist Robot Policy with Predictive Visual Representations |
link |
Yucheng Hu, Yanjiang Guo,..., Jianyu Chen |
8 |
2025-02-25 |
SpargeAttn: Accurate Sparse Attention Accelerating Any Model Inference |
link |
Jintao Zhang, Chendong Xiang,..., Jianfei Chen |
8 |
2025-02-08 |
AnyEdit: Edit Any Knowledge Encoded in Language Models |
link |
Houcheng Jiang, Junfeng Fang,..., Tat-Seng Chua |
8 |
2023-11-16 |
Flow-field inference from neural data using deep recurrent networks |
link |
Tim Kim, Thomas Luo,..., Carlos Brody |
7 |
2024-10-04 |
Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in Multimodal Large Language Models |
link |
Xin Zou, Yizhou WANG,..., Xuming Hu |
7 |
2024-06-20 |
Adversaries Can Misuse Combinations of Safe Models |
link |
Erik Jones, Anca Dragan, Jacob Steinhardt |
7 |
2024-07-29 |
Emergence in non-neural models: grokking modular arithmetic via average gradient outer product |
link |
Neil Mallinar, Daniel Beaglehole,..., Misha Belkin |
7 |
2024-10-11 |
PoisonBench: Assessing Large Language Model Vulnerability to Poisoned Preference Data |
link |
Tingchen Fu, Mrinank Sharma,..., Fazl Barez |
7 |
2025-02-19 |
Which Attention Heads Matter for In-Context Learning? |
link |
Kayo Yin, Jacob Steinhardt |
7 |
2024-07-03 |
Universal Length Generalization with Turing Programs |
link |
Kaiying Hou, Eran Malach,..., Sham Kakade |
7 |
2024-12-16 |
The dark side of the forces: assessing non-conservative force models for atomistic machine learning |
link |
Filippo Bigi, Marcel Langer, Michele Ceriotti |
7 |
2025-02-26 |
Hi Robot: Open-Ended Instruction Following with Hierarchical Vision-Language-Action Models |
link |
Lucy Xiaoyang Shi, brian ichter,..., Chelsea Finn |
7 |
2024-09-24 |
Interactive Tools Substantially Assist LM Agents in Finding Security Vulnerabilities |
link |
Talor Abramovich, Meet Udeshi,..., Ofir Press |
7 |
2025-02-06 |
Universal Sparse Autoencoders: Interpretable Cross-Model Concept Alignment |
link |
Harrish Thasarathan, Julian Forsyth,..., Konstantinos Derpanis |
7 |
2023-12-27 |
Adaptive Message Passing: A General Framework to Mitigate Oversmoothing, Oversquashing, and Underreaching |
link |
Federico Errica, Henrik Christiansen,..., Francesco Alesiani |
7 |
2024-04-07 |
Shortcut-connected Expert Parallelism for Accelerating Mixture of Experts |
link |
Weilin Cai, Juyong Jiang,..., Jiayi Huang |
7 |
2024-10-02 |
Stochastic Deep Restoration Priors for Imaging Inverse Problems |
link |
Yuyang Hu, Albert Peng,..., Ulugbek Kamilov |
7 |
2025-02-10 |
History-Guided Video Diffusion |
link |
Kiwhan Song, Boyuan Chen,..., Vincent Sitzmann |
7 |
2024-10-15 |
G-Designer: Architecting Multi-agent Communication Topologies via Graph Neural Networks |
link |
Guibin Zhang, Yanwei Yue,..., Dawei Cheng |
7 |
2024-10-03 |
Contrastive Localized Language-Image Pre-Training |
link |
Hong-You Chen, Zhengfeng Lai,..., Zhe Gan |
7 |
2024-10-18 |
Diverging Preferences: When do Annotators Disagree and do Models Know? |
link |
Michael Zhang, Zhilin Wang,..., Valentina Pyatkin |
7 |
2025-02-02 |
Dissecting Submission Limit in Desk-Rejections: A Mathematical Analysis of Fairness in AI Conference Policies |
link |
Yuefan Cao, Xiaoyu Li,..., Jiahao Zhang |
7 |
2024-12-17 |
Proposer-Agent-Evaluator (PAE): Autonomous Skill Discovery For Foundation Model Internet Agents |
link |
Yifei Zhou, Qianlan Yang,..., Li Li |
7 |
2024-11-15 |
MARS: Unleashing the Power of Variance Reduction for Training Large Models |
link |
Huizhuo Yuan, Yifeng Liu,..., Quanquan Gu |
7 |
2025-02-10 |
MATH-Perturb: Benchmarking LLMs' Math Reasoning Abilities against Hard Perturbations |
link |
Kaixuan Huang, Jiacheng Guo,..., Mengdi Wang |
7 |
2024-02-22 |
Subobject-level Image Tokenization |
link |
Delong Chen, Samuel Cahyawijaya,..., Pascale FUNG |
6 |
2024-05-10 |
No-Regret is not enough! Bandits with General Constraints through Adaptive Regret Minimization |
link |
Martino Bernasconi, Matteo Castiglioni, Andrea Celli |
6 |
None |
Understanding Mode Connectivity via Parameter Space Symmetry |
link |
Bo Zhao, Nima Dehmamy,..., Rose Yu |
6 |
2024-10-21 |
A Simple Model of Inference Scaling Laws |
link |
Noam Levi |
6 |
2025-02-19 |
FlexTok: Resampling Images into 1D Token Sequences of Flexible Length |
link |
Roman Bachmann, Jesse Allardice,..., Afshin Dehghan |
6 |
2024-11-07 |
Scaling Laws for Pre-training Agents and World Models |
link |
Tim Pearce, Tabish Rashid,..., Katja Hofmann |
6 |
2024-04-30 |
Synthetic Face Datasets Generation via Latent Space Exploration from Brownian Identity Diffusion |
link |
David Geissbühler, Hatef Otroshi Shahreza, Sébastien Marcel |
6 |
2025-01-29 |
SAeUron: Interpretable Concept Unlearning in Diffusion Models with Sparse Autoencoders |
link |
Bartosz Cywiński, Kamil Deja |
6 |
2025-01-24 |
Leveraging Online Olympiad-Level Math Problems for LLMs Training and Contamination-Resistant Evaluation |
link |
Seyed Mohammad Sadegh Mahdavi, Muchen Li,..., Renjie Liao |
6 |
2025-02-16 |
A Physics-Informed Machine Learning Framework for Safe and Optimal Control of Autonomous Systems |
link |
Manan Tayal, Aditya Singh,..., Somil Bansal |
6 |
2023-11-21 |
Limitations of measure-first protocols in quantum machine learning |
link |
Casper Gyurik, Riccardo Molteni, Vedran Dunjko |
6 |
2025-02-05 |
Robust Autonomy Emerges from Self-Play |
link |
Marco Cusumano-Towner, David Hafner,..., Vladlen Koltun |
6 |
2024-10-06 |
SITCOM: Step-wise Triple-Consistent Diffusion Sampling For Inverse Problems |
link |
Ismail Alkhouri, Shijun Liang,..., Rongrong Wang |
6 |
2025-04-02 |
PaperBench: Evaluating AI’s Ability to Replicate AI Research |
link |
Giulio Starace, Oliver Jaffe,..., Tejal Patwardhan |
6 |
2024-10-15 |
A Hitchhiker's Guide to Scaling Law Estimation |
link |
Leshem Choshen, Yang Zhang, Jacob Andreas |
6 |
2025-02-04 |
Layer by Layer: Uncovering Hidden Representations in Language Models |
link |
Oscar Skean, Md Rifat Arefin,..., Ravid Shwartz-Ziv |
6 |
2025-02-04 |
TabPFN Unleashed: A Scalable and Effective Solution to Tabular Classification Problems |
link |
Si-Yang Liu, Han-Jia Ye |
6 |
2025-02-06 |
Fast Video Generation with Sliding Tile Attention |
link |
Peiyuan Zhang, Yongqi Chen,..., Hao Zhang |
6 |
2024-11-06 |
Self-Consistency Preference Optimization |
link |
Archiki Prasad, Weizhe Yuan,..., Jane Dwivedi-Yu |
6 |
2023-10-10 |
Self-Discriminative Modeling for Anomalous Graph Detection |
link |
Jinyu Cai, Yunhe Zhang, Jicong Fan |
6 |
2024-12-29 |
EraseAnything: Enabling Concept Erasure in Rectified Flow Transformers |
link |
Daiheng Gao, Shilin Lu,..., Weiming Zhang |
6 |
2024-03-15 |
DSP: Dynamic Sequence Parallelism for Multi-Dimensional Transformers |
link |
Xuanlei Zhao, Shenggan Cheng,..., Yang You |
6 |
2025-01-28 |
Over-Tokenized Transformer: Vocabulary is Generally Worth Scaling |
link |
Hongzhi Huang, Defa Zhu,..., zhou Xun |
6 |
2025-02-05 |
Teaching Language Models to Critique via Reinforcement Learning |
link |
Zhihui Xie, Jie chen,..., Lingpeng Kong |
6 |
2025-02-05 |
Masked Autoencoders Are Effective Tokenizers for Diffusion Models |
link |
Hao Chen, Yujin Han,..., Bhiksha Raj |
6 |
2024-10-31 |
Scalable Reinforcement Post-Training Beyond Static Human Prompts |
link |
Ziyu Ye, Rishabh Agarwal,..., Yuan Liu |
6 |
2024-10-04 |
Compute or Load KV Cache? Why Not Both? |
link |
Shuowei Jin, Xueshen Liu,..., Zhuoqing Morley Mao |
6 |
2024-12-12 |
EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM |
link |
Zhuofan Zong, Dongzhi Jiang,..., Hongsheng Li |
6 |
2025-02-03 |
Massive Values in Self-Attention Modules are the Key to Contextual Knowledge Understanding |
link |
Mingyu Jin, Kai Mei,..., Yongfeng Zhang |
6 |
2025-02-04 |
Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search |
link |
Maohao Shen, Guangtao Zeng,..., Chuang Gan |
6 |
2024-06-07 |
A Manifold Perspective on the Statistical Generalization of Graph Neural Networks |
link |
Zhiyang Wang, Juan Cervino, Alejandro Ribeiro |
5 |
2024-10-16 |
Mechanistic Unlearning: Robust Knowledge Unlearning and Editing via Mechanistic Localization |
link |
Phillip Guo, Aaquib Syed,..., Gintare Karolina Dziugaite |
5 |
2023-11-15 |
One-Shot Heterogeneous Federated Learning with Local Model-Guided Diffusion Models |
link |
Mingzhao Yang, Shangchao Su,..., Xiangyang Xue |
5 |
2024-08-27 |
How transformers learn structured data: insights from hierarchical filtering |
link |
Jerome Garnier-Brun, Marc Mezard,..., Luca Saglietti |
5 |
2025-02-08 |
TabICL: A Tabular Foundation Model for In-Context Learning on Large Data |
link |
Jingang QU, David Holzmüller,..., Marine Le Morvan |
5 |
2025-02-07 |
NoLiMa: Long-Context Evaluation Beyond Literal Matching |
link |
Ali Modarressi, Hanieh Deilamsalehy,..., Hinrich Schuetze |
5 |
2024-10-29 |
Auditing $f$-differential privacy in one run |
link |
Saeed Mahloujifar, Luca Melis, Kamalika Chaudhuri |
5 |
2023-02-20 |
Depth Degeneracy in Neural Networks: Vanishing Angles in Fully Connected ReLU Networks on Initialization |
link |
Cameron Jakub, Mihai Nica |
5 |
2025-01-23 |
GeoPixel: Pixel Grounding Large Multimodal Model in Remote Sensing |
link |
Akashah Shabbir, Ilmuz Zaman Mohammed Zumri,..., Salman Khan |
5 |
2025-02-04 |
MedRAX: Medical Reasoning Agent for Chest X-ray |
link |
Adibvafa Fallahpour, Jun Ma,..., BO WANG |
5 |
2025-03-04 |
Wyckoff Transformer: Generation of Symmetric Crystals |
link |
Nikita Kazeev, Wei Nong,..., Kedar Hippalgaonkar |
5 |
2024-12-09 |
AlphaVerus: Bootstrapping Formally Verified Code Generation through Self-Improving Translation and Treefinement |
link |
Pranjal Aggarwal, Bryan Parno, Sean Welleck |
5 |
2025-02-23 |
Are Sparse Autoencoders Useful? A Case Study in Sparse Probing |
link |
Subhash Kantamneni, Josh Engels,..., Neel Nanda |
5 |
2025-02-13 |
EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling |
link |
Theodoros Kouzelis, Ioannis Kakogeorgiou,..., Nikos Komodakis |
5 |
2024-12-09 |
Normalizing Flows are Capable Generative Models |
link |
Shuangfei Zhai, Ruixiang ZHANG,..., Joshua M Susskind |
5 |
2025-03-05 |
OTTER: A Vision-Language-Action Model with Text-Aware Visual Feature Extraction |
link |
Huang Huang, Fangchen Liu,..., Pieter Abbeel |
5 |
2024-10-05 |
Equivariant Polynomial Functional Networks |
link |
Thieu Vo, Viet Hoang Tran,..., Tan Nguyen |
5 |
2025-02-01 |
Position: Evaluating Generative AI Systems is a Social Science Measurement Challenge |
link |
Hanna Wallach, Meera Desai,..., Abigail Z. Jacobs |
5 |
2024-10-29 |
A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks |
link |
Thomas Schmied, Thomas Adler,..., Sepp Hochreiter |
5 |
2024-12-02 |
LMAct: A Benchmark for In-Context Imitation Learning with Long Multimodal Demonstrations |
link |
Anian Ruoss, Fabio Pardo,..., Tim Genewein |
5 |
2025-01-21 |
Parameters vs FLOPs: Scaling Laws for Optimal Sparsity for Mixture-of-Experts Language Models |
link |
Samira Abnar, Harshay Shah,..., Vimal Thilak |
5 |
2022-06-06 |
Goal-Space Planning with Subgoal Models |
link |
Chunlok Lo, Kevin Roice,..., Martha White |
5 |
2021-10-12 |
Causal Discovery from Conditionally Stationary Time Series |
link |
Carles Balsells-Rodas, Xavier Sumba,..., Yingzhen Li |
5 |
2025-01-13 |
Exploring and Mitigating Adversarial Manipulation of Voting-Based Leaderboards |
link |
Yangsibo Huang, Milad Nasr,..., Chiyuan Zhang |
5 |
2025-01-05 |
Generalizing from SIMPLE to HARD Visual Reasoning: Can We Mitigate Modality Imbalance in VLMs? |
link |
Simon Park, Abhishek Panigrahi,..., Sanjeev Arora |
5 |
2025-02-04 |
DAMO: Data- and Model-aware Alignment of Multi-modal LLMs |
link |
Jinda Lu, Junkang Wu,..., Xiangnan He |
5 |
2025-03-05 |
MA-LoT: Multi-Agent Lean-based Long Chain-of-Thought Reasoning enhances Formal Theorem Proving |
link |
Ruida WANG, Rui Pan,..., Tong Zhang |
5 |
2024-08-02 |
On the Resilience of LLM-Based Multi-Agent Collaboration with Faulty Agents |
link |
Jen-Tse Huang, Jiaxu Zhou,..., Maarten Sap |
5 |
2024-06-07 |
CTBench: A Library and Benchmark for Certified Training |
link |
Yuhao Mao, Stefan Balauca, Martin Vechev |
5 |
2025-01-28 |
P(all-atom) Is Unlocking New Path For Protein Design |
link |
Wei Qu, Jiawei Guan,..., haobo Wang |
5 |
2024-10-29 |
AAAR-1.0: Assessing AI’s Potential to Assist Research |
link |
Renze Lou, Hanzi Xu,..., Wenpeng Yin |
5 |
2025-02-03 |
Scalable Language Models with Posterior Inference of Latent Thought Vectors |
link |
Deqian Kong, Minglu Zhao,..., Ying Nian Wu |
5 |
2025-02-19 |
AdaptiveStep: Automatically Dividing Reasoning Step through Model Confidence |
link |
Yuliang Liu, Junjie Lu,..., Zhouhan Lin |
5 |
2023-12-31 |
GraphGPT: Generative Pre-trained Graph Eulerian Transformer |
link |
Qifang Zhao, Weidong Ren,..., Xiaoxiao Xu |
5 |
2025-02-05 |
DreamDPO: Aligning Text-to-3D Generation with Human Preferences via Direct Preference Optimization |
link |
Zhenglin Zhou, Xiaobo Xia,..., Tat-Seng Chua |
5 |
2025-02-18 |
Is Noise Conditioning Necessary for Denoising Generative Models? |
link |
Zhicheng Jiang, Qiao Sun,..., Kaiming He |
5 |
2024-11-28 |
Orthus: Autoregressive Interleaved Image-Text Generation with Modality-Specific Heads |
link |
Siqi Kou, Jiachun Jin,..., Zhijie Deng |
5 |
2024-12-05 |
ZipAR: Accelerating Autoregressive Image Generation through Spatial Locality |
link |
Yefei He, Feng Chen,..., Bohan Zhuang |
5 |
2025-02-09 |
Reinforced Lifelong Editing for Language Models |
link |
Zherui Li, Houcheng Jiang,..., Xiang Wang |
5 |
2024-06-20 |
Raising the Bar: Investigating the Values of Large Language Models via Generative Evolving Testing |
link |
Han Jiang, Xiaoyuan Yi,..., Xing Xie |
5 |
2025-01-28 |
Optimizing Large Language Model Training Using FP4 Quantization |
link |
Ruizhe Wang, Yeyun Gong,..., Peng CHENG |
5 |
2024-06-17 |
Compress then Serve: Serving Thousands of LoRA Adapters with Little Overhead |
link |
Rickard Gabrielsson, Jiacheng Zhu,..., Justin Solomon |
5 |
2024-12-19 |
How to Synthesize Text Data without Model Collapse? |
link |
Xuekai Zhu, Daixuan Cheng,..., Bowen Zhou |
4 |
2025-01-29 |
Think Smarter not Harder: Adaptive Reasoning with Inference Aware Optimization |
link |
Zishun Yu, Tengyu Xu,..., Han Fang |
4 |
2024-12-19 |
HashAttention: Semantic Sparsity for Faster Inference |
link |
Aditya Desai, Shuo Yang,..., Ion Stoica |
4 |
2025-02-10 |
Implicit Language Models are RNNs: Balancing Parallelization and Expressivity |
link |
Mark Schoene, Babak Rahmani,..., Jannes Gladrow |
4 |
2024-06-09 |
Flow of Reasoning: Training LLMs for Divergent Reasoning with Minimal Examples |
link |
Fangxu Yu, Lai Jiang,..., Lianhui Qin |
4 |
2024-11-04 |
ViTally Consistent: Scaling Biological Representation Learning for Cell Microscopy |
link |
Kian Kenyon-Dean, Zitong Jerry Wang,..., Oren Kraus |
4 |
2025-02-20 |
From RAG to Memory: Non-Parametric Continual Learning for Large Language Models |
link |
Bernal Jimenez Gutierrez, Yiheng Shu,..., Yu Su |
4 |
2024-06-16 |
ExPLoRA: Parameter-Efficient Extended Pre-Training to Adapt Vision Transformers under Domain Shifts |
link |
Samar Khanna, Medhanie Irgau,..., Stefano Ermon |
4 |
2022-04-23 |
Spherical Rotation Dimension Reduction with Geometric Loss Functions |
link |
Hengrui Luo, Jeremy E. Purvis, Didong Li |
4 |
2024-05-29 |
Does learning the right latent variables necessarily improve in-context learning? |
link |
Sarthak Mittal, Eric Elmoznino,..., Dhanya Sridhar |
4 |
2024-12-12 |
Lexico: Extreme KV Cache Compression via Sparse Coding over Universal Dictionaries |
link |
Junhyuck Kim, Jong Ho Park,..., Dimitris Papailiopoulos |
4 |
2024-11-20 |
GraphCL: Graph-based Clustering for Semi-Supervised Medical Image Segmentation |
link |
Mengzhu Wang, houcheng su,..., Jingcai Guo |
4 |
2024-06-06 |
The Brain's Bitter Lesson: Scaling Speech Decoding With Self-Supervised Learning |
link |
Dulhan Jayalath, Gilad Landau,..., ʻŌiwi Parker Jones |
4 |
2024-08-08 |
Risk and cross validation in ridge regression with correlated samples |
link |
Alexander Atanasov, Jacob A Zavatone-Veth, Cengiz Pehlevan |
4 |
2025-02-05 |
Detecting Strategic Deception with Linear Probes |
link |
Nicholas Goldowsky-Dill, Bilal Chughtai,..., Marius Hobbhahn |
4 |
2025-01-31 |
Fixing the Double Penalty in Data-Driven Weather Forecasting Through a Modified Spherical Harmonic Loss Function |
link |
Christopher Subich, Syed Husain,..., Jing Yang |
4 |
2024-10-01 |
Flex3D: Feed-Forward 3D Generation With Flexible Reconstruction Model And Input View Curation |
link |
Junlin Han, Jianyuan Wang,..., Filippos Kokkinos |
4 |
2025-02-06 |
Discovering Symbolic Cognitive Models from Human and Animal Behavior |
link |
Pablo Samuel Castro, Nenad Tomasev,..., Kimberly Stachenfeld |
4 |
2025-02-12 |
Distillation Scaling Laws |
link |
Dan Busbridge, Amitis Shidani,..., Russell Webb |
4 |
2024-07-26 |
Right Now, Wrong Then: Non-Stationary Direct Preference Optimization under Preference Drift |
link |
Seongho Son, William Bankes,..., Ilija Bogunovic |
4 |
2024-02-09 |
Where is the Truth? The Risk of Getting Confounded in a Continual World |
link |
Florian Peter Busch, ROSHNI KAMATH,..., Martin Mundt |
4 |
2024-03-10 |
A Reductions Approach to Risk-Sensitive Reinforcement Learning with Optimized Certainty Equivalents |
link |
Kaiwen Wang, Dawen Liang,..., Wen Sun |
4 |
2024-11-01 |
KAN-AD: Time Series Anomaly Detection with Kolmogorov–Arnold Networks |
link |
Quan Zhou, Changhua Pei,..., HanJing |
4 |
2024-10-20 |
PICI: Efficient Position-Independent Context Caching for Serving Large Language Models |
link |
JUNHAO HU, Wenrui Huang,..., Tao Xie |
4 |
2024-11-03 |
Autoformulation of Mathematical Optimization Models Using LLMs |
link |
Nicolás Astorga, Tennison Liu,..., M van der Schaar |
4 |
2025-05-01 |
PertEval-scFM: Benchmarking Single-Cell Foundation Models for Perturbation Effect Prediction |
link |
Aaron Wenteler, Martina Occhetta,..., Amaya Gallagher-Syed |
4 |
2025-02-13 |
CoSER: Coordinating LLM-Based Persona Simulation of Established Roles |
link |
Xintao Wang, Heng Wang,..., Shuchang Zhou |
4 |
2025-01-30 |
SAM2Act: Integrating Visual Foundation Model with A Memory Architecture for Robotic Manipulation |
link |
Haoquan Fang, Markus Grotz,..., Jiafei Duan |
4 |
2023-10-25 |
Sum-of-Parts: Self-Attributing Neural Networks with End-to-End Learning of Feature Groups |
link |
Weiqiu You, Helen Qu,..., Eric Wong |
4 |
2025-02-08 |
From Mechanistic Interpretability to Mechanistic Biology: Training, Evaluating, and Interpreting Sparse Autoencoders on Protein Language Models |
link |
Etowah Adams, Liam Bai,..., Mohammed AlQuraishi |
4 |
2024-10-11 |
Parameter-Efficient Fine-Tuning of State Space Models |
link |
Kevin Galim, Wonjun Kang,..., Kangwook Lee |
4 |
2025-02-14 |
HealthGPT: A Medical Large Vision-Language Model for Unifying Comprehension and Generation via Heterogeneous Knowledge Adaptation |
link |
Tianwei Lin, Wenqiao Zhang,..., Beng Chin Ooi |
4 |
2025-02-03 |
Self-Improving Transformers Overcome Easy-to-Hard and Length Generalization Challenges |
link |
Nayoung Lee, Jack Cai,..., Dimitris Papailiopoulos |
4 |
2024-11-17 |
Learn from Downstream and Be Yourself in Multimodal Large Language Models Fine-Tuning |
link |
Wenke Huang, Jian Liang,..., Mang Ye |
4 |
2024-10-08 |
Locate-then-edit for Multi-hop Factual Recall under Knowledge Editing |
link |
Zhuoran Zhang, Yongxiang Li,..., Di Wang |
4 |
2024-10-12 |
FlatQuant: Flatness Matters for LLM Quantization |
link |
Yuxuan Sun, Ruikang Liu,..., Jun Yao |
4 |
2025-02-05 |
Position: Editing Large Language Models Poses Serious Safety Risks |
link |
Paul Youssef, Zhixue Zhao,..., Christin Seifert |
4 |
2024-06-04 |
S2-Track: A Simple yet Strong Approach for End-to-End 3D Multi-Object Tracking |
link |
Tao Tang, Lijun Zhou,..., Xiaodan Liang |
4 |
2025-02-06 |
Scaling Laws in Patchification: An Image Is Worth 50,176 Tokens And More |
link |
Feng Wang, Yaodong Yu,..., Cihang Xie |
4 |
2025-02-17 |
video-SALMONN-o1: Reasoning-enhanced Audio-visual Large Language Model |
link |
Guangzhi Sun, Yudong Yang,..., Chao Zhang |
4 |
2025-02-01 |
OrcaLoca: An LLM Agent Framework for Software Issue Localization |
link |
Zhongming Yu, Hejia Zhang,..., Jishen Zhao |
4 |
2024-02-10 |
Clients Collaborate: Flexible Differentially Private Federated Learning with Guaranteed Improvement of Utility-Privacy Trade-off |
link |
Yuecheng Li, Lele Fu,..., Chuan Chen |
4 |
2024-06-09 |
Expressive Power of Graph Neural Networks for (Mixed-Integer) Quadratic Programs |
link |
Ziang Chen, Xiaohan Chen,..., Wotao Yin |
4 |
2024-12-24 |
Orient Anything: Learning Robust Object Orientation Estimation from Rendering 3D Models |
link |
Zehan Wang, Ziang Zhang,..., Zhou Zhao |
4 |
2024-10-16 |
Graph-constrained Reasoning: Faithful Reasoning on Knowledge Graphs with Large Language Models |
link |
Linhao Luo, Zicheng Zhao,..., Shirui Pan |
4 |
2025-01-02 |
Modeling Multi-Task Model Merging as Adaptive Projective Gradient Descent |
link |
Yongxian Wei, Anke Tang,..., Xiaochun Cao |
4 |
2025-02-17 |
Idiosyncrasies in Large Language Models |
link |
Mingjie Sun, Yida Yin,..., Zhuang Liu |
4 |
2025-01-26 |
Rethinking External Slow-Thinking: From Snowball Errors to Probability of Correct Reasoning |
link |
Zeyu Gan, Yun Liao, Yong Liu |
3 |
2024-08-09 |
EasyInv: Toward Fast and Better DDIM Inversion |
link |
Ziyue Zhang, Mingbao Lin,..., Rongrong Ji |