Last updated: 2025-05-19 23:37:51. Maintained by Weisen Jiang.

citation publish date title (pdf) review authors
915 2024-05-23 YOLOv10: Real-Time End-to-End Object Detection link Ao Wang, Hui Chen,..., Guiguang Ding
612 2024-01-18 VMamba: Visual State Space Model link Yue Liu, Yunjie Tian,..., Yunfan Liu
512 2023-05-24 Gorilla: Large Language Model Connected with Massive APIs link Shishir G Patil, Tianjun Zhang,..., Joseph E. Gonzalez
446 2023-11-06 CogVLM: Visual Expert for Pretrained Language Models link Weihan Wang, Qingsong Lv,..., Jie Tang
349 2024-05-23 SimPO: Simple Preference Optimization with a Reference-Free Reward link Yu Meng, Mengzhou Xia, Danqi Chen
323 2024-06-13 Depth Anything V2 link Lihe Yang, Bingyi Kang,..., Hengshuang Zhao
279 2024-06-24 Cambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMs link Shengbang Tong, Ellis L Brown II,..., Saining Xie
251 2024-04-03 Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction link Keyu Tian, Yi Jiang,..., Liwei Wang
221 2024-03-29 Are We on the Right Way for Evaluating Large
Vision-Language Models?
link Lin Chen, Jinsong Li,..., Feng Zhao
203 2023-12-04 Tree of Attacks: Jailbreaking Black-Box LLMs Automatically link Anay Mehrotra, Manolis Zampetakis,..., Amin Karbasi
191 2024-05-06 SWE-agent: Agent-Computer Interfaces Enable Automated Software Engineering link John Yang, Carlos E Jimenez,..., Ofir Press
185 2023-11-28 LightGaussian: Unbounded 3D Gaussian Compression with 15x Reduction and
200+ FPS
link Zhiwen Fan, Kevin Wang,..., Zhangyang Wang
177 2024-01-31 KVQuant: Towards 10 Million Context Length LLM Inference with
KV Cache Quantization
link Coleman Richard Charles Hooper, Sehoon Kim,..., Amir Gholami
174 2024-06-17 Autoregressive Image Generation without Vector Quantization link Tianhong Li, Yonglong Tian,..., Kaiming He
156 2024-04-15 LLM Evaluators Recognize and Favor Their Own Generations link Arjun Panickssery, Samuel R. Bowman, Shi Feng
156 2024-05-03 What matters when building vision-language models? link Hugo Laurençon, Leo Tronchon,..., Victor Sanh
155 2024-05-07 xLSTM: Extended Long Short-Term Memory link Maximilian Beck, Korbinian Pöppel,..., Sepp Hochreiter
153 2024-04-22 SnapKV: LLM Knows What You are Looking for Before
Generation
link Yuhong Li, Yingbing Huang,..., Deming Chen
150 2024-05-16 CAT3D: Create Anything in 3D with Multi-View Diffusion Models link Ruiqi Gao, Aleksander Holynski,..., Ben Poole
135 2024-06-06 ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search link Dan Zhang, Sining Zhoubian,..., Jie Tang
133 2024-06-17 Refusal in Language Models Is Mediated by a Single
Direction
link Andy Arditi, Oscar Balcells Obeso,..., Neel Nanda
133 2024-03-30 QuaRot: Outlier-Free 4-Bit Inference in Rotated LLMs link Saleh Ashkboos, Amirkeivan Mohtashami,..., James Hensman
113 2024-07-11 FlashAttention-3: Fast and Accurate Attention with Asynchrony and Low-precision link Jay Shah, Ganesh Bikshandi,..., Tri Dao
112 2024-04-09 InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from
336 Pixels to 4K HD
link Xiaoyi Dong, Pan Zhang,..., Jiaqi Wang
111 2024-04-30 Iterative Reasoning Preference Optimization link Richard Yuanzhe Pang, Weizhe Yuan,..., Jason E Weston
110 2023-12-12 SGLang: Efficient Execution of Structured Language Model Programs link Lianmin Zheng, Liangsheng Yin,..., Ying Sheng
110 2023-10-14 Large Language Model Unlearning link Yuanshun Yao, Xiaojun Xu, Yang Liu
109 None Many-shot Jailbreaking link Cem Anil, Esin DURMUS,..., David Duvenaud
101 2024-02-15 Chain-of-Thought Reasoning Without Prompting link Xuezhi Wang, Denny Zhou
96 2024-04-17 Many-Shot In-Context Learning link Rishabh Agarwal, Avi Singh,..., Hugo Larochelle
94 2024-05-23 Improved Distribution Matching Distillation for Fast Image Synthesis link Tianwei Yin, Michaël Gharbi,..., William T. Freeman
91 2024-02-16 PointMamba: A Simple State Space Model for Point Cloud
Analysis
link Dingkang Liang, Xin Zhou,..., Xiang Bai
88 2024-05-02 StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation link Yupeng Zhou, Daquan Zhou,..., Qibin Hou
87 2024-04-16 VASA-1: Lifelike Audio-Driven Talking Faces Generated in Real Time link Sicheng Xu, Guojun Chen,..., Baining Guo
85 2024-05-06 MAmmoTH2: Scaling Instructions from the Web link Xiang Yue, Tianyu Zheng,..., Wenhu Chen
83 2024-07-02 MInference 1.0: Accelerating Pre-filling for Long-Context LLMs via Dynamic
Sparse Attention
link Huiqiang Jiang, YUCHENG LI,..., Lili Qiu
82 2024-05-06 AlphaMath Almost Zero: Process Supervision without Process link Guoxin Chen, Minpeng Liao,..., Kai Fan
81 2024-06-11 An Image is Worth 32 Tokens for Reconstruction and
Generation
link Qihang Yu, Mark Weber,..., Liang-Chieh Chen
75 2024-05-27 Vista: A Generalizable Driving World Model with High Fidelity
and Versatile Controllability
link Shenyuan Gao, Jiazhi Yang,..., Hongyang Li
73 2024-07-01 Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion link Boyuan Chen, Diego Martí Monsó,..., Vincent Sitzmann
72 2024-01-30 Robust Prompt Optimization for Defending Language Models Against Jailbreaking
Attacks
link Andy Zhou, Bo Li, Haohan Wang
72 2024-06-06 Improving Alignment and Robustness with Circuit Breakers link Andy Zou, Long Phan,..., Dan Hendrycks
70 2024-04-03 PiSSA: Principal Singular Values and Singular Vectors Adaptation of
Large Language Models
link Fanxu Meng, Zhaohui Wang, Muhan Zhang
70 2024-06-11 Simple and Effective Masked Diffusion Language Models link Subham Sekhar Sahoo, Marianne Arriola,..., Volodymyr Kuleshov
70 2024-02-12 G-Retriever: Retrieval-Augmented Generation for Textual Graph Understanding and Question
Answering
link Xiaoxin He, Yijun Tian,..., Bryan Hooi
64 2024-04-18 Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing link Ye Tian, Baolin Peng,..., Dong Yu
63 2024-04-21 Hyper-SD: Trajectory Segmented Consistency Model for Efficient Image Synthesis link Yuxi Ren, Xin Xia,..., Xuefeng Xiao
62 2024-06-06 Simplified and Generalized Masked Diffusion for Discrete Data link Jiaxin Shi, Kehang Han,..., Michalis Titsias
62 2024-02-26 Rainbow Teaming: Open-Ended Generation of Diverse Adversarial Prompts link Mikayel Samvelyan, Sharath Chandra Raparthy,..., Roberta Raileanu
61 2024-02-29 Humanoid Locomotion as Next Token Prediction link Ilija Radosavovic, Bike Zhang,..., Jitendra Malik
59 2024-05-16 Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement
Learning
link Yuexiang Zhai, Hao Bai,..., Sergey Levine
58 2024-06-04 Guiding a Diffusion Model with a Bad Version of
Itself
link Tero Karras, Miika Aittala,..., Samuli Laine
58 2024-04-04 ReFT: Representation Finetuning for Language Models link Zhengxuan Wu, Aryaman Arora,..., Christopher Potts
58 2024-06-10 Parallelizing Linear Transformers with the Delta Rule over Sequence
Length
link Songlin Yang, Bailin Wang,..., Yoon Kim
58 2024-04-11 Applying Guidance in a Limited Interval Improves Sample and
Distribution Quality in Diffusion Models
link Tuomas Kynkäänniemi, Miika Aittala,..., Jaakko Lehtinen
58 2024-04-24 PuLID: Pure and Lightning ID Customization via Contrastive Alignment link Zinan Guo, Yanze Wu,..., Qian HE
57 2024-07-22 Discrete Flow Matching link Itai Gat, Tal Remez,..., Yaron Lipman
57 2023-12-06 Scaling transformer neural networks for skillful and reliable medium-range
weather forecasting
link Tung Nguyen, Rohan Shah,..., Aditya Grover
54 2024-04-25 Make Your LLM Fully Utilize the Context link Shengnan An, Zexiong Ma,..., Weizhu Chen
53 2024-02-17 Watch Out for Your Agents! Investigating Backdoor Threats to
LLM-Based Agents
link Wenkai Yang, Xiaohan Bi,..., Xu Sun
53 2024-07-25 Recursive Introspection: Teaching Language Model Agents How to Self-Improve link Yuxiao Qu, Tianjun Zhang,..., Aviral Kumar
53 2024-02-07 Can Large Language Model Agents Simulate Human Trust Behavior? link Chengxing Xie, Canyu Chen,..., Guohao Li
52 2024-05-08 You Only Cache Once: Decoder-Decoder Architectures for Language Models link Yutao Sun, Li Dong,..., Furu Wei
52 2024-02-29 How do Large Language Models Handle Multilingualism? link Yiran Zhao, Wenxuan Zhang,..., Lidong Bing
52 2024-03-14 Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision link Zhiqing Sun, Longhui Yu,..., Chuang Gan
52 2024-05-30 Unique3D: High-Quality and Efficient 3D Mesh Generation from a
Single Image
link Kailu Wu, Fangfu Liu,..., Kaisheng Ma
52 2024-05-24 Defensive Unlearning with Adversarial Training for Robust Concept Erasure
in Diffusion Models
link Yimeng Zhang, Xin Chen,..., Sijia Liu
51 2024-06-12 One-Step Effective Diffusion Network for Real-World Image Super-Resolution link Rongyuan Wu, Lingchen Sun,..., Lei Zhang
50 2024-07-02 RankRAG: Unifying Context Ranking with Retrieval-Augmented Generation in LLMs link Yue Yu, Wei Ping,..., Bryan Catanzaro
49 2024-04-15 3D Gaussian Splatting as Markov Chain Monte Carlo link Shakiba Kheradmand, Daniel Rebain,..., Kwang Moo Yi
49 2023-06-02 Invisible Image Watermarks Are Provably Removable Using Generative AI link Xuandong Zhao, Kexun Zhang,..., Lei Li
49 2024-05-26 Demystify Mamba in Vision: A Linear Attention Perspective link Dongchen Han, Ziyi Wang,..., Gao Huang
48 2024-06-05 Scaling Laws for Reward Model Overoptimization in Direct Alignment
Algorithms
link Rafael Rafailov, Yaswanth Chittepu,..., Scott Niekum
48 2024-06-27 OMG-LLaVA: Bridging Image-level, Object-level, Pixel-level Reasoning and Understanding link Tao Zhang, Xiangtai Li,..., Shuicheng YAN
48 2024-06-17 Twin-Merging: Dynamic Integration of Modular Expertise in Model Merging link Zhenyi Lu, Chenghao Fan,..., Yu Cheng
48 2024-06-12 VisionLLM v2: An End-to-End Generalist Multimodal Large Language Model
for Hundreds of Vision-Language Tasks
link Jiannan Wu, Muyan Zhong,..., Jifeng Dai
47 2024-02-06 SELF-DISCOVER: Large Language Models Self-Compose Reasoning Structures link Pei Zhou, Jay Pujara,..., Steven Zheng
47 2024-07-17 AgentPoison: Red-teaming LLM Agents via Poisoning Memory or Knowledge
Bases
link Zhaorun Chen, Zhen Xiang,..., Bo Li
47 2023-12-18 Cascade Speculative Drafting for Even Faster LLM Inference link Ziyi Chen, Xiaocong Yang,..., Jie Huang
47 2024-05-23 Direct3D: Scalable Image-to-3D Generation via 3D Latent Diffusion Transformer link Shuang Wu, Youtian Lin,..., Yao Yao
46 2024-03-23 Understanding Emergent Abilities of Language Models from the Loss
Perspective
link Zhengxiao Du, Aohan Zeng,..., Jie Tang
46 2024-06-03 Mobile-Agent-v2: Mobile Device Operation Assistant with Effective Navigation via
Multi-Agent Collaboration
link Junyang Wang, Haiyang Xu,..., Jitao Sang
46 2024-02-02 ReEvo: Large Language Models as Hyper-Heuristics with Reflective Evolution link Haoran Ye, Jiarui Wang,..., Guojie Song
46 2024-02-29 TimeXer: Empowering Transformers for Time Series Forecasting with Exogenous
Variables
link Yuxuan Wang, Haixu Wu,..., Mingsheng Long
45 2024-05-24 The Road Less Scheduled link Aaron Defazio, Xingyu Alice Yang,..., Ashok Cutkosky
45 2024-06-22 Are Language Models Actually Useful for Time Series Forecasting? link Mingtian Tan, Mike A Merrill,..., Thomas Hartvigsen
45 2024-04-04 No "Zero-Shot" Without Exponential Data: Pretraining Concept Frequency Determines
Multimodal Model Performance
link Vishaal Udandarao, Ameya Prabhu,..., Matthias Bethge
45 2024-06-20 RL on Incorrect Synthetic Data Scales the Efficiency of
LLM Math Reasoning by Eight-Fold
link Amrith Setlur, Saurabh Garg,..., Aviral Kumar
45 2024-06-26 WildTeaming at Scale: From In-the-Wild Jailbreaks to (Adversarially) Safer
Language Models
link Liwei Jiang, Kavel Rao,..., Nouha Dziri
45 2023-12-19 Large Language Models Play StarCraft II:Benchmarks and A Chain
of Summarization Approach
link Weiyu Ma, Qirui Mi,..., Haifeng Zhang
45 2024-07-18 Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies link Chaofan Tao, Qian Liu,..., Ngai Wong
45 2024-03-27 Long-form factuality in large language models link Jerry Wei, Chengrun Yang,..., Quoc V Le
44 2024-05-24 Efficient Adversarial Training in LLMs with Continuous Attacks link Sophie Xhonneux, Alessandro Sordoni,..., Leo Schwinn
44 2024-02-28 Keeping LLMs Aligned After Fine-tuning: The Crucial Role of
Prompt Templates
link Kaifeng Lyu, Haoyu Zhao,..., Sanjeev Arora
44 2024-06-05 Lumina-Next : Making Lumina-T2X Stronger and Faster with Next-DiT link Le Zhuo, Ruoyi Du,..., Peng Gao
44 2024-06-03 SpatialRGPT: Grounded Spatial Reasoning in Vision-Language Models link An-Chieh Cheng, Hongxu Yin,..., Sifei Liu
43 2023-12-13 Chat-Scene: Bridging 3D Scene and Large Language Models with
Object Identifiers
link Haifeng Huang, Yilun Chen,..., Zhou Zhao
42 2024-06-14 Regularizing Hidden States Enables Learning Generalizable Reward Model for
LLMs
link Rui Yang, Ruomeng Ding,..., Tong Zhang
42 2024-02-16 The Evolution of Statistical Induction Heads: In-Context Learning Markov
Chains
link Ezra Edelman, Nikolaos Tsilivis,..., Surbhi Goel
42 2024-05-26 Provably Mitigating Overoptimization in RLHF: Your SFT Loss is
Implicitly an Adversarial Regularizer
link Zhihan Liu, Miao Lu,..., Zhaoran Wang
41 2024-04-19 MoVA: Adapting Mixture of Vision Experts to Multimodal Context link Zhuofan Zong, Bingqi Ma,..., Yu Liu
41 2024-02-26 Why Transformers Need Adam: A Hessian Perspective link Yushun Zhang, Congliang Chen,..., Zhi-Quan Luo
40 2024-05-20 Diffusion for World Modeling: Visual Details Matter in Atari link Eloi Alonso, Adam Jelley,..., François Fleuret
40 2024-06-25 MotionBooth: Motion-Aware Customized Text-to-Video Generation link Jianzong Wu, Xiangtai Li,..., Kai Chen
40 2024-05-21 Reducing Transformer Key-Value Cache Size with Cross-Layer Attention link William Brandon, Mayank Mishra,..., Jonathan Ragan-Kelley
40 2024-05-13 PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator link Hanshu Yan, Xingchao Liu,..., Jiashi Feng
40 2024-04-23 Rethinking LLM Memorization through the Lens of Adversarial Compression link Avi Schwarzschild, Zhili Feng,..., J Zico Kolter
40 2024-05-25 Streaming Long Video Understanding with Large Language Models link Rui Qian, Xiaoyi Dong,..., Jiaqi Wang
40 2024-05-19 Era3D: High-Resolution Multiview Diffusion using Efficient Row-wise Attention link Peng Li, Yuan Liu,..., Yike Guo
40 2024-07-11 WildGaussians: 3D Gaussian Splatting In the Wild link Jonas Kulhanek, Songyou Peng,..., Torsten Sattler
39 2024-06-14 DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning link Hao Bai, Yifei Zhou,..., Aviral Kumar
38 2023-11-22 SegVol: Universal and Interactive Volumetric Medical Image Segmentation link Yuxin Du, Fan BAI,..., Bo Zhao
38 2024-06-13 Unpacking DPO and PPO: Disentangling Best Practices for Learning
from Preference Feedback
link Hamish Ivison, Yizhong Wang,..., Hannaneh Hajishirzi
38 2024-06-13 Visual Sketchpad: Sketching as a Visual Chain of Thought
for Multimodal Language Models
link Yushi Hu, Weijia Shi,..., Ranjay Krishna
38 2024-05-08 Chain of Thoughtlessness? An Analysis of CoT in Planning link Kaya Stechly, Karthik Valmeekam, Subbarao Kambhampati
38 2024-06-21 Is A Picture Worth A Thousand Words? Delving Into
Spatial Reasoning for Vision Language Models
link Jiayu Wang, Yifei Ming,..., Neel Joshi
38 2024-02-08 Noise Contrastive Alignment of Language Models with Explicit Rewards link Huayu Chen, Guande He,..., Jun Zhu
38 2024-01-18 ChatQA: Surpassing GPT-4 on Conversational QA and RAG link Zihan Liu, Wei Ping,..., Bryan Catanzaro
38 2023-05-23 Decoupled Kullback-Leibler Divergence Loss link Jiequan Cui, Zhuotao Tian,..., Hanwang Zhang
38 2024-05-26 Diffusion4D: Fast Spatial-temporal Consistent 4D generation via Video Diffusion
Models
link HANWEN LIANG, Yuyang Yin,..., Yunchao Wei
38 2024-10-08 Vitron: A Unified Pixel-level Vision LLM for Understanding, Generating,
Segmenting, Editing
link Hao Fei, Shengqiong Wu,..., Shuicheng YAN
38 2024-05-31 MeshXL: Neural Coordinate Field for Generative 3D Foundation Models link Sijin Chen, Xin Chen,..., Tao Chen
37 2024-05-28 Aligning to Thousands of Preferences via System Message Generalization link Seongyun Lee, Sue Hyun Park,..., Minjoon Seo
37 2024-05-17 Identifying Functionally Important Features with End-to-End Sparse Dictionary Learning link Dan Braun, Jordan Taylor,..., Lee Sharkey
37 2024-07-19 Compact Language Models via Pruning and Knowledge Distillation link Saurav Muralidharan, Sharath Turuvekere Sreenivas,..., Pavlo Molchanov
37 2024-02-14 Soft Prompt Threats: Attacking Safety Alignment and Unlearning in
Open-Source LLMs through the Embedding Space
link Leo Schwinn, David Dobre,..., Stephan Günnemann
37 2024-02-17 OneBit: Towards Extremely Low-bit Large Language Models link Yuzhuang Xu, Xu Han,..., Wanxiang Che
37 2024-06-14 L4GM: Large 4D Gaussian Reconstruction Model link Jiawei Ren, Kevin Xie,..., Huan Ling
36 2024-03-26 LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model
Fine-Tuning
link Rui Pan, Xiang Liu,..., Tong Zhang
36 2024-03-26 MAGIS: LLM-Based Multi-Agent Framework for GitHub Issue Resolution link Wei Tao, Yucheng Zhou,..., Yu Cheng
36 2024-06-02 BoNBoN Alignment for Large Language Models and the Sweetness
of Best-of-n Sampling
link Lin Gui, Cristina Garbacea, Victor Veitch
36 2024-06-03 MixEval: Deriving Wisdom of the Crowd from LLM Benchmark
Mixtures
link Jinjie Ni, Fuzhao Xue,..., Yang You
36 2023-04-26 The Closeness of In-Context Learning and Weight Shifting for
Softmax Regression
link Shuai Li, Zhao Song,..., Tianyi Zhou
36 2024-05-30 Enhancing Large Vision Language Models with Self-Training on Image
Comprehension
link Yihe Deng, Pan Lu,..., Wei Wang
36 2024-06-13 OmniTokenizer: A Joint Image-Video Tokenizer for Visual Generation link Junke Wang, Yi Jiang,..., Yu-Gang Jiang
36 2024-02-24 Spec-Gaussian: Anisotropic View-Dependent Appearance for 3D Gaussian Splatting link Ziyi Yang, Xinyu Gao,..., Xiaogang Jin
35 2024-08-19 Personalizing Reinforcement Learning from Human Feedback with Variational Preference
Learning
link Sriyash Poddar, Yanming Wan,..., Natasha Jaques
35 2024-03-11 SARDet-100K: Towards Open-Source Benchmark and ToolKit for Large-Scale SAR
Object Detection
link Yuxuan Li, Xiang Li,..., Jian Yang
35 2024-02-16 Interpreting CLIP with Sparse Linear Concept Embeddings (SpLiCE) link Usha Bhalla, Alex Oesterling,..., Himabindu Lakkaraju
35 2024-11-02 Rule Based Rewards for Language Model Safety link Tong Mu, Alec Helyar,..., Lilian Weng
35 2023-11-29 Elo Uncovered: Robustness and Best Practices in Language Model
Evaluation
link Meriem Boubdir, Edward Kim,..., Marzieh Fadaee
35 2024-04-09 MambaAD: Exploring State Space Models for Multi-class Unsupervised Anomaly
Detection
link Haoyang He, Yuhu Bai,..., Lei Xie
34 2024-05-28 Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations link Alexander Hägele, Elie Bakouch,..., Martin Jaggi
34 2024-06-06 Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models link Ling Yang, Zhaochen Yu,..., Bin CUI
34 2024-06-13 Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs link Xuan Zhang, Chao Du,..., Min Lin
34 2024-02-07 InfLLM: Training-Free Long-Context Extrapolation for LLMs with an Efficient
Context Memory
link Chaojun Xiao, Pengle Zhang,..., Maosong Sun
34 2023-05-27 MADiff: Offline Multi-agent Learning with Diffusion Models link Zhengbang Zhu, Minghuan Liu,..., Weinan Zhang
33 2024-04-30 HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning link Chunlin Tian, Zhan Shi,..., Cheng-zhong Xu
33 2024-09-30 Scaling Proprioceptive-Visual Learning with Heterogeneous Pre-trained Transformers link Lirui Wang, Xinlei Chen,..., Kaiming He
33 2024-05-23 HippoRAG: Neurobiologically Inspired Long-Term Memory for Large Language Models link Bernal Jimenez Gutierrez, Yiheng Shu,..., Yu Su
32 2024-02-12 Model Collapse Demystified: The Case of Regression link Elvis Dohmatob, Yunzhen Feng, Julia Kempe
32 2024-06-18 DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving link Yuxuan Tong, Xiwen Zhang,..., Junxian He
32 2024-05-27 Safe LoRA: The Silver Lining of Reducing Safety Risks
when Finetuning Large Language Models
link Chia-Yi Hsu, Yu-Lin Tsai,..., Chun-Ying Huang
32 2024-05-23 Multi-Scale VMamba: Hierarchy in Hierarchy Visual State Space Model link Yuheng Shi, Minjing Dong, Chang Xu
31 2024-06-14 Be like a Goldfish, Don't Memorize! Mitigating Memorization in
Generative LLMs
link Abhimanyu Hans, John Kirchenbauer,..., Tom Goldstein
31 2024-02-22 Large Language Models as Urban Residents: An LLM Agent
Framework for Personal Mobility Generation
link Jiawei Wang, Renhe Jiang,..., Chuan Xiao
31 2024-04-25 REBEL: Reinforcement Learning via Regressing Relative Rewards link Zhaolin Gao, Jonathan Daniel Chang,..., Wen Sun
31 2024-02-03 Panacea: Pareto Alignment via Preference Adaptation for LLMs link Yifan Zhong, Chengdong Ma,..., Yaodong Yang
31 2024-06-27 Decoding-Time Language Model Alignment with Multiple Objectives link Ruizhe Shi, Yifang Chen,..., Simon Shaolei Du
31 2024-01-11 A Closer Look at AUROC and AUPRC under Class
Imbalance
link Matthew B.A. McDermott, Haoran Zhang,..., Jack Gallifant
31 2023-05-15 PLIP: Language-Image Pre-training for Person Representation Learning link Jialong Zuo, Jiahao Hong,..., Jingdong Wang
30 2023-06-13 Questioning the Survey Responses of Large Language Models link Ricardo Dominguez-Olmedo, Moritz Hardt, Celestine Mendler-Dünner
30 2024-06-17 How Do Large Language Models Acquire Factual Knowledge During
Pretraining?
link Hoyeon Chang, Jinho Park,..., Minjoon Seo
30 2024-05-23 MiniCache: KV Cache Compression in Depth Dimension for Large
Language Models
link Akide Liu, Jing Liu,..., Bohan Zhuang
30 2024-05-27 Collaborative Video Diffusion: Consistent Multi-video Generation with Camera Control link Zhengfei Kuang, Shengqu Cai,..., Gordon Wetzstein
29 2024-06-03 DuQuant: Distributing Outliers via Dual Transformation Makes Stronger Quantized
LLMs
link Haokun Lin, Haobo Xu,..., Ying Wei
29 2024-05-23 EMR-Merging: Tuning-Free High-Performance Model Merging link Chenyu Huang, Peng Ye,..., Wanli Ouyang
29 2024-05-23 JiuZhang3.0: Efficiently Improving Mathematical Reasoning by Training Small Data
Synthesis Models
link Kun Zhou, Beichen Zhang,..., Ji-Rong Wen
29 2024-05-28 Understanding Transformer Reasoning Capabilities via Graph Algorithms link Clayton Sanford, Bahare Fatemi,..., Vahab Mirrokni
29 2024-05-29 Poseidon: Efficient Foundation Models for PDEs link Maximilian Herde, Bogdan Raonic,..., Siddhartha Mishra
29 2024-05-22 xRAG: Extreme Context Compression for Retrieval-augmented Generation with One
Token
link Xin Cheng, Xun Wang,..., Dongyan Zhao
29 2024-04-06 Aligning Diffusion Models by Optimizing Human Utility link Shufan Li, Konstantinos Kallidromitis,..., Kazuki Kozuka
29 2024-07-05 On scalable oversight with weak LLMs judging strong LLMs link Zachary Kenton, Noah Yamamoto Siegel,..., Rohin Shah
29 2024-07-02 Meta 3D AssetGen: Text-to-Mesh Generation with High-Quality Geometry, Texture,
and PBR Materials
link Yawar Siddiqui, Tom Monnier,..., David Novotny
29 2024-04-16 Self-playing Adversarial Language Game Enhances LLM Reasoning link Pengyu Cheng, Tianhao Hu,..., Xiaolong Li
29 2024-06-17 Emotion-LLaMA: Multimodal Emotion Recognition and Reasoning with Instruction Tuning link Zebang Cheng, Zhi-Qi Cheng,..., Alexander G Hauptmann
29 2024-06-03 Improved Few-Shot Jailbreaking Can Circumvent Aligned Language Models and
Their Defenses
link Xiaosen Zheng, Tianyu Pang,..., Min Lin
29 2024-05-03 DreamScene4D: Dynamic Multi-Object Scene Generation from Monocular Videos link Wen-Hsuan Chu, Lei Ke, Katerina Fragkiadaki
29 2024-02-28 Approaching Human-Level Forecasting with Language Models link Danny Halawi, Fred Zhang,..., Jacob Steinhardt
29 2024-06-04 OpenGaussian: Towards Point-Level 3D Gaussian-based Open Vocabulary Understanding link Yanmin Wu, Jiarui Meng,..., Jian Zhang
28 2024-02-04 Aligner: Efficient Alignment by Learning to Correct link Jiaming Ji, Boyuan Chen,..., Yaodong Yang
28 2024-06-17 Unveiling Encoder-Free Vision-Language Models link Haiwen Diao, Yufeng Cui,..., Xinlong Wang
28 2024-06-04 Chain of Agents: Large Language Models Collaborating on Long-Context
Tasks
link Yusen Zhang, Ruoxi Sun,..., Sercan O Arik
28 2024-10-26 Fast Best-of-N Decoding via Speculative Rejection link Hanshi Sun, Momin Haider,..., Andrea Zanette
28 2024-06-12 Large Language Model Unlearning via Embedding-Corrupted Prompts link Chris Yuhao Liu, Yaxuan Wang,..., Yang Liu
28 2023-12-12 Alignment for Honesty link Yuqing Yang, Ethan Chern,..., Pengfei Liu
28 2024-05-27 Transformers Can Do Arithmetic with the Right Embeddings link Sean Michael McLeish, Arpit Bansal,..., Tom Goldstein
28 2024-06-10 LLM Dataset Inference: Did you train on my dataset? link Pratyush Maini, Hengrui Jia,..., Adam Dziedzic
27 2024-06-06 Evaluating the World Model Implicit in a Generative Model link Keyon Vafa, Justin Y. Chen,..., Sendhil Mullainathan
27 2024-04-23 Aligning LLM Agents by Learning Latent Preference from User
Edits
link Ge Gao, Alexey Taymanov,..., Dipendra Misra
27 2024-07-06 LoRA-GA: Low-Rank Adaptation with Gradient Approximation link Shaowen Wang, Linxi Yu, Jian Li
27 2024-02-19 WorldCoder, a Model-Based LLM Agent: Building World Models by
Writing Code and Interacting with the Environment
link Hao Tang, Darren Yan Key, Kevin Ellis
27 2023-10-06 Why Do We Need Weight Decay in Modern Deep
Learning?
link Francesco D'Angelo, Maksym Andriushchenko,..., Nicolas Flammarion
27 2024-05-31 4Diffusion: Multi-view Video Diffusion Model for 4D Generation link Haiyu Zhang, Xinyuan Chen,..., Yu Qiao
27 2024-05-29 T2V-Turbo: Breaking the Quality Bottleneck of Video Consistency Model
with Mixed Reward Feedback
link Jiachen Li, Weixi Feng,..., William Yang Wang
27 2023-12-06 OctreeOcc: Efficient and Multi-Granularity Occupancy Prediction Using Octree Queries link Yuhang Lu, Xinge ZHU,..., Yuexin Ma
27 2024-08-27 The Mamba in the Llama: Distilling and Accelerating Hybrid
Models
link Junxiong Wang, Daniele Paliotta,..., Tri Dao
27 2024-04-12 Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context
Length
link Xuezhe Ma, Xiaomeng Yang,..., Chunting Zhou
27 2024-07-01 On Statistical Rates and Provably Efficient Criteria of
Latent Diffusion Transformers (DiTs)
link Jerry Yao-Chieh Hu, Weimin Wu,..., Han Liu
27 2024-05-28 Why are Visually-Grounded Language Models Bad at Image Classification? link Yuhui Zhang, Alyssa Unell,..., Serena Yeung-Levy
27 2024-07-29 FreeLong: Training-Free Long Video Generation with SpectralBlend Temporal Attention link Yu Lu, Yuanzhi Liang,..., Yi Yang
26 2024-02-29 Heavy-Tailed Class Imbalance and Why Adam Outperforms Gradient Descent
on Language Models
link Frederik Kunstner, Alan Milligan,..., Alberto Bietti
26 2024-07-31 Measuring Progress in Dictionary Learning for Language Model Interpretability
with Board Game Models
link Adam Karvonen, Benjamin Wright,..., Samuel Marks
26 2024-06-10 MATES: Model-Aware Data Selection for Efficient Pretraining with Data
Influence Models
link Zichun Yu, Spandan Das, Chenyan Xiong
26 2024-06-06 Transformers need glasses! Information over-squashing in language tasks link Federico Barbero, Andrea Banino,..., Petar Veličković
26 2024-02-18 Federated Fine-tuning of Large Language Models under Heterogeneous Tasks
and Client Resources
link Jiamu Bai, Daoyuan Chen,..., Yaliang Li
26 2024-03-05 Found in the Middle: How Language Models Use Long
Contexts Better via Plug-and-Play Positional Encoding
link Zhenyu Zhang, Runjin Chen,..., Zhangyang Wang
26 2024-05-07 KV Cache is 1 Bit Per Channel: Efficient Large
Language Model Inference with Coupled Quantization
link Tianyi Zhang, Jonah Wonkyu Yi,..., Anshumali Shrivastava
26 2024-03-01 Gradient Cuff: Detecting Jailbreak Attacks on Large Language Models
by Exploring Refusal Loss Landscapes
link Xiaomeng Hu, Pin-Yu Chen, Tsung-Yi Ho
26 2024-05-23 Calibrated Self-Rewarding Vision Language Models link Yiyang Zhou, Zhiyuan Fan,..., Huaxiu Yao
26 2024-06-11 BAKU: An Efficient Transformer for Multi-Task Policy Learning link Siddhant Haldar, Zhuoran Peng, Lerrel Pinto
26 None IMAGPose: A Unified Conditional Framework for Pose-Guided Person Generation link Fei Shen, Jinhui Tang
26 2024-05-09 CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts link Jiachen Li, Xinyao Wang,..., Longyin Wen
25 2023-12-06 Return of Unconditional Generation: A Self-supervised Representation Generation Method link Tianhong Li, Dina Katabi, Kaiming He
25 2024-07-08 GenArtist: Multimodal LLM as an Agent for Unified Image
Generation and Editing
link Zhenyu Wang, Aoxue Li,..., Xihui Liu
25 2024-02-09 Fight Back Against Jailbreaking via Prompt Adversarial Tuning link Yichuan Mo, Yuji Wang,..., Yisen Wang
25 2024-02-19 Query-Based Adversarial Prompt Generation link Jonathan Hayase, Ema Borevković,..., Milad Nasr
25 2024-02-29 Theoretical Foundations of Deep Selective State-Space Models link Nicola Muca Cirone, Antonio Orvieto,..., Terry Lyons
25 2024-09-09 FLoRA: Federated Fine-Tuning Large Language Models with Heterogeneous Low-Rank
Adaptations
link Ziyao Wang, Zheyu Shen,..., Ang Li
25 2024-03-14 MambaTalk: Efficient Holistic Gesture Synthesis with Selective State Space
Models
link Zunnan Xu, Yukang Lin,..., Xiu Li
25 2024-05-23 ZipCache: Accurate and Efficient KV Cache Quantization with Salient
Token Identification
link Yefei He, Luoming Zhang,..., Bohan Zhuang
24 2024-08-19 MeshFormer : High-Quality Mesh Generation with 3D-Guided Reconstruction Model link Minghua Liu, Chong Zeng,..., Hao Su
24 2024-05-30 Transfer Q-star : Principled Decoding for LLM Alignment link Souradip Chakraborty, Soumya Suvra Ghosal,..., Furong Huang
24 2024-06-13 On Softmax Direct Preference Optimization for Recommendation link Yuxin Chen, Junfei Tan,..., Tat-Seng Chua
24 2024-03-28 InterDreamer: Zero-Shot Text to 3D Dynamic Human-Object Interaction link Sirui Xu, Ziyin Wang,..., Liangyan Gui
24 2024-02-15 Self-Play Fine-tuning of Diffusion Models for Text-to-image Generation link Huizhuo Yuan, Zixiang Chen,..., Quanquan Gu
24 2024-08-19 Transformers to SSMs: Distilling Quadratic Knowledge to Subquadratic Models link Aviv Bick, Kevin Li,..., Albert Gu
24 2024-05-27 EM Distillation for One-step Diffusion Models link Sirui Xie, Zhisheng Xiao,..., Ruiqi Gao
24 2024-05-19 FIFO-Diffusion: Generating Infinite Videos from Text without Training link Jihwan Kim, Junoh Kang,..., Bohyung Han
24 2024-05-31 Amortizing intractable inference in diffusion models for vision, language,
and control
link Siddarth Venkatraman, Moksh Jain,..., Nikolay Malkin
24 2024-07-23 Harmonizing Visual Text Comprehension and Generation link Zhen Zhao, Jingqun Tang,..., Yuan Xie
24 2023-10-26 Transformers Learn to Achieve Second-Order Convergence Rates for In-Context
Linear Regression
link Deqing Fu, Tian-qi Chen,..., Vatsal Sharan
24 2024-05-31 ContextGS : Compact 3D Gaussian Splatting with Anchor Level
Context Model
link Yufei Wang, Zhihao Li,..., Bihan Wen
23 2024-02-19 A Critical Evaluation of AI Feedback for Aligning Large
Language Models
link Archit Sharma, Sedrick Keh,..., Thomas Kollar
23 2024-06-06 Multistep Distillation of Diffusion Models via Moment Matching link Tim Salimans, Thomas Mensink,..., Emiel Hoogeboom
23 2024-05-30 Kernel Language Entropy: Fine-grained Uncertainty Quantification for LLMs from
Semantic Similarities
link Alexander V Nikitin, Jannik Kossen,..., Pekka Marttinen
23 2024-04-22 SOFTS: Efficient Multivariate Time Series Forecasting with Series-Core Fusion link Lu Han, Xu-Yang Chen,..., De-Chuan Zhan
23 2024-05-29 Preference Learning Algorithms Do Not Learn Preference Rankings link Angelica Chen, Sadhika Malladi,..., Kyunghyun Cho
23 2022-08-22 Efficiency of the First-Price Auction in the Autobidding World link Yuan Deng, Jieming Mao,..., Song Zuo
23 2024-06-11 4Real: Towards Photorealistic 4D Scene Generation via Video Diffusion
Models
link Heng Yu, Chaoyang Wang,..., Hsin-Ying Lee
23 2024-03-22 Can large language models explore in-context? link Akshay Krishnamurthy, Keegan Harris,..., Aleksandrs Slivkins
23 2024-05-30 CV-VAE: A Compatible Video VAE for Latent Generative Video
Models
link Sijie Zhao, Yong Zhang,..., Ying Shan
23 2024-05-30 Group Robust Preference Optimization in Reward-free RLHF link Shyam Sundhar Ramesh, Yifan Hu,..., Ilija Bogunovic
23 2024-05-24 iVideoGPT: Interactive VideoGPTs are Scalable World Models link Jialong Wu, Shaofeng Yin,..., Mingsheng Long
22 2024-04-19 Ensemble Learning for Heterogeneous Large Language Models with Deep
Parallel Collaboration
link Yichong Huang, Xiaocheng Feng,..., Bing Qin
22 2024-07-09 Scaling Retrieval-Based Language Models with a Trillion-Token Datastore link Rulin Shao, Jacqueline He,..., Pang Wei Koh
22 2024-05-20 Metacognitive Capabilities of LLMs: An Exploration in Mathematical Problem
Solving
link Aniket Rajiv Didolkar, Anirudh Goyal,..., Sanjeev Arora
22 2024-06-10 Aligning Large Language Models with Representation Editing: A Control
Perspective
link Lingkai Kong, Haorui Wang,..., Chao Zhang
22 2023-10-11 MatFormer: Nested Transformer for Elastic Inference link Fnu Devvrit, Sneha Kudugunta,..., Prateek Jain
22 2024-02-04 AutoTimes: Autoregressive Time Series Forecasters via Large Language Models link Yong Liu, Guo Qin,..., Mingsheng Long
22 2024-06-12 Reversing the Forget-Retain Objectives: An Efficient LLM Unlearning Framework
from Logit Difference
link Jiabao Ji, Yujian Liu,..., Shiyu Chang
21 2024-04-22 MDAgents: An Adaptive Collaboration of LLMs for Medical Decision-Making link Yubin Kim, Chanwoo Park,..., Hae Won Park
21 2024-06-15 Voxel Mamba: Group-Free State Space Models for Point Cloud
based 3D Object Detection
link Guowen Zhang, Lue Fan,..., Lei Zhang
21 2024-06-14 Large language model validity via enhanced conformal prediction methods link John Cherian, Isaac Gibbs, Emmanuel Candes
21 2024-06-17 Transcoders find interpretable LLM feature circuits link Jacob Dunefsky, Philippe Chlenski, Neel Nanda
21 2023-08-04 Adaptive Proximal Gradient Method for Convex Optimization link Yura Malitsky, Konstantin Mishchenko
21 None Are More LLM Calls All You Need? Towards the
Scaling Properties of Compound AI Systems
link Lingjiao Chen, Jared Quincy Davis,..., James Zou
21 2024-06-03 Neural network learns low-dimensional polynomials with SGD near the
information-theoretic limit
link Jason D. Lee, Kazusato Oko,..., Denny Wu
21 2024-05-24 Medformer: A Multi-Granularity Patching Transformer for Medical Time-Series Classification link Yihe Wang, Nan Huang,..., Xiang Zhang
21 2024-02-15 BitDelta: Your Fine-Tune May Only Be Worth One Bit link James Liu, Guangxuan Xiao,..., Tianle Cai
21 2024-05-17 ProSST: Protein Language Modeling with Quantized Structure and Disentangled
Attention
link Mingchen Li, Yang Tan,..., Liang Hong
21 2024-06-11 Zero-shot Image Editing with Reference Imitation link Xi Chen, Yutong Feng,..., Hengshuang Zhao
21 2024-06-17 Exploring the Role of Large Language Models in Prompt
Encoding for Diffusion Models
link Bingqi Ma, Zhuofan Zong,..., Yu Liu
21 2024-05-22 ReVideo: Remake a Video with Motion and Content Control link Chong Mou, Mingdeng Cao,..., Jian Zhang
20 2024-06-27 Resolving Discrepancies in Compute-Optimal Scaling of Language Models link Tomer Porian, Mitchell Wortsman,..., Yair Carmon
20 2024-05-27 Navigating the Safety Landscape: Measuring Risks in Finetuning Large
Language Models
link ShengYun Peng, Pin-Yu Chen,..., Duen Horng Chau
20 2024-06-18 SWT-Bench: Testing and Validating Real-World Bug-Fixes with Code Agents link Niels Mündler, Mark Niklas Mueller,..., Martin Vechev
20 2024-04-23 Gradient Guidance for Diffusion Models: An Optimization Perspective link Yingqing Guo, Hui Yuan,..., Mengdi Wang
20 2024-05-29 Principled Probabilistic Imaging using Diffusion Models as Plug-and-Play Priors link Zihui Wu, Yu Sun,..., Katherine Bouman
20 2024-03-25 Provably Robust Score-Based Diffusion Posterior Sampling for Plug-and-Play Image
Reconstruction
link Xingyu Xu, Yuejie Chi
20 2024-06-12 DiTFastAttn: Attention Compression for Diffusion Transformer Models link Zhihang Yuan, Hanling Zhang,..., Yu Wang
20 2024-05-29 Weak-to-Strong Search: Align Large Language Models via Searching over
Small Language Models
link Zhanhui Zhou, Zhixuan Liu,..., Yu Qiao
20 2024-09-26 From News to Forecast: Integrating Event Analysis in LLM-Based
Time Series Forecasting with Reflection
link Xinlei Wang, Maike Feng,..., Junhua Zhao
20 2024-05-23 Instruction Tuning With Loss Over Instructions link Zhengyan Shi, Adam X. Yang,..., Aldo Lipani
20 2024-05-28 Knowledge Circuits in Pretrained Transformers link Yunzhi Yao, Ningyu Zhang,..., Huajun Chen
20 2024-05-27 PromptFix: You Prompt and We Fix the Photo link Yongsheng Yu, Ziyun Zeng,..., Jiebo Luo
20 2024-04-04 CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching link Dongzhi Jiang, Guanglu Song,..., Hongsheng Li
20 2024-05-23 WISE: Rethinking the Knowledge Memory for Lifelong Model Editing
of Large Language Models
link Peng Wang, Zexi Li,..., Huajun Chen
20 2024-03-19 Pretraining Codomain Attention Neural Operators for Solving Multiphysics PDEs link Md Ashiqur Rahman, Robert Joseph George,..., Anima Anandkumar
19 2024-05-23 PV-Tuning: Beyond Straight-Through Estimation for Extreme LLM Compression link Vladimir Malinovskii, Denis Mazur,..., Peter Richtárik
19 2024-10-21 3DGS-Enhancer: Enhancing Unbounded 3D Gaussian Splatting with View-consistent 2D
Diffusion Priors
link Xi Liu, Chaoyi Zhou, Siyu Huang
19 2024-06-13 Understanding Hallucinations in Diffusion Models through Mode Interpolation link Sumukh K Aithal, Pratyush Maini,..., J Zico Kolter
19 2024-06-10 AutoSurvey: Large Language Models Can Automatically Write Surveys link Yidong Wang, Qi Guo,..., Yue Zhang
19 2024-01-29 Contracting with a Learning Agent link Guru Guruganesh, Yoav Kolumbus,..., S. Matthew Weinberg
19 2024-11-04 Can Language Models Learn to Skip Steps? link Tengxiao Liu, Qipeng Guo,..., Zheng Zhang
19 2024-10-10 Generalizable and Animatable Gaussian Head Avatar link Xuangeng Chu, Tatsuya Harada
19 2024-05-23 Base of RoPE Bounds Context Length link Mingyu Xu, Xin Men,..., weipeng chen
19 2024-03-25 Antigen-Specific Antibody Design via Direct Energy-based Preference Optimization link Xiangxin Zhou, Dongyu Xue,..., Quanquan Gu
19 2024-05-28 Personalized Steering of Large Language Models: Versatile Steering Vectors
Through Bi-directional Preference Optimization
link Yuanpu Cao, Tianrong Zhang,..., Jinghui Chen
19 2024-05-30 Improving the Training of Rectified Flows link Sangyun Lee, Zinan Lin, Giulia Fanti
19 2024-03-28 Dual-Personalizing Adapter for Federated Foundation Models link yiyuan yang, Guodong Long,..., Michael Blumenstein
19 2024-01-18 Cross-Modality Perturbation Synergy Attack for Person Re-identification link Yunpeng Gong, Zhun Zhong,..., Min Jiang
19 2024-08-28 Efficient LLM Scheduling by Learning to Rank link Yichao Fu, Siqi Zhu,..., Hao Zhang
19 2024-06-13 Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Models
and Time-Dependent Layer Normalization
link Qihao Liu, Zhanpeng Zeng,..., Liang-Chieh Chen
19 2024-05-23 Representation Noising: A Defence Mechanism Against Harmful Finetuning link Domenic Rosati, Jan Wehner,..., Frank Rudzicz
19 2024-04-04 Mind's Eye of LLMs: Visualization-of-Thought Elicits Spatial Reasoning in
Large Language Models
link Wenshan Wu, Shaoguang Mao,..., Furu Wei
19 2024-05-22 DOGS: Distributed-Oriented Gaussian Splatting for Large-Scale 3D Reconstruction
Via Gaussian Consensus
link Yu Chen, Gim Hee Lee
19 2024-05-25 Theoretical Analysis of Weak-to-Strong Generalization link Hunter Lang, David Sontag, Aravindan Vijayaraghavan
19 2024-05-28 FreeSplat: Generalizable 3D Gaussian Splatting Towards Free View Synthesis
of Indoor Scenes
link Yunsong Wang, Tianxin Huang,..., Gim Hee Lee
18 2024-05-29 Adaptive Image Quality Assessment via Teaching Large Multimodal Model
to Compare
link Hanwei Zhu, Haoning Wu,..., Shiqi Wang
18 2024-07-05 Better by default: Strong pre-tuned MLPs and boosted trees
on tabular data
link David Holzmüller, Leo Grinsztajn, Ingo Steinwart
18 2024-09-01 ContextCite: Attributing Model Generation to Context link Benjamin Cohen-Wang, Harshay Shah,..., Aleksander Madry
18 2024-01-11 Interpretable Concept Bottlenecks to Align Reinforcement Learning Agents link Quentin Delfosse, Sebastian Sztwiertnia,..., Kristian Kersting
18 2024-06-20 MR-Ben: A Meta-Reasoning Benchmark for Evaluating System-2 Thinking in
LLMs
link Zhongshen Zeng, Yinhong Liu,..., Jiaya Jia
18 2024-06-20 Prism: A Framework for Decoupling and Assessing the Capabilities
of VLMs
link Yuxuan Qiao, Haodong Duan,..., Kai Chen
18 2024-05-16 Conformal Alignment: Knowing When to Trust Foundation Models with
Guarantees
link Yu Gui, Ying Jin, Zhimei Ren
18 2024-03-07 Online Adaptation of Language Models with a Memory of
Amortized Contexts
link Jihoon Tack, Jaehyung Kim,..., Jonathan Richard Schwarz
18 2024-08-07 Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon
Tasks
link Zaijing Li, Yuquan Xie,..., Liqiang Nie
18 2024-05-23 Adapting to Unknown Low-Dimensional Structures in Score-Based Diffusion Models link Gen Li, Yuling Yan
18 2024-06-06 BitsFusion: 1.99 bits Weight Quantization of Diffusion Model link Yang Sui, Yanyu Li,..., Jian Ren
18 2024-03-12 Visual Decoding and Reconstruction via EEG Embeddings with Guided
Diffusion
link Dongyang Li, Chen Wei,..., Quanying Liu
18 2024-03-12 Lumen: Unleashing Versatile Vision-Centric Capabilities of Large Multimodal Models link Yang Jiao, Shaoxiang Chen,..., Yu-Gang Jiang
18 2024-06-18 HumanSplat: Generalizable Single-Image Human Gaussian Splatting with Structure Priors link Panwang Pan, Zhuo Su,..., Yebin Liu
18 2024-06-03 TabPedia: Towards Comprehensive Visual Table Understanding with Concept Synergy link Weichao Zhao, Hao Feng,..., Can Huang
18 2024-07-02 UltraPixel: Advancing Ultra High-Resolution Image Synthesis to New Peaks link Jingjing Ren, Wenbo Li,..., Lei Zhu
18 2024-10-21 Mitigating Object Hallucination via Concentric Causal Attention link Yun Xing, Yiheng Li,..., Shijian Lu
18 2024-07-01 Evaluation of Text-to-Video Generation Models: A Dynamics Perspective link Mingxiang Liao, Hannan Lu,..., Xinyu Zhang
18 2024-05-25 PTQ4DiT: Post-training Quantization for Diffusion Transformers link Junyi Wu, Haoxuan Wang,..., Yan Yan
18 2024-06-14 Wild-GS: Real-Time Novel View Synthesis from Unconstrained Photo Collections link Jiacong Xu, Yiqun Mei, Vishal M. Patel
17 2024-10-08 Unlocking the Capabilities of Thought: A Reasoning Boundary Framework
to Quantify and Optimize Chain-of-Thought
link Qiguang Chen, Libo Qin,..., Wanxiang Che
17 2024-06-24 Finding Transformer Circuits With Edge Pruning link Adithya Bhaskar, Alexander Wettig,..., Danqi Chen
17 2024-09-26 HaloScope: Harnessing Unlabeled LLM Generations for Hallucination Detection link Xuefeng Du, Chaowei Xiao, Yixuan Li
17 2024-06-05 Dynamic 3D Gaussian Fields for Urban Areas link Tobias Fischer, Jonas Kulhanek,..., Peter Kontschieder
17 2023-10-19 AutoMix: Automatically Mixing Language Models link Pranjal Aggarwal, Aman Madaan,..., Mausam .
17 2024-06-03 What makes unlearning hard and what to do about
it
link Kairan Zhao, Meghdad Kurmanji,..., Peter Triantafillou
17 2024-01-24 Beyond Concept Bottleneck Models: How to Make Black Boxes
Intervenable?
link Sonia Laguna, Ričards Marcinkevičs,..., Julia E Vogt
17 2024-05-27 BehaviorGPT: Smart Agent Simulation for Autonomous Driving with Next-Patch
Prediction
link Zikang Zhou, Haibo HU,..., Chun Jason Xue
17 2024-09-11 Gated Slot Attention for Efficient Linear-Time Sequence Modeling link Yu Zhang, Songlin Yang,..., Guohong Fu
17 2024-05-27 GenWarp: Single Image to Novel Views with Semantic-Preserving Generative
Warping
link Junyoung Seo, Kazumi Fukuda,..., Yuki Mitsufuji
17 2024-02-09 CultureLLM: Incorporating Cultural Differences into Large Language Models link CHENG LI, Mengzhuo Chen,..., Xing Xie
17 2024-03-12 SmallToLarge (S2L): Scalable Data Selection for Fine-tuning Large Language
Models by Summarizing Training Trajectories of Small Models
link Yu Yang, Siddhartha Mishra,..., Baharan Mirzasoleiman
17 2024-05-24 Meteor: Mamba-based Traversal of Rationale for Large Language and
Vision Models
link Byung-Kwan Lee, Chae Won Kim,..., Yong Man Ro
17 2023-12-20 UniSDF: Unifying Neural Representations for High-Fidelity 3D Reconstruction of
Complex Scenes with Reflections
link Fangjinhua Wang, Marie-Julie Rakotosaona,..., Federico Tombari
17 2024-06-06 ReNO: Enhancing One-step Text-to-Image Models through Reward-based Noise Optimization link Luca Eyring, Shyamgopal Karthik,..., Zeynep Akata
17 2024-01-08 Tiny Time Mixers (TTMs): Fast Pre-trained Models for Enhanced
Zero/Few-Shot Forecasting of Multivariate Time Series
link Vijay Ekambaram, Arindam Jati,..., Jayant Kalagnanam
17 2024-05-23 Video Diffusion Models are Training-free Motion Interpreter and Controller link Zeqi Xiao, Yifan Zhou,..., Xingang Pan
17 2024-05-24 Score Distillation via Reparametrized DDIM link Artem Lukoianov, Haitz Sáez de Ocáriz Borde,..., Justin Solomon
17 2024-02-07 Improved off-policy training of diffusion samplers link Marcin Sendera, Minsu Kim,..., Nikolay Malkin
17 2024-09-29 One Token to Seg Them All: Language Instructed Reasoning
Segmentation in Videos
link Zechen Bai, Tong He,..., Mike Zheng Shou
17 2024-06-21 GeoLRM: Geometry-Aware Large Reconstruction Model for High-Quality 3D Gaussian
Generation
link Chubin Zhang, Hongliang Song,..., Yansong Tang
17 2024-06-13 COVE: Unleashing the Diffusion Feature Correspondence for Consistent Video
Editing
link Jiangshan Wang, Yue Ma,..., Xiu Li
16 2024-05-25 Bigger, Regularized, Optimistic: scaling for compute and sample efficient
continuous control
link Michal Nauman, Mateusz Ostaszewski,..., Marek Cygan
16 2024-02-02 AMOR: A Recipe for Building Adaptable Modular Knowledge Agents
Through Process Feedback
link Jian Guan, Wei Wu,..., Minlie Huang
16 2024-03-09 Algorithmic progress in language models link Anson Ho, Tamay Besiroglu,..., Jaime Sevilla
16 2024-05-23 PaGoDA: Progressive Growing of a One-Step Generator from a
Low-Resolution Diffusion Teacher
link Dongjun Kim, Chieh-Hsin Lai,..., Stefano Ermon
16 2024-02-05 Shadowcast: Stealthy Data Poisoning Attacks Against Vision-Language Models link Yuancheng Xu, Jiarui Yao,..., Furong Huang
16 2024-06-03 SemCoder: Training Code Language Models with Comprehensive Semantics Reasoning link Yangruibo Ding, Jinjun Peng,..., Baishakhi Ray
16 2024-02-26 SelectIT: Selective Instruction Tuning for LLMs via Uncertainty-Aware Self-Reflection link Liangxin Liu, Xuebo Liu,..., Min Zhang
16 2024-05-24 Quantifying the Gain in Weak-to-Strong Generalization link Moses Charikar, Chirag Pabbaraju, Kirankumar Shiragur
16 2024-05-23 Fisher Flow Matching for Generative Modeling over Discrete Data link Oscar Davis, Samuel Kessler,..., Joey Bose
16 2024-06-12 A Concept-Based Explainability Framework for Large Multimodal Models link Jayneel Parekh, Pegah KHAYATAN,..., Matthieu Cord
16 2024-10-31 SelfCodeAlign: Self-Alignment for Code Generation link Yuxiang Wei, Federico Cassano,..., LINGMING ZHANG
16 2024-07-16 Animate3D: Animating Any 3D Model with Multi-view Video Diffusion link Yanqin Jiang, Chaohui Yu,..., Jin Gao
16 2024-07-22 QueST: Self-Supervised Skill Abstractions for Learning Continuous Control link Atharva Mete, Haotian Xue,..., Animesh Garg
16 2024-02-05 Estimating Epistemic and Aleatoric Uncertainty with a Single Model link Matthew Albert Chan, Maria J. Molina, Christopher Metzler
16 2024-02-05 FuseMoE: Mixture-of-Experts Transformers for Fleximodal Fusion link Xing Han, Huy Nguyen,..., Suchi Saria
16 2024-05-23 Lorentz-Equivariant Geometric Algebra Transformers for High-Energy Physics link Jonas Spinner, Victor Breso Pla,..., Johann Brehmer
16 2024-05-22 Dense Connector for MLLMs link Huanjin Yao, Wenhao Wu,..., Jingdong Wang
16 2024-05-04 U-DiTs: Downsample Tokens in U-Shaped Diffusion Transformers link Yuchuan Tian, Zhijun Tu,..., Yunhe Wang
16 2024-11-04 DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for
Efficient Robot Execution
link Yang Yue, Yulin Wang,..., Gao Huang
16 2024-06-12 Vivid-ZOO: Multi-View Video Generation with Diffusion Model link Bing Li, Cheng Zheng,..., Bernard Ghanem
15 None Not All Tokens Are What You Need for Pretraining link Zhenghao Lin, Zhibin Gou,..., Weizhu Chen
15 2024-06-10 Get rich quick: exact solutions reveal how unbalanced initializations
promote rapid feature learning
link Daniel Kunin, Allan Raventos,..., Surya Ganguli
15 2024-05-24 Accelerating Diffusion Models with Parallel Sampling: Inference at Sub-Linear
Time Complexity
link Haoxuan Chen, Yinuo Ren,..., Grant M. Rotskoff
15 2024-05-23 Axioms for AI Alignment from Human Feedback link Luise Ge, Daniel Halpern,..., Junlin Wu
15 2024-06-03 MediQ: Question-Asking LLMs and a Benchmark for Reliable Interactive
Clinical Reasoning
link Shuyue Stella Li, Vidhisha Balachandran,..., Yulia Tsvetkov
15 2023-10-21 Masked Hard-Attention Transformers Recognize Exactly the Star-Free Languages link Andy Yang, David Chiang, Dana Angluin
15 2024-04-08 SpeechAlign: Aligning Speech Generation to Human Preferences link Dong Zhang, Zhaowei Li,..., Xipeng Qiu
15 2024-05-28 Getting More Juice Out of the SFT Data: Reward
Learning from Human Demonstration Improves SFT for LLM Alignment
link Jiaxiang Li, Siliang Zeng,..., Mingyi Hong
15 2024-06-12 Scaling Laws in Linear Regression: Compute, Parameters, and Data link Licong Lin, Jingfeng Wu,..., Jason D. Lee
15 2024-06-06 VideoTetris: Towards Compositional Text-to-Video Generation link Ye Tian, Ling Yang,..., Bin CUI
15 2024-06-12 Discovering Preference Optimization Algorithms with and for Large Language
Models
link Chris Lu, Samuel Holt,..., Robert Tjarko Lange
15 2024-04-22 Protecting Your LLMs with Information Bottleneck link Zichuan Liu, Zefan Wang,..., Jiang Bian
15 2024-01-27 DiffuserLite: Towards Real-time Diffusion Planning link Zibin Dong, Jianye HAO,..., YAN ZHENG
15 2024-09-05 Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene
Understanding
link Yunze Man, Shuhong Zheng,..., Yu-Xiong Wang
15 2024-05-15 Spectral Editing of Activations for Large Language Model Alignment link Yifu QIU, Zheng Zhao,..., Shay B Cohen
15 2024-06-14 UniAudio 1.5: Large Language Model-Driven Audio Codec is A
Few-Shot Audio Task Learner
link Dongchao Yang, Haohan Guo,..., Helen M. Meng
15 2024-04-05 Identity Decoupling for Multi-Subject Personalization of Text-to-Image Models link Sangwon Jang, Jaehyeong Jo,..., Sung Ju Hwang
15 2024-05-31 R$^2$-Gaussian: Rectifying Radiative Gaussian Splatting for Tomographic Reconstruction link Ruyi Zha, Tao Jun Lin,..., Hongdong Li
15 2024-07-08 Multi-Object Hallucination in Vision Language Models link Xuweiyi Chen, Ziqiao Ma,..., Joyce Chai
15 2024-05-24 HDR-GS: Efficient High Dynamic Range Novel View Synthesis at
1000x Speed via Gaussian Splatting
link Yuanhao Cai, Zihao Xiao,..., Alan Yuille
15 2024-06-11 Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without
Guidance
link Kuan Heng Lin, Sicheng Mo,..., Bolei Zhou
15 2024-10-25 DiffGS: Functional Gaussian Splatting Diffusion link Junsheng Zhou, Weiqi Zhang, Yu-Shen Liu
15 2024-10-18 Neural Signed Distance Function Inference through Splatting 3D Gaussians
Pulled on Zero-Level Set
link Wenyuan Zhang, Yu-Shen Liu, Zhizhong Han
15 2024-02-29 UniTS: A Unified Multi-Task Time Series Model link Shanghua Gao, Teddy Koker,..., Marinka Zitnik
15 2024-06-13 Yo'LLaVA: Your Personalized Language and Vision Assistant link Thao Nguyen, Haotian Liu,..., Yong Jae Lee
15 2024-03-03 GuardT2I: Defending Text-to-Image Models from Adversarial Prompts link Yijun Yang, Ruiyuan Gao,..., Qiang Xu
15 2024-06-12 Large Language Models Must Be Taught to Know What
They Don’t Know
link Sanyam Kapoor, Nate Gruver,..., Andrew Gordon Wilson
14 2024-02-29 RL-GPT: Integrating Reinforcement Learning and Code-as-policy link Shaoteng Liu, Haoqi Yuan,..., Jiaya Jia
14 2024-09-27 CycleNet: Enhancing Time Series Forecasting through Modeling Periodic Patterns link Shengsheng Lin, Weiwei Lin,..., Haocheng Zhong
14 2024-03-25 QKFormer: Hierarchical Spiking Transformer using Q-K Attention link Chenlin Zhou, Han Zhang,..., Yonghong Tian
14 2024-06-20 CooHOI: Learning Cooperative Human-Object Interaction with Manipulated Object Dynamics link Jiawei Gao, Ziqin Wang,..., Jiangmiao Pang
14 2024-06-13 4M-21: An Any-to-Any Vision Model for Tens of Tasks
and Modalities
link Roman Bachmann, Oğuzhan Fatih Kar,..., Amir Zamir
14 2024-10-30 FlowLLM: Flow Matching for Material Generation with Large Language
Models as Base Distributions
link Anuroop Sriram, Benjamin Kurt Miller,..., Brandon M Wood
14 2024-09-26 Generative Modeling of Molecular Dynamics Trajectories link Bowen Jing, Hannes Stark,..., Bonnie Berger
14 2024-01-22 Self-Labeling the Job Shop Scheduling Problem link Andrea Corsini, Angelo Porrello,..., Mauro Dell'Amico
14 2023-10-10 A General Protocol to Probe Large Vision Models for
3D Physical Understanding
link Guanqi Zhan, Chuanxia Zheng,..., Andrew Zisserman
14 2023-12-13 SwitchHead: Accelerating Transformers with Mixture-of-Experts Attention link Róbert Csordás, Piotr Piękos,..., Jürgen Schmidhuber
14 2024-02-07 Amortized Planning with Large-Scale Transformers: A Case Study on
Chess
link Anian Ruoss, Gregoire Deletang,..., Tim Genewein
14 2024-06-05 HYDRA: Model Factorization Framework for Black-Box LLM Personalization link Yuchen Zhuang, Haotian Sun,..., Bo Dai
14 2024-07-09 FinCon: A Synthesized LLM Multi-Agent System with Conceptual Verbal
Reinforcement for Enhanced Financial Decision Making
link Yangyang Yu, Zhiyuan Yao,..., Qianqian Xie
14 2024-06-27 Length Optimization in Conformal Prediction link Shayan Kiyani, George J. Pappas, Hamed Hassani
14 2024-06-13 Rethinking Score Distillation as a Bridge Between Image Distributions link David McAllister, Songwei Ge,..., Angjoo Kanazawa
14 2024-12-05 SceneDiffuser: Efficient and Controllable Driving Simulation Initialization and Rollout link Chiyu Max Jiang, Yijing Bai,..., Dragomir Anguelov
14 2024-06-25 DiffusionPDE: Generative PDE-Solving under Partial Observation link Jiahe Huang, Guandao Yang,..., Jeong Joon Park
14 2024-05-25 Breaking the False Sense of Security in Backdoor Defense
through Re-Activation Attack
link Mingli Zhu, Siyuan Liang, Baoyuan Wu
14 2024-05-23 ALI-Agent: Assessing LLMs' Alignment with Human Values via
Agent-based Evaluation
link Jingnan Zheng, Han Wang,..., Tat-Seng Chua
14 2024-06-03 The Importance of Online Data: Understanding Preference Fine-tuning via
Coverage
link Yuda Song, Gokul Swamy,..., Wen Sun
14 2024-06-17 AvaTaR: Optimizing LLM Agents for Tool Usage via Contrastive
Reasoning
link Shirley Wu, Shiyu Zhao,..., James Zou
14 2024-03-25 MetaAligner: Towards Generalizable Multi-Objective Alignment of Language Models link Kailai Yang, Zhiwei Liu,..., Sophia Ananiadou
14 2024-05-23 Agent Planning with World Knowledge Model link Shuofei Qiao, Runnan Fang,..., Huajun Chen
14 2024-05-27 Vidu4D: Single Generated Video to High-Fidelity 4D Reconstruction with
Dynamic Gaussian Surfels
link Yikai Wang, Xinzhou Wang,..., Jun Zhu
14 2024-11-07 MVSplat360: Feed-Forward 360 Scene Synthesis from Sparse Views link Yuedong Chen, Chuanxia Zheng,..., Jianfei Cai
14 2024-06-11 Neural Gaffer: Relighting Any Object via Diffusion link Haian Jin, Yuan Li,..., Noah Snavely
14 2024-05-02 FLAME : Factuality-Aware Alignment for Large Language Models link Sheng-Chieh Lin, Luyu Gao,..., Xilun Chen
14 2024-04-25 Cooperate or Collapse: Emergence of Sustainable Cooperation in
a Society of LLM Agents
link Giorgio Piatti, Zhijing Jin,..., Rada Mihalcea
14 2024-02-17 TuneTables: Context Optimization for Scalable Prior-Data Fitted Networks link Benjamin Feuer, Robin Tibor Schirrmeister,..., Colin White
14 2024-06-13 LRM-Zero: Training Large Reconstruction Models with Synthesized Data link Desai Xie, Sai Bi,..., Hao Tan
13 2024-06-09 Training Compute-Optimal Protein Language Models link Xingyi Cheng, Bo Chen,..., Le Song
13 2024-09-24 TFG: Unified Training-Free Guidance for Diffusion Models link Haotian Ye, Haowei Lin,..., Stefano Ermon
13 2024-05-23 4+3 Phases of Compute-Optimal Neural Scaling Laws link Elliot Paquette, Courtney Paquette,..., Jeffrey Pennington
13 2023-10-29 Optimal Algorithms for Online Convex Optimization with Adversarial Constraints link Abhishek Sinha, Rahul Vaze
13 2024-10-31 Understanding the Limits of Vision Language Models Through the
Lens of the Binding Problem
link Declan Iain Campbell, Sunayana Rane,..., Taylor Whittington Webb
13 2024-07-14 What Makes and Breaks Safety Fine-tuning? A Mechanistic Study link Samyak Jain, Ekdeep Singh Lubana,..., Puneet K. Dokania
13 2024-02-21 Linear Transformers are Versatile In-Context Learners link Max Vladymyrov, Johannes Von Oswald,..., Rong Ge
13 2024-06-01 RGFN: Synthesizable Molecular Generation Using GFlowNets link Michał Koziarski, Andrei Rekesh,..., Robert A. Batey
13 2024-05-29 Grasp as You Say: Language-guided Dexterous Grasp Generation link Yi-Lin Wei, Jian-Jian Jiang,..., Wei-Shi Zheng
13 2024-05-07 Towards a Theoretical Understanding of the 'Reversal Curse' via
Training Dynamics
link Hanlin Zhu, Baihe Huang,..., Stuart Russell
13 2024-06-13 Talking Heads: Understanding Inter-Layer Communication in Transformer Language Models link Jack Merullo, Carsten Eickhoff, Ellie Pavlick
13 2023-11-03 Towards Calibrated Robust Fine-Tuning of Vision-Language Models link Changdae Oh, Hyesu Lim,..., Kyungwoo Song
13 2024-06-04 Loki: Low-rank Keys for Efficient Sparse Attention link Prajwal Singhania, Siddharth Singh,..., Abhinav Bhatele
13 2024-05-30 Bridging Model-Based Optimization and Generative Modeling via Conservative Fine-Tuning
of Diffusion Models
link Masatoshi Uehara, Yulai Zhao,..., Tommaso Biancalani
13 2024-06-03 LoFiT: Localized Fine-tuning on LLM Representations link Fangcong Yin, Xi Ye, Greg Durrett
13 2024-05-26 Code Repair with LLMs gives an Exploration-Exploitation Tradeoff link Hao Tang, Keya Hu,..., Kevin Ellis
13 2024-04-25 PhyRecon: Physically Plausible Neural Scene Reconstruction link Junfeng Ni, Yixin Chen,..., Siyuan Huang
13 2024-05-28 A Theoretical Understanding of Self-Correction through In-context Alignment link Yifei Wang, Yuyang Wu,..., Yisen Wang
13 2024-10-22 One-Step Diffusion Distillation through Score Implicit Matching link Weijian Luo, Zemin Huang,..., Guo-Jun Qi
13 2024-05-24 Transformers Represent Belief State Geometry in their Residual Stream link Adam Shai, Lucas Teixeira,..., Paul M. Riechers
13 2024-07-17 Direct Unlearning Optimization for Robust and Safe Text-to-Image Models link Yong-Hyun Park, Sangdoo Yun,..., Gayoung Lee
13 2024-06-03 D-CPT Law: Domain-specific Continual Pre-Training Scaling Law for Large
Language Models
link Haoran Que, Jiaheng Liu,..., Bo Zheng
13 2024-05-29 Stress-Testing Capability Elicitation With Password-Locked Models link Ryan Greenblatt, Fabien Roger,..., David Krueger
13 2024-02-28 Implicit Optimization Bias of Next-token Prediction in Linear Models link Christos Thrampoulidis
13 2024-06-17 Large Scale Transfer Learning for Tabular Data via
Language Modeling
link Joshua P Gardner, Juan Carlos Perdomo, Ludwig Schmidt
13 2024-10-10 SG-Nav: Online 3D Scene Graph Prompting for LLM-based Zero-shot
Object Navigation
link Hang Yin, Xiuwei Xu,..., Jiwen Lu
13 2024-02-04 Diffusion Models are Certifiably Robust Classifiers link Huanran Chen, Yinpeng Dong,..., Jun Zhu
13 2024-09-13 Closed-Loop Visuomotor Control with Generative Expectation for Robotic Manipulation link Qingwen Bu, Jia Zeng,..., Hongyang Li
12 2024-06-12 Self-Consuming Generative Models with Curated Data Provably Optimize Human
Preferences
link Damien Ferbach, Quentin Bertrand,..., Gauthier Gidel
12 2024-05-22 Context and Geometry Aware Voxel Transformer for Semantic Scene
Completion
link Zhu Yu, Runmin Zhang,..., Hui-liang Shen
12 2024-06-23 Trace is the Next AutoDiff: Generative Optimization with Rich
Feedback, Execution Traces, and LLMs
link Ching-An Cheng, Allen Nie, Adith Swaminathan
12 2024-06-07 The Factorization Curse: Which Tokens You Predict Underlie the
Reversal Curse and More
link Ouail Kitouni, Niklas Nolte,..., Mark Ibrahim
12 2024-05-28 Linguistic Collapse: Neural Collapse in (Large) Language Models link Robert Wu, Vardan Papyan
12 2024-06-27 OmniJARVIS: Unified Vision-Language-Action Tokenization Enables Open-World Instruction Following Agents link Zihao Wang, Shaofei Cai,..., Yitao Liang
12 2024-10-10 Global Lyapunov functions: a long-standing open problem in mathematics,
with symbolic transformers
link Alberto Alfarano, Francois Charton, Amaury Hayat
12 2024-02-07 Universal Neural Functionals link Allan Zhou, Chelsea Finn, James Harrison
12 2024-06-20 Transferable Boltzmann Generators link Leon Klein, Frank Noe
12 2024-03-06 WaterMax: breaking the LLM watermark detectability-robustness-quality trade-off link Eva Giboulot, Teddy Furon
12 2024-02-01 Understanding the Expressive Power and Mechanisms of Transformer for
Sequence Modeling
link Mingze Wang, Weinan E
12 None EEGPT: Pretrained Transformer for Universal and Reliable Representation of
EEG Signals
link Guangyu Wang, Wenchao Liu,..., Haifeng Li
12 2024-05-24 MicroAdam: Accurate Adaptive Optimization with Low Space Overhead and
Provable Convergence
link Ionut-Vlad Modoranu, Mher Safaryan,..., Dan Alistarh
12 2024-05-28 Exploiting LLM Quantization link Kazuki Egashira, Mark Vero,..., Martin Vechev
12 2024-07-28 SaulLM-54B & SaulLM-141B: Scaling Up Domain Adaptation for the
Legal Domain
link Pierre Colombo, Telmo Pires,..., Michael Desa
12 2024-02-22 In-Context Learning of a Linear Transformer Block: Benefits of
the MLP Component and One-Step GD Initialization
link Ruiqi Zhang, Jingfeng Wu, Peter Bartlett
12 2023-05-22 Imprecise Label Learning: A Unified Framework for Learning with
Various Imprecise Label Configurations
link Hao Chen, Ankit Shah,..., Bhiksha Raj
12 2024-12-19 Cal-DPO: Calibrated Direct Preference Optimization for Language Model Alignment link Teng Xiao, Yige Yuan,..., Vasant G Honavar
12 2024-05-27 DMPlug: A Plug-in Method for Solving Inverse Problems with
Diffusion Models
link Hengkang Wang, Xu Zhang,..., Ju Sun
12 2024-05-23 Scalable Optimization in the Modular Norm link Tim Large, Yang Liu,..., Jeremy Bernstein
12 2024-04-17 On the Scalability of GNNs for Molecular Graphs link Maciej Sypetkowski, Frederik Wenkel,..., Dominique Beaini
12 2024-06-04 SpecExec: Massively Parallel Speculative Decoding For Interactive LLM Inference
on Consumer Devices
link Ruslan Svirschevski, Avner May,..., Max Ryabinin
12 2024-02-07 QGFN: Controllable Greediness with Action Values link Elaine Lau, Stephen Zhewen Lu,..., Emmanuel Bengio
12 2024-10-03 Parameter Competition Balancing for Model Merging link Guodong DU, Junlin Lee,..., Min Zhang
12 2024-08-29 VideoLLM-MoD: Efficient Video-Language Streaming with Mixture-of-Depths Vision Computation link Shiwei Wu, Joya Chen,..., Mike Zheng Shou
12 2024-05-21 Single Image Unlearning: Efficient Machine Unlearning in Multimodal Large
Language Models
link Jiaqi Li, Qianshan Wei,..., Fan Liu
12 2024-05-24 Towards Understanding the Working Mechanism of Text-to-Image Diffusion Model link Mingyang Yi, Aoxue Li,..., Zhenguo Li
12 2024-02-14 InfoRM: Mitigating Reward Hacking in RLHF via Information-Theoretic Reward
Modeling
link Yuchun Miao, Sen Zhang,..., Dacheng Tao
12 2024-06-24 Confidence Regulation Neurons in Language Models link Alessandro Stolfo, Ben Peng Wu,..., Neel Nanda
12 2023-10-27 Proportional Fairness in Clustering: A Social Choice Perspective link Leon Kellerhals, Jannik Peters
12 2024-02-21 Average gradient outer product as a mechanism for deep
neural collapse
link Daniel Beaglehole, Peter Súkeník,..., Mikhail Belkin
12 2024-05-23 Metric Flow Matching for Smooth Interpolations on the Data
Manifold
link Kacper Kapusniak, Peter Potaptchik,..., Francesco Di Giovanni
12 2024-06-21 Multimodal Task Vectors Enable Many-Shot Multimodal In-Context Learning link Brandon Huang, Chancharik Mitra,..., Roei Herzig
12 2024-10-24 Binocular-Guided 3D Gaussian Splatting with View Consistency for Sparse
View Synthesis
link Liang Han, Junsheng Zhou,..., Zhizhong Han
12 2024-07-13 Hydra: Bidirectional State Space Models Through Generalized Matrix Mixers link Sukjun Hwang, Aakash Lahoti,..., Albert Gu
12 2024-02-07 The Fine-Grained Complexity of Gradient Computation for Training Large
Language Models
link Josh Alman, Zhao Song
12 2024-06-10 MVGamba: Unify 3D Content Generation as State Space Sequence
Modeling
link Xuanyu Yi, Zike Wu,..., Hanwang Zhang
12 2024-05-21 LLM Processes: Numerical Predictive Distributions Conditioned on Natural Language link James Requeima, John F Bronskill,..., David Duvenaud
12 2024-04-23 Multi-Head Mixture-of-Experts link Xun Wu, Shaohan Huang,..., Furu Wei
12 2024-06-09 VCR-GauS: View Consistent Depth-Normal Regularizer for Gaussian Surface Reconstruction link Hanlin Chen, Fangyin Wei,..., Gim Hee Lee
12 2024-02-02 Segment Any Change link Zhuo Zheng, Yanfei Zhong,..., Stefano Ermon
12 2024-05-24 GS-Hider: Hiding Messages into 3D Gaussian Splatting link Xuanyu Zhang, Jiarui Meng,..., Jian Zhang
11 2024-05-31 LLM-ESR: Large Language Models Enhancement for Long-tailed Sequential Recommendation link Qidong Liu, Xian Wu,..., Xiangyu Zhao
11 2024-06-05 A Geometric View of Data Complexity: Efficient Local Intrinsic
Dimension Estimation with Diffusion Models
link Hamidreza Kamkari, Brendan Leigh Ross,..., Gabriel Loaiza-Ganem
11 2024-02-22 Watermarking Makes Language Models Radioactive link Tom Sander, Pierre Fernandez,..., Teddy Furon
11 2024-02-18 In-Context Learning with Transformers: Softmax Attention Adapts to Function
Lipschitzness
link Liam Collins, Advait U Parulekar,..., Sanjay Shakkottai
11 2024-02-09 Learn To be Efficient: Build Structured Sparsity in Large
Language Models
link Haizhong Zheng, Xiaoyan Bai,..., Atul Prakash
11 2024-05-24 Stacking Your Transformers: A Closer Look at Model Growth
for Efficient LLM Pre-Training
link Wenyu Du, Tongxu Luo,..., Jie Fu
11 2024-05-27 MultiOOD: Scaling Out-of-Distribution Detection for Multiple Modalities link Hao Dong, Yue Zhao,..., Olga Fink
11 2024-02-06 A Phase Transition between Positional and Semantic Learning in
a Solvable Model of Dot-Product Attention
link Hugo Cui, Freya Behrens,..., Lenka Zdeborova
11 2024-02-16 Conformalized Credal Set Predictors link Alireza Javanmardi, David Stutz, Eyke Hüllermeier
11 2024-06-09 Distributional Preference Alignment of LLMs via Optimal Transport link Igor Melnyk, Youssef Mroueh,..., Jarret Ross
11 2024-09-26 DarkSAM: Fooling Segment Anything Model to Segment Nothing link Ziqi Zhou, Yufei Song,..., Hai Jin
11 2024-02-06 Scaling laws for learning with real and surrogate data link Ayush Jain, Andrea Montanari, Eren Sasoglu
11 2024-06-10 How Far Can Transformers Reason? The Globality Barrier and
Inductive Scratchpad
link Emmanuel Abbe, Samy Bengio,..., Omid Saremi
11 2024-09-30 Magnet: We Never Know How Text-to-Image Diffusion Models Work,
Until We Learn How Vision-Language Models Function
link Chenyi Zhuang, Ying Hu, Pan Gao
11 2024-03-02 Accelerating Greedy Coordinate Gradient and General Prompt Optimization via
Probe Sampling
link Yiran Zhao, Wenyue Zheng,..., Michael Shieh
11 2024-05-23 Nearly Tight Black-Box Auditing of Differentially Private Machine Learning link Meenatchi Sundaram Muthu Selva Annamalai, Emiliano De Cristofaro
11 2024-04-06 MACM: Utilizing a Multi-Agent System for Condition Mining in
Solving Complex Mathematical Problems
link Bin Lei, Yi Zhang,..., Caiwen Ding
11 2024-02-21 Full-Atom Peptide Design with Geometric Latent Diffusion link Xiangzhe Kong, Yinjun Jia,..., Yang Liu
11 2024-10-07 TableRAG: Million-Token Table Understanding with Language Models link Si-An Chen, Lesly Miculicich,..., Tomas Pfister
11 2024-02-06 AdaFlow: Imitation Learning with Variance-Adaptive Flow-Based Policies link Xixi Hu, qiang liu,..., Bo Liu
11 2024-06-07 Retrieval & Fine-Tuning for In-Context Tabular Models link Valentin Thomas, Junwei Ma,..., Anthony L. Caterini
11 2024-04-18 Thought of Search: Planning with Language Models Through The
Lens of Efficiency
link Michael Katz, Harsha Kokel,..., Shirin Sohrabi
11 2024-02-01 Credal Learning Theory link Michele Caprio, Maryam Sultana,..., Fabio Cuzzolin
11 2024-04-23 SHED: Shapley-Based Automated Dataset Refinement for Instruction Fine-Tuning link Yexiao He, Ziyao Wang,..., Ang Li
11 2024-10-25 Utilizing Image Transforms and Diffusion Models for Generative
Modeling of Short and Long Time Series
link Ilan Naiman, Nimrod Berman,..., Omri Azencot
11 2024-05-23 Proving Theorems Recursively link Haiming Wang, Huajian Xin,..., Xiaodan Liang
11 2024-06-01 Frieren: Efficient Video-to-Audio Generation Network with Rectified Flow Matching link Yongqi Wang, Wenxiang Guo,..., Zhou Zhao
11 2024-02-22 Learning an Actionable Discrete Diffusion Policy via Large-Scale Actionless
Video Pre-Training
link Haoran He, Chenjia Bai,..., Xuelong Li
11 2024-02-03 GITA: Graph to Visual and Textual Integration for Vision-Language
Graph Reasoning
link Yanbin Wei, Shuai Fu,..., Yu Zhang
11 2024-03-19 Optimal Flow Matching: Learning Straight Trajectories in Just One
Step
link Nikita Maksimovich Kornilov, Petr Mokrov,..., Alexander Korotin
11 2024-06-10 Decision-Making Behavior Evaluation Framework for LLMs under Uncertain Context link Jingru Jia, Zehua Yuan,..., Deming Chen
11 2024-05-30 Jailbreaking Large Language Models Against Moderation Guardrails via Cipher
Characters
link Haibo Jin, Andy Zhou,..., Haohan Wang
11 2024-05-25 M$^3$GPT: An Advanced Multimodal, Multitask Framework for Motion
Comprehension and Generation
link Mingshuang Luo, RuiBing Hou,..., Shiguang Shan
11 2024-09-11 NVRC: Neural Video Representation Compression link Ho Man Kwan, Ge Gao,..., David Bull
11 2023-11-01 Learning Cooperative Trajectory Representations for Motion Forecasting link Hongzhi Ruan, Haibao Yu,..., Zaiqing Nie
11 2024-03-18 Dynamic Tuning Towards Parameter and Inference Efficiency for ViT
Adaptation
link Wangbo Zhao, Jiasheng Tang,..., Yang You
11 2024-10-24 Large Spatial Model: End-to-end Unposed Images to Semantic 3D link Zhiwen Fan, Jian Zhang,..., Yue Wang
11 2024-06-04 Finding NeMo: Localizing Neurons Responsible For Memorization in Diffusion
Models
link Dominik Hintersdorf, Lukas Struppek,..., Franziska Boenisch
11 2023-05-21 DPIC: Decoupling Prompt and Intrinsic Characteristics for LLM Generated
Text Detection
link Xiao Yu, Yuang Qi,..., Nenghai Yu
11 2024-06-13 SimGen: Simulator-conditioned Driving Scene Generation link Yunsong Zhou, Michael Simon,..., Bolei Zhou
11 2024-10-30 Provably Optimal Memory Capacity for Modern Hopfield Models:
Transformer-Compatible Dense Associative Memories as Spherical Codes
link Jerry Yao-Chieh Hu, Dennis Wu, Han Liu
11 2024-07-25 LION: Linear Group RNN for 3D Object Detection in
Point Clouds
link Zhe Liu, Jinghua Hou,..., Xiang Bai
11 2024-05-22 RadarOcc: Robust 3D Occupancy Prediction with 4D Imaging Radar link Fangqiang Ding, Xiangyu Wen,..., Chris Xiaoxuan Lu
10 2024-06-04 Learning to grok: Emergence of in-context learning and skill
composition in modular arithmetic tasks
link Tianyu He, Darshil Doshi,..., Andrey Gromov
10 2024-11-07 Don't Look Twice: Faster Video Transformers with Run-Length Tokenization link Rohan Choudhury, Guanglei Zhu,..., Laszlo Attila Jeni
10 2024-05-28 Exploring Context Window of Large Language Models via Decomposed
Positional Vectors
link zican Dong, Junyi Li,..., Ji-Rong Wen
10 2024-07-20 Is Behavior Cloning All You Need? Understanding Horizon in
Imitation Learning
link Dylan J Foster, Adam Block, Dipendra Misra
10 2024-06-13 Neural Assets: 3D-Aware Multi-Object Scene Synthesis with Image Diffusion
Models
link Ziyi Wu, Yulia Rubanova,..., Thomas Kipf
10 2024-04-05 Who Evaluates the Evaluations? Objectively Scoring Text-to-Image Prompt Coherence
Metrics with T2IScoreScore (TS2)
link Michael Saxon, Fatima Jahara,..., William Yang Wang
10 2024-06-17 Transcendence: Generative Models Can Outperform The Experts That Train
Them
link Edwin Zhang, Vincent Zhu,..., eran malach
10 2024-07-15 LLM Circuit Analyses Are Consistent Across Training and Scale link Curt Tigges, Michael Hanna,..., Stella Biderman
10 2024-06-22 Teach Better or Show Smarter? On Instructions and Exemplars
in Automatic Prompt Optimization
link Xingchen Wan, Ruoxi Sun,..., Sercan O Arik
10 2024-06-12 Grounding Multimodal Large Language Models in Actions link Andrew Szot, Bogdan Mazoure,..., Alexander T Toshev
10 2024-09-09 Unveiling Induction Heads: Provable Training Dynamics and Feature Learning
in Transformers
link Siyu Chen, Heejune Sheen,..., Zhuoran Yang
10 2024-05-29 LP-3DGS: Learning to Prune 3D Gaussian Splatting link Zhaoliang Zhang, Tianchen Song,..., Deliang Fan
10 2024-06-15 A Label is Worth A Thousand Images in Dataset
Distillation
link Tian Qin, Zhiwei Deng, David Alvarez-Melis
10 2024-02-05 Constrained Synthesis with Projected Diffusion Models link Jacob K Christopher, Stephen Baek, Ferdinando Fioretto
10 2023-11-19 Large Pre-trained time series models for cross-domain Time series
analysis tasks
link Harshavardhan Kamarthi, B. Aditya Prakash
10 2024-06-07 Variational Flow Matching for Graph Generation link Floor Eijkelboom, Grigory Bartosh,..., Jan-Willem van de Meent
10 2024-05-24 Alleviating Hallucinations in Large Vision-Language Models through Hallucination-Induced Optimization link Xinyu Lyu, Beitao Chen,..., Jingkuan Song
10 2024-05-29 A Full-duplex Speech Dialogue Scheme Based On Large Language
Model
link Peng Wang, Songshuo Lu,..., Yuanjun Xiong
10 2024-03-31 From Similarity to Superiority: Channel Clustering for Time Series
Forecasting
link Jialin Chen, Jan Eric Lenssen,..., Rex Ying
10 2024-05-29 Nearest Neighbor Speculative Decoding for LLM Generation and Attribution link Minghan Li, Xilun Chen,..., Xi Victoria Lin
10 2024-06-11 Advancing Tool-Augmented Large Language Models: Integrating Insights from Errors
in Inference Trees
link Sijia Chen, Yibo Wang,..., Lijun Zhang
10 2024-07-19 Towards a "Universal Translator" for Neural Dynamics at Single-Cell,
Single-Spike Resolution
link Yizi Zhang, Yanchen Wang,..., Cole Lincoln Hurwitz
10 2024-05-31 Kaleido Diffusion: Improving Conditional Diffusion Models with Autoregressive Latent
Modeling
link Jiatao Gu, Ying Shen,..., Joshua M. Susskind
10 2024-02-06 On Convergence of Adam for Stochastic Optimization under Relaxed
Assumptions
link Yusu Hong, Junhong Lin
10 2024-07-10 Neural Localizer Fields for Continuous 3D Human Pose and
Shape Estimation
link István Sárándi, Gerard Pons-Moll
10 2024-10-31 The Importance of Being Scalable: Improving the Speed and
Accuracy of Neural Network Interatomic Potentials Across Chemical Domains
link Eric Qu, Aditi S. Krishnapriyan
10 2024-02-25 No Free Lunch in LLM Watermarking: Trade-offs in Watermarking
Design Choices
link Qi Pang, Shengyuan Hu,..., Virginia Smith
10 2024-05-27 ARC: A Generalist Graph Anomaly Detector with In-Context Learning link Yixin Liu, Shiyuan Li,..., Shirui Pan
10 2024-05-24 VB-LoRA: Extreme Parameter Efficient Fine-Tuning with Vector Banks link Yang Li, Shaobo Han, Shihao Ji
10 2024-06-29 UDC: A Unified Neural Divide-and-Conquer Framework for Large-Scale Combinatorial
Optimization Problems
link Zhi Zheng, Changliang Zhou,..., Zhenkun Wang
10 2024-06-06 Understanding Information Storage and Transfer in Multi-Modal Large Language
Models
link Samyadeep Basu, Martin Grayson,..., Daniela Massiceti
10 2024-04-05 Dynamic Conditional Optimal Transport through Simulation-Free Flows link Gavin Kerrigan, Giosue Migliorini, Padhraic Smyth
10 2024-09-14 Symbolic Regression with a Learned Concept Library link Arya Grayeli, Atharva Sehgal,..., Swarat Chaudhuri
10 2024-06-12 The Impact of Initialization on LoRA Finetuning Dynamics link Soufiane Hayou, Nikhil Ghosh, Bin Yu
10 2024-05-25 MambaLLIE: Implicit Retinex-Aware Low Light Enhancement with Global-then-Local State
Space
link Jiangwei Weng, Zhiqiang Yan,..., Jun Li
10 2024-06-13 When LLM Meets DRL: Advancing Jailbreaking Efficiency via DRL-guided
Search
link Xuan Chen, Yuzhou Nie,..., Xiangyu Zhang
10 2024-06-02 Evidence of Learned Look-Ahead in a Chess-Playing Neural Network link Erik Jenner, Shreyas Kapur,..., Stuart Russell
10 2024-06-04 CODE: Contrasting Self-generated Description to Combat Hallucination in Large
Multi-modal Models
link Junho Kim, Hyunjun Kim,..., Yong Man Ro
10 2024-04-18 Uncovering Safety Risks of Large Language Models through Concept
Activation Vector
link Zhihao Xu, Ruixuan HUANG,..., Xiting Wang
10 2024-02-06 Discovery of the Hidden World with Large Language Models link Chenxi Liu, Yongqiang Chen,..., Kun Zhang
10 2024-03-18 A Sober Look at the Robustness of CLIPs to
Spurious Features
link Qizhou Wang, Yong Lin,..., Tong Zhang
10 2024-09-18 DynaMo: In-Domain Dynamics Pretraining for Visuo-Motor Control link Zichen Jeff Cui, Hengkai Pan,..., Lerrel Pinto
10 2024-06-04 DDGS-CT: Direction-Disentangled Gaussian Splatting for Realistic Volume Rendering link Zhongpai Gao, Benjamin Planche,..., Ziyan Wu
10 2024-06-28 Segment Anything without Supervision link Xudong Wang, Jingfeng Yang, Trevor Darrell
10 2024-08-30 Can We Leave Deepfake Data Behind in Training Deepfake
Detector?
link Jikang Cheng, Zhiyuan Yan,..., Chen Li
10 2024-02-22 A Decision-Language Model (DLM) for Dynamic Restless Multi-Armed Bandit
Tasks in Public Health
link Nikhil Behari, Edwin Zhang,..., Milind Tambe
10 2024-05-24 ART: Automatic Red-teaming for Text-to-Image Models to Protect Benign
Users
link Guanlin Li, Kangjie Chen,..., Tianwei Zhang
10 2024-05-26 Categorical Flow Matching on Statistical Manifolds link Chaoran Cheng, Jiahan Li,..., Ge Liu
10 2024-03-21 Few-Shot Adversarial Prompt Learning on Vision-Language Models link Yiwei Zhou, Xiaobo Xia,..., Tongliang Liu
10 2024-05-18 Automated Multi-level Preference for MLLMs link Mengxi Zhang, Wenhao Wu,..., Yifan Sun
9 None GREATS: Online Selection of High-Quality Data for LLM Training
in Every Iteration
link Jiachen T. Wang, Tong Wu,..., Ruoxi Jia
9 2024-10-10 Doob's Lagrangian: A Sample-Efficient Variational Approach to Transition Path
Sampling
link Yuanqi Du, Michael Plainer,..., Kirill Neklyudov
9 2024-05-30 Physically Compatible 3D Object Modeling from a Single Image link Minghao Guo, Bohan Wang,..., Wojciech Matusik
9 2024-09-14 Schrodinger Bridge Flow for Unpaired Data Translation link Valentin De Bortoli, Iryna Korshunova,..., Arnaud Doucet