Last updated: 2025-04-16 04:14:11. Maintained by Weisen Jiang.

citation publish date title (pdf) review authors
673 2024-05-23 YOLOv10: Real-Time End-to-End Object Detection link Ao Wang, Hui Chen,..., Guiguang Ding
528 2024-01-18 VMamba: Visual State Space Model link Yue Liu, Yunjie Tian,..., Yunfan Liu
464 2023-05-24 Gorilla: Large Language Model Connected with Massive APIs link Shishir G Patil, Tianjun Zhang,..., Joseph E. Gonzalez
426 2023-11-06 CogVLM: Visual Expert for Pretrained Language Models link Weihan Wang, Qingsong Lv,..., Jie Tang
305 2024-05-23 SimPO: Simple Preference Optimization with a Reference-Free Reward link Yu Meng, Mengzhou Xia, Danqi Chen
253 2024-06-13 Depth Anything V2 link Lihe Yang, Bingyi Kang,..., Hengshuang Zhao
246 2024-06-24 Cambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMs link Shengbang Tong, Ellis L Brown II,..., Saining Xie
216 2024-04-03 Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction link Keyu Tian, Yi Jiang,..., Liwei Wang
203 2024-03-29 Are We on the Right Way for Evaluating Large
Vision-Language Models?
link Lin Chen, Jinsong Li,..., Feng Zhao
185 2023-12-04 Tree of Attacks: Jailbreaking Black-Box LLMs Automatically link Anay Mehrotra, Manolis Zampetakis,..., Amin Karbasi
157 2023-11-28 LightGaussian: Unbounded 3D Gaussian Compression with 15x Reduction and
200+ FPS
link Zhiwen Fan, Kevin Wang,..., Zhangyang Wang
152 2024-01-31 KVQuant: Towards 10 Million Context Length LLM Inference with
KV Cache Quantization
link Coleman Richard Charles Hooper, Sehoon Kim,..., Amir Gholami
152 2024-05-06 SWE-agent: Agent-Computer Interfaces Enable Automated Software Engineering link John Yang, Carlos E Jimenez,..., Ofir Press
151 2024-06-17 Autoregressive Image Generation without Vector Quantization link Tianhong Li, Yonglong Tian,..., Kaiming He
147 2024-05-03 What matters when building vision-language models? link Hugo Laurençon, Leo Tronchon,..., Victor Sanh
135 2024-05-07 xLSTM: Extended Long Short-Term Memory link Maximilian Beck, Korbinian Pöppel,..., Sepp Hochreiter
133 2024-05-16 CAT3D: Create Anything in 3D with Multi-View Diffusion Models link Ruiqi Gao, Aleksander Holynski,..., Ben Poole
133 2024-04-22 SnapKV: LLM Knows What You are Looking for Before
Generation
link Yuhong Li, Yingbing Huang,..., Deming Chen
131 2024-04-15 LLM Evaluators Recognize and Favor Their Own Generations link Arjun Panickssery, Samuel R. Bowman, Shi Feng
123 2024-06-17 Refusal in Language Models Is Mediated by a Single
Direction
link Andy Arditi, Oscar Balcells Obeso,..., Neel Nanda
117 2024-06-06 ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search link Dan Zhang, Sining Zhoubian,..., Jie Tang
107 2024-03-30 QuaRot: Outlier-Free 4-Bit Inference in Rotated LLMs link Saleh Ashkboos, Amirkeivan Mohtashami,..., James Hensman
105 2024-04-09 InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from
336 Pixels to 4K HD
link Xiaoyi Dong, Pan Zhang,..., Jiaqi Wang
97 2023-10-14 Large Language Model Unlearning link Yuanshun Yao, Xiaojun Xu, Yang Liu
96 2024-04-30 Iterative Reasoning Preference Optimization link Richard Yuanzhe Pang, Weizhe Yuan,..., Jason E Weston
96 None Many-shot Jailbreaking link Cem Anil, Esin DURMUS,..., David Duvenaud
92 2024-07-11 FlashAttention-3: Fast and Accurate Attention with Asynchrony and Low-precision link Jay Shah, Ganesh Bikshandi,..., Tri Dao
90 2023-12-12 SGLang: Efficient Execution of Structured Language Model Programs link Lianmin Zheng, Liangsheng Yin,..., Ying Sheng
90 2024-02-15 Chain-of-Thought Reasoning Without Prompting link Xuezhi Wang, Denny Zhou
86 2024-04-17 Many-Shot In-Context Learning link Rishabh Agarwal, Avi Singh,..., Hugo Larochelle
84 2024-02-16 PointMamba: A Simple State Space Model for Point Cloud
Analysis
link Dingkang Liang, Xin Zhou,..., Xiang Bai
81 2024-05-23 Improved Distribution Matching Distillation for Fast Image Synthesis link Tianwei Yin, Michaël Gharbi,..., William T. Freeman
77 2024-05-02 StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation link Yupeng Zhou, Daquan Zhou,..., Qibin Hou
77 2024-05-06 MAmmoTH2: Scaling Instructions from the Web link Xiang Yue, Tianyu Zheng,..., Wenhu Chen
76 2024-05-06 AlphaMath Almost Zero: Process Supervision without Process link Guoxin Chen, Minpeng Liao,..., Kai Fan
74 2024-04-16 VASA-1: Lifelike Audio-Driven Talking Faces Generated in Real Time link Sicheng Xu, Guojun Chen,..., Baining Guo
74 2024-06-11 An Image is Worth 32 Tokens for Reconstruction and
Generation
link Qihang Yu, Mark Weber,..., Liang-Chieh Chen
68 2024-07-02 MInference 1.0: Accelerating Pre-filling for Long-Context LLMs via Dynamic
Sparse Attention
link Huiqiang Jiang, YUCHENG LI,..., Lili Qiu
68 2024-01-30 Robust Prompt Optimization for Defending Language Models Against Jailbreaking
Attacks
link Andy Zhou, Bo Li, Haohan Wang
64 2024-05-27 Vista: A Generalizable Driving World Model with High Fidelity
and Versatile Controllability
link Shenyuan Gao, Jiazhi Yang,..., Hongyang Li
62 2024-06-11 Simple and Effective Masked Diffusion Language Models link Subham Sekhar Sahoo, Marianne Arriola,..., Volodymyr Kuleshov
61 2024-04-03 PiSSA: Principal Singular Values and Singular Vectors Adaptation of
Large Language Models
link Fanxu Meng, Zhaohui Wang, Muhan Zhang
61 2024-06-06 Improving Alignment and Robustness with Circuit Breakers link Andy Zou, Long Phan,..., Dan Hendrycks
61 2024-07-01 Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion link Boyuan Chen, Diego Martí Monsó,..., Vincent Sitzmann
59 2024-02-12 G-Retriever: Retrieval-Augmented Generation for Textual Graph Understanding and Question
Answering
link Xiaoxin He, Yijun Tian,..., Bryan Hooi
58 2024-02-26 Rainbow Teaming: Open-Ended Generation of Diverse Adversarial Prompts link Mikayel Samvelyan, Sharath Chandra Raparthy,..., Roberta Raileanu
57 2024-02-29 Humanoid Locomotion as Next Token Prediction link Ilija Radosavovic, Bike Zhang,..., Jitendra Malik
57 2024-04-18 Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing link Ye Tian, Baolin Peng,..., Dong Yu
56 2024-04-04 ReFT: Representation Finetuning for Language Models link Zhengxuan Wu, Aryaman Arora,..., Christopher Potts
56 2024-06-06 Simplified and Generalized Masked Diffusion for Discrete Data link Jiaxin Shi, Kehang Han,..., Michalis Titsias
56 2024-04-21 Hyper-SD: Trajectory Segmented Consistency Model for Efficient Image Synthesis link Yuxi Ren, Xin Xia,..., Xuefeng Xiao
55 2024-04-11 Applying Guidance in a Limited Interval Improves Sample and
Distribution Quality in Diffusion Models
link Tuomas Kynkäänniemi, Miika Aittala,..., Jaakko Lehtinen
54 2023-12-06 Scaling transformer neural networks for skillful and reliable medium-range
weather forecasting
link Tung Nguyen, Rohan Shah,..., Aditya Grover
50 2024-06-04 Guiding a Diffusion Model with a Bad Version of
Itself
link Tero Karras, Miika Aittala,..., Samuli Laine
50 2024-02-17 Watch Out for Your Agents! Investigating Backdoor Threats to
LLM-Based Agents
link Wenkai Yang, Xiaohan Bi,..., Xu Sun
50 2024-02-07 Can Large Language Model Agents Simulate Human Trust Behavior? link Chengxing Xie, Canyu Chen,..., Guohao Li
49 2024-04-24 PuLID: Pure and Lightning ID Customization via Contrastive Alignment link Zinan Guo, Yanze Wu,..., Qian HE
48 2024-05-08 You Only Cache Once: Decoder-Decoder Architectures for Language Models link Yutao Sun, Li Dong,..., Furu Wei
48 2024-07-22 Discrete Flow Matching link Itai Gat, Tal Remez,..., Yaron Lipman
47 2024-07-25 Recursive Introspection: Teaching Language Model Agents How to Self-Improve link Yuxiao Qu, Tianjun Zhang,..., Aviral Kumar
47 2024-03-14 Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision link Zhiqing Sun, Longhui Yu,..., Chuang Gan
47 2024-04-25 Make Your LLM Fully Utilize the Context link Shengnan An, Zexiong Ma,..., Weizhu Chen
46 2024-02-29 How do Large Language Models Handle Multilingualism? link Yiran Zhao, Wenxuan Zhang,..., Lidong Bing
46 2024-06-10 Parallelizing Linear Transformers with the Delta Rule over Sequence
Length
link Songlin Yang, Bailin Wang,..., Yoon Kim
45 2024-02-06 SELF-DISCOVER: Large Language Models Self-Compose Reasoning Structures link Pei Zhou, Jay Pujara,..., Steven Zheng
45 2023-06-02 Invisible Image Watermarks Are Provably Removable Using Generative AI link Xuandong Zhao, Kexun Zhang,..., Lei Li
45 2024-06-05 Scaling Laws for Reward Model Overoptimization in Direct Alignment
Algorithms
link Rafael Rafailov, Yaswanth Chittepu,..., Scott Niekum
45 2024-06-17 Twin-Merging: Dynamic Integration of Modular Expertise in Model Merging link Zhenyi Lu, Chenghao Fan,..., Yu Cheng
44 2024-05-16 Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement
Learning
link Yuexiang Zhai, Hao Bai,..., Sergey Levine
44 2024-05-24 Defensive Unlearning with Adversarial Training for Robust Concept Erasure
in Diffusion Models
link Yimeng Zhang, Xin Chen,..., Sijia Liu
43 2023-12-18 Cascade Speculative Drafting for Even Faster LLM Inference link Ziyi Chen, Xiaocong Yang,..., Jie Huang
42 2024-05-24 The Road Less Scheduled link Aaron Defazio, Xingyu Alice Yang,..., Ashok Cutkosky
42 2024-03-23 Understanding Emergent Abilities of Language Models from the Loss
Perspective
link Zhengxiao Du, Aohan Zeng,..., Jie Tang
42 2024-06-12 One-Step Effective Diffusion Network for Real-World Image Super-Resolution link Rongyuan Wu, Lingchen Sun,..., Lei Zhang
41 2024-05-24 Efficient Adversarial Training in LLMs with Continuous Attacks link Sophie Xhonneux, Alessandro Sordoni,..., Leo Schwinn
41 2024-02-28 Keeping LLMs Aligned After Fine-tuning: The Crucial Role of
Prompt Templates
link Kaifeng Lyu, Haoyu Zhao,..., Sanjeev Arora
41 2024-07-02 RankRAG: Unifying Context Ranking with Retrieval-Augmented Generation in LLMs link Yue Yu, Wei Ping,..., Bryan Catanzaro
41 2023-12-19 Large Language Models Play StarCraft II:Benchmarks and A Chain
of Summarization Approach
link Weiyu Ma, Qirui Mi,..., Haifeng Zhang
41 2024-05-23 Direct3D: Scalable Image-to-3D Generation via 3D Latent Diffusion Transformer link Shuang Wu, Youtian Lin,..., Yao Yao
41 2024-06-27 OMG-LLaVA: Bridging Image-level, Object-level, Pixel-level Reasoning and Understanding link Tao Zhang, Xiangtai Li,..., Shuicheng YAN
41 2024-05-30 Unique3D: High-Quality and Efficient 3D Mesh Generation from a
Single Image
link Kailu Wu, Fangfu Liu,..., Kaisheng Ma
41 2024-03-27 Long-form factuality in large language models link Jerry Wei, Chengrun Yang,..., Quoc V Le
40 2024-04-15 3D Gaussian Splatting as Markov Chain Monte Carlo link Shakiba Kheradmand, Daniel Rebain,..., Kwang Moo Yi
40 2024-04-04 No "Zero-Shot" Without Exponential Data: Pretraining Concept Frequency Determines
Multimodal Model Performance
link Vishaal Udandarao, Ameya Prabhu,..., Matthias Bethge
40 2024-06-12 VisionLLM v2: An End-to-End Generalist Multimodal Large Language Model
for Hundreds of Vision-Language Tasks
link Jiannan Wu, Muyan Zhong,..., Jifeng Dai
39 2024-05-13 PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator link Hanshu Yan, Xingchao Liu,..., Jiashi Feng
39 2024-06-05 Lumina-Next : Making Lumina-T2X Stronger and Faster with Next-DiT link Le Zhuo, Ruoyi Du,..., Peng Gao
38 2024-06-22 Are Language Models Actually Useful for Time Series Forecasting? link Mingtian Tan, Mike A Merrill,..., Thomas Hartvigsen
38 2024-06-14 Regularizing Hidden States Enables Learning Generalizable Reward Model for
LLMs
link Rui Yang, Ruomeng Ding,..., Tong Zhang
38 2024-05-26 Provably Mitigating Overoptimization in RLHF: Your SFT Loss is
Implicitly an Adversarial Regularizer
link Zhihan Liu, Miao Lu,..., Zhaoran Wang
38 2024-05-26 Demystify Mamba in Vision: A Linear Attention Perspective link Dongchen Han, Ziyi Wang,..., Gao Huang
38 2024-06-03 SpatialRGPT: Grounded Spatial Reasoning in Vision-Language Models link An-Chieh Cheng, Hongxu Yin,..., Sifei Liu
37 2024-07-17 AgentPoison: Red-teaming LLM Agents via Poisoning Memory or Knowledge
Bases
link Zhaorun Chen, Zhen Xiang,..., Bo Li
37 2024-06-26 WildTeaming at Scale: From In-the-Wild Jailbreaks to (Adversarially) Safer
Language Models
link Liwei Jiang, Kavel Rao,..., Nouha Dziri
37 2024-02-16 The Evolution of Statistical Induction Heads: In-Context Learning Markov
Chains
link Ezra Edelman, Nikolaos Tsilivis,..., Surbhi Goel
37 2024-02-02 ReEvo: Large Language Models as Hyper-Heuristics with Reflective Evolution link Haoran Ye, Jiarui Wang,..., Guojie Song
37 2023-05-23 Decoupled Kullback-Leibler Divergence Loss link Jiequan Cui, Zhuotao Tian,..., Hanwang Zhang
37 2024-07-18 Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies link Chaofan Tao, Qian Liu,..., Ngai Wong
37 2024-07-11 WildGaussians: 3D Gaussian Splatting In the Wild link Jonas Kulhanek, Songyou Peng,..., Torsten Sattler
36 2024-05-21 Reducing Transformer Key-Value Cache Size with Cross-Layer Attention link William Brandon, Mayank Mishra,..., Jonathan Ragan-Kelley
36 2024-06-20 RL on Incorrect Synthetic Data Scales the Efficiency of
LLM Math Reasoning by Eight-Fold
link Amrith Setlur, Saurabh Garg,..., Aviral Kumar
36 2024-02-14 Soft Prompt Threats: Attacking Safety Alignment and Unlearning in
Open-Source LLMs through the Embedding Space
link Leo Schwinn, David Dobre,..., Stephan Günnemann
36 2024-02-26 Why Transformers Need Adam: A Hessian Perspective link Yushun Zhang, Congliang Chen,..., Zhi-Quan Luo
36 2024-02-29 TimeXer: Empowering Transformers for Time Series Forecasting with Exogenous
Variables
link Yuxuan Wang, Haixu Wu,..., Mingsheng Long
36 2023-12-13 Chat-Scene: Bridging 3D Scene and Large Language Models with
Object Identifiers
link Haifeng Huang, Yilun Chen,..., Zhou Zhao
35 2024-06-25 MotionBooth: Motion-Aware Customized Text-to-Video Generation link Jianzong Wu, Xiangtai Li,..., Kai Chen
35 2023-11-22 SegVol: Universal and Interactive Volumetric Medical Image Segmentation link Yuxin Du, Fan BAI,..., Bo Zhao
35 2024-06-13 Unpacking DPO and PPO: Disentangling Best Practices for Learning
from Preference Feedback
link Hamish Ivison, Yizhong Wang,..., Hannaneh Hajishirzi
35 2024-06-14 L4GM: Large 4D Gaussian Reconstruction Model link Jiawei Ren, Kevin Xie,..., Huan Ling
34 2024-02-17 OneBit: Towards Extremely Low-bit Large Language Models link Yuzhuang Xu, Xu Han,..., Wanxiang Che
34 2023-04-26 The Closeness of In-Context Learning and Weight Shifting for
Softmax Regression
link Shuai Li, Zhao Song,..., Tianyi Zhou
34 2024-01-18 ChatQA: Surpassing GPT-4 on Conversational QA and RAG link Zihan Liu, Wei Ping,..., Bryan Catanzaro
34 2024-05-25 Streaming Long Video Understanding with Large Language Models link Rui Qian, Xiaoyi Dong,..., Jiaqi Wang
34 2024-05-19 Era3D: High-Resolution Multiview Diffusion using Efficient Row-wise Attention link Peng Li, Yuan Liu,..., Yike Guo
33 2024-11-02 Rule Based Rewards for Language Model Safety link Tong Mu, Alec Helyar,..., Lilian Weng
33 2024-05-08 Chain of Thoughtlessness? An Analysis of CoT in Planning link Kaya Stechly, Karthik Valmeekam, Subbarao Kambhampati
33 2024-06-21 Is A Picture Worth A Thousand Words? Delving Into
Spatial Reasoning for Vision Language Models
link Jiayu Wang, Yifei Ming,..., Neel Joshi
33 2024-02-08 Noise Contrastive Alignment of Language Models with Explicit Rewards link Huayu Chen, Guande He,..., Jun Zhu
33 2024-04-23 Rethinking LLM Memorization through the Lens of Adversarial Compression link Avi Schwarzschild, Zhili Feng,..., J Zico Kolter
33 2024-04-19 MoVA: Adapting Mixture of Vision Experts to Multimodal Context link Zhuofan Zong, Bingqi Ma,..., Yu Liu
33 2024-05-26 Diffusion4D: Fast Spatial-temporal Consistent 4D generation via Video Diffusion
Models
link HANWEN LIANG, Yuyang Yin,..., Yunchao Wei
33 2024-02-24 Spec-Gaussian: Anisotropic View-Dependent Appearance for 3D Gaussian Splatting link Ziyi Yang, Xinyu Gao,..., Xiaogang Jin
32 2024-05-28 Aligning to Thousands of Preferences via System Message Generalization link Seongyun Lee, Sue Hyun Park,..., Minjoon Seo
32 2024-06-03 Mobile-Agent-v2: Mobile Device Operation Assistant with Effective Navigation via
Multi-Agent Collaboration
link Junyang Wang, Haiyang Xu,..., Jitao Sang
32 2024-06-03 MixEval: Deriving Wisdom of the Crowd from LLM Benchmark
Mixtures
link Jinjie Ni, Fuzhao Xue,..., Yang You
32 2024-02-12 Model Collapse Demystified: The Case of Regression link Elvis Dohmatob, Yunzhen Feng, Julia Kempe
32 2024-10-08 Vitron: A Unified Pixel-level Vision LLM for Understanding, Generating,
Segmenting, Editing
link Hao Fei, Shengqiong Wu,..., Shuicheng YAN
31 2024-05-28 Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations link Alexander Hägele, Elie Bakouch,..., Martin Jaggi
31 2024-02-16 Interpreting CLIP with Sparse Linear Concept Embeddings (SpLiCE) link Usha Bhalla, Alex Oesterling,..., Himabindu Lakkaraju
31 2024-03-26 LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model
Fine-Tuning
link Rui Pan, Xiang Liu,..., Tong Zhang
31 2024-06-13 Visual Sketchpad: Sketching as a Visual Chain of Thought
for Multimodal Language Models
link Yushi Hu, Weijia Shi,..., Ranjay Krishna
31 2024-05-30 Enhancing Large Vision Language Models with Self-Training on Image
Comprehension
link Yihe Deng, Pan Lu,..., Wei Wang
30 2024-03-11 SARDet-100K: Towards Open-Source Benchmark and ToolKit for Large-Scale SAR
Object Detection
link Yuxuan Li, Xiang Li,..., Jian Yang
30 2024-05-17 Identifying Functionally Important Features with End-to-End Sparse Dictionary Learning link Dan Braun, Jordan Taylor,..., Lee Sharkey
30 2024-03-26 MAGIS: LLM-Based Multi-Agent Framework for GitHub Issue Resolution link Wei Tao, Yucheng Zhou,..., Yu Cheng
30 2023-11-29 Elo Uncovered: Robustness and Best Practices in Language Model
Evaluation
link Meriem Boubdir, Edward Kim,..., Marzieh Fadaee
30 2024-06-14 DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning link Hao Bai, Yifei Zhou,..., Aviral Kumar
30 2024-05-27 Safe LoRA: The Silver Lining of Reducing Safety Risks
when Finetuning Large Language Models
link Chia-Yi Hsu, Yu-Lin Tsai,..., Chun-Ying Huang
30 2024-06-13 OmniTokenizer: A Joint Image-Video Tokenizer for Visual Generation link Junke Wang, Yi Jiang,..., Yu-Gang Jiang
29 2023-06-13 Questioning the Survey Responses of Large Language Models link Ricardo Dominguez-Olmedo, Moritz Hardt, Celestine Mendler-Dünner
29 2024-04-30 HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning link Chunlin Tian, Zhan Shi,..., Cheng-zhong Xu
29 2024-05-20 Diffusion for World Modeling: Visual Details Matter in Atari link Eloi Alonso, Adam Jelley,..., François Fleuret
29 2024-07-19 Compact Language Models via Pruning and Knowledge Distillation link Saurav Muralidharan, Sharath Turuvekere Sreenivas,..., Pavlo Molchanov
29 2024-06-02 BoNBoN Alignment for Large Language Models and the Sweetness
of Best-of-n Sampling
link Lin Gui, Cristina Garbacea, Victor Veitch
29 2024-04-25 REBEL: Reinforcement Learning via Regressing Relative Rewards link Zhaolin Gao, Jonathan Daniel Chang,..., Wen Sun
29 2024-06-13 Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs link Xuan Zhang, Chao Du,..., Min Lin
29 2024-02-07 InfLLM: Training-Free Long-Context Extrapolation for LLMs with an Efficient
Context Memory
link Chaojun Xiao, Pengle Zhang,..., Maosong Sun
29 2024-05-23 MiniCache: KV Cache Compression in Depth Dimension for Large
Language Models
link Akide Liu, Jing Liu,..., Bohan Zhuang
28 2024-05-23 EMR-Merging: Tuning-Free High-Performance Model Merging link Chenyu Huang, Peng Ye,..., Wanli Ouyang
28 2024-05-27 Transformers Can Do Arithmetic with the Right Embeddings link Sean Michael McLeish, Arpit Bansal,..., Tom Goldstein
28 2023-05-27 MADiff: Offline Multi-agent Learning with Diffusion Models link Zhengbang Zhu, Minghuan Liu,..., Weinan Zhang
28 2024-05-23 Multi-Scale VMamba: Hierarchy in Hierarchy Visual State Space Model link Yuheng Shi, Minjing Dong, Chang Xu
28 2024-02-28 Approaching Human-Level Forecasting with Language Models link Danny Halawi, Fred Zhang,..., Jacob Steinhardt
28 2024-05-31 MeshXL: Neural Coordinate Field for Generative 3D Foundation Models link Sijin Chen, Xin Chen,..., Tao Chen
27 2023-12-12 Alignment for Honesty link Yuqing Yang, Ethan Chern,..., Pengfei Liu
27 2024-06-17 How Do Large Language Models Acquire Factual Knowledge During
Pretraining?
link Hoyeon Chang, Jinho Park,..., Minjoon Seo
27 2024-04-09 MambaAD: Exploring State Space Models for Multi-class Unsupervised Anomaly
Detection
link Haoyang He, Yuhu Bai,..., Lei Xie
27 2023-10-26 Transformers Learn to Achieve Second-Order Convergence Rates for In-Context
Linear Regression
link Deqing Fu, Tian-qi Chen,..., Vatsal Sharan
26 2024-09-30 Scaling Proprioceptive-Visual Learning with Heterogeneous Pre-trained Transformers link Lirui Wang, Xinlei Chen,..., Kaiming He
26 2024-06-14 Be like a Goldfish, Don't Memorize! Mitigating Memorization in
Generative LLMs
link Abhimanyu Hans, John Kirchenbauer,..., Tom Goldstein
26 2024-03-05 Found in the Middle: How Language Models Use Long
Contexts Better via Plug-and-Play Positional Encoding
link Zhenyu Zhang, Runjin Chen,..., Zhangyang Wang
26 2024-05-29 T2V-Turbo: Breaking the Quality Bottleneck of Video Consistency Model
with Mixed Reward Feedback
link Jiachen Li, Weixi Feng,..., William Yang Wang
26 2024-07-02 Meta 3D AssetGen: Text-to-Mesh Generation with High-Quality Geometry, Texture,
and PBR Materials
link Yawar Siddiqui, Tom Monnier,..., David Novotny
26 2024-06-03 Improved Few-Shot Jailbreaking Can Circumvent Aligned Language Models and
Their Defenses
link Xiaosen Zheng, Tianyu Pang,..., Min Lin
26 2024-05-09 CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts link Jiachen Li, Xinyao Wang,..., Longyin Wen
25 2024-02-04 Aligner: Efficient Alignment by Learning to Correct link Jiaming Ji, Boyuan Chen,..., Yaodong Yang
25 2024-08-19 Personalizing Reinforcement Learning from Human Feedback with Variational Preference
Learning
link Sriyash Poddar, Yanming Wan,..., Natasha Jaques
25 2024-06-17 Unveiling Encoder-Free Vision-Language Models link Haiwen Diao, Yufeng Cui,..., Xinlong Wang
25 2024-06-06 Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models link Ling Yang, Zhaochen Yu,..., Bin CUI
25 2024-02-18 Federated Fine-tuning of Large Language Models under Heterogeneous Tasks
and Client Resources
link Jiamu Bai, Daoyuan Chen,..., Yaliang Li
25 2024-05-28 Understanding Transformer Reasoning Capabilities via Graph Algorithms link Clayton Sanford, Bahare Fatemi,..., Vahab Mirrokni
25 2024-05-31 4Diffusion: Multi-view Video Diffusion Model for 4D Generation link Haiyu Zhang, Xinyuan Chen,..., Yu Qiao
25 2024-02-03 Panacea: Pareto Alignment via Preference Adaptation for LLMs link Yifan Zhong, Chengdong Ma,..., Yaodong Yang
25 2024-05-22 xRAG: Extreme Context Compression for Retrieval-augmented Generation with One
Token
link Xin Cheng, Xun Wang,..., Dongyan Zhao
25 2024-06-18 DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving link Yuxuan Tong, Xiwen Zhang,..., Junxian He
25 2024-04-12 Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context
Length
link Xuezhe Ma, Xiaomeng Yang,..., Chunting Zhou
25 2024-07-01 On Statistical Rates and Provably Efficient Criteria of
Latent Diffusion Transformers (DiTs)
link Jerry Yao-Chieh Hu, Weimin Wu,..., Han Liu
25 2023-05-15 PLIP: Language-Image Pre-training for Person Representation Learning link Jialong Zuo, Jiahao Hong,..., Jingdong Wang
25 2024-05-28 Why are Visually-Grounded Language Models Bad at Image Classification? link Yuhui Zhang, Alyssa Unell,..., Serena Yeung-Levy
24 2024-06-06 Evaluating the World Model Implicit in a Generative Model link Keyon Vafa, Justin Y. Chen,..., Sendhil Mullainathan
24 2024-07-31 Measuring Progress in Dictionary Learning for Language Model Interpretability
with Board Game Models
link Adam Karvonen, Benjamin Wright,..., Samuel Marks
24 2024-02-19 Query-Based Adversarial Prompt Generation link Jonathan Hayase, Ema Borevković,..., Milad Nasr
24 2024-03-28 InterDreamer: Zero-Shot Text to 3D Dynamic Human-Object Interaction link Sirui Xu, Ziyin Wang,..., Liangyan Gui
24 2024-02-22 Large Language Models as Urban Residents: An LLM Agent
Framework for Personal Mobility Generation
link Jiawei Wang, Renhe Jiang,..., Chuan Xiao
24 2023-10-06 Why Do We Need Weight Decay in Modern Deep
Learning?
link Francesco D'Angelo, Maksym Andriushchenko,..., Nicolas Flammarion
24 2024-04-06 Aligning Diffusion Models by Optimizing Human Utility link Shufan Li, Konstantinos Kallidromitis,..., Kazuki Kozuka
24 2024-03-01 Gradient Cuff: Detecting Jailbreak Attacks on Large Language Models
by Exploring Refusal Loss Landscapes
link Xiaomeng Hu, Pin-Yu Chen, Tsung-Yi Ho
24 2023-12-06 OctreeOcc: Efficient and Multi-Granularity Occupancy Prediction Using Octree Queries link Yuhang Lu, Xinge ZHU,..., Yuexin Ma
24 2024-08-27 The Mamba in the Llama: Distilling and Accelerating Hybrid
Models
link Junxiong Wang, Daniele Paliotta,..., Tri Dao
24 2024-05-03 DreamScene4D: Dynamic Multi-Object Scene Generation from Monocular Videos link Wen-Hsuan Chu, Lei Ke, Katerina Fragkiadaki
24 2024-06-04 OpenGaussian: Towards Point-Level 3D Gaussian-based Open Vocabulary Understanding link Yanmin Wu, Jiarui Meng,..., Jian Zhang
24 2024-05-27 Collaborative Video Diffusion: Consistent Multi-video Generation with Camera Control link Zhengfei Kuang, Shengqu Cai,..., Gordon Wetzstein
24 2024-06-10 LLM Dataset Inference: Did you train on my dataset? link Pratyush Maini, Hengrui Jia,..., Adam Dziedzic
23 2024-02-29 Heavy-Tailed Class Imbalance and Why Adam Outperforms Gradient Descent
on Language Models
link Frederik Kunstner, Alan Milligan,..., Alberto Bietti
23 2024-06-04 Chain of Agents: Large Language Models Collaborating on Long-Context
Tasks
link Yusen Zhang, Ruoxi Sun,..., Sercan O Arik
23 2024-05-23 JiuZhang3.0: Efficiently Improving Mathematical Reasoning by Training Small Data
Synthesis Models
link Kun Zhou, Beichen Zhang,..., Ji-Rong Wen
23 2024-02-09 Fight Back Against Jailbreaking via Prompt Adversarial Tuning link Yichuan Mo, Yuji Wang,..., Yisen Wang
23 2024-06-12 Large Language Model Unlearning via Embedding-Corrupted Prompts link Chris Yuhao Liu, Yaxuan Wang,..., Yang Liu
23 2024-02-29 Theoretical Foundations of Deep Selective State-Space Models link Nicola Muca Cirone, Antonio Orvieto,..., Terry Lyons
23 2024-06-11 4Real: Towards Photorealistic 4D Scene Generation via Video Diffusion
Models
link Heng Yu, Chaoyang Wang,..., Hsin-Ying Lee
23 2024-06-27 Decoding-Time Language Model Alignment with Multiple Objectives link Ruizhe Shi, Yifang Chen,..., Simon Shaolei Du
23 2024-07-05 On scalable oversight with weak LLMs judging strong LLMs link Zachary Kenton, Noah Yamamoto Siegel,..., Rohin Shah
23 2024-03-14 MambaTalk: Efficient Holistic Gesture Synthesis with Selective State Space
Models
link Zunnan Xu, Yukang Lin,..., Xiu Li
23 2024-05-23 ZipCache: Accurate and Efficient KV Cache Quantization with Salient
Token Identification
link Yefei He, Luoming Zhang,..., Bohan Zhuang
23 2024-05-23 Calibrated Self-Rewarding Vision Language Models link Yiyang Zhou, Zhiyuan Fan,..., Huaxiu Yao
23 2024-05-31 Amortizing intractable inference in diffusion models for vision, language,
and control
link Siddarth Venkatraman, Moksh Jain,..., Nikolay Malkin
23 2024-07-29 FreeLong: Training-Free Long Video Generation with SpectralBlend Temporal Attention link Yu Lu, Yuanzhi Liang,..., Yi Yang
22 2024-06-03 DuQuant: Distributing Outliers via Dual Transformation Makes Stronger Quantized
LLMs
link Haokun Lin, Haobo Xu,..., Ying Wei
22 2024-04-23 Aligning LLM Agents by Learning Latent Preference from User
Edits
link Ge Gao, Alexey Taymanov,..., Dipendra Misra
22 2024-06-06 Transformers need glasses! Information over-squashing in language tasks link Federico Barbero, Andrea Banino,..., Petar Veličković
22 2024-07-06 LoRA-GA: Low-Rank Adaptation with Gradient Approximation link Shaowen Wang, Linxi Yu, Jian Li
22 2024-05-07 KV Cache is 1 Bit Per Channel: Efficient Large
Language Model Inference with Coupled Quantization
link Tianyi Zhang, Jonah Wonkyu Yi,..., Anshumali Shrivastava
22 2024-05-27 EM Distillation for One-step Diffusion Models link Sirui Xie, Zhisheng Xiao,..., Ruiqi Gao
22 2024-01-11 A Closer Look at AUROC and AUPRC under Class
Imbalance
link Matthew B.A. McDermott, Haoran Zhang,..., Jack Gallifant
22 2024-04-16 Self-playing Adversarial Language Game Enhances LLM Reasoning link Pengyu Cheng, Tianhao Hu,..., Xiaolong Li
22 2024-06-11 BAKU: An Efficient Transformer for Multi-Task Policy Learning link Siddhant Haldar, Zhuoran Peng, Lerrel Pinto
21 2024-02-19 A Critical Evaluation of AI Feedback for Aligning Large
Language Models
link Archit Sharma, Sedrick Keh,..., Thomas Kollar
21 2024-06-06 Multistep Distillation of Diffusion Models via Moment Matching link Tim Salimans, Thomas Mensink,..., Emiel Hoogeboom
21 2024-05-30 Transfer Q-star : Principled Decoding for LLM Alignment link Souradip Chakraborty, Soumya Suvra Ghosal,..., Furong Huang
21 2024-02-19 WorldCoder, a Model-Based LLM Agent: Building World Models by
Writing Code and Interacting with the Environment
link Hao Tang, Darren Yan Key, Kevin Ellis
21 2022-08-22 Efficiency of the First-Price Auction in the Autobidding World link Yuan Deng, Jieming Mao,..., Song Zuo
21 2024-05-23 HippoRAG: Neurobiologically Inspired Long-Term Memory for Large Language Models link Bernal Jimenez Gutierrez, Yiheng Shu,..., Yu Su
21 2023-10-11 MatFormer: Nested Transformer for Elastic Inference link Fnu Devvrit, Sneha Kudugunta,..., Prateek Jain
21 2024-05-31 ContextGS : Compact 3D Gaussian Splatting with Anchor Level
Context Model
link Yufei Wang, Zhihao Li,..., Bihan Wen
20 2024-02-15 Self-Play Fine-tuning of Diffusion Models for Text-to-image Generation link Huizhuo Yuan, Zixiang Chen,..., Quanquan Gu
20 2024-06-10 Aligning Large Language Models with Representation Editing: A Control
Perspective
link Lingkai Kong, Haorui Wang,..., Chao Zhang
20 2024-06-03 Neural network learns low-dimensional polynomials with SGD near the
information-theoretic limit
link Jason D. Lee, Kazusato Oko,..., Denny Wu
20 2024-05-29 Poseidon: Efficient Foundation Models for PDEs link Maximilian Herde, Bogdan Raonic,..., Siddhartha Mishra
20 2024-05-19 FIFO-Diffusion: Generating Infinite Videos from Text without Training link Jihwan Kim, Junoh Kang,..., Bohyung Han
20 None IMAGPose: A Unified Conditional Framework for Pose-Guided Person Generation link Fei Shen, Jinhui Tang
19 2023-12-06 Return of Unconditional Generation: A Self-supervised Representation Generation Method link Tianhong Li, Dina Katabi, Kaiming He
19 2024-07-08 GenArtist: Multimodal LLM as an Agent for Unified Image
Generation and Editing
link Zhenyu Wang, Aoxue Li,..., Xihui Liu
19 2024-06-10 MATES: Model-Aware Data Selection for Efficient Pretraining with Data
Influence Models
link Zichun Yu, Spandan Das, Chenyan Xiong
19 2024-05-30 Kernel Language Entropy: Fine-grained Uncertainty Quantification for LLMs from
Semantic Similarities
link Alexander V Nikitin, Jannik Kossen,..., Pekka Marttinen
19 2024-05-27 Navigating the Safety Landscape: Measuring Risks in Finetuning Large
Language Models
link ShengYun Peng, Pin-Yu Chen,..., Duen Horng Chau
19 2024-05-29 Weak-to-Strong Search: Align Large Language Models via Searching over
Small Language Models
link Zhanhui Zhou, Zhixuan Liu,..., Yu Qiao
19 2024-05-30 CV-VAE: A Compatible Video VAE for Latent Generative Video
Models
link Sijie Zhao, Yong Zhang,..., Ying Shan
19 2024-05-27 PromptFix: You Prompt and We Fix the Photo link Yongsheng Yu, Ziyun Zeng,..., Jiebo Luo
19 2024-05-22 ReVideo: Remake a Video with Motion and Content Control link Chong Mou, Mingdeng Cao,..., Jian Zhang
18 2024-06-14 Large language model validity via enhanced conformal prediction methods link John Cherian, Isaac Gibbs, Emmanuel Candes
18 2023-08-04 Adaptive Proximal Gradient Method for Convex Optimization link Yura Malitsky, Konstantin Mishchenko
18 2024-01-29 Contracting with a Learning Agent link Guru Guruganesh, Yoav Kolumbus,..., S. Matthew Weinberg
18 2024-06-18 SWT-Bench: Testing and Validating Real-World Bug-Fixes with Code Agents link Niels Mündler, Mark Niklas Mueller,..., Martin Vechev
18 2024-04-23 Gradient Guidance for Diffusion Models: An Optimization Perspective link Yingqing Guo, Hui Yuan,..., Mengdi Wang
18 None Are More LLM Calls All You Need? Towards the
Scaling Properties of Compound AI Systems
link Lingjiao Chen, Jared Quincy Davis,..., James Zou
18 2024-10-26 Fast Best-of-N Decoding via Speculative Rejection link Hanshi Sun, Momin Haider,..., Andrea Zanette
18 2024-08-19 Transformers to SSMs: Distilling Quadratic Knowledge to Subquadratic Models link Aviv Bick, Kevin Li,..., Albert Gu
18 2024-05-29 Preference Learning Algorithms Do Not Learn Preference Rankings link Angelica Chen, Sadhika Malladi,..., Kyunghyun Cho
18 2024-09-09 FLoRA: Federated Fine-Tuning Large Language Models with Heterogeneous Low-Rank
Adaptations
link Ziyao Wang, Zheyu Shen,..., Ang Li
18 2024-01-11 Interpretable Concept Bottlenecks to Align Reinforcement Learning Agents link Quentin Delfosse, Sebastian Sztwiertnia,..., Kristian Kersting
18 2024-03-25 Provably Robust Score-Based Diffusion Posterior Sampling for Plug-and-Play Image
Reconstruction
link Xingyu Xu, Yuejie Chi
18 2024-06-12 DiTFastAttn: Attention Compression for Diffusion Transformer Models link Zhihang Yuan, Hanling Zhang,..., Yu Wang
18 2024-06-17 Emotion-LLaMA: Multimodal Emotion Recognition and Reasoning with Instruction Tuning link Zebang Cheng, Zhi-Qi Cheng,..., Alexander G Hauptmann
18 2024-02-15 BitDelta: Your Fine-Tune May Only Be Worth One Bit link James Liu, Guangxuan Xiao,..., Tianle Cai
18 2024-05-17 ProSST: Protein Language Modeling with Quantized Structure and Disentangled
Attention
link Mingchen Li, Yang Tan,..., Liang Hong
18 2024-04-04 CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching link Dongzhi Jiang, Guanglu Song,..., Hongsheng Li
18 2024-05-25 Theoretical Analysis of Weak-to-Strong Generalization link Hunter Lang, David Sontag, Aravindan Vijayaraghavan
18 2024-06-12 Reversing the Forget-Retain Objectives: An Efficient LLM Unlearning Framework
from Logit Difference
link Jiabao Ji, Yujian Liu,..., Shiyu Chang
18 2024-05-28 FreeSplat: Generalizable 3D Gaussian Splatting Towards Free View Synthesis
of Indoor Scenes
link Yunsong Wang, Tianxin Huang,..., Gim Hee Lee
18 2024-05-24 iVideoGPT: Interactive VideoGPTs are Scalable World Models link Jialong Wu, Shaofeng Yin,..., Mingsheng Long
17 2024-08-19 MeshFormer : High-Quality Mesh Generation with 3D-Guided Reconstruction Model link Minghua Liu, Chong Zeng,..., Hao Su
17 2024-04-19 Ensemble Learning for Heterogeneous Large Language Models with Deep
Parallel Collaboration
link Yichong Huang, Xiaocheng Feng,..., Bing Qin
17 2024-10-21 3DGS-Enhancer: Enhancing Unbounded 3D Gaussian Splatting with View-consistent 2D
Diffusion Priors
link Xi Liu, Chaoyi Zhou, Siyu Huang
17 2024-06-15 Voxel Mamba: Group-Free State Space Models for Point Cloud
based 3D Object Detection
link Guowen Zhang, Lue Fan,..., Lei Zhang
17 2024-07-09 Scaling Retrieval-Based Language Models with a Trillion-Token Datastore link Rulin Shao, Jacqueline He,..., Pang Wei Koh
17 2024-06-13 On Softmax Direct Preference Optimization for Recommendation link Yuxin Chen, Junfei Tan,..., Tat-Seng Chua
17 2024-03-25 Antigen-Specific Antibody Design via Direct Energy-based Preference Optimization link Xiangxin Zhou, Dongyu Xue,..., Quanquan Gu
17 2024-05-28 Personalized Steering of Large Language Models: Versatile Steering Vectors
Through Bi-directional Preference Optimization
link Yuanpu Cao, Tianrong Zhang,..., Jinghui Chen
17 2024-05-24 Meteor: Mamba-based Traversal of Rationale for Large Language and
Vision Models
link Byung-Kwan Lee, Chae Won Kim,..., Yong Man Ro
17 2024-06-13 Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Models
and Time-Dependent Layer Normalization
link Qihao Liu, Zhanpeng Zeng,..., Liang-Chieh Chen
17 2024-03-22 Can large language models explore in-context? link Akshay Krishnamurthy, Keegan Harris,..., Aleksandrs Slivkins
17 2024-03-12 Visual Decoding and Reconstruction via EEG Embeddings with Guided
Diffusion
link Dongyang Li, Chen Wei,..., Quanying Liu
17 2024-05-28 Knowledge Circuits in Pretrained Transformers link Yunzhi Yao, Ningyu Zhang,..., Huajun Chen
17 2024-07-23 Harmonizing Visual Text Comprehension and Generation link Zhen Zhao, Jingqun Tang,..., Yuan Xie
17 2024-09-29 One Token to Seg Them All: Language Instructed Reasoning
Segmentation in Videos
link Zechen Bai, Tong He,..., Mike Zheng Shou
17 2024-05-25 PTQ4DiT: Post-training Quantization for Diffusion Transformers link Junyi Wu, Haoxuan Wang,..., Yan Yan
17 2024-02-04 AutoTimes: Autoregressive Time Series Forecasters via Large Language Models link Yong Liu, Guo Qin,..., Mingsheng Long
17 2024-06-17 Exploring the Role of Large Language Models in Prompt
Encoding for Diffusion Models
link Bingqi Ma, Zhuofan Zong,..., Yu Liu
17 2024-06-21 GeoLRM: Geometry-Aware Large Reconstruction Model for High-Quality 3D Gaussian
Generation
link Chubin Zhang, Hongliang Song,..., Yansong Tang
16 2024-06-27 Resolving Discrepancies in Compute-Optimal Scaling of Language Models link Tomer Porian, Mitchell Wortsman,..., Yair Carmon
16 2024-09-26 HaloScope: Harnessing Unlabeled LLM Generations for Hallucination Detection link Xuefeng Du, Chaowei Xiao, Yixuan Li
16 2024-07-05 Better by default: Strong pre-tuned MLPs and boosted trees
on tabular data
link David Holzmüller, Leo Grinsztajn, Ingo Steinwart
16 2024-03-09 Algorithmic progress in language models link Anson Ho, Tamay Besiroglu,..., Jaime Sevilla
16 2024-06-17 Transcoders find interpretable LLM feature circuits link Jacob Dunefsky, Philippe Chlenski, Neel Nanda
16 2024-05-27 GenWarp: Single Image to Novel Views with Semantic-Preserving Generative
Warping
link Junyoung Seo, Kazumi Fukuda,..., Yuki Mitsufuji
16 2024-02-09 CultureLLM: Incorporating Cultural Differences into Large Language Models link CHENG LI, Mengzhuo Chen,..., Xing Xie
16 2024-05-23 Base of RoPE Bounds Context Length link Mingyu Xu, Xin Men,..., weipeng chen
16 2024-04-22 SOFTS: Efficient Multivariate Time Series Forecasting with Series-Core Fusion link Lu Han, Xu-Yang Chen,..., De-Chuan Zhan
16 2023-12-20 UniSDF: Unifying Neural Representations for High-Fidelity 3D Reconstruction of
Complex Scenes with Reflections
link Fangjinhua Wang, Marie-Julie Rakotosaona,..., Federico Tombari
16 2024-06-20 MR-Ben: A Meta-Reasoning Benchmark for Evaluating System-2 Thinking in
LLMs
link Zhongshen Zeng, Yinhong Liu,..., Jiaya Jia
16 2024-05-16 Conformal Alignment: Knowing When to Trust Foundation Models with
Guarantees
link Yu Gui, Ying Jin, Zhimei Ren
16 2024-03-07 Online Adaptation of Language Models with a Memory of
Amortized Contexts
link Jihoon Tack, Jaehyung Kim,..., Jonathan Richard Schwarz
16 2024-08-07 Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon
Tasks
link Zaijing Li, Yuquan Xie,..., Liqiang Nie
16 2024-05-23 Adapting to Unknown Low-Dimensional Structures in Score-Based Diffusion Models link Gen Li, Yuling Yan
16 2024-02-05 FuseMoE: Mixture-of-Experts Transformers for Fleximodal Fusion link Xing Han, Huy Nguyen,..., Suchi Saria
16 2024-03-12 Lumen: Unleashing Versatile Vision-Centric Capabilities of Large Multimodal Models link Yang Jiao, Shaoxiang Chen,..., Yu-Gang Jiang
16 2024-06-11 Zero-shot Image Editing with Reference Imitation link Xi Chen, Yutong Feng,..., Hengshuang Zhao
16 2024-06-18 HumanSplat: Generalizable Single-Image Human Gaussian Splatting with Structure Priors link Panwang Pan, Zhuo Su,..., Yebin Liu
16 2024-07-02 UltraPixel: Advancing Ultra High-Resolution Image Synthesis to New Peaks link Jingjing Ren, Wenbo Li,..., Lei Zhu
16 2024-05-30 Group Robust Preference Optimization in Reward-free RLHF link Shyam Sundhar Ramesh, Yifan Hu,..., Ilija Bogunovic
16 2024-05-22 DOGS: Distributed-Oriented Gaussian Splatting for Large-Scale 3D Reconstruction
Via Gaussian Consensus
link Yu Chen, Gim Hee Lee
16 2024-06-14 Wild-GS: Real-Time Novel View Synthesis from Unconstrained Photo Collections link Jiacong Xu, Yiqun Mei, Vishal M. Patel
16 2024-06-13 COVE: Unleashing the Diffusion Feature Correspondence for Consistent Video
Editing
link Jiangshan Wang, Yue Ma,..., Xiu Li
16 2024-03-19 Pretraining Codomain Attention Neural Operators for Solving Multiphysics PDEs link Md Ashiqur Rahman, Robert Joseph George,..., Anima Anandkumar
15 2023-10-19 AutoMix: Automatically Mixing Language Models link Pranjal Aggarwal, Aman Madaan,..., Mausam .
15 2024-06-13 Understanding Hallucinations in Diffusion Models through Mode Interpolation link Sumukh K Aithal, Pratyush Maini,..., J Zico Kolter
15 2024-02-02 AMOR: A Recipe for Building Adaptable Modular Knowledge Agents
Through Process Feedback
link Jian Guan, Wei Wu,..., Minlie Huang
15 2024-06-10 AutoSurvey: Large Language Models Can Automatically Write Surveys link Yidong Wang, Qi Guo,..., Yue Zhang
15 2024-09-01 ContextCite: Attributing Model Generation to Context link Benjamin Cohen-Wang, Harshay Shah,..., Aleksander Madry
15 2024-05-27 BehaviorGPT: Smart Agent Simulation for Autonomous Driving with Next-Patch
Prediction
link Zikang Zhou, Haibo HU,..., Chun Jason Xue
15 2024-05-23 PaGoDA: Progressive Growing of a One-Step Generator from a
Low-Resolution Diffusion Teacher
link Dongjun Kim, Chieh-Hsin Lai,..., Stefano Ermon
15 2024-05-24 Quantifying the Gain in Weak-to-Strong Generalization link Moses Charikar, Chirag Pabbaraju, Kirankumar Shiragur
15 2024-05-30 Improving the Training of Rectified Flows link Sangyun Lee, Zinan Lin, Giulia Fanti
15 2024-03-28 Dual-Personalizing Adapter for Federated Foundation Models link yiyuan yang, Guodong Long,..., Michael Blumenstein
15 2024-01-18 Cross-Modality Perturbation Synergy Attack for Person Re-identification link Yunpeng Gong, Zhun Zhong,..., Min Jiang
15 2024-06-20 Prism: A Framework for Decoupling and Assessing the Capabilities
of VLMs
link Yuxuan Qiao, Haodong Duan,..., Kai Chen
15 2024-09-05 Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene
Understanding
link Yunze Man, Shuhong Zheng,..., Yu-Xiong Wang
15 2024-10-31 SelfCodeAlign: Self-Alignment for Code Generation link Yuxiang Wei, Federico Cassano,..., LINGMING ZHANG
15 2024-07-16 Animate3D: Animating Any 3D Model with Multi-view Video Diffusion link Yanqin Jiang, Chaohui Yu,..., Jin Gao
15 2024-01-08 Tiny Time Mixers (TTMs): Fast Pre-trained Models for Enhanced
Zero/Few-Shot Forecasting of Multivariate Time Series
link Vijay Ekambaram, Arindam Jati,..., Jayant Kalagnanam
15 2024-06-06 BitsFusion: 1.99 bits Weight Quantization of Diffusion Model link Yang Sui, Yanyu Li,..., Jian Ren
15 2024-05-15 Spectral Editing of Activations for Large Language Model Alignment link Yifu QIU, Zheng Zhao,..., Shay B Cohen
15 2024-05-23 Video Diffusion Models are Training-free Motion Interpreter and Controller link Zeqi Xiao, Yifan Zhou,..., Xingang Pan
15 2024-05-23 Instruction Tuning With Loss Over Instructions link Zhengyan Shi, Adam X. Yang,..., Aldo Lipani
15 2024-06-14 UniAudio 1.5: Large Language Model-Driven Audio Codec is A
Few-Shot Audio Task Learner
link Dongchao Yang, Haohan Guo,..., Helen M. Meng
15 2024-05-23 Representation Noising: A Defence Mechanism Against Harmful Finetuning link Domenic Rosati, Jan Wehner,..., Frank Rudzicz
15 2024-10-25 DiffGS: Functional Gaussian Splatting Diffusion link Junsheng Zhou, Weiqi Zhang, Yu-Shen Liu
15 2024-05-04 U-DiTs: Downsample Tokens in U-Shaped Diffusion Transformers link Yuchuan Tian, Zhijun Tu,..., Yunhe Wang
15 2024-05-23 WISE: Rethinking the Knowledge Memory for Lifelong Model Editing
of Large Language Models
link Peng Wang, Zexi Li,..., Huajun Chen
14 2024-05-23 PV-Tuning: Beyond Straight-Through Estimation for Extreme LLM Compression link Vladimir Malinovskii, Denis Mazur,..., Peter Richtárik
14 2024-05-25 Bigger, Regularized, Optimistic: scaling for compute and sample efficient
continuous control
link Michal Nauman, Mateusz Ostaszewski,..., Marek Cygan
14 2024-06-10 Get rich quick: exact solutions reveal how unbalanced initializations
promote rapid feature learning
link Daniel Kunin, Allan Raventos,..., Surya Ganguli
14 2024-05-29 Adaptive Image Quality Assessment via Teaching Large Multimodal Model
to Compare
link Hanwei Zhu, Haoning Wu,..., Shiqi Wang
14 2024-06-05 Dynamic 3D Gaussian Fields for Urban Areas link Tobias Fischer, Jonas Kulhanek,..., Peter Kontschieder
14 2024-06-03 What makes unlearning hard and what to do about
it
link Kairan Zhao, Meghdad Kurmanji,..., Peter Triantafillou
14 2023-10-10 A General Protocol to Probe Large Vision Models for
3D Physical Understanding
link Guanqi Zhan, Chuanxia Zheng,..., Andrew Zisserman
14 2024-05-20 Metacognitive Capabilities of LLMs: An Exploration in Mathematical Problem
Solving
link Aniket Rajiv Didolkar, Anirudh Goyal,..., Sanjeev Arora
14 2024-06-06 VideoTetris: Towards Compositional Text-to-Video Generation link Ye Tian, Ling Yang,..., Bin CUI
14 2024-04-22 Protecting Your LLMs with Information Bottleneck link Zichuan Liu, Zefan Wang,..., Jiang Bian
14 2024-05-29 Principled Probabilistic Imaging using Diffusion Models as Plug-and-Play Priors link Zihui Wu, Yu Sun,..., Katherine Bouman
14 2024-05-23 Fisher Flow Matching for Generative Modeling over Discrete Data link Oscar Davis, Samuel Kessler,..., Joey Bose
14 2024-06-12 A Concept-Based Explainability Framework for Large Multimodal Models link Jayneel Parekh, Pegah KHAYATAN,..., Matthieu Cord
14 2024-06-06 ReNO: Enhancing One-step Text-to-Image Models through Reward-based Noise Optimization link Luca Eyring, Shyamgopal Karthik,..., Zeynep Akata
14 2024-02-05 Estimating Epistemic and Aleatoric Uncertainty with a Single Model link Matthew Albert Chan, Maria J. Molina, Christopher Metzler
14 2024-04-05 Identity Decoupling for Multi-Subject Personalization of Text-to-Image Models link Sangwon Jang, Jaehyeong Jo,..., Sung Ju Hwang
14 2024-07-08 Multi-Object Hallucination in Vision Language Models link Xuweiyi Chen, Ziqiao Ma,..., Joyce Chai
14 2024-02-07 Improved off-policy training of diffusion samplers link Marcin Sendera, Minsu Kim,..., Nikolay Malkin
14 2024-10-21 Mitigating Object Hallucination via Concentric Causal Attention link Yun Xing, Yiheng Li,..., Shijian Lu
14 2024-10-18 Neural Signed Distance Function Inference through Splatting 3D Gaussians
Pulled on Zero-Level Set
link Wenyuan Zhang, Yu-Shen Liu, Zhizhong Han
14 2024-02-29 UniTS: A Unified Multi-Task Time Series Model link Shanghua Gao, Teddy Koker,..., Marinka Zitnik
14 2024-11-04 DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for
Efficient Robot Execution
link Yang Yue, Yulin Wang,..., Gao Huang
14 2024-06-13 LRM-Zero: Training Large Reconstruction Models with Synthesized Data link Desai Xie, Sai Bi,..., Hao Tan
13 2024-02-29 RL-GPT: Integrating Reinforcement Learning and Code-as-policy link Shaoteng Liu, Haoqi Yuan,..., Jiaya Jia
13 2024-06-24 Finding Transformer Circuits With Edge Pruning link Adithya Bhaskar, Alexander Wettig,..., Danqi Chen
13 2024-05-23 Axioms for AI Alignment from Human Feedback link Luise Ge, Daniel Halpern,..., Junlin Wu
13 2024-06-13 4M-21: An Any-to-Any Vision Model for Tens of Tasks
and Modalities
link Roman Bachmann, Oğuzhan Fatih Kar,..., Amir Zamir
13 2023-10-21 Masked Hard-Attention Transformers Recognize Exactly the Star-Free Languages link Andy Yang, David Chiang, Dana Angluin
13 2024-05-29 Grasp as You Say: Language-guided Dexterous Grasp Generation link Yi-Lin Wei, Jian-Jian Jiang,..., Wei-Shi Zheng
13 2024-01-24 Beyond Concept Bottleneck Models: How to Make Black Boxes
Intervenable?
link Sonia Laguna, Ričards Marcinkevičs,..., Julia E Vogt
13 2024-02-07 Amortized Planning with Large-Scale Transformers: A Case Study on
Chess
link Anian Ruoss, Gregoire Deletang,..., Tim Genewein
13 2024-02-05 Shadowcast: Stealthy Data Poisoning Attacks Against Vision-Language Models link Yuancheng Xu, Jiarui Yao,..., Furong Huang
13 2024-06-03 SemCoder: Training Code Language Models with Comprehensive Semantics Reasoning link Yangruibo Ding, Jinjun Peng,..., Baishakhi Ray
13 2024-06-12 Scaling Laws in Linear Regression: Compute, Parameters, and Data link Licong Lin, Jingfeng Wu,..., Jason D. Lee
13 2024-06-25 DiffusionPDE: Generative PDE-Solving under Partial Observation link Jiahe Huang, Guandao Yang,..., Jeong Joon Park
13 2024-06-12 Discovering Preference Optimization Algorithms with and for Large Language
Models
link Chris Lu, Samuel Holt,..., Robert Tjarko Lange
13 2024-10-22 One-Step Diffusion Distillation through Score Implicit Matching link Weijian Luo, Zemin Huang,..., Guo-Jun Qi
13 2024-01-27 DiffuserLite: Towards Real-time Diffusion Planning link Zibin Dong, Jianye HAO,..., YAN ZHENG
13 2024-05-24 Medformer: A Multi-Granularity Patching Transformer for Medical Time-Series Classification link Yihe Wang, Nan Huang,..., Xiang Zhang
13 2024-06-17 AvaTaR: Optimizing LLM Agents for Tool Usage via Contrastive
Reasoning
link Shirley Wu, Shiyu Zhao,..., James Zou
13 2024-08-28 Efficient LLM Scheduling by Learning to Rank link Yichao Fu, Siqi Zhu,..., Hao Zhang
13 2024-05-23 Lorentz-Equivariant Geometric Algebra Transformers for High-Energy Physics link Jonas Spinner, Victor Breso Pla,..., Johann Brehmer
13 2024-04-04 Mind's Eye of LLMs: Visualization-of-Thought Elicits Spatial Reasoning in
Large Language Models
link Wenshan Wu, Shaoguang Mao,..., Furu Wei
13 2024-02-28 Implicit Optimization Bias of Next-token Prediction in Linear Models link Christos Thrampoulidis
13 2024-05-27 Vidu4D: Single Generated Video to High-Fidelity 4D Reconstruction with
Dynamic Gaussian Surfels
link Yikai Wang, Xinzhou Wang,..., Jun Zhu
13 2024-05-22 Dense Connector for MLLMs link Huanjin Yao, Wenhao Wu,..., Jingdong Wang
13 2024-11-07 MVSplat360: Feed-Forward 360 Scene Synthesis from Sparse Views link Yuedong Chen, Chuanxia Zheng,..., Jianfei Cai
13 2024-05-02 FLAME : Factuality-Aware Alignment for Large Language Models link Sheng-Chieh Lin, Luyu Gao,..., Xilun Chen
13 2024-06-12 Vivid-ZOO: Multi-View Video Generation with Diffusion Model link Bing Li, Cheng Zheng,..., Bernard Ghanem
13 2024-03-03 GuardT2I: Defending Text-to-Image Models from Adversarial Prompts link Yijun Yang, Ruiyuan Gao,..., Qiang Xu
12 None Not All Tokens Are What You Need for Pretraining link Zhenghao Lin, Zhibin Gou,..., Weizhu Chen
12 2024-04-22 MDAgents: An Adaptive Collaboration of LLMs for Medical Decision-Making link Yubin Kim, Chanwoo Park,..., Hae Won Park
12 2024-03-25 QKFormer: Hierarchical Spiking Transformer using Q-K Attention link Chenlin Zhou, Han Zhang,..., Yonghong Tian
12 2024-05-23 4+3 Phases of Compute-Optimal Neural Scaling Laws link Elliot Paquette, Courtney Paquette,..., Jeffrey Pennington
12 2024-05-22 Context and Geometry Aware Voxel Transformer for Semantic Scene
Completion
link Zhu Yu, Runmin Zhang,..., Hui-liang Shen
12 2024-02-21 Linear Transformers are Versatile In-Context Learners link Max Vladymyrov, Johannes Von Oswald,..., Rong Ge
12 2024-01-22 Self-Labeling the Job Shop Scheduling Problem link Andrea Corsini, Angelo Porrello,..., Mauro Dell'Amico
12 2024-04-08 SpeechAlign: Aligning Speech Generation to Human Preferences link Dong Zhang, Zhaowei Li,..., Xipeng Qiu
12 2024-06-05 HYDRA: Model Factorization Framework for Black-Box LLM Personalization link Yuchen Zhuang, Haotian Sun,..., Bo Dai
12 2024-10-10 Generalizable and Animatable Gaussian Head Avatar link Xuangeng Chu, Tatsuya Harada
12 2024-02-22 In-Context Learning of a Linear Transformer Block: Benefits of
the MLP Component and One-Step GD Initialization
link Ruiqi Zhang, Jingfeng Wu, Peter Bartlett
12 2024-02-26 SelectIT: Selective Instruction Tuning for LLMs via Uncertainty-Aware Self-Reflection link Liangxin Liu, Xuebo Liu,..., Min Zhang
12 2024-06-27 Length Optimization in Conformal Prediction link Shayan Kiyani, George J. Pappas, Hamed Hassani
12 2024-05-30 Bridging Model-Based Optimization and Generative Modeling via Conservative Fine-Tuning
of Diffusion Models
link Masatoshi Uehara, Yulai Zhao,..., Tommaso Biancalani
12 2024-05-26 Code Repair with LLMs gives an Exploration-Exploitation Tradeoff link Hao Tang, Keya Hu,..., Kevin Ellis
12 2024-05-28 A Theoretical Understanding of Self-Correction through In-context Alignment link Yifei Wang, Yuyang Wu,..., Yisen Wang
12 2024-05-25 Breaking the False Sense of Security in Backdoor Defense
through Re-Activation Attack
link Mingli Zhu, Siyuan Liang, Baoyuan Wu
12 2024-05-29 Stress-Testing Capability Elicitation With Password-Locked Models link Ryan Greenblatt, Fabien Roger,..., David Krueger
12 2024-06-03 The Importance of Online Data: Understanding Preference Fine-tuning via
Coverage
link Yuda Song, Gokul Swamy,..., Wen Sun
12 2024-09-26 From News to Forecast: Integrating Event Analysis in LLM-Based
Time Series Forecasting with Reflection
link Xinlei Wang, Maike Feng,..., Junhua Zhao
12 2024-07-22 QueST: Self-Supervised Skill Abstractions for Learning Continuous Control link Atharva Mete, Haotian Xue,..., Animesh Garg
12 2024-05-24 Score Distillation via Reparametrized DDIM link Artem Lukoianov, Haitz Sáez de Ocáriz Borde,..., Justin Solomon
12 2024-06-17 Large Scale Transfer Learning for Tabular Data via
Language Modeling
link Joshua P Gardner, Juan Carlos Perdomo, Ludwig Schmidt
12 2024-06-21 Multimodal Task Vectors Enable Many-Shot Multimodal In-Context Learning link Brandon Huang, Chancharik Mitra,..., Roei Herzig
12 2024-10-24 Binocular-Guided 3D Gaussian Splatting with View Consistency for Sparse
View Synthesis
link Liang Han, Junsheng Zhou,..., Zhizhong Han
12 2024-06-03 TabPedia: Towards Comprehensive Visual Table Understanding with Concept Synergy link Weichao Zhao, Hao Feng,..., Can Huang
12 2024-06-10 MVGamba: Unify 3D Content Generation as State Space Sequence
Modeling
link Xuanyu Yi, Zike Wu,..., Hanwang Zhang
12 2024-07-01 Evaluation of Text-to-Video Generation Models: A Dynamics Perspective link Mingxiang Liao, Hannan Lu,..., Xinyu Zhang
12 2024-06-13 Yo'LLaVA: Your Personalized Language and Vision Assistant link Thao Nguyen, Haotian Liu,..., Yong Jae Lee
12 2024-05-24 GS-Hider: Hiding Messages into 3D Gaussian Splatting link Xuanyu Zhang, Jiarui Meng,..., Jian Zhang
12 2024-02-04 Diffusion Models are Certifiably Robust Classifiers link Huanran Chen, Yinpeng Dong,..., Jun Zhu
11 2024-10-08 Unlocking the Capabilities of Thought: A Reasoning Boundary Framework
to Quantify and Optimize Chain-of-Thought
link Qiguang Chen, Libo Qin,..., Wanxiang Che
11 2024-06-12 Self-Consuming Generative Models with Curated Data Provably Optimize Human
Preferences
link Damien Ferbach, Quentin Bertrand,..., Gauthier Gidel
11 2024-06-09 Training Compute-Optimal Protein Language Models link Xingyi Cheng, Bo Chen,..., Le Song
11 2024-06-20 CooHOI: Learning Cooperative Human-Object Interaction with Manipulated Object Dynamics link Jiawei Gao, Ziqin Wang,..., Jiangmiao Pang
11 2024-02-09 Learn To be Efficient: Build Structured Sparsity in Large
Language Models
link Haizhong Zheng, Xiaoyan Bai,..., Atul Prakash
11 2023-10-29 Optimal Algorithms for Online Convex Optimization with Adversarial Constraints link Abhishek Sinha, Rahul Vaze
11 2024-02-06 A Phase Transition between Positional and Semantic Learning in
a Solvable Model of Dot-Product Attention
link Hugo Cui, Freya Behrens,..., Lenka Zdeborova
11 2024-06-23 Trace is the Next AutoDiff: Generative Optimization with Rich
Feedback, Execution Traces, and LLMs
link Ching-An Cheng, Allen Nie, Adith Swaminathan
11 2024-05-28 Linguistic Collapse: Neural Collapse in (Large) Language Models link Robert Wu, Vardan Papyan
11 2024-09-26 Generative Modeling of Molecular Dynamics Trajectories link Bowen Jing, Hannes Stark,..., Bonnie Berger
11 2024-10-10 Global Lyapunov functions: a long-standing open problem in mathematics,
with symbolic transformers
link Alberto Alfarano, Francois Charton, Amaury Hayat
11 2024-06-01 RGFN: Synthesizable Molecular Generation Using GFlowNets link Michał Koziarski, Andrei Rekesh,..., Robert A. Batey
11 2024-06-09 Distributional Preference Alignment of LLMs via Optimal Transport link Igor Melnyk, Youssef Mroueh,..., Jarret Ross
11 2024-02-06 Scaling laws for learning with real and surrogate data link Ayush Jain, Andrea Montanari, Eren Sasoglu
11 2023-12-13 SwitchHead: Accelerating Transformers with Mixture-of-Experts Attention link Róbert Csordás, Piotr Piękos,..., Jürgen Schmidhuber
11 2024-05-07 Towards a Theoretical Understanding of the 'Reversal Curse' via
Training Dynamics
link Hanlin Zhu, Baihe Huang,..., Stuart Russell
11 2024-11-04 Can Language Models Learn to Skip Steps? link Tengxiao Liu, Qipeng Guo,..., Zheng Zhang
11 2023-11-03 Towards Calibrated Robust Fine-Tuning of Vision-Language Models link Changdae Oh, Hyesu Lim,..., Kyungwoo Song
11 2024-09-11 Gated Slot Attention for Efficient Linear-Time Sequence Modeling link Yu Zhang, Songlin Yang,..., Guohong Fu
11 2024-05-28 Getting More Juice Out of the SFT Data: Reward
Learning from Human Demonstration Improves SFT for LLM Alignment
link Jiaxiang Li, Siliang Zeng,..., Mingyi Hong
11 2024-06-04 Loki: Low-rank Keys for Efficient Sparse Attention link Prajwal Singhania, Siddharth Singh,..., Abhinav Bhatele
11 2023-05-22 Imprecise Label Learning: A Unified Framework for Learning with
Various Imprecise Label Configurations
link Hao Chen, Ankit Shah,..., Bhiksha Raj
11 2024-03-02 Accelerating Greedy Coordinate Gradient and General Prompt Optimization via
Probe Sampling
link Yiran Zhao, Wenyue Zheng,..., Michael Shieh
11 2024-12-19 Cal-DPO: Calibrated Direct Preference Optimization for Language Model Alignment link Teng Xiao, Yige Yuan,..., Vasant G Honavar
11 2024-05-23 Nearly Tight Black-Box Auditing of Differentially Private Machine Learning link Meenatchi Sundaram Muthu Selva Annamalai, Emiliano De Cristofaro
11 2024-06-03 LoFiT: Localized Fine-tuning on LLM Representations link Fangcong Yin, Xi Ye, Greg Durrett
11 2024-03-12 SmallToLarge (S2L): Scalable Data Selection for Fine-tuning Large Language
Models by Summarizing Training Trajectories of Small Models
link Yu Yang, Siddhartha Mishra,..., Baharan Mirzasoleiman
11 2024-02-07 QGFN: Controllable Greediness with Action Values link Elaine Lau, Stephen Zhewen Lu,..., Emmanuel Bengio
11 2024-05-21 Single Image Unlearning: Efficient Machine Unlearning in Multimodal Large
Language Models
link Jiaqi Li, Qianshan Wei,..., Fan Liu
11 2024-06-24 Confidence Regulation Neurons in Language Models link Alessandro Stolfo, Ben Peng Wu,..., Neel Nanda
11 2024-02-21 Average gradient outer product as a mechanism for deep
neural collapse
link Daniel Beaglehole, Peter Súkeník,..., Mikhail Belkin
11 2024-05-23 Metric Flow Matching for Smooth Interpolations on the Data
Manifold
link Kacper Kapusniak, Peter Potaptchik,..., Francesco Di Giovanni
11 2024-07-17 Direct Unlearning Optimization for Robust and Safe Text-to-Image Models link Yong-Hyun Park, Sangdoo Yun,..., Gayoung Lee
11 2024-03-25 MetaAligner: Towards Generalizable Multi-Objective Alignment of Language Models link Kailai Yang, Zhiwei Liu,..., Sophia Ananiadou
11 2024-03-18 Dynamic Tuning Towards Parameter and Inference Efficiency for ViT
Adaptation
link Wangbo Zhao, Jiasheng Tang,..., Yang You
11 2024-06-11 Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without
Guidance
link Kuan Heng Lin, Sicheng Mo,..., Bolei Zhou
11 2023-05-21 DPIC: Decoupling Prompt and Intrinsic Characteristics for LLM Generated
Text Detection
link Xiao Yu, Yuang Qi,..., Nenghai Yu
11 2024-05-23 Agent Planning with World Knowledge Model link Shuofei Qiao, Runnan Fang,..., Huajun Chen
11 2024-02-07 The Fine-Grained Complexity of Gradient Computation for Training Large
Language Models
link Josh Alman, Zhao Song
11 2024-06-11 Neural Gaffer: Relighting Any Object via Diffusion link Haian Jin, Yuan Li,..., Noah Snavely
11 2024-04-23 Multi-Head Mixture-of-Experts link Xun Wu, Shaohan Huang,..., Furu Wei
11 2024-06-09 VCR-GauS: View Consistent Depth-Normal Regularizer for Gaussian Surface Reconstruction link Hanlin Chen, Fangyin Wei,..., Gim Hee Lee
11 2024-07-25 LION: Linear Group RNN for 3D Object Detection in
Point Clouds
link Zhe Liu, Jinghua Hou,..., Xiang Bai
11 2024-06-12 Large Language Models Must Be Taught to Know What
They Don’t Know
link Sanyam Kapoor, Nate Gruver,..., Andrew Gordon Wilson
11 2024-02-17 TuneTables: Context Optimization for Scalable Prior-Data Fitted Networks link Benjamin Feuer, Robin Tibor Schirrmeister,..., Colin White
10 2024-02-18 In-Context Learning with Transformers: Softmax Attention Adapts to Function
Lipschitzness
link Liam Collins, Advait U Parulekar,..., Sanjay Shakkottai
10 2024-06-17 Transcendence: Generative Models Can Outperform The Experts That Train
Them
link Edwin Zhang, Vincent Zhu,..., eran malach
10 2024-02-16 Conformalized Credal Set Predictors link Alireza Javanmardi, David Stutz, Eyke Hüllermeier
10 2024-10-30 FlowLLM: Flow Matching for Material Generation with Large Language
Models as Base Distributions
link Anuroop Sriram, Benjamin Kurt Miller,..., Brandon M Wood
10 2024-06-27 OmniJARVIS: Unified Vision-Language-Action Tokenization Enables Open-World Instruction Following Agents link Zihao Wang, Shaofei Cai,..., Yitao Liang
10 2024-02-07 Universal Neural Functionals link Allan Zhou, Chelsea Finn, James Harrison
10 2024-06-03 MediQ: Question-Asking LLMs and a Benchmark for Reliable Interactive
Clinical Reasoning
link Shuyue Stella Li, Vidhisha Balachandran,..., Yulia Tsvetkov
10 2024-06-15 A Label is Worth A Thousand Images in Dataset
Distillation
link Tian Qin, Zhiwei Deng, David Alvarez-Melis
10 2024-03-06 WaterMax: breaking the LLM watermark detectability-robustness-quality trade-off link Eva Giboulot, Teddy Furon
10 2024-06-10 How Far Can Transformers Reason? The Globality Barrier and
Inductive Scratchpad
link Emmanuel Abbe, Samy Bengio,..., Omid Saremi
10 2024-05-24 Alleviating Hallucinations in Large Vision-Language Models through Hallucination-Induced Optimization link Xinyu Lyu, Beitao Chen,..., Jingkuan Song
10 2024-06-13 Talking Heads: Understanding Inter-Layer Communication in Transformer Language Models link Jack Merullo, Carsten Eickhoff, Ellie Pavlick
10 2024-05-29 Nearest Neighbor Speculative Decoding for LLM Generation and Attribution link Minghan Li, Xilun Chen,..., Xi Victoria Lin
10 2024-09-30 Magnet: We Never Know How Text-to-Image Diffusion Models Work,
Until We Learn How Vision-Language Models Function
link Chenyi Zhuang, Ying Hu, Pan Gao
10 2024-05-31 Kaleido Diffusion: Improving Conditional Diffusion Models with Autoregressive Latent
Modeling
link Jiatao Gu, Ying Shen,..., Joshua M. Susskind
10 2024-02-21 Full-Atom Peptide Design with Geometric Latent Diffusion link Xiangzhe Kong, Yinjun Jia,..., Yang Liu
10 2024-06-13 Rethinking Score Distillation as a Bridge Between Image Distributions link David McAllister, Songwei Ge,..., Angjoo Kanazawa
10 2024-05-24 VB-LoRA: Extreme Parameter Efficient Fine-Tuning with Vector Banks link Yang Li, Shaobo Han, Shihao Ji
10 2024-06-04 SpecExec: Massively Parallel Speculative Decoding For Interactive LLM Inference
on Consumer Devices
link Ruslan Svirschevski, Avner May,..., Max Ryabinin
10 2024-10-03 Parameter Competition Balancing for Model Merging link Guodong DU, Junlin Lee,..., Min Zhang
10 2024-02-22 Learning an Actionable Discrete Diffusion Policy via Large-Scale Actionless
Video Pre-Training
link Haoran He, Chenjia Bai,..., Xuelong Li
10 2024-03-19 Optimal Flow Matching: Learning Straight Trajectories in Just One
Step
link Nikita Maksimovich Kornilov, Petr Mokrov,..., Alexander Korotin
10 2024-05-23 ALI-Agent: Assessing LLMs' Alignment with Human Values via
Agent-based Evaluation
link Jingnan Zheng, Han Wang,..., Tat-Seng Chua
10 2024-06-02 Evidence of Learned Look-Ahead in a Chess-Playing Neural Network link Erik Jenner, Shreyas Kapur,..., Stuart Russell
10 2024-06-03 D-CPT Law: Domain-specific Continual Pre-Training Scaling Law for Large
Language Models
link Haoran Que, Jiaheng Liu,..., Bo Zheng
10 2024-05-30 Jailbreaking Large Language Models Against Moderation Guardrails via Cipher
Characters
link Haibo Jin, Andy Zhou,..., Haohan Wang
10 2024-03-18 A Sober Look at the Robustness of CLIPs to
Spurious Features
link Qizhou Wang, Yong Lin,..., Tong Zhang
10 2024-05-25 M$^3$GPT: An Advanced Multimodal, Multitask Framework for Motion
Comprehension and Generation
link Mingshuang Luo, RuiBing Hou,..., Shiguang Shan
10 2024-09-11 NVRC: Neural Video Representation Compression link Ho Man Kwan, Ge Gao,..., David Bull
10 2023-11-01 Learning Cooperative Trajectory Representations for Motion Forecasting link Hongzhi Ruan, Haibao Yu,..., Zaiqing Nie
10 2024-05-31 R$^2$-Gaussian: Rectifying Radiative Gaussian Splatting for Tomographic Reconstruction link Ruyi Zha, Tao Jun Lin,..., Hongdong Li
10 2024-05-24 HDR-GS: Efficient High Dynamic Range Novel View Synthesis at
1000x Speed via Gaussian Splatting
link Yuanhao Cai, Zihao Xiao,..., Alan Yuille
10 2024-06-04 Finding NeMo: Localizing Neurons Responsible For Memorization in Diffusion
Models
link Dominik Hintersdorf, Lukas Struppek,..., Franziska Boenisch
10 2024-07-13 Hydra: Bidirectional State Space Models Through Generalized Matrix Mixers link Sukjun Hwang, Aakash Lahoti,..., Albert Gu
10 2024-05-28 Phased Consistency Models link Fu-Yun Wang, Zhaoyang Huang,..., Hongsheng Li
10 2024-06-13 SimGen: Simulator-conditioned Driving Scene Generation link Yunsong Zhou, Michael Simon,..., Bolei Zhou
10 2024-05-24 ART: Automatic Red-teaming for Text-to-Image Models to Protect Benign
Users
link Guanlin Li, Kangjie Chen,..., Tianwei Zhang
10 2024-02-02 Segment Any Change link Zhuo Zheng, Yanfei Zhong,..., Stefano Ermon
10 2024-04-25 Cooperate or Collapse: Emergence of Sustainable Cooperation in
a Society of LLM Agents
link Giorgio Piatti, Zhijing Jin,..., Rada Mihalcea
10 2024-05-22 RadarOcc: Robust 3D Occupancy Prediction with 4D Imaging Radar link Fangqiang Ding, Xiangyu Wen,..., Chris Xiaoxuan Lu
9 2024-06-04 Learning to grok: Emergence of in-context learning and skill
composition in modular arithmetic tasks
link Tianyu He, Darshil Doshi,..., Andrey Gromov
9 2024-05-31 LLM-ESR: Large Language Models Enhancement for Long-tailed Sequential Recommendation link Qidong Liu, Xian Wu,..., Xiangyu Zhao
9 2024-09-14 Schrodinger Bridge Flow for Unpaired Data Translation link Valentin De Bortoli, Iryna Korshunova,..., Arnaud Doucet
9 2024-06-05 A Geometric View of Data Complexity: Efficient Local Intrinsic
Dimension Estimation with Diffusion Models
link Hamidreza Kamkari, Brendan Leigh Ross,..., Gabriel Loaiza-Ganem
9 2024-09-24 TFG: Unified Training-Free Guidance for Diffusion Models link Haotian Ye, Haowei Lin,..., Stefano Ermon
9 2024-05-24 Accelerating Diffusion Models with Parallel Sampling: Inference at Sub-Linear
Time Complexity
link Haoxuan Chen, Yinuo Ren,..., Grant M. Rotskoff
9 2024-05-27 MultiOOD: Scaling Out-of-Distribution Detection for Multiple Modalities link Hao Dong, Yue Zhao,..., Olga Fink
9 2024-10-31 Understanding the Limits of Vision Language Models Through the
Lens of the Binding Problem
link Declan Iain Campbell, Sunayana Rane,..., Taylor Whittington Webb
9 2024-07-14 What Makes and Breaks Safety Fine-tuning? A Mechanistic Study link Samyak Jain, Ekdeep Singh Lubana,..., Puneet K. Dokania
9 2024-06-07 The Factorization Curse: Which Tokens You Predict Underlie the
Reversal Curse and More
link Ouail Kitouni, Niklas Nolte,..., Mark Ibrahim
9 2024-07-15 LLM Circuit Analyses Are Consistent Across Training and Scale link Curt Tigges, Michael Hanna,..., Stella Biderman
9 2024-10-16 Stabilize the Latent Space for Image Autoregressive Modeling: A
Unified Perspective
link Yongxin Zhu, Bocheng Li,..., Lidong Bing
9 2024-02-05 Constrained Synthesis with Projected Diffusion Models link Jacob K Christopher, Stephen Baek, Ferdinando Fioretto
9 2024-05-24 MicroAdam: Accurate Adaptive Optimization with Low Space Overhead and
Provable Convergence
link Ionut-Vlad Modoranu, Mher Safaryan,..., Dan Alistarh
9 2024-10-24 Schedule Your Edit: A Simple yet Effective Diffusion Noise
Schedule for Image Editing
link Haonan Lin, Yan Chen,..., QianYing Wang
9 2024-07-19 Towards a "Universal Translator" for Neural Dynamics at Single-Cell,
Single-Spike Resolution
link Yizi Zhang, Yanchen Wang,..., Cole Lincoln Hurwitz
9 2024-02-06 On Convergence of Adam for Stochastic Optimization under Relaxed
Assumptions
link Yusu Hong, Junhong Lin
9 2024-10-31 The Importance of Being Scalable: Improving the Speed and
Accuracy of Neural Network Interatomic Potentials Across Chemical Domains
link Eric Qu, Aditi S. Krishnapriyan
9 2024-12-05 SceneDiffuser: Efficient and Controllable Driving Simulation Initialization and Rollout link Chiyu Max Jiang, Yijing Bai,..., Dragomir Anguelov
9 2024-02-25 No Free Lunch in LLM Watermarking: Trade-offs in Watermarking
Design Choices
link Qi Pang, Shengyuan Hu,..., Virginia Smith
9 2024-05-27 ARC: A Generalist Graph Anomaly Detector with In-Context Learning link Yixin Liu, Shiyuan Li,..., Shirui Pan
9 2024-05-27 DMPlug: A Plug-in Method for Solving Inverse Problems with
Diffusion Models
link Hengkang Wang, Xu Zhang,..., Ju Sun
9 2024-05-29 On the Role of Attention Masks and LayerNorm in
Transformers
link Xinyi Wu, Amir Ajorlou,..., Ali Jadbabaie
9 2024-05-31 Grammar-Aligned Decoding link Kanghee Park, Jiayu Wang,..., Loris D'Antoni
9 2024-06-29 UDC: A Unified Neural Divide-and-Conquer Framework for Large-Scale Combinatorial
Optimization Problems
link Zhi Zheng, Changliang Zhou,..., Zhenkun Wang
9 2024-04-23 SHED: Shapley-Based Automated Dataset Refinement for Instruction Fine-Tuning link Yexiao He, Ziyao Wang,..., Ang Li
9 2024-06-06 Understanding Information Storage and Transfer in Multi-Modal Large Language
Models
link Samyadeep Basu, Martin Grayson,..., Daniela Massiceti
9 2024-04-05 Dynamic Conditional Optimal Transport through Simulation-Free Flows link Gavin Kerrigan, Giosue Migliorini, Padhraic Smyth
9 2024-02-24 Data-Efficient Operator Learning via Unsupervised Pretraining and In-Context Learning link Wuyang Chen, Jialin Song,..., Michael W. Mahoney
9 2024-04-25 PhyRecon: Physically Plausible Neural Scene Reconstruction link Junfeng Ni, Yixin Chen,..., Siyuan Huang
9 2024-05-28 FASTopic: Pretrained Transformer is a Fast, Adaptive, Stable, and
Transferable Topic Model
link Xiaobao Wu, Thong Thanh Nguyen,..., Anh Tuan Luu
9 2024-06-12 The Impact of Initialization on LoRA Finetuning Dynamics link Soufiane Hayou, Nikhil Ghosh, Bin Yu
9 2024-02-14 InfoRM: Mitigating Reward Hacking in RLHF via Information-Theoretic Reward
Modeling
link Yuchun Miao, Sen Zhang,..., Dacheng Tao
9 2023-10-27 Proportional Fairness in Clustering: A Social Choice Perspective link Leon Kellerhals, Jannik Peters
9 2024-05-27 Entity Alignment with Noisy Annotations from Large Language Models link Shengyuan Chen, Qinggang Zhang,..., Xiao Huang
9 2024-04-01 Privacy Backdoors: Enhancing Membership Inference through Poisoning Pre-trained Models link Yuxin Wen, Leo Marchyok,..., Nicholas Carlini
9 2024-02-06 Discovery of the Hidden World with Large Language Models link Chenxi Liu, Yongqiang Chen,..., Kun Zhang
9 2024-04-22 Self-Supervised Alignment with Mutual Information: Learning to Follow Principles
without Preference Labels
link Jan-Philipp Fränken, Eric Zelikman,..., Noah Goodman
9 2024-10-24 Large Spatial Model: End-to-end Unposed Images to Semantic 3D link Zhiwen Fan, Jian Zhang,..., Yue Wang
9 2024-06-11 Motion Consistency Model: Accelerating Video Diffusion with Disentangled Motion-Appearance
Distillation
link Yuanhao Zhai, Kevin Lin,..., Lijuan Wang
9 2024-05-23 Unchosen Experts Can Contribute Too: Unleashing MoE Models’ Power
by Self-Contrast
link Chufan Shi, Cheng Yang,..., Yu Meng
9 2024-02-22 A Decision-Language Model (DLM) for Dynamic Restless Multi-Armed Bandit
Tasks in Public Health
link Nikhil Behari, Edwin Zhang,..., Milind Tambe
9 2024-05-21 LLM Processes: Numerical Predictive Distributions Conditioned on Natural Language link James Requeima, John F Bronskill,..., David Duvenaud
9 2024-05-26 Categorical Flow Matching on Statistical Manifolds link Chaoran Cheng, Jiahan Li,..., Ge Liu
9 2024-10-23 How to Continually Adapt Text-to-Image Diffusion Models for Flexible
Customization?
link Jiahua Dong, Wenqi Liang,..., Fahad Khan
9 2024-10-30 Provably Optimal Memory Capacity for Modern Hopfield Models:
Transformer-Compatible Dense Associative Memories as Spherical Codes
link Jerry Yao-Chieh Hu, Dennis Wu, Han Liu
9 2024-09-13 Closed-Loop Visuomotor Control with Generative Expectation for Robotic Manipulation link Qingwen Bu, Jia Zeng,..., Hongyang Li
9 2024-03-21 SyncTweedies: A General Generative Framework Based on Synchronized Diffusions link Jaihoon Kim, Juil Koo,..., Minhyuk Sung
9 2024-05-20 Images that Sound: Composing Images and Sounds on a
Single Canvas
link Ziyang Chen, Daniel Geng, Andrew Owens
8 2024-10-25 NeuroClips: Towards High-fidelity and Smooth fMRI-to-Video Reconstruction link Zixuan Gong, Guangyin Bao,..., Yu Zhang
8 2024-06-05 Reparameterization invariance in approximate Bayesian inference link Hrittik Roy, Marco Miani,..., Søren Hauberg
8 2024-07-20 Is Behavior Cloning All You Need? Understanding Horizon in
Imitation Learning
link Dylan J Foster, Adam Block, Dipendra Misra
8 2024-02-22 Watermarking Makes Language Models Radioactive link Tom Sander, Pierre Fernandez,..., Teddy Furon
8 2024-06-07 Probabilistic Weather Forecasting with Hierarchical Graph Neural Networks link Joel Oskarsson, Tomas Landelius,..., Fredrik Lindsten
8 2024-09-27 CycleNet: Enhancing Time Series Forecasting through Modeling Periodic Patterns link Shengsheng Lin, Weiwei Lin,..., Haocheng Zhong
8 2024-04-05 Who Evaluates the Evaluations? Objectively Scoring Text-to-Image Prompt Coherence
Metrics with T2IScoreScore (TS2)
link Michael Saxon, Fatima Jahara,..., William Yang Wang
8 2024-03-25 Is Your LiDAR Placement Optimized for 3D Scene Understanding? link Ye Li, Lingdong Kong,..., Xiaonan Huang
8 2024-06-13 Interpreting the Weight Space of Customized Diffusion Models link Amil Dravid, Yossi Gandelsman,..., Kfir Aberman
8 2024-09-16 Causal language modeling can elicit search and reasoning capabilities
on logic puzzles
link Kulin Shah, Nishanth Dikkala,..., Rina Panigrahy
8 2024-05-27 Mixed Dynamics In Linear Networks: Unifying the Lazy and
Active Regimes
link Zhenfeng Tu, Santiago Aranguri, Arthur Jacot
8 2024-07-26 SLIM: Style-Linguistics Mismatch Model for Generalized Audio Deepfake Detection link Yi Zhu, Surya Koppisetti,..., Gaurav Bharaj
8 2024-06-22 Teach Better or Show Smarter? On Instructions and Exemplars
in Automatic Prompt Optimization
link Xingchen Wan, Ruoxi Sun,..., Sercan O Arik
8 2024-06-12 Grounding Multimodal Large Language Models in Actions link Andrew Szot, Bogdan Mazoure,..., Alexander T Toshev
8 2024-06-13 Separations in the Representational Capabilities of Transformers and Recurrent
Architectures
link Satwik Bhattamishra, Michael Hahn,..., Varun Kanade
8 2023-12-09 Consistency Models for Scalable and Fast Simulation-Based Inference link Marvin Schmitt, Valentin Pratz,..., Stefan T. Radev
8 2024-09-26 DarkSAM: Fooling Segment Anything Model to Segment Nothing link Ziqi Zhou, Yufei Song,..., Hai Jin
8 2024-06-07 Variational Flow Matching for Graph Generation link Floor Eijkelboom, Grigory Bartosh,..., Jan-Willem van de Meent
8 2024-06-20 Transferable Boltzmann Generators link Leon Klein, Frank Noe
8 2024-02-12 Policy Improvement using Language Feedback Models link Victor Zhong, Dipendra Misra,..., Marc-Alexandre Côté
8 2024-02-01 Understanding the Expressive Power and Mechanisms of Transformer for
Sequence Modeling
link Mingze Wang, Weinan E
8 2024-07-08 B'MOJO: Hybrid State Space Realizations of Foundation Models with
Eidetic and Fading Memory
link Luca Zancato, Arjun Seshadri,..., Stefano Soatto
8 2024-05-28 Exploiting LLM Quantization link Kazuki Egashira, Mark Vero,..., Martin Vechev
8 2024-06-12 Optimized Feature Generation for Tabular Data via LLMs with
Decision Tree Reasoning
link Jaehyun Nam, Kyuyoung Kim,..., Jinwoo Shin
8 2024-05-29 A Full-duplex Speech Dialogue Scheme Based On Large Language
Model
link Peng Wang, Songshuo Lu,..., Yuanjun Xiong
8 2024-05-22 Spectral Adapter: Fine-Tuning in Spectral Space link Fangzhao Zhang, Mert Pilanci
8 2024-03-31 From Similarity to Superiority: Channel Clustering for Time Series
Forecasting
link Jialin Chen, Jan Eric Lenssen,..., Rex Ying
8 2023-11-26 A Nearly Optimal and Low-Switching Algorithm for Reinforcement Learning
with General Function Approximation
link Heyang Zhao, Jiafan He, Quanquan Gu
8 2024-04-06 MACM: Utilizing a Multi-Agent System for Condition Mining in
Solving Complex Mathematical Problems
link Bin Lei, Yi Zhang,..., Caiwen Ding
8 2024-10-07 TableRAG: Million-Token Table Understanding with Language Models link Si-An Chen, Lesly Miculicich,..., Tomas Pfister
8 2024-05-22 A Versatile Diffusion Transformer with Mixture of Noise Levels
for Audiovisual Generation
link Gwanghyun Kim, Alonso Martinez,..., Krishna Somandepalli
8 2024-02-06 AdaFlow: Imitation Learning with Variance-Adaptive Flow-Based Policies link Xixi Hu, qiang liu,..., Bo Liu
8 2024-04-18 Thought of Search: Planning with Language Models Through The
Lens of Efficiency
link Michael Katz, Harsha Kokel,..., Shirin Sohrabi
8 2024-05-23 Scalable Optimization in the Modular Norm link Tim Large, Yang Liu,..., Jeremy Bernstein
8 2024-04-17 On the Scalability of GNNs for Molecular Graphs link Maciej Sypetkowski, Frederik Wenkel,..., Dominique Beaini
8 2024-06-12 Is Programming by Example Solved by LLMs? link Wen-Ding Li, Kevin Ellis
8 2024-05-29 Matryoshka Query Transformer for Large Vision-Language Models link Wenbo Hu, Zi-Yi Dou,..., Kai-Wei Chang
8 2024-06-26 IRCAN: Mitigating Knowledge Conflicts in LLM Generation via Identifying
and Reweighting Context-Aware Neurons
link Dan Shi, Renren Jin,..., Deyi Xiong
8 2024-09-14 Symbolic Regression with a Learned Concept Library link Arya Grayeli, Atharva Sehgal,..., Swarat Chaudhuri
8 2024-06-01 Frieren: Efficient Video-to-Audio Generation Network with Rectified Flow Matching link Yongqi Wang, Wenxiang Guo,..., Zhou Zhao
8 2024-02-03 GITA: Graph to Visual and Textual Integration for Vision-Language
Graph Reasoning
link Yanbin Wei, Shuai Fu,..., Yu Zhang
8 2024-02-11 Online Iterative Reinforcement Learning from Human Feedback with General
Preference Model
link Chenlu Ye, Wei Xiong,..., Tong Zhang
8 2024-08-29 VideoLLM-MoD: Efficient Video-Language Streaming with Mixture-of-Depths Vision Computation link Shiwei Wu, Joya Chen,..., Mike Zheng Shou
8 2024-08-22 Transformers are Minimax Optimal Nonparametric In-Context Learners link Juno Kim, Tai Nakamaki, Taiji Suzuki