Last updated: 2024-12-09 08:39:55. Maintained by Weisen Jiang.

citation date review title (pdf) authors
374 2023-05-24 link Gorilla: Large Language Model Connected with Massive APIs Shishir G Patil, Tianjun Zhang,..., Joseph E. Gonzalez
332 2024-01-18 link VMamba: Visual State Space Model Yue Liu, Yunjie Tian,..., Yunfan Liu
325 2023-11-06 link CogVLM: Visual Expert for Pretrained Language Models Weihan Wang, Qingsong Lv,..., Jie Tang
238 2024-05-23 link YOLOv10: Real-Time End-to-End Object Detection Ao Wang, Hui Chen,..., Guiguang Ding
163 2024-05-23 link SimPO: Simple Preference Optimization with a Reference-Free Reward Yu Meng, Mengzhou Xia, Danqi Chen
135 2023-12-04 link Tree of Attacks: Jailbreaking Black-Box LLMs Automatically Anay Mehrotra, Manolis Zampetakis,..., Amin Karbasi
102 2024-03-29 link Are We on the Right Way for Evaluating Large
Vision-Language Models?
Lin Chen, Jinsong Li,..., Feng Zhao
96 2024-06-24 link Cambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMs Shengbang Tong, Ellis L Brown II,..., Saining Xie
91 2024-01-31 link KVQuant: Towards 10 Million Context Length LLM Inference with
KV Cache Quantization
Coleman Richard Charles Hooper, Sehoon Kim,..., Amir Gholami
88 2023-11-28 link LightGaussian: Unbounded 3D Gaussian Compression with 15x Reduction and
200+ FPS
Zhiwen Fan, Kevin Wang,..., Zhangyang Wang
83 2024-05-03 link What matters when building vision-language models? Hugo Laurençon, Leo Tronchon,..., Victor Sanh
82 2024-04-03 link Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction Keyu Tian, Yi Jiang,..., Liwei Wang
79 2024-04-09 link InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from
336 Pixels to 4K HD
Xiaoyi Dong, Pan Zhang,..., Jiaqi Wang
77 2024-05-06 link SWE-agent: Agent-Computer Interfaces Enable Automated Software Engineering John Yang, Carlos E Jimenez,..., Ofir Press
77 2023-10-14 link Large Language Model Unlearning Yuanshun Yao, Xiaojun Xu, Yang Liu
77 2024-04-15 link LLM Evaluators Recognize and Favor Their Own Generations Arjun Panickssery, Samuel R. Bowman, Shi Feng
70 None link Many-shot Jailbreaking Cem Anil, Esin DURMUS,..., David Duvenaud
68 2024-06-13 link Depth Anything V2 Lihe Yang, Bingyi Kang,..., Hengshuang Zhao
63 2024-05-07 link xLSTM: Extended Long Short-Term Memory Maximilian Beck, Korbinian Pöppel,..., Sepp Hochreiter
58 2024-05-16 link CAT3D: Create Anything in 3D with Multi-View Diffusion Models Ruiqi Gao, Aleksander Holynski,..., Ben Poole
56 2024-02-15 link Chain-of-Thought Reasoning Without Prompting Xuezhi Wang, Denny Zhou
56 2024-03-30 link QuaRot: Outlier-Free 4-Bit Inference in Rotated LLMs Saleh Ashkboos, Amirkeivan Mohtashami,..., James Hensman
55 2024-04-17 link Many-Shot In-Context Learning Rishabh Agarwal, Avi Singh,..., Hugo Larochelle
55 2024-04-22 link SnapKV: LLM Knows What You are Looking for Before
Generation
Yuhong Li, Yingbing Huang,..., Deming Chen
54 2024-01-30 link Robust Prompt Optimization for Defending Language Models Against Jailbreaking
Attacks
Andy Zhou, Bo Li, Haohan Wang
52 2024-02-16 link PointMamba: A Simple State Space Model for Point Cloud
Analysis
Dingkang Liang, Xin Zhou,..., Xiang Bai
50 2024-05-06 link MAmmoTH2: Scaling Instructions from the Web Xiang Yue, Tianyu Zheng,..., Wenhu Chen
48 2024-04-30 link Iterative Reasoning Preference Optimization Richard Yuanzhe Pang, Weizhe Yuan,..., Jason E Weston
47 2024-06-17 link Autoregressive Image Generation without Vector Quantization Tianhong Li, Yonglong Tian,..., Kaiming He
46 2024-06-17 link Refusal in Language Models Is Mediated by a Single
Direction
Andy Arditi, Oscar Balcells Obeso,..., Neel Nanda
40 2023-12-06 link Scaling transformer neural networks for skillful and reliable medium-range
weather forecasting
Tung Nguyen, Rohan Shah,..., Aditya Grover
38 2023-12-12 link SGLang: Efficient Execution of Structured Language Model Programs Lianmin Zheng, Liangsheng Yin,..., Ying Sheng
38 2023-10-26 link Transformers Learn to Achieve Second-Order Convergence Rates for In-Context
Linear Regression
Deqing Fu, Tian-qi Chen,..., Vatsal Sharan
37 2024-04-16 link VASA-1: Lifelike Audio-Driven Talking Faces Generated in Real Time Sicheng Xu, Guojun Chen,..., Baining Guo
37 2024-07-11 link FlashAttention-3: Fast and Accurate Attention with Asynchrony and Low-precision Jay Shah, Ganesh Bikshandi,..., Tri Dao
35 2024-02-06 link Self-Discover: Large Language Models Self-Compose Reasoning Structures Pei Zhou, Jay Pujara,..., Steven Zheng
35 2024-05-02 link StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation Yupeng Zhou, Daquan Zhou,..., Qibin Hou
35 2024-04-21 link Hyper-SD: Trajectory Segmented Consistency Model for Efficient Image Synthesis Yuxi Ren, Xin Xia,..., Xuefeng Xiao
34 2024-04-03 link PiSSA: Principal Singular Values and Singular Vectors Adaptation of
Large Language Models
Fanxu Meng, Zhaohui Wang, Muhan Zhang
33 2024-02-07 link Can Large Language Model Agents Simulate Human Trust Behaviors? Chengxing Xie, Canyu Chen,..., Guohao Li
33 2023-12-18 link Cascade Speculative Drafting for Even Faster LLM Inference Ziyi Chen, Xiaocong Yang,..., Jie Huang
33 2023-06-02 link Invisible Image Watermarks Are Provably Removable Using Generative AI Xuandong Zhao, Kexun Zhang,..., Lei Li
33 2024-04-04 link No "Zero-Shot" Without Exponential Data: Pretraining Concept Frequency Determines
Multimodal Model Performance
Vishaal Udandarao, Ameya Prabhu,..., Matthias Bethge
33 2023-04-26 link The Closeness of In-Context Learning and Weight Shifting for
Softmax Regression
Shuai Li, Zhao Song,..., Tianyi Zhou
31 2024-02-12 link G-Retriever: Retrieval-Augmented Generation for Textual Graph Understanding and Question
Answering
Xiaoxin He, Yijun Tian,..., Bryan Hooi
31 2024-05-23 link Improved Distribution Matching Distillation for Fast Image Synthesis Tianwei Yin, Michaël Gharbi,..., William T. Freeman
31 2024-02-26 link Rainbow Teaming: Open-Ended Generation of Diverse Adversarial Prompts Mikayel Samvelyan, Sharath Chandra Raparthy,..., Roberta Raileanu
30 2023-12-19 link Large Language Models Play StarCraft II: Benchmarks and A
Chain of Summarization Approach
Weiyu Ma, Qirui Mi,..., Haifeng Zhang
30 2024-04-04 link ReFT: Representation Finetuning for Language Models Zhengxuan Wu, Aryaman Arora,..., Christopher Potts
30 2024-02-17 link Watch Out for Your Agents! Investigating Backdoor Threats to
LLM-Based Agents
Wenkai Yang, Xiaohan Bi,..., Xu Sun
29 2023-05-23 link Decoupled Kullback-Leibler Divergence Loss Jiequan Cui, Zhuotao Tian,..., Hanwang Zhang
29 2024-04-18 link Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing Ye Tian, Baolin Peng,..., Dong Yu
29 2024-02-29 link Humanoid Locomotion as Next Token Prediction Ilija Radosavovic, Jathushan Rajasegaran,..., Jitendra Malik
28 2024-03-27 link Long-form factuality in large language models Jerry Wei, Chengrun Yang,..., Quoc V Le
28 2024-05-21 link Reducing Transformer Key-Value Cache Size with Cross-Layer Attention William Brandon, Mayank Mishra,..., Jonathan Ragan-Kelley
28 2024-06-11 link An Image is Worth 32 Tokens for Reconstruction and
Generation
Qihang Yu, Mark Weber,..., Liang-Chieh Chen
28 2024-07-02 link MInference 1.0: Accelerating Pre-filling for Long-Context LLMs via Dynamic
Sparse Attention
Huiqiang Jiang, YUCHENG LI,..., Lili Qiu
28 2024-03-23 link Understanding Emergent Abilities of Language Models from the Loss
Perspective
Zhengxiao Du, Aohan Zeng,..., Jie Tang
27 2024-03-14 link Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision Zhiqing Sun, Longhui Yu,..., Chuang Gan
27 2024-06-06 link ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search Dan Zhang, Sining Zhoubian,..., Jie Tang
27 2024-05-08 link You Only Cache Once: Decoder-Decoder Architectures for Language Models Yutao Sun, Li Dong,..., Furu Wei
26 2024-06-05 link Scaling Laws for Reward Model Overoptimization in Direct Alignment
Algorithms
Rafael Rafailov, Yaswanth Chittepu,..., Scott Niekum
25 2024-05-06 link AlphaMath Almost Zero: process Supervision without process Guoxin Chen, Minpeng Liao,..., Kai Fan
25 2024-06-06 link Improving Alignment and Robustness with Circuit Breakers Andy Zou, Long Phan,..., Dan Hendrycks
25 2024-04-25 link Make Your LLM Fully Utilize the Context Shengnan An, Zexiong Ma,..., Weizhu Chen
25 2023-11-22 link SegVol: Universal and Interactive Volumetric Medical Image Segmentation Yuxin Du, Fan BAI,..., Bo Zhao
25 2024-02-16 link The Evolution of Statistical Induction Heads: In-Context Learning Markov
Chains
Ezra Edelman, Nikolaos Tsilivis,..., Surbhi Goel
25 2024-02-29 link How do Large Language Models Handle Multilingualism? Yiran Zhao, Wenxuan Zhang,..., Lidong Bing
23 2023-11-29 link Elo Uncovered: Robustness and Best Practices in Language Model
Evaluation
Meriem Boubdir, Edward Kim,..., Marzieh Fadaee
23 2024-05-27 link Vista: A Generalizable Driving World Model with High Fidelity
and Versatile Controllability
Shenyuan Gao, Jiazhi Yang,..., Hongyang Li
22 2024-01-18 link ChatQA: Surpassing GPT-4 on Conversational QA and RAG Zihan Liu, Wei Ping,..., Bryan Catanzaro
22 2024-06-11 link Simple and Effective Masked Diffusion Language Models Subham Sekhar Sahoo, Marianne Arriola,..., Volodymyr Kuleshov
22 2024-03-26 link LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model
Fine-Tuning
Rui Pan, Xiang Liu,..., Tong Zhang
22 2023-05-27 link MADiff: Offline Multi-agent Learning with Diffusion Models Zhengbang Zhu, Minghuan Liu,..., Weinan Zhang
22 2024-02-14 link Soft Prompt Threats: Attacking Safety Alignment and Unlearning in
Open-Source LLMs through the Embedding Space
Leo Schwinn, David Dobre,..., Stephan Günnemann
21 2024-02-12 link Model Collapse Demystified: The Case of Regression Elvis Dohmatob, Yunzhen Feng, Julia Kempe
21 2024-02-24 link Spec-Gaussian: Anisotropic View-Dependent Appearance for 3D Gaussian Splatting Ziyi Yang, Xinyu Gao,..., Xiaogang Jin
21 2024-02-28 link Approaching Human-Level Forecasting with Language Models Danny Halawi, Fred Zhang,..., Jacob Steinhardt
21 2024-04-19 link MoVA: Adapting Mixture of Vision Experts to Multimodal Context Zhuofan Zong, Bingqi Ma,..., Yu Liu
21 2024-06-20 link RL on Incorrect Synthetic Data Scales the Efficiency of
LLM Math Reasoning by Eight-Fold
Amrith Setlur, Saurabh Garg,..., Aviral Kumar
21 2022-08-22 link Efficiency of the First-Price Auction in the Autobidding World Yuan Deng, Jieming Mao,..., Song Zuo
21 2023-12-12 link Alignment for Honesty Yuqing Yang, Ethan Chern,..., Pengfei Liu
21 2024-02-28 link Keeping LLMs Aligned After Fine-tuning: The Crucial Role of
Prompt Templates
Kaifeng Lyu, Haoyu Zhao,..., Sanjeev Arora
20 2024-02-02 link ReEvo: Large Language Models as Hyper-Heuristics with Reflective Evolution Haoran Ye, Jiarui Wang,..., Guojie Song
20 2024-06-06 link Simplified and Generalized Masked Diffusion for Discrete Data Jiaxin Shi, Kehang Han,..., Michalis Titsias
20 2024-02-19 link Query-Based Adversarial Prompt Generation Jonathan Hayase, Ema Borevković,..., Milad Nasr
20 2024-07-02 link RankRAG: Unifying Context Ranking with Retrieval-Augmented Generation in LLMs Yue Yu, Wei Ping,..., Bryan Catanzaro
20 2024-05-26 link Demystify Mamba in Vision: A Linear Attention Perspective Dongchen Han, Ziyi Wang,..., Gao Huang
20 2024-06-03 link MixEval: Deriving Wisdom of the Crowd from LLM Benchmark
Mixtures
Jinjie Ni, Fuzhao Xue,..., Yang You
20 2024-04-11 link Applying Guidance in a Limited Interval Improves Sample and
Distribution Quality in Diffusion Models
Tuomas Kynkäänniemi, Miika Aittala,..., Jaakko Lehtinen
20 2023-05-15 link PLIP: Language-Image Pre-training for Person Representation Learning Jialong Zuo, Jiahao Hong,..., Jingdong Wang
20 2024-07-18 link Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies Chaofan Tao, Qian Liu,..., Ngai Wong
19 2024-05-13 link PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator Hanshu Yan, Xingchao Liu,..., Jiashi Feng
19 2024-05-30 link Unique3D: High-Quality and Efficient 3D Mesh Generation from a
Single Image
Kailu Wu, Fangfu Liu,..., Kaisheng Ma
19 2024-06-14 link Regularizing Hidden States Enables Learning Generalizable Reward Model for
LLMs
Rui Yang, Ruomeng Ding,..., Tong Zhang
19 2024-05-27 link Transformers Can Do Arithmetic with the Right Embeddings Sean Michael McLeish, Arpit Bansal,..., Tom Goldstein
18 2023-12-13 link Chat-Scene: Bridging 3D Scene and Large Language Models with
Object Identifiers
Haifeng Huang, Yilun Chen,..., Zhou Zhao
18 2024-04-15 link 3D Gaussian Splatting as Markov Chain Monte Carlo Shakiba Kheradmand, Daniel Rebain,..., Kwang Moo Yi
18 2024-02-26 link Why Transformers Need Adam: A Hessian Perspective Yushun Zhang, Congliang Chen,..., Zhi-Quan Luo
18 2024-06-26 link WildTeaming at Scale: From In-the-Wild Jailbreaks to (Adversarially) Safer
Language Models
Liwei Jiang, Kavel Rao,..., Nouha Dziri
18 2024-03-26 link MAGIS: LLM-Based Multi-Agent Framework for GitHub Issue Resolution Wei Tao, Yucheng Zhou,..., Yu Cheng
18 2024-04-12 link Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context
Length
Xuezhe Ma, Xiaomeng Yang,..., Chunting Zhou
18 2024-07-01 link Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion Boyuan Chen, Diego Martí Monsó,..., Vincent Sitzmann
18 2024-05-28 link Aligning to Thousands of Preferences via System Message Generalization Seongyun Lee, Sue Hyun Park,..., Minjoon Seo
18 2024-05-26 link Provably Mitigating Overoptimization in RLHF: Your SFT Loss is
Implicitly an Adversarial Regularizer
Zhihan Liu, Miao Lu,..., Zhaoran Wang
17 2024-05-24 link Efficient Adversarial Training in LLMs with Continuous Attacks Sophie Xhonneux, Alessandro Sordoni,..., Leo Schwinn
17 2024-02-08 link Noise Contrastive Alignment of Language Models with Explicit Rewards Huayu Chen, Guande He,..., Jun Zhu
17 2024-02-17 link OneBit: Towards Extremely Low-bit Large Language Models Yuzhuang Xu, Xu Han,..., Wanxiang Che
17 2024-06-17 link Twin-Merging: Dynamic Integration of Modular Expertise in Model Merging Zhenyi Lu, Chenghao Fan,..., Yu Cheng
17 2024-04-23 link Rethinking LLM Memorization through the Lens of Adversarial Compression Avi Schwarzschild, Zhili Feng,..., J Zico Kolter
17 2023-12-06 link OctreeOcc: Efficient and Multi-Granularity Occupancy Prediction Using Octree Queries Yuhang Lu, Xinge ZHU,..., Yuexin Ma
17 2024-06-12 link VisionLLM v2: An End-to-End Generalist Multimodal Large Language Model
for Hundreds of Vision-Language Tasks
Jiannan Wu, Muyan Zhong,..., Jifeng Dai
17 2024-04-25 link REBEL: Reinforcement Learning via Regressing Relative Rewards Zhaolin Gao, Jonathan Daniel Chang,..., Wen Sun
17 2024-05-24 link Defensive Unlearning with Adversarial Training for Robust Concept Erasure
in Diffusion Models
Yimeng Zhang, Xin Chen,..., Sijia Liu
17 2024-06-03 link SpatialRGPT: Grounded Spatial Reasoning in Vision Language Model An-Chieh Cheng, Hongxu Yin,..., Sifei Liu
17 2024-05-24 link The Road Less Scheduled Aaron Defazio, Xingyu Alice Yang,..., Ashok Cutkosky
17 2024-03-05 link Found in the Middle: How Language Models Use Long
Contexts Better via Plug-and-Play Positional Encoding
Zhenyu Zhang, Runjin Chen,..., Zhangyang Wang
17 2024-05-09 link CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts Jiachen Li, Xinyao Wang,..., Longyin Wen
16 2023-10-06 link Why Do We Need Weight Decay in Modern Deep
Learning?
Francesco D'Angelo, Maksym Andriushchenko,..., Nicolas Flammarion
16 2024-06-04 link Guiding a Diffusion Model with a Bad Version of
Itself
Tero Karras, Miika Aittala,..., Samuli Laine
16 2024-05-23 link JiuZhang3.0: Efficiently Improving Mathematical Reasoning by Training Small Data
Synthesis Models
Kun Zhou, Beichen Zhang,..., Ji-Rong Wen
16 2024-06-22 link Are Language Models Actually Useful for Time Series Forecasting? Mingtian Tan, Mike A Merrill,..., Thomas Hartvigsen
16 2024-07-22 link Discrete Flow Matching Itai Gat, Tal Remez,..., Yaron Lipman
16 2024-02-09 link Fight Back Against Jailbreaking via Prompt Adversarial Tuning Yichuan Mo, Yuji Wang,..., Yisen Wang
16 2024-04-09 link MambaAD: Exploring State Space Models for Multi-class Unsupervised Anomaly
Detection
Haoyang He, Yuhu Bai,..., Lei Xie
16 2024-02-18 link Federated Fine-tuning of Large Language Models under Heterogeneous Language
Tasks and Client Resources
Jiamu Bai, Daoyuan Chen,..., Yaliang Li
16 2024-02-19 link A Critical Evaluation of AI Feedback for Aligning Large
Language Models
Archit Sharma, Sedrick Keh,..., Thomas Kollar
16 2023-10-12 link MatFormer: Nested Transformer for Elastic Inference Fnu Devvrit, Sneha Kudugunta,..., Prateek Jain
16 2024-06-03 link Improved Few-Shot Jailbreaking Can Circumvent Aligned Language Models and
Their Defenses
Xiaosen Zheng, Tianyu Pang,..., Min Lin
16 2023-06-13 link Questioning the Survey Responses of Large Language Models Ricardo Dominguez-Olmedo, Moritz Hardt, Celestine Mendler-Dünner
16 None link AutoGuide: Automated Generation and Selection of State-Aware Guidelines for
Large Language Model Agents
Yao Fu, Dong-Ki Kim,..., Honglak Lee
16 2023-08-04 link Adaptive Proximal Gradient Method for Convex Optimization Yura Malitsky, Konstantin Mishchenko
15 2024-06-17 link How Do Large Language Models Acquire Factual Knowledge During
Pretraining?
Hoyeon Chang, Jinho Park,..., Minjoon Seo
15 2024-05-16 link Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement
Learning
Yuexiang Zhai, Hao Bai,..., Sergey Levine
15 2024-05-28 link Understanding Transformer Reasoning Capabilities via Graph Algorithms Clayton Sanford, Bahare Fatemi,..., Vahab Mirrokni
15 2024-05-17 link Identifying Functionally Important Features with End-to-End Sparse Dictionary Learning Dan Braun, Jordan Taylor,..., Lee Sharkey
15 2024-02-16 link Interpreting CLIP with Sparse Linear Concept Embeddings (SpLiCE) Usha Bhalla, Alex Oesterling,..., Himabindu Lakkaraju
15 2024-03-01 link Gradient Cuff: Detecting Jailbreak Attacks on Large Language Models
by Exploring Refusal Loss Landscapes
Xiaomeng Hu, Pin-Yu Chen, Tsung-Yi Ho
15 2024-02-29 link TimeXer: Empowering Transformers for Time Series Forecasting with Exogenous
Variables
Yuxuan Wang, Haixu Wu,..., Mingsheng Long
15 2024-07-19 link Compact Language Models via Pruning and Knowledge Distillation Saurav Muralidharan, Sharath Turuvekere Sreenivas,..., Pavlo Molchanov
15 2024-05-19 link Era3D: High-Resolution Multiview Diffusion using Efficient Row-wise Attention Peng Li, Yuan Liu,..., Yike Guo
15 2024-05-29 link Poseidon: Efficient Foundation Models for PDEs Maximilian Herde, Bogdan Raonic,..., Siddhartha Mishra
15 2024-02-03 link Panacea: Pareto Alignment via Preference Adaptation for LLMs Yifan Zhong, Chengdong Ma,..., Yaodong Yang
14 2024-05-24 link Meteor: Mamba-based Traversal of Rationale for Large Language and
Vision Models
Byung-Kwan Lee, Chae Won Kim,..., Yong Man Ro
14 2024-06-03 link Mobile-Agent-v2: Mobile Device Operation Assistant with Effective Navigation via
Multi-Agent Collaboration
Junyang Wang, Haiyang Xu,..., Jitao Sang
14 2024-05-28 link Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations Alexander Hägele, Elie Bakouch,..., Martin Jaggi
14 2024-06-02 link BoNBoN Alignment for Large Language Models and the Sweetness
of Best-of-n Sampling
Lin Gui, Cristina Garbacea, Victor Veitch
14 2024-05-26 link Diffusion4D: Fast Spatial-temporal Consistent 4D Generation via Video Diffusion
Models
HANWEN LIANG, Yuyang Yin,..., Yunchao Wei
14 2024-05-23 link Multi-Scale VMamba: Hierarchy in Hierarchy Visual State Space Model Yuheng Shi, Minjing Dong, Chang Xu
14 2024-05-03 link DreamScene4D: Dynamic Multi-Object Scene Generation from Monocular Videos Wen-Hsuan Chu, Lei Ke, Katerina Fragkiadaki
14 2024-05-23 link MiniCache: KV Cache Compression in Depth Dimension for Large
Language Models
Akide Liu, Jing Liu,..., Bohan Zhuang
14 2024-01-11 link A Closer Look at AUROC and AUPRC under Class
Imbalance
Matthew B.A. McDermott, Haoran Zhang,..., Jack Gallifant
14 2024-05-22 link xRAG: Extreme Context Compression for Retrieval-augmented Generation with One
Token
Xin Cheng, Xun Wang,..., Dongyan Zhao
14 2024-05-08 link Chain of Thoughtlessness? An Analysis of CoT in Planning Kaya Stechly, Karthik Valmeekam, Subbarao Kambhampati
14 2024-02-15 link BitDelta: Your Fine-Tune May Only Be Worth One Bit James Liu, Guangxuan Xiao,..., Tianle Cai
13 2024-05-23 link Adapting to Unknown Low-Dimensional Structures in Score-Based Diffusion Models Gen Li, Yuling Yan
13 2024-03-25 link Antigen-Specific Antibody Design via Direct Energy-based Preference Optimization Xiangxin Zhou, Dongyu Xue,..., Quanquan Gu
13 2024-03-22 link Can large language models explore in-context? Akshay Krishnamurthy, Keegan Harris,..., Aleksandrs Slivkins
13 2024-04-24 link PuLID: Pure and Lightning ID Customization via Contrastive Alignment Zinan Guo, Yanze Wu,..., Qian HE
13 2024-06-10 link Parallelizing Linear Transformers with the Delta Rule over Sequence
Length
Songlin Yang, Bailin Wang,..., Yoon Kim
13 2024-05-29 link Preference Learning Algorithms Do Not Learn Preference Rankings Angelica Chen, Sadhika Malladi,..., Kyunghyun Cho
13 2024-02-22 link Large Language Models as Urban Residents: An LLM Agent
Framework for Personal Mobility Generation
Jiawei Wang, Renhe Jiang,..., Chuan Xiao
13 2024-05-07 link KV Cache is 1 Bit Per Channel: Efficient Large
Language Model Inference with Coupled Quantization
Tianyi Zhang, Jonah Wonkyu Yi,..., Anshumali Shrivastava
13 2024-03-19 link Pretraining Codomain Attention Neural Operators for Solving Multiphysics PDEs Md Ashiqur Rahman, Robert Joseph George,..., Anima Anandkumar
13 2024-06-14 link DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning Hao Bai, Yifei Zhou,..., Aviral Kumar
13 2024-06-27 link OMG-LLaVA: Bridging Image-level, Object-level, Pixel-level Reasoning and Understanding Tao Zhang, Xiangtai Li,..., Shuicheng YAN
13 2024-06-13 link Unpacking DPO and PPO: Disentangling Best Practices for Learning
from Preference Feedback
Hamish Ivison, Yizhong Wang,..., Hannaneh Hajishirzi
13 2024-05-27 link Safe LoRA: the Silver Lining of Reducing Safety Risks
when Fine-tuning Large Language Models
Chia-Yi Hsu, Yu-Lin Tsai,..., Chun-Ying Huang
13 2024-06-06 link Transformers need glasses! Information over-squashing in language tasks Federico Barbero, Andrea Banino,..., Petar Veličković
13 2024-05-31 link Amortizing intractable inference in diffusion models for vision, language,
and control
Siddarth Venkatraman, Moksh Jain,..., Nikolay Malkin
12 2024-05-23 link HippoRAG: Neurobiologically Inspired Long-Term Memory for Large Language Models Bernal Jimenez Gutierrez, Yiheng Shu,..., Yu Su
12 2024-02-04 link Aligner: Efficient Alignment by Learning to Correct Jiaming Ji, Boyuan Chen,..., Yaodong Yang
12 2024-04-23 link Aligning LLM Agents by Learning Latent Preference from User
Edits
Ge Gao, Alexey Taymanov,..., Dipendra Misra
12 2024-05-30 link Enhancing Large Vision Language Models with Self-Training on Image
Comprehension
Yihe Deng, Pan Lu,..., Wei Wang
12 2024-06-11 link BAKU: An Efficient Transformer for Multi-Task Policy Learning Siddhant Haldar, Zhuoran Peng, Lerrel Pinto
12 2024-06-03 link Neural network learns low-dimensional polynomials with SGD near the
information-theoretic limit
Jason D. Lee, Kazusato Oko,..., Denny Wu
12 2024-05-29 link Weak-to-Strong Search: Align Large Language Models via Searching over
Small Language Models
Zhanhui Zhou, Zhixuan Liu,..., Yu Qiao
12 2024-02-28 link Implicit Optimization Bias of Next-Token Prediction in Linear Models Christos Thrampoulidis
12 2024-06-10 link LLM Dataset Inference: Did you train on my dataset? Pratyush Maini, Hengrui Jia,..., Adam Dziedzic
12 2024-05-23 link Representation Noising: A Defence Mechanism Against Harmful Finetuning Domenic Rosati, Jan Wehner,..., Frank Rudzicz
12 2024-02-29 link Theoretical Foundations of Deep Selective State-Space Models Nicola Muca Cirone, Antonio Orvieto,..., Terry Lyons
12 2024-05-27 link EM Distillation for One-step Diffusion Models Sirui Xie, Zhisheng Xiao,..., Ruiqi Gao
12 2024-11-02 link Rule Based Rewards for Language Model Safety Tong Mu, Alec Helyar,..., Lilian Weng
12 2024-01-18 link Cross-Modality Perturbation Synergy Attack for Person Re-identification Yunpeng Gong, Zhun Zhong,..., Min Jiang
11 2024-06-27 link Resolving Discrepancies in Compute-Optimal Scaling of Language Models Tomer Porian, Mitchell Wortsman,..., Yair Carmon
11 2024-02-05 link FuseMoE: Mixture-of-Experts Transformers for Fleximodal Fusion Xing Han, Huy Nguyen,..., Suchi Saria
11 2024-06-12 link Large Language Model Unlearning via Embedding-Corrupted Prompts Chris Yuhao Liu, Yaxuan Wang,..., Yang Liu
11 2024-05-25 link Theoretical Analysis of Weak-to-Strong Generalization Hunter Lang, David Sontag, Aravindan Vijayaraghavan
11 2024-01-29 link Contracting with a Learning Agent Guru Guruganesh, Yoav Kolumbus,..., S. Matthew Weinberg
11 2024-03-14 link MambaTalk: Efficient Holistic Gesture Synthesis with Selective State Space
Models
Zunnan Xu, Yukang Lin,..., Xiu Li
11 2023-10-19 link AutoMix: Automatically Mixing Language Models Pranjal Aggarwal, Aman Madaan,..., Mausam .
11 2024-05-23 link ZipCache: Accurate and Efficient KV Cache Quantization with Salient
Token Identification
Yefei He, Luoming Zhang,..., Bohan Zhuang
11 2024-06-10 link MATES: Model-Aware Data Selection for Efficient Pretraining with Data
Influence Models
Zichun Yu, Spandan Das, Chenyan Xiong
11 2024-07-11 link WildGaussians: 3D Gaussian Splatting in the Wild Jonas Kulhanek, Songyou Peng,..., Torsten Sattler
11 2024-07-31 link Measuring Progress in Dictionary Learning for Language Model Interpretability
with Board Game Models
Adam Karvonen, Benjamin Wright,..., Samuel Marks
11 2023-05-21 link DPIC: Decoupling Prompt and Intrinsic Characteristics for LLM Generated
Text Detection
Xiao Yu, Yuang Qi,..., Nenghai Yu
11 2024-03-07 link Online Adaptation of Language Models with a Memory of
Amortized Contexts
Jihoon Tack, Jaehyung Kim,..., Jonathan Richard Schwarz
11 2024-06-06 link Multistep Distillation of Diffusion Models via Moment Matching Tim Salimans, Thomas Mensink,..., Emiel Hoogeboom
11 2024-05-27 link Collaborative Video Diffusion: Consistent Multi-video Generation with Camera Control Zhengfei Kuang, Shengqu Cai,..., Gordon Wetzstein
11 2024-06-18 link DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving Yuxuan Tong, Xiwen Zhang,..., Junxian He
11 2024-02-07 link InfLLM: Training-Free Long-Context Extrapolation for LLMs with an Efficient
Context Memory
Chaojun Xiao, Pengle Zhang,..., Maosong Sun
11 2024-05-25 link Streaming Long Video Understanding with Large Language Models Rui Qian, Xiaoyi Dong,..., Jiaqi Wang
11 2024-05-23 link Direct3D: Scalable Image-to-3D Generation via 3D Latent Diffusion Transformer Shuang Wu, Youtian Lin,..., Yao Yao
10 2024-05-23 link WISE: Rethinking the Knowledge Memory for Lifelong Model Editing
of Large Language Models
Peng Wang, Zexi Li,..., Huajun Chen
10 2024-03-06 link WaterMax: breaking the LLM watermark detectability-robustness-quality trade-off Eva Giboulot, Teddy Furon
10 2024-05-23 link Instruction Tuning With Loss Over Instructions Zhengyan Shi, Adam X. Yang,..., Aldo Lipani
10 2024-04-06 link Aligning Diffusion Models by Optimizing Human Utility Shufan Li, Konstantinos Kallidromitis,..., Kazuki Kozuka
10 2024-05-23 link EMR-Merging: Tuning-Free High-Performance Model Merging Chenyu Huang, Peng Ye,..., Wanli Ouyang
10 2024-01-11 link Interpretable Concept Bottlenecks to Align Reinforcement Learning Agents Quentin Delfosse, Sebastian Sztwiertnia,..., Kristian Kersting
10 2024-02-21 link Average gradient outer product as a mechanism for deep
neural collapse
Daniel Beaglehole, Peter Súkeník,..., Mikhail Belkin
10 2024-06-11 link 4Real: Towards Photorealistic 4D Scene Generation via Video Diffusion
Models
Heng Yu, Chaoyang Wang,..., Hsin-Ying Lee
10 2024-05-27 link Navigating the Safety Landscape: Measuring Risks in Finetuning Large
Language Models
ShengYun Peng, Pin-Yu Chen,..., Duen Horng Chau
10 2024-06-27 link Decoding-Time Language Model Alignment with Multiple Objectives Ruizhe Shi, Yifang Chen,..., Simon Shaolei Du
10 2024-07-02 link Meta 3D AssetGen: Text-to-Mesh Generation with High-Quality Geometry, Texture,
and PBR Materials
Yawar Siddiqui, Tom Monnier,..., David Novotny
10 2024-06-21 link Is A Picture Worth A Thousand Words? Delving Into
Spatial Reasoning for Vision Language Models
Jiayu Wang, Yifei Ming,..., Neel Joshi
10 2024-04-16 link Self-playing Adversarial Language Game Enhances LLM Reasoning Pengyu Cheng, Tianhao Hu,..., Xiaolong Li
10 2024-02-07 link Amortized Planning with Large-Scale Transformers: A Case Study on
Chess
Anian Ruoss, Gregoire Deletang,..., Tim Genewein
10 2024-05-28 link Why are Visually-Grounded Language Models Bad at Image Classification? Yuhui Zhang, Alyssa Unell,..., Serena Yeung-Levy
10 2024-07-17 link AgentPoison: Red-teaming LLM Agents via Poisoning Memory or Knowledge
Bases
Zhaorun Chen, Zhen Xiang,..., Bo Li
10 2023-10-10 link A General Protocol to Probe Large Vision Models for
3D Physical Understanding
Guanqi Zhan, Chuanxia Zheng,..., Andrew Zisserman
10 2024-04-08 link SpeechAlign: Aligning Speech Generation to Human Preferences Dong Zhang, Zhaowei Li,..., Xipeng Qiu
10 2024-02-22 link In-Context Learning of a Linear Transformer Block: Benefits of
the MLP Component and One-Step GD Initialization
Ruiqi Zhang, Jingfeng Wu, Peter Bartlett
10 2024-02-09 link CultureLLM: Incorporating Cultural Differences into Large Language Models CHENG LI, Mengzhuo Chen,..., Xing Xie
10 2024-03-09 link Algorithmic progress in language models Anson Ho, Tamay Besiroglu,..., Jaime Sevilla
10 2024-02-15 link Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation Huizhuo Yuan, Zixiang Chen,..., Quanquan Gu
10 2024-02-04 link Diffusion Models are Certifiably Robust Classifiers Huanran Chen, Yinpeng Dong,..., Jun Zhu
10 2024-02-19 link WorldCoder, a Model-Based LLM Agent: Building World Models by
Writing Code and Interacting with the Environment
Hao Tang, Darren Yan Key, Kevin Ellis
10 2024-05-23 link Calibrated Self-Rewarding Vision Language Models Yiyang Zhou, Zhiyuan Fan,..., Huaxiu Yao
10 2024-05-17 link ProSST: Protein Language Modeling with Quantized Structure and Disentangled
Attention
Mingchen Li, Yang Tan,..., Liang Hong
10 2024-03-12 link Lumen: Unleashing Versatile Vision-Centric Capabilities of Large Multimodal Models Yang Jiao, Shaoxiang Chen,..., Yu-Gang Jiang
10 2024-05-27 link Vidu4D: Single Generated Video to High-Fidelity 4D Reconstruction with
Dynamic Gaussian Surfels
Yikai Wang, Xinzhou Wang,..., Jun Zhu
10 2024-02-29 link Heavy-Tailed Class Imbalance and Why Adam Outperforms Gradient Descent
on Language Models
Frederik Kunstner, Robin Yadav,..., Alberto Bietti
10 2024-06-14 link L4GM: Large 4D Gaussian Reconstruction Model Jiawei Ren, Kevin Xie,..., Huan Ling
10 2024-04-04 link CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching Dongzhi Jiang, Guanglu Song,..., Hongsheng Li
9 2023-12-20 link UniSDF: Unifying Neural Representations for High-Fidelity 3D Reconstruction of
Complex Scenes with Reflections
Fangjinhua Wang, Marie-Julie Rakotosaona,..., Federico Tombari
9 2024-05-27 link PromptFix: You Prompt and We Fix the Photo Yongsheng Yu, Ziyun Zeng,..., Jiebo Luo
9 2024-03-28 link InterDreamer: Zero-Shot Text to 3D Dynamic Human-Object Interaction Sirui Xu, Ziyin Wang,..., Liangyan Gui
9 2024-06-03 link D-CPT Law: Domain-specific Continual Pre-Training Scaling Law for Large
Language Models
Haoran Que, Jiaheng Liu,..., Bo Zheng
9 2024-05-31 link 4Diffusion: Multi-view Video Diffusion Model for 4D Generation Haiyu Zhang, Xinyuan Chen,..., Yu Qiao
9 2024-07-06 link LoRA-GA: Low-Rank Adaptation with Gradient Approximation Shaowen Wang, Linxi Yu, Jian Li
9 2024-06-17 link Exploring the Role of Large Language Models in Prompt
Encoding for Diffusion Models
Bingqi Ma, Zhuofan Zong,..., Yu Liu
9 2024-04-22 link Protecting Your LLMs with Information Bottleneck Zichuan Liu, Zefan Wang,..., Jiang Bian
9 2023-12-06 link Return of Unconditional Generation: A Self-supervised Representation Generation Method Tianhong Li, Dina Katabi, Kaiming He
9 2024-05-31 link MeshXL: Neural Coordinate Field for Generative 3D Foundation Models Sijin Chen, Xin Chen,..., Tao Chen
9 2023-10-20 link Towards Understanding How Transformers Learn In-context Through a Representation
Learning Lens
Ruifeng Ren, Yong Liu
9 2024-05-28 link Linguistic Collapse: Neural Collapse in (Large) Language Models Robert Wu, Vardan Papyan
9 2024-05-30 link Sequence-Augmented SE(3)-Flow Matching For Conditional Protein Backbone Generation Guillaume Huguet, James Vuckovic,..., Joey Bose
9 2024-03-12 link Visual Decoding and Reconstruction via EEG Embeddings with Guided
Diffusion
Dongyang Li, Chen Wei,..., Quanying Liu
9 2024-06-25 link MotionBooth: Motion-Aware Customized Text-to-Video Generation Jianzong Wu, Xiangtai Li,..., Kai Chen
9 2024-05-24 link Quantifying the Gain in Weak-to-Strong Generalization Moses Charikar, Chirag Pabbaraju, Kirankumar Shiragur
9 2024-06-13 link Visual Sketchpad: Sketching as a Visual Chain of Thought
for Multimodal Language Models
Yushi Hu, Weijia Shi,..., Ranjay Krishna
9 2024-07-25 link Recursive Introspection: Teaching Language Model Agents How to Self-Improve Yuxiao Qu, Tianjun Zhang,..., Aviral Kumar
9 2024-06-06 link Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models Ling Yang, Zhaochen Yu,..., Bin CUI
9 2024-02-06 link Scaling laws for learning with real and surrogate data Ayush Jain, Andrea Montanari, Eren Sasoglu
9 2024-05-23 link PaGoDA: Progressive Growing of a One-Step Generator from a
Low-Resolution Diffusion Teacher
Dongjun Kim, Chieh-Hsin Lai,..., Stefano Ermon
8 2024-09-30 link Scaling Proprioceptive-Visual Learning with Heterogeneous Pre-trained Transformers Lirui Wang, Xinlei Chen,..., Kaiming He
8 2024-04-19 link Ensemble Learning for Heterogeneous Large Language Models with Deep
Parallel Collaboration
Yichong Huang, Xiaocheng Feng,..., Bing Qin
8 2023-10-27 link Proportional Fairness in Clustering: A Social Choice Perspective Leon Kellerhals, Jannik Peters
8 2024-06-12 link One-Step Effective Diffusion Network for Real-World Image Super-Resolution Rongyuan Wu, Lingchen Sun,..., Lei Zhang
8 2024-05-28 link A Theoretical Understanding of Self-Correction through In-context Alignment Yifei Wang, Yuyang Wu,..., Yisen Wang
8 2024-02-06 link A phase transition between positional and semantic learning in
a solvable model of dot-product attention
Hugo Cui, Freya Behrens,..., Lenka Zdeborova
8 2024-05-30 link Jailbreaking Large Language Models Against Moderation Guardrails via Cipher
Characters
Haibo Jin, Andy Zhou,..., Haohan Wang
8 2024-02-07 link The Fine-Grained Complexity of Gradient Computation for Training Large
Language Models
Josh Alman, Zhao Song
8 2024-05-20 link Diffusion for World Modeling: Visual Details Matter in Atari Eloi Alonso, Adam Jelley,..., François Fleuret
8 2024-06-06 link Evaluating the World Model Implicit in a Generative Model Keyon Vafa, Justin Y. Chen,..., Sendhil Mullainathan
8 2024-03-25 link Provably Robust Score-Based Diffusion Posterior Sampling for Plug-and-Play Image
Reconstruction
Xingyu Xu, Yuejie Chi
8 2024-05-24 link iVideoGPT: Interactive VideoGPTs are Scalable World Models Jialong Wu, Shaofeng Yin,..., Mingsheng Long
8 2024-02-07 link QGFN: Controllable Greediness with Action Values Elaine Lau, Stephen Zhewen Lu,..., Emmanuel Bengio
8 2024-05-20 link Metacognitive Capabilities of LLMs: An Exploration in Mathematical Problem
Solving
Aniket Rajiv Didolkar, Anirudh Goyal,..., Sanjeev Arora
8 2024-02-02 link AMOR: A Recipe for Building Adaptable Modular Knowledge Agents
Through Process Feedback
Jian Guan, Wei Wu,..., Minlie Huang
8 2024-04-22 link Self-Supervised Alignment with Mutual Information: Learning to Follow Principles
without Preference Labels
Jan-Philipp Fränken, Eric Zelikman,..., Noah Goodman
8 2024-06-17 link Transcendence: Generative Models Can Outperform The Experts That Train
Them
Edwin Zhang, Vincent Zhu,..., eran malach
8 2024-07-29 link FreeLong: Training-Free Long Video Generation with SpectralBlend Temporal Attention Yu Lu, Yuanzhi Liang,..., Yi Yang
8 2024-05-23 link Base of RoPE Bounds Context Length Mingyu Xu, Xin Men,..., weipeng chen
8 2024-03-25 link Is Your LiDAR Placement Optimized for 3D Scene Understanding? Ye Li, Lingdong Kong,..., Xiaonan Huang
8 2024-06-13 link OmniTokenizer: A Joint Image-Video Tokenizer for Visual Generation Junke Wang, Yi Jiang,..., Yu-Gang Jiang
8 2024-06-13 link COVE: Unleashing the Diffusion Feature Correspondence for Consistent Video
Editing
Jiangshan Wang, Yue Ma,..., Xiu Li
8 2024-02-05 link Shadowcast: Stealthy Data Poisoning Attacks Against Vision-Language Models Yuancheng Xu, Jiarui Yao,..., Furong Huang
8 2024-02-07 link Universal Neural Functionals Allan Zhou, Chelsea Finn, James Harrison
8 2024-05-16 link Conformal Alignment: Knowing When to Trust Foundation Models with
Guarantees
Yu Gui, Ying Jin, Zhimei Ren
8 2024-04-01 link Privacy Backdoors: Enhancing Membership Inference through Poisoning Pre-trained Models Yuxin Wen, Leo Marchyok,..., Nicholas Carlini
8 2024-04-23 link Gradient Guidance for Diffusion Models: An Optimization Perspective Yingqing Guo, Hui Yuan,..., Mengdi Wang
8 2024-05-23 link 4+3 Phases of Compute-Optimal Neural Scaling Laws Elliot Paquette, Courtney Paquette,..., Jeffrey Pennington
8 2024-06-13 link Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs Xuan Zhang, Chao Du,..., Min Lin
8 2024-06-01 link RGFN: Synthesizable Molecular Generation Using GFlowNets Michał Koziarski, Andrei Rekesh,..., Robert A. Batey
8 2024-05-22 link DOGS: Distributed-Oriented Gaussian Splatting for Large-Scale 3D Reconstruction Via
Gaussian Consensus
Yu Chen, Gim Hee Lee
8 2024-07-05 link On scalable oversight with weak LLMs judging strong LLMs Zachary Kenton, Noah Yamamoto Siegel,..., Rohin Shah
8 2024-06-13 link LRM-Zero: Training Large Reconstruction Models with Synthesized Data Desai Xie, Sai Bi,..., Hao Tan
8 2024-05-15 link Spectral Editing of Activations for Large Language Model Alignment Yifu QIU, Zheng Zhao,..., Shay B Cohen
8 2024-05-25 link Breaking the False Sense of Security in Backdoor Defense
through Re-Activation Attack
Mingli Zhu, Siyuan Liang, Baoyuan Wu
8 2023-05-22 link Imprecise Label Learning: A Unified Framework for Learning with
Various Imprecise Label Configurations
Hao Chen, Ankit Shah,..., Bhiksha Raj
8 2024-05-30 link Improving the Training of Rectified Flows Sangyun Lee, Zinan Lin, Giulia Fanti
8 2024-01-24 link Beyond Concept Bottleneck Models: How to Make Black Boxes
Intervenable?
Sonia Laguna, Ričards Marcinkevičs,..., Julia E Vogt
8 2024-03-03 link GuardT2I: Defending Text-to-Image Models from Adversarial Prompts Yijun Yang, Ruiyuan Gao,..., Qiang Xu
8 2024-05-30 link Bridging Model-Based Optimization and Generative Modeling via Conservative Fine-Tuning
of Diffusion Models
Masatoshi Uehara, Yulai Zhao,..., Tommaso Biancalani
7 2024-06-14 link Large language model validity via enhanced conformal prediction methods John Cherian, Isaac Gibbs, Emmanuel Candes
7 2024-06-23 link Trace is the Next AutoDiff: Generative Optimization with Rich
Feedback, Execution Traces, and LLMs
Ching-An Cheng, Allen Nie, Adith Swaminathan
7 2024-06-20 link Prism: A Framework for Decoupling and Assessing the Capabilities
of VLMs
Yuxuan Qiao, Haodong Duan,..., Kai Chen
7 2024-05-29 link T2V-Turbo: Breaking the Quality Bottleneck of Video Consistency Model
with Mixed Reward Feedback
Jiachen Li, Weixi Feng,..., William Yang Wang
7 2024-09-09 link FLoRA: Federated Fine-Tuning Large Language Models with Heterogeneous Low-Rank
Adaptations
Ziyao Wang, Zheyu Shen,..., Ang Li
7 2024-05-23 link Lorentz-Equivariant Geometric Algebra Transformers for High-Energy Physics Jonas Spinner, Victor Breso Pla,..., Johann Brehmer
7 2024-05-26 link Code Repair with LLMs gives an Exploration-Exploitation Tradeoff Hao Tang, Keya Hu,..., Kevin Ellis
7 2024-05-23 link Unchosen Experts Can Contribute Too: Unleashing MoE Models' Power
by Self-Contrast
Chufan Shi, Cheng Yang,..., Yu Meng
7 2024-04-23 link Multi-Head Mixture-of-Experts Xun Wu, Shaohan Huang,..., Furu Wei
7 2024-02-05 link Estimating Epistemic and Aleatoric Uncertainty with a Single Model Matthew Albert Chan, Maria J. Molina, Christopher Metzler
7 2024-07-08 link Multi-Object Hallucination in Vision-Language Models Xuweiyi Chen, Ziqiao Ma,..., Joyce Chai
7 2024-01-22 link Self-Labeling the Job Shop Scheduling Problem Andrea Corsini, Angelo Porrello,..., Mauro Dell'Amico
7 2024-06-03 link DuQuant: Distributing Outliers via Dual Transformation Makes Stronger Quantized
LLMs
Haokun Lin, Haobo Xu,..., Ying Wei
7 2024-06-10 link Aligning Large Language Models with Representation Editing: A Control
Perspective
Lingkai Kong, Haorui Wang,..., Chao Zhang
7 2024-02-24 link Data-Efficient Operator Learning via Unsupervised Pretraining and In-Context Learning Wuyang Chen, Jialin Song,..., Michael W. Mahoney
7 2024-06-12 link Reversing the Forget-Retain Objectives: An Efficient LLM Unlearning Framework
from Logit Difference
Jiabao Ji, Yujian Liu,..., Shiyu Chang
7 2023-11-03 link Towards Calibrated Robust Fine-Tuning of Vision-Language Models Changdae Oh, Hyesu Lim,..., Kyungwoo Song
7 2024-05-23 link Metric Flow Matching for Smooth Interpolations on the Data
Manifold
Kacper Kapusniak, Peter Potaptchik,..., Francesco Di Giovanni
7 2024-06-10 link MVGamba: Unify 3D Content Generation as State Space Sequence
Modeling
Xuanyu Yi, Zike Wu,..., Hanwang Zhang
7 2024-06-06 link ReNO: Enhancing One-step Text-to-Image Models through Reward-based Noise Optimization Luca Eyring, Shyamgopal Karthik,..., Zeynep Akata
7 2024-03-12 link SmallToLarge (S2L): Scalable Data Selection for Fine-tuning Large Language
Models by Summarizing Training Trajectories of Small Models
Yu Yang, Siddhartha Mishra,..., Baharan Mirzasoleiman
7 2024-02-21 link Linear Transformers are Versatile In-Context Learners Max Vladymyrov, Johannes Von Oswald,..., Rong Ge
7 2024-06-15 link Voxel Mamba: Group-Free State Space Models for Point Cloud
based 3D Object Detection
Guowen Zhang, Lue Fan,..., Lei Zhang
7 2024-02-17 link TuneTables: Context Optimization for Scalable Prior-Data Fitted Networks Benjamin Feuer, Robin Tibor Schirrmeister,..., Colin White
7 2024-06-14 link Be like a Goldfish, Don't Memorize! Mitigating Memorization in
Generative LLMs
Abhimanyu Hans, John Kirchenbauer,..., Tom Goldstein
7 2024-06-21 link GeoLRM: Geometry-Aware Large Reconstruction Model for High-Quality 3D Gaussian
Generation
Chubin Zhang, Hongliang Song,..., Yansong Tang
7 2024-06-17 link Transcoders Find Interpretable LLM Feature Circuits Jacob Dunefsky, Philippe Chlenski, Neel Nanda
7 2024-07-17 link Direct Unlearning Optimization for Robust and Safe Text-to-Image Models Yong-Hyun Park, Sangdoo Yun,..., Gayoung Lee
7 2024-02-29 link RL-GPT: Integrating Reinforcement Learning and Code-as-policy Shaoteng Liu, Haoqi Yuan,..., Jiaya Jia
7 2024-05-30 link Kernel Language Entropy: Fine-grained Uncertainty Quantification for LLMs from
Semantic Similarities
Alexander V Nikitin, Jannik Kossen,..., Pekka Marttinen
7 2024-02-22 link Learning an Actionable Discrete Diffusion Policy via Large-Scale Actionless
Video Pre-Training
Haoran He, Chenjia Bai,..., Xuelong Li
7 2024-02-18 link In-Context Learning with Transformers: Softmax Attention Adapts to Function
Lipschitzness
Liam Collins, Advait U Parulekar,..., Sanjay Shakkottai
7 2024-05-28 link A Canonicalization Perspective on Invariant and Equivariant Learning George Ma, Yifei Wang,..., Yisen Wang
7 2024-03-11 link SARDet-100K: Towards Open-Source Benchmark and ToolKit for Large-Scale SAR
Object Detection
Yuxuan Li, Xiang Li,..., Jian Yang
6 2023-12-13 link SwitchHead: Accelerating Transformers with Mixture-of-Experts Attention Róbert Csordás, Piotr Piękos,..., Jürgen Schmidhuber
6 2024-06-04 link Chain of Agents: Large Language Models Collaborating on Long-Context
Tasks
Yusen Zhang, Ruoxi Sun,..., Sercan O Arik
6 2024-06-03 link DEFT: Efficient Fine-Tuning of Diffusion Models by Learning the
Generalised $h$-transform
Alexander Denker, Francisco Vargas,..., Pietro Lio
6 2024-03-31 link From Similarity to Superiority: Channel Clustering for Time Series
Forecasting
Jialin Chen, Jan Eric Lenssen,..., Rex Ying
6 2024-06-06 link VideoTetris: Towards Compositional Text-to-Video Generation Ye Tian, Ling Yang,..., Bin CUI
6 2024-05-29 link PediatricsGPT: Large Language Models as Chinese Medical Assistants for
Pediatric Applications
Dingkang Yang, Jinjie Wei,..., Lihua Zhang
6 2024-10-25 link DiffGS: Functional Gaussian Splatting Diffusion Junsheng Zhou, Weiqi Zhang, Yu-Shen Liu
6 2024-06-28 link Mixture of In-Context Experts Enhance LLMs' Long Context Awareness Hongzhan Lin, Ang Lv,..., Rui Yan
6 2024-06-13 link 4M-21: An Any-to-Any Vision Model for Tens of Tasks
and Modalities
Roman Bachmann, Oğuzhan Fatih Kar,..., Amir Zamir
6 2024-02-05 link Uncertainty of Thoughts: Uncertainty-Aware Planning Enhances Information Seeking in
Large Language Models
Zhiyuan Hu, Chumin Liu,..., Bryan Hooi
6 2024-05-30 link Transfer Q Star: Principled Decoding for LLM Alignment Souradip Chakraborty, Soumya Suvra Ghosal,..., Furong Huang
6 2024-05-24 link Understanding the differences in Foundation Models: Attention, State Space
Models, and Recurrent Neural Networks
Jerome Sieber, Carmen Amo Alonso,..., Antonio Orvieto
6 2024-09-01 link ContextCite: Attributing Model Generation to Context Benjamin Cohen-Wang, Harshay Shah,..., Aleksander Madry
6 2024-08-07 link Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon
Tasks
Zaijing Li, Yuquan Xie,..., Liqiang Nie
6 2024-05-24 link ART: Automatic Red-teaming for Text-to-Image Models to Protect Benign
Users
Guanlin Li, Kangjie Chen,..., Tianwei Zhang
6 2024-06-14 link Wild-GS: Real-Time Novel View Synthesis from Unconstrained Photo Collections Jiacong Xu, Yiqun Mei, Vishal M. Patel
6 2024-04-30 link HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning Chunlin Tian, Zhan Shi,..., Cheng-zhong Xu
6 2024-05-07 link Towards a Theoretical Understanding of the 'Reversal Curse' via
Training Dynamics
Hanlin Zhu, Baihe Huang,..., Stuart Russell
6 2024-06-04 link OpenGaussian: Towards Point-Level 3D Gaussian-based Open Vocabulary Understanding Yanmin Wu, Jiarui Meng,..., Jian Zhang
6 2024-06-10 link How Far Can Transformers Reason? The Globality Barrier and
Inductive Scratchpad
Emmanuel Abbe, Samy Bengio,..., Omid Saremi
6 2024-06-12 link The Impact of Initialization on LoRA Finetuning Dynamics Soufiane Hayou, Nikhil Ghosh, Bin Yu
6 2024-08-27 link The Mamba in the Llama: Distilling and Accelerating Hybrid
Models
Junxiong Wang, Daniele Paliotta,..., Tri Dao
6 2024-05-27 link AutoPSV: Automated Process-Supervised Verifier Jianqiao Lu, Zhiyang Dou,..., Zhijiang Guo
6 2024-05-31 link Kaleido Diffusion: Improving Conditional Diffusion Models with Autoregressive Latent
Modeling
Jiatao Gu, Ying Shen,..., Joshua M. Susskind
6 2024-05-25 link PTQ4DiT: Post-training Quantization for Diffusion Transformers Junyi Wu, Haoxuan Wang,..., Yan Yan
6 2024-01-27 link DiffuserLite: Towards Real-time Diffusion Planning Zibin Dong, Jianye HAO,..., YAN ZHENG
6 2024-04-22 link SOFTS: Efficient Multivariate Time Series Forecasting with Series-Core Fusion Lu Han, Xu-Yang Chen,..., De-Chuan Zhan
6 2024-06-13 link Understanding Hallucinations in Diffusion Models through Mode Interpolation Sumukh K Aithal, Pratyush Maini,..., J Zico Kolter
6 2024-02-09 link Learn To be Efficient: Build Structured Sparsity in Large
Language Models
Haizhong Zheng, Xiaoyan Bai,..., Atul Prakash
6 2024-07-14 link What Makes and Breaks Safety Fine-tuning? A Mechanistic Study Samyak Jain, Ekdeep Singh Lubana,..., Puneet K. Dokania
6 2024-05-30 link CV-VAE: A Compatible Video VAE for Latent Generative Video
Models
Sijie Zhao, Yong Zhang,..., Ying Shan
6 2024-05-29 link Principled Probabilistic Imaging using Diffusion Models as Plug-and-Play Priors Zihui Wu, Yu Sun,..., Katherine Bouman
6 2024-05-24 link Accelerating Diffusion Models with Parallel Sampling: Inference at Sub-Linear
Time Complexity
Haoxuan Chen, Yinuo Ren,..., Grant M. Rotskoff
6 2024-06-05 link Reparameterization invariance in approximate Bayesian inference Hrittik Roy, Marco Miani,..., Søren Hauberg
6 2024-06-12 link Large Language Models Must Be Taught to Know What
They Don't Know
Sanyam Kapoor, Nate Gruver,..., Andrew Gordon Wilson
6 2024-06-02 link Evidence of Learned Look-Ahead in a Chess-Playing Neural Network Erik Jenner, Shreyas Kapur,..., Stuart Russell
6 2024-05-28 link Personalized Steering of Large Language Models: Versatile Steering Vectors
Through Bi-directional Preference Optimization
Yuanpu Cao, Tianrong Zhang,..., Jinghui Chen
6 2023-05-30 link Geometry-aware training of factorized layers in tensor Tucker format Emanuele Zangrando, Steffen Schotthöfer,..., Francesco Tudisco
6 2024-01-08 link Attack-Resilient Image Watermarking Using Stable Diffusion Lijun Zhang, Xiao Liu,..., Hui Guan
6 2024-05-24 link VB-LoRA: Extreme Parameter Efficient Fine-Tuning with Vector Banks Yang Li, Shaobo Han, Shihao Ji
6 2024-06-14 link UniAudio 1.5: Large Language Model-driven Audio Codec is A
Few-shot Audio Task Learner
Dongchao Yang, Haohan Guo,..., Helen M. Meng
6 2024-06-12 link Self-Consuming Generative Models with Curated Data Provably Optimize Human
Preferences
Damien Ferbach, Quentin Bertrand,..., Gauthier Gidel
6 2023-05-29 link Approximation Rate of the Transformer Architecture for Sequence Modeling Haotian Jiang, Qianxiao Li
6 2023-07-15 link RegExplainer: Generating Explanations for Graph Neural Networks in Regression
Task
Jiaxing Zhang, Zhuomin Chen,..., Hua Wei
6 2024-03-18 link Dynamic Tuning Towards Parameter and Inference Efficiency for ViT
Adaptation
Wangbo Zhao, Jiasheng Tang,..., Yang You
6 2024-06-05 link Dynamic 3D Gaussian Fields for Urban Areas Tobias Fischer, Jonas Kulhanek,..., Peter Kontschieder
6 2024-05-21 link Single Image Unlearning: Efficient Machine Unlearning in Multimodal Large
Language Models
Jiaqi Li, Qianshan Wei,..., Fan Liu
6 2024-04-05 link Identity Decoupling for Multi-Subject Personalization of Text-to-Image Models Sangwon Jang, Jaehyeong Jo,..., Sung Ju Hwang
6 2024-05-28 link Knowledge Circuits in Pretrained Transformers Yunzhi Yao, Ningyu Zhang,..., Huajun Chen
6 2024-05-27 link LCM: Locally Constrained Compact Point Cloud Model for Masked
Point Modeling
Yaohua Zha, Naiqi Li,..., Shu-Tao Xia
6 2024-04-17 link On the Scalability of GNNs for Molecular Graphs Maciej Sypetkowski, Frederik Wenkel,..., Dominique Beaini
6 2024-05-27 link DMPlug: A Plug-in Method for Solving Inverse Problems with
Diffusion Models
Hengkang Wang, Xu Zhang,..., Ju Sun
6 2024-06-12 link Vivid-ZOO: Multi-View Video Generation with Diffusion Model Bing Li, Cheng Zheng,..., Bernard Ghanem
6 2024-09-04 link Exploring Low-Dimensional Subspaces in Diffusion Models for Controllable Image
Editing
Siyi Chen, Huijie Zhang,..., Qing Qu
6 2023-11-01 link Learning Cooperative Trajectory Representations for Motion Forecasting Hongzhi Ruan, Haibao Yu,..., Zaiqing Nie
6 2024-05-23 link Fisher Flow Matching for Generative Modeling over Discrete Data Oscar Davis, Samuel Kessler,..., Joey Bose
6 2024-04-05 link Who Evaluates the Evaluations? Objectively Scoring Text-to-Image Prompt Coherence
Metrics with T2IScoreScore (TS2)
Michael Saxon, Fatima Jahara,..., William Yang Wang
6 2023-05-26 link Set-based Neural Network Encoding Without Weight Tying Bruno Andreis, Bedionita Soro,..., Sung Ju Hwang
6 2024-08-22 link Transformers are Minimax Optimal Nonparametric In-Context Learners Juno Kim, Tai Nakamaki, Taiji Suzuki
6 2024-01-30 link Diff-eRank: A Novel Rank-Based Metric for Evaluating Large Language
Models
Lai Wei, Zhiquan Tan,..., Weiran Huang
6 2024-02-18 link Attractor Memory for Long-Term Time Series Forecasting: A Chaos
Perspective
Jiaxi Hu, Yuehong HU,..., Yuxuan Liang
6 2023-12-09 link Consistency Models for Scalable and Fast Simulation-Based Inference Marvin Schmitt, Valentin Pratz,..., Stefan T. Radev
6 2024-02-22 link A Decision-Language Model (DLM) for Dynamic Restless Multi-Armed Bandit
Tasks in Public Health
Nikhil Behari, Edwin Zhang,..., Milind Tambe
6 2024-02-02 link Segment Any Change Zhuo Zheng, Yanfei Zhong,..., Stefano Ermon
6 2024-06-10 link AutoSurvey: Large Language Models Can Automatically Write Surveys Yidong Wang, Qi Guo,..., Yue Zhang
5 2024-01-02 link PAC-Bayes-Chernoff bounds for unbounded losses Ioar Casado, Luis A. Ortega,..., Andres R Masegosa
5 2024-06-03 link What makes unlearning hard and what to do about
it
Kairan Zhao, Meghdad Kurmanji,..., Peter Triantafillou
5 2024-06-06 link DeepStack: Deeply Stacking Visual Tokens is Surprisingly Simple and
Effective for LMMs
Lingchen Meng, Jianwei Yang,..., Yu-Gang Jiang
5 2024-06-13 link On Softmax Direct Preference Optimization for Recommendation Yuxin Chen, Junfei Tan,..., Tat-Seng Chua
5 2024-07-09 link End-To-End Causal Effect Estimation from Unstructured Natural Language Data Nikita Dhawan, Leonardo Cotta,..., Chris J. Maddison
5 2024-05-23 link Membership Inference on Text-to-Image Diffusion Models via Conditional Likelihood
Discrepancy
Shengfang Zhai, Huanran Chen,..., Yang Liu
5 2024-05-27 link MultiOOD: Scaling Out-of-Distribution Detection for Multiple Modalities Hao Dong, Yue Zhao,..., Olga Fink
5 2024-05-29 link Stress-Testing Capability Elicitation With Password-Locked Models Ryan Greenblatt, Fabien Roger,..., David Krueger
5 2024-06-11 link Zero-shot Image Editing with Reference Imitation Xi Chen, Yutong Feng,..., Hengshuang Zhao
5 2024-05-26 link Categorical Flow Matching on Statistical Manifolds Chaoran Cheng, Jiahan Li,..., Ge Liu
5 2024-05-14 link Energy-based Hopfield Boosting for Out-of-Distribution Detection Claus Hofmann, Simon Lucas Schmid,..., Sepp Hochreiter
5 2024-06-27 link Length Optimization in Conformal Prediction Shayan Kiyani, George J. Pappas, Hamed Hassani
5 2024-05-27 link ARC: A Generalist Graph Anomaly Detector with In-Context Learning Yixin Liu, Shiyuan Li,..., Shirui Pan
5 2024-05-31 link LLM-ESR: Large Language Models Enhancement for Long-tailed Sequential Recommendation Qidong Liu, Xian Wu,..., Xiangyu Zhao
5 2024-02-27 link Zeroth-Order Sampling Methods for Non-Log-Concave Distributions: Alleviating Metastability by
Denoising Diffusion
Ye He, Kevin Rojas, Molei Tao
5 2023-07-03 link Understanding the Transferability of Representations via Task-Relatedness Akshay Mehra, Yunbei Zhang, Jihun Hamm
5 2023-10-21 link Masked Hard-Attention Transformers Recognize Exactly the Star-Free Languages Andy Yang, David Chiang, Dana Angluin
5 2024-05-21 link LLM Processes: Numerical Predictive Distributions Conditioned on Natural Language James Requeima, John F Bronskill,..., David Duvenaud
5 2024-06-11 link Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without
Guidance
Kuan Heng Lin, Sicheng Mo,..., Bolei Zhou
5 2023-07-05 link Convolutions and More as Einsum: A Tensor Network Perspective
with Advances for Second-Order Methods
Felix Dangel
5 2024-05-04 link U-DiTs: Downsample Tokens in U-Shaped Diffusion Transformers Yuchuan Tian, Zhijun Tu,..., Yunhe Wang
5 2024-06-21 link Multimodal Task Vectors Enable Many-Shot Multimodal In-Context Learning Brandon Huang, Chancharik Mitra,..., Roei Herzig
5 2024-05-22 link Context and Geometry Aware Voxel Transformer for Semantic Scene
Completion
Zhu Yu, Runmin Zhang,..., Hui-liang Shen
5 2024-04-21 link Adversarial Representation Engineering: A General Model Editing Framework for
Large Language Models
Yihao Zhang, Zeming Wei,..., Meng Sun
5 2024-10-22 link One-Step Diffusion Distillation through Score Implicit Matching Weijian Luo, Zemin Huang,..., Guo-Jun Qi
5 2024-06-17 link Large Scale Transfer Learning for Tabular Data via Language
Modeling
Joshua P Gardner, Juan Carlos Perdomo, Ludwig Schmidt
5 2024-03-28 link Dual-Personalizing Adapter for Federated Foundation Models yiyuan yang, Guodong Long,..., Michael Blumenstein
5 2024-05-22 link Spectral Adapter: Fine-Tuning in Spectral Space Fangzhao Zhang, Mert Pilanci
5 2024-05-22 link Dense Connector for MLLMs Huanjin Yao, Wenhao Wu,..., Jingdong Wang
5 2024-10-18 link Neural Signed Distance Function Inference through Splatting 3D Gaussians
Pulled on Zero-Level Set
Wenyuan Zhang, Yu-Shen Liu, Zhizhong Han
5 2024-10-24 link Binocular-Guided 3D Gaussian Splatting with View Consistency for Sparse
View Synthesis
Liang Han, Junsheng Zhou,..., Zhizhong Han
5 2024-05-24 link MicroAdam: Accurate Adaptive Optimization with Low Space Overhead and
Provable Convergence
Ionut-Vlad Modoranu, Mher Safaryan,..., Dan Alistarh
5 2024-02-04 link AutoTimes: Autoregressive Time Series Forecasters via Large Language Models Yong Liu, Guo Qin,..., Mingsheng Long
5 2024-02-04 link DenseFormer: Enhancing Information Flow in Transformers via Depth Weighted
Averaging
Matteo Pagliardini, Amirkeivan Mohtashami,..., Martin Jaggi
5 2024-06-06 link BitsFusion: 1.99 bits Weight Quantization of Diffusion Model Yang Sui, Yanyu Li,..., Jian Ren
5 2023-11-26 link A Nearly Optimal and Low-Switching Algorithm for Reinforcement Learning
with General Function Approximation
Heyang Zhao, Jiafan He, Quanquan Gu
5 2024-05-27 link DC-Gaussian: Improving 3D Gaussian Splatting for Reflective Dash Cam
Videos
Linhan Wang, Kai Cheng,..., Chang-Tien Lu
5 2024-06-17 link Unveiling Encoder-Free Vision-Language Models Haiwen Diao, Yufeng Cui,..., Xinlong Wang
5 2024-05-22 link RadarOcc: Robust 3D Occupancy Prediction with 4D Imaging Radar Fangqiang Ding, Xiangyu Wen,..., Chris Xiaoxuan Lu
5 2024-05-24 link Medformer: A Multi-Granularity Patching Transformer for Medical Time-Series Classification Yihe Wang, Nan Huang,..., Xiang Zhang
5 2024-05-29 link Matryoshka Query Transformer for Large Vision-Language Models Wenbo Hu, Zi-Yi Dou,..., Kai-Wei Chang
5 2024-06-13 link Talking Heads: Understanding Inter-layer Communication in Transformer Language Models Jack Merullo, Carsten Eickhoff, Ellie Pavlick
5 2023-08-22 link Enhancing Graph Transformers with Hierarchical Distance Structural Encoding Yuankai Luo, Hongkang Li,..., Xiao-Ming Wu
5 2024-02-21 link Full-Atom Peptide Design with Geometric Latent Diffusion Xiangzhe Kong, Yinjun Jia,..., Yang Liu
5 2024-06-07 link Probabilistic Weather Forecasting with Hierarchical Graph Neural Networks Joel Oskarsson, Tomas Landelius,..., Fredrik Lindsten
5 2024-03-06 link Inference via Interpolation: Contrastive Representations Provably Enable Planning and
Inference
Benjamin Eysenbach, Vivek Myers,..., Sergey Levine
5 2024-02-16 link Conformalized Credal Set Predictors Alireza Javanmardi, David Stutz, Eyke Hüllermeier
5 2024-06-20 link Transferable Boltzmann Generators Leon Klein, Frank Noe
5 2024-05-18 link Automated Multi-level Preference for MLLMs Mengxi Zhang, Wenhao Wu,..., Yifan Sun
5 2024-02-16 link Provably Safe Neural Network Controllers via Differential Dynamic Logic Samuel Teuber, Stefan Mitsch, Andre Platzer
5 2024-05-28 link Towards a theory of how the structure of language
is acquired by deep neural networks
Francesco Cagnetta, Matthieu Wyart
5 2024-05-23 link Video Diffusion Models are Training-free Motion Interpreter and Controller Zeqi Xiao, Yifan Zhou,..., Xingang Pan
5 2024-05-28 link FASTopic: Pretrained Transformer is a Fast, Adaptive, Stable, and
Transferable Topic Model
Xiaobao Wu, Thong Thanh Nguyen,..., Anh Tuan Luu
5 2024-07-09 link Scaling Retrieval-Based Language Models with a Trillion-Token Datastore Rulin Shao, Jacqueline He,..., Pang Wei Koh
5 2024-05-24 link GS-Hider: Hiding Messages into 3D Gaussian Splatting Xuanyu Zhang, Jiarui Meng,..., Jian Zhang
5 2024-06-01 link Frieren: Efficient Video-to-Audio Generation Network with Rectified Flow Matching Yongqi Wang, Wenxiang Guo,..., Zhou Zhao
5 2024-06-09 link Training Compute-Optimal Protein Language Models Xingyi Cheng, Bo Chen,..., Le Song
5 2024-05-23 link PV-Tuning: Beyond Straight-Through Estimation for Extreme LLM Compression Vladimir Malinovskii, Denis Mazur,..., Peter Richtárik
5 2024-05-23 link Scalable Optimization in the Modular Norm Tim Large, Yang Liu,..., Jeremy Bernstein
5 2024-06-10 link Get rich quick: exact solutions reveal how unbalanced initializations
promote rapid feature learning
Daniel Kunin, Allan Raventos,..., Surya Ganguli
5 2024-05-27 link BehaviorGPT: Smart Agent Simulation for Autonomous Driving with Next-Patch
Prediction
Zikang Zhou, Haibo HU,..., Chun Jason Xue
5 2023-10-11 link Non-asymptotic Approximation Error Bounds of Parameterized Quantum Circuits Zhan Yu, Qiuhao Chen,..., Jerry Zhijian Yang
5 2024-05-29 link Adaptive Image Quality Assessment via Teaching Large Multimodal Model
to Compare
Hanwei Zhu, Haoning Wu,..., Shiqi Wang
5 2024-06-12 link A Concept-Based Explainability Framework for Large Multimodal Models Jayneel Parekh, Pegah KHAYATAN,..., Matthieu Cord
5 2024-06-12 link Scaling Laws in Linear Regression: Compute, Parameters, and Data Licong Lin, Jingfeng Wu,..., Jason D. Lee
5 2024-06-17 link Emotion-LLaMA: Multimodal Emotion Recognition and Reasoning with Instruction Tuning Zebang Cheng, Zhi-Qi Cheng,..., Alexander G Hauptmann
5 2024-09-14 link Schrödinger Bridge Flow for Unpaired Data Translation Valentin De Bortoli, Iryna Korshunova,..., Arnaud Doucet
5 2024-06-25 link DiffusionPDE: Generative PDE-Solving Under Partial Observation Jiahe Huang, Guandao Yang,..., Jeong Joon Park
5 2024-05-28 link Exploiting LLM Quantization Kazuki Egashira, Mark Vero,..., Martin Vechev
5 2024-05-23 link Axioms for AI Alignment from Human Feedback Luise Ge, Daniel Halpern,..., Junlin Wu
5 2024-06-05 link A Geometric View of Data Complexity: Efficient Local Intrinsic
Dimension Estimation with Diffusion Models
Hamidreza Kamkari, Brendan Leigh Ross,..., Gabriel Loaiza-Ganem
5 2024-05-29 link On the Role of Attention Masks and LayerNorm in
Transformers
Xinyi Wu, Amir Ajorlou,..., Ali Jadbabaie
5 2024-02-01 link Understanding the Expressive Power and Mechanisms of Transformer for
Sequence Modeling
Mingze Wang, Weinan E
5 2024-09-29 link One Token to Seg Them All: Language Instructed Reasoning
Segmentation in Videos
Zechen Bai, Tong He,..., Mike Zheng Shou
5 2023-10-07 link Test-Time Adaptation Induces Stronger Accuracy and Agreement-on-the-Line Eungyeup Kim, Mingjie Sun,..., J Zico Kolter
5 2024-05-27 link GenWarp: Single Image to Novel Views with Semantic-Preserving Generative
Warping
Junyoung Seo, Kazumi Fukuda,..., Yuki Mitsufuji
5 2024-06-24 link Confidence Regulation Neurons in Language Models Alessandro Stolfo, Ben Peng Wu,..., Neel Nanda
5 2024-05-09 link A Universal Growth Rate for Learning with Smooth Surrogate
Losses
Anqi Mao, Mehryar Mohri, Yutao Zhong
5 2024-06-17 link AvaTaR: Optimizing LLM Agents for Tool Usage via Contrastive
Reasoning
Shirley Wu, Shiyu Zhao,..., James Zou
4 2024-06-11 link Advancing Tool-Augmented Large Language Models: Integrating Insights from Errors
in Inference Trees
Sijia Chen, Yibo Wang,..., Lijun Zhang
4 2024-05-20 link Images that Sound: Composing Images and Sounds on a
Single Canvas
Ziyang Chen, Daniel Geng, Andrew Owens
4 2024-03-28 link CLAP4CLIP: Continual Learning with Probabilistic Finetuning for Vision-Language Models Saurav Jha, Dong Gong, Lina Yao
4 2024-03-19 link Optimal Flow Matching: Learning Straight Trajectories in Just One
Step
Nikita Maksimovich Kornilov, Petr Mokrov,..., Alexander Korotin
4 2024-06-14 link Neural Concept Binder Wolfgang Stammer, Antonia Wüst,..., Kristian Kersting
4 2024-06-04 link Bileve: Securing Text Provenance in Large Language Models Against
Spoofing with Bi-level Signature
Tong Zhou, Xuandong Zhao,..., Shaolei Ren
4 2024-07-25 link RestoreAgent: Autonomous Image Restoration Agent via Multimodal Large Language
Models
Haoyu Chen, Wenbo Li,..., Lei Zhu
4 2024-02-12 link PANORAMIA: Privacy Auditing of Machine Learning Models without Retraining Mishaal Kazmi, Hadrien Lautraite,..., Mathias Lécuyer
4 2024-05-21 link Dataset Decomposition: Faster LLM Training with Variable Sequence Length
Curriculum
Hadi Pouransari, Chun-Liang Li,..., Oncel Tuzel
4 2024-06-24 link Inferring stochastic low-rank recurrent neural networks from neural data Matthijs Pals, A Erdem Sağtekin,..., Jakob H. Macke
4 2024-05-22 link ReVideo: Remake a Video with Motion and Content Control Chong Mou, Mingdeng Cao,..., Jian Zhang
4 2024-06-11 link Neural Gaffer: Relighting Any Object via Diffusion Haian Jin, Yuan Li,..., Noah Snavely
4 2024-08-02 link Mission Impossible: A Statistical Perspective on Jailbreaking LLMs Jingtong Su, Julia Kempe, Karen Ullrich
4 2024-06-09 link Distributional Preference Alignment of LLMs via Optimal Transport Igor Melnyk, Youssef Mroueh,..., Jarret Ross
4 2024-05-28 link Improved Generation of Adversarial Examples Against Safety-aligned LLMs Qizhang Li, Yiwen Guo,..., Hao Chen
4 2024-06-11 link MambaLRP: Explaining Selective State Space Sequence Models Farnoush Rezaei Jafari, Grégoire Montavon,..., Oliver Eberle
4 2024-06-03 link SemCoder: Training Code Language Models with Comprehensive Semantics Yangruibo Ding, Jinjun Peng,..., Baishakhi Ray
4 2024-06-12 link Discovering Preference Optimization Algorithms with and for Large Language
Models
Chris Lu, Samuel Holt,..., Robert Tjarko Lange
4 2024-06-05 link HYDRA: Model Factorization Framework for Black-Box LLM Personalization Yuchen Zhuang, Haotian Sun,..., Bo Dai
4 2024-04-23 link SHED: Shapley-Based Automated Dataset Refinement for Instruction Fine-Tuning Yexiao He, Ziyao Wang,..., Ang Li
4 2024-05-25 link Pessimistic Backward Policy for GFlowNets Hyosoon Jang, Yunhui Jang,..., Sungsoo Ahn
4 2024-01-21 link Language Models as Hierarchy Encoders Yuan He, Moy Yuan,..., Ian Horrocks
4 2024-03-18 link Divide-and-Conquer Posterior Sampling for Denoising Diffusion Priors Yazid Janati, Badr MOUFAD,..., Jimmy Olsson
4 2024-08-30 link Can We Leave Deepfake Data Behind in Training Deepfake
Detector?
Jikang Cheng, Zhiyuan Yan,..., Chen Li
4 2024-05-13 link Zero-Shot Tokenizer Transfer Benjamin Minixhofer, Edoardo Ponti, Ivan Vulić
4 2024-05-23 link Large Language Models-guided Dynamic Adaptation for Temporal Knowledge Graph
Reasoning
Jiapu Wang, Kai Sun,..., Baocai Yin
4 2024-06-29 link UDC: A Unified Neural Divide-and-Conquer Framework for Large-Scale Combinatorial
Optimization Problems
Zhi Zheng, Changliang Zhou,..., Zhenkun Wang
4 2024-03-14 link Minimax Optimal and Computationally Efficient Algorithms for Distributionally Robust
Offline Reinforcement Learning
Zhishuai Liu, Pan Xu
4 2024-05-22 link The Power of Extrapolation in Federated Learning Hanmin Li, Kirill Acharya, Peter Richtárik
4 2024-02-29 link UniTS: A Unified Multi-Task Time Series Model Shanghua Gao, Teddy Koker,..., Marinka Zitnik
4 2024-05-23 link Surge Phenomenon in Optimal Learning Rate and Batch Size
Scaling
Shuaipeng Li, Penghao Zhao,..., Di Wang
4 2024-02-01 link Credal Learning Theory Michele Caprio, Maryam Sultana,..., Fabio Cuzzolin
4 2024-03-30 link Communication Efficient Distributed Training with Distributed Lion Bo Liu, Lemeng Wu,..., qiang liu
4 2024-05-08 link Initialization is Critical to Whether Transformers Fit Composite Functions
by Inference or Memorizing
Zhongwang Zhang, Pengxiao Lin,..., Zhi-Qin John Xu
4 2024-06-12 link Optimized Feature Generation for Tabular Data via LLMs with
Decision Tree Reasoning
Jaehyun Nam, Kyuyoung Kim,..., Jinwoo Shin
4 2024-02-29 link Learning Commonality, Divergence and Variety for Unsupervised Visible-Infrared Person
Re-identification
Jiangming Shi, Xiangbo Yin,..., Yanyun Qu
4 2024-09-11 link NVRC: Neural Video Representation Compression Ho Man Kwan, Ge Gao,..., David Bull
4 2024-02-26 link Graph Diffusion Policy Optimization Yijing Liu, Chao Du,..., Wei Chen
4 2024-07-05 link Better by Default: Strong Pre-Tuned MLPs and Boosted Trees
on Tabular Data
David Holzmüller, Leo Grinsztajn, Ingo Steinwart
4 2024-07-25 link Unlocking Tokens as Data Points for Generalization Bounds on
Larger Language Models
Sanae Lotfi, Yilun Kuang,..., Andrew Gordon Wilson
4 2024-05-24 link Alleviating Hallucinations in Large Vision-Language Models through Hallucination-Induced Optimization Beitao Chen, Xinyu Lyu,..., Jingkuan Song
4 2024-03-06 link Directional Smoothness and Gradient Methods: Convergence and Adaptivity Aaron Mishkin, Ahmed Khaled,..., Robert M. Gower
4 2024-03-20 link Bridge the Modality and Capability Gaps in Vision-Language Model
Selection
Chao Yi, Yuhang He,..., Han-Jia Ye
4 2024-06-17 link Probing the Decision Boundaries of In-context Learning in Large
Language Models
Siyan Zhao, Tung Nguyen, Aditya Grover
4 2024-02-07 link Improved off-policy training of diffusion samplers Marcin Sendera, Minsu Kim,..., Nikolay Malkin
4 2024-06-13 link Is Value Learning Really the Main Bottleneck in Offline
RL?
Seohong Park, Kevin Frans,..., Aviral Kumar
4 2023-12-08 link HuRef: HUman-REadable Fingerprint for Large Language Models Boyi Zeng, Lizheng Wang,..., Zhouhan Lin
4 2024-06-04 link Finding NeMo: Localizing Neurons Responsible For Memorization in Diffusion
Models
Dominik Hintersdorf, Lukas Struppek,..., Franziska Boenisch
4 2024-02-04 link Towards an Information Theoretic Framework of Context-Based Offline Meta-Reinforcement
Learning
Lanqing Li, Hai Zhang,..., Pheng-Ann Heng
4 2024-02-25 link No Free Lunch in LLM Watermarking: Trade-offs in Watermarking
Design Choices
Qi Pang, Shengyuan Hu,..., Virginia Smith
4 2023-03-16 link Addressing bias in online selection with limited budget of
comparisons
Ziyad Benomar, Evgenii Chzhen,..., Vianney Perchet
4 2023-11-19 link Large Pre-trained time series models for cross-domain Time series
analysis tasks
Harshavardhan Kamarthi, B. Aditya Prakash
4 2024-05-24 link Stacking Your Transformers: A Closer Look at Model Growth
for Efficient LLM Pre-Training
Wenyu Du, Tongxu Luo,..., Jie Fu
4 2024-02-06 link AdaFlow: Imitation Learning with Variance-Adaptive Flow-Based Policies Xixi Hu, qiang liu,..., Bo Liu
4 2023-12-03 link G2D: From Global to Dense Radiography Representation Learning via
Vision-Language Pre-training
Che Liu, Cheng Ouyang,..., Rossella Arcucci
4 2024-08-19 link NeuRodin: A Two-stage Framework for High-Fidelity Neural Surface Reconstruction Yifan Wang, Di Huang,..., Tong He
4 2024-05-02 link In-and-Out: Algorithmic Diffusion for Sampling Convex Bodies Yunbum Kook, Santosh Vempala, Matthew Shunshi Zhang
4 2024-07-01 link Aligning Target-Aware Molecule Diffusion Models with Exact Energy Optimization Siyi Gu, Minkai Xu,..., Stefano Ermon
4 2024-05-25 link Bigger, Regularized, Optimistic: scaling for compute and sample-efficient continuous
control
Michal Nauman, Mateusz Ostaszewski,..., Marek Cygan
4 2024-06-04 link Loki: Low-Rank Keys for Efficient Sparse Attention Prajwal Singhania, Siddharth Singh,..., Abhinav Bhatele
4 2024-05-23 link Unveiling the Tapestry of Consistency in Large Vision-Language Models Yuan Zhang, Fei xiao,..., Haoyuan Guo
4 2024-01-19 link Neglected Hessian component explains mysteries in Sharpness regularization Yann Dauphin, Atish Agarwala, Hossein Mobahi
4 2024-06-03 link The Importance of Online Data: Understanding Preference Fine-tuning via
Coverage
Yuda Song, Gokul Swamy,..., Wen Sun
4 2024-06-13 link Interpreting the Weight Space of Customized Diffusion Models Amil Dravid, Yossi Gandelsman,..., Kfir Aberman
4 2024-02-12 link Policy Improvement using Language Feedback Models Victor Zhong, Dipendra Misra,..., Marc-Alexandre Côté
4 2024-06-06 link PaCE: Parsimonious Concept Engineering for Large Language Models Jinqi Luo, Tianjiao Ding,..., Rene Vidal
4 2024-06-27 link OmniJARVIS: Unified Vision-Language-Action Tokenization Enables Open-World Instruction Following Agents Zihao Wang, Shaofei Cai,..., Yitao Liang
4 2024-07-26 link SLIM: Style-Linguistics Mismatch Model for Generalized Audio Deepfake Detection Yi Zhu, Surya Koppisetti,..., Gaurav Bharaj
4 2024-06-24 link Finding Transformer Circuits with Edge Pruning Adithya Bhaskar, Alexander Wettig,..., Danqi Chen
4 2024-09-26 link Generative Modeling of Molecular Dynamics Trajectories Bowen Jing, Hannes Stark,..., Bonnie Berger
4 2024-04-25 link PhyRecon: Physically Plausible Neural Scene Reconstruction Junfeng Ni, Yixin Chen,..., Siyuan Huang
4 2024-06-22 link Identifying and Solving Conditional Image Leakage in Image-to-Video Diffusion
Model
Min Zhao, Hongzhou Zhu,..., Jun Zhu
4 2024-07-12 link GAVEL: Generating Games Via Evolution and Language Models Graham Todd, Alexander George Padula,..., Julian Togelius
4 2024-04-20 link GRANOLA: Adaptive Normalization for Graph Neural Networks Moshe Eliasof, Beatrice Bevilacqua,..., Haggai Maron
4 2024-05-19 link FIFO-Diffusion: Generating Infinite Videos from Text without Training Jihwan Kim, Junoh Kang,..., Bohyung Han
4 2024-02-06 link Discovery of the Hidden World with Large Language Models Chenxi Liu, Yongqiang Chen,..., Kun Zhang
4 2024-05-29 link A Full-duplex Speech Dialogue Scheme Based On Large Language
Models
Peng Wang, Songshuo Lu,..., Yuanjun Xiong
4 2024-10-10 link Global Lyapunov functions: a long-standing open problem in mathematics,
with symbolic transformers
Alberto Alfarano, Francois Charton, Amaury Hayat
4 2024-07-01 link Evaluation of Text-to-Video Generation Models: A Dynamics Perspective Mingxiang Liao, Hannan Lu,..., Xinyu Zhang
4 2024-10-03 link Parameter Competition Balancing for Model Merging Guodong DU, Junlin Lee,..., Min Zhang
4 2024-03-02 link NoMAD-Attention: Efficient LLM Inference on CPUs Through Multiply-add-free Attention Tianyi Zhang, Jonah Wonkyu Yi,..., Anshumali Shrivastava
4 2024-06-12 link DiTFastAttn: Attention Compression for Diffusion Transformer Models Zhihang Yuan, Hanling Zhang,..., Yu Wang
4 2024-02-06 link On Convergence of Adam for Stochastic Optimization under Relaxed
Assumptions
Yusu Hong, Junhong Lin
4 2024-05-27 link Entity Alignment with Noisy Annotations from Large Language Models Shengyuan Chen, Qinggang Zhang,..., Xiao Huang
4 2024-04-05 link Dynamic Conditional Optimal Transport through Simulation-Free Flows Gavin Kerrigan, Giosue Migliorini, Padhraic Smyth
4 2024-05-23 link D-MiSo: Editing Dynamic 3D Scenes using Multi-Gaussians Soup Joanna Waczynska, Piotr Borycki,..., Przemysław Spurek
4 2024-06-06 link Understanding Information Storage and Transfer in Multi-modal Large Language
Models
Samyadeep Basu, Martin Grayson,..., Daniela Massiceti
4 2024-08-28 link Efficient LLM Scheduling by Learning to Rank Yichao Fu, Siqi Zhu,..., Hao Zhang
4 2024-06-10 link IllumiNeRF: 3D Relighting without Inverse Rendering Xiaoming Zhao, Pratul P. Srinivasan,..., Philipp Henzler
4 2024-03-25 link MetaAligner: Towards Generalizable Multi-Objective Alignment of Language Models Kailai Yang, Zhiwei Liu,..., Sophia Ananiadou
4 2024-07-08 link On the Complexity of Learning Sparse Functions with Statistical
and Gradient Queries
Nirmit Joshi, Theodor Misiakiewicz, Nathan Srebro
4 2024-06-23 link LiveScene: Language Embedding Interactive Radiance Fields for Physical Scene
Rendering and Control
Delin Qu, Qizhi Chen,..., Xuelong Li
4 2024-05-28 link Seeing the Image: Prioritizing Visual Correlation by Contrastive Alignment Xin Xiao, Bohong Wu,..., Haoyuan Guo
4 2024-02-22 link Watermarking Makes Language Models Radioactive Tom Sander, Pierre Fernandez,..., Teddy Furon
4 2024-07-29 link Mixture of Nested Experts: Adaptive Processing of Visual Tokens Gagan Jain, Nidhi Hegde,..., Sujoy Paul