Last updated: 2025-04-16 04:12:00. Maintained by Weisen Jiang.

citation publish date title (pdf) review authors
1942 2023-07-04 SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis link Dustin Podell, Zion English,..., Robin Rombach
1820 2023-04-20 MiniGPT-4: Enhancing Vision-Language Understanding with Advanced Large Language Models link Deyao Zhu, Jun Chen,..., Mohamed Elhoseiny
1032 2023-07-17 FlashAttention-2: Faster Attention with Better Parallelism and Work Partitioning link Tri Dao
771 2023-05-31 Let's Verify Step by Step link Hunter Lightman, Vineet Kosaraju,..., Karl Cobbe
726 2023-07-10 AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific
Tuning
link Yuwei Guo, Ceyuan Yang,..., Bo Dai
602 2023-04-11 Teaching Large Language Models to Self-Debug link Xinyun Chen, Maxwell Lin,..., Denny Zhou
588 2023-06-14 WizardCoder: Empowering Code Large Language Models with Evol-Instruct link Ziyang Luo, Can Xu,..., Daxin Jiang
564 2023-07-31 ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world
APIs
link Yujia Qin, Shihao Liang,..., Maosong Sun
559 2023-10-17 Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection link Akari Asai, Zeqiu Wu,..., Hannaneh Hajishirzi
553 2023-09-28 DreamGaussian: Generative Gaussian Splatting for Efficient 3D Content Creation link Jiaxiang Tang, Jiawei Ren,..., Gang Zeng
553 2023-08-31 MVDream: Multi-view Diffusion for 3D Generation link Yichun Shi, Peng Wang,..., Xiao Yang
489 2023-10-05 Fine-tuning Aligned Language Models Compromises Safety, Even When Users
Do Not Intend To!
link Xiangyu Qi, Yi Zeng,..., Peter Henderson
458 2023-10-03 MathVista: Evaluating Mathematical Reasoning of Foundation Models in Visual
Contexts
link Pan Lu, Hritik Bansal,..., Jianfeng Gao
398 2023-08-14 ChatEval: Towards Better LLM-based Evaluators through Multi-Agent Debate link Chi-Min Chan, Weize Chen,..., Zhiyuan Liu
388 2023-10-10 SWE-bench: Can Language Models Resolve Real-world Github Issues? link Carlos E Jimenez, John Yang,..., Karthik R Narasimhan
385 2023-09-07 SyncDreamer: Generating Multiview-consistent Images from a Single-view Image link Yuan Liu, Cheng Lin,..., Wenping Wang
379 2023-10-03 Large Language Models Cannot Self-Correct Reasoning Yet link Jie Huang, Xinyun Chen,..., Denny Zhou
372 2023-11-08 LRM: Large Reconstruction Model for Single Image to 3D link Yicong Hong, Kai Zhang,..., Hao Tan
370 2023-10-10 iTransformer: Inverted Transformers Are Effective for Time Series Forecasting link Yong Liu, Tengge Hu,..., Mingsheng Long
347 2023-07-25 WebArena: A Realistic Web Environment for Building Autonomous Agents link Shuyan Zhou, Frank F. Xu,..., Graham Neubig
343 2023-09-07 Large Language Models as Optimizers link Chengrun Yang, Xuezhi Wang,..., Xinyun Chen
342 2023-09-11 MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning link Xiang Yue, Xingwei Qu,..., Wenhu Chen
337 2023-05-19 CRITIC: Large Language Models Can Self-Correct with Tool-Interactive Critiquing link Zhibin Gou, Zhihong Shao,..., Weizhu Chen
336 2023-06-30 Magic123: One Image to High-Quality 3D Object Generation Using
Both 2D and 3D Diffusion Priors
link Guocheng Qian, Jinjie Mai,..., Bernard Ghanem
332 2023-06-22 Can LLMs Express Their Uncertainty? An Empirical Evaluation of
Confidence Elicitation in LLMs
link Miao Xiong, Zhiyuan Hu,..., Bryan Hooi
317 2023-06-20 A Simple and Effective Pruning Approach for Large Language
Models
link Mingjie Sun, Zhuang Liu,..., J Zico Kolter
313 2023-10-03 Time-LLM: Time Series Forecasting by Reprogramming Large Language Models link Ming Jin, Shiyu Wang,..., Qingsong Wen
300 2023-09-21 MetaMath: Bootstrap Your Own Mathematical Questions for Large Language
Models
link Longhui Yu, Weisen Jiang,..., Weiyang Liu
293 2023-09-15 Sparse Autoencoders Find Highly Interpretable Features in Language Models link Robert Huben, Hoagy Cunningham,..., Lee Sharkey
284 2023-10-12 Ferret: Refer and Ground Anything Anywhere at Any Granularity link Haoxuan You, Haotian Zhang,..., Yinfei Yang
283 2023-09-28 Vision Transformers Need Registers link Timothée Darcet, Maxime Oquab,..., Piotr Bojanowski
282 2023-05-22 Training Diffusion Models with Reinforcement Learning link Kevin Black, Michael Janner,..., Sergey Levine
274 2023-10-17 Quantifying Language Models' Sensitivity to Spurious Features in Prompt
Design or: How I learned to start worrying about prompt formatting
link Melanie Sclar, Yejin Choi,..., Alane Suhr
268 2023-10-19 Safe RLHF: Safe Reinforcement Learning from Human Feedback link Josef Dai, Xuehai Pan,..., Yaodong Yang
260 2023-10-09 Language Model Beats Diffusion - Tokenizer is key to
visual generation
link Lijun Yu, Jose Lezama,..., Lu Jiang
258 2023-10-19 Eureka: Human-Level Reward Design via Coding Large Language Models link Yecheng Jason Ma, William Liang,..., Anima Anandkumar
256 2023-10-11 Catastrophic Jailbreak of Open-source LLMs via Exploiting Generation link Yangsibo Huang, Samyak Gupta,..., Danqi Chen
252 2023-10-16 Llemma: An Open Language Model for Mathematics link Zhangir Azerbayev, Hailey Schoelkopf,..., Sean Welleck
242 2023-08-07 AgentBench: Evaluating LLMs as Agents link Xiao Liu, Hao Yu,..., Jie Tang
241 2023-10-10 Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning link Mengzhou Xia, Tianyu Gao,..., Danqi Chen
234 2023-10-03 AutoDAN: Generating Stealthy Jailbreak Prompts on Aligned Large Language
Models
link Xiaogeng Liu, Nan Xu,..., Chaowei Xiao
233 2023-11-10 Instant3D: Fast Text-to-3D with Sparse-view Generation and Large Reconstruction
Model
link Jiahao Li, Hao Tan,..., Sai Bi
231 2023-07-19 TokenFlow: Consistent Diffusion Features for Consistent Video Editing link Michal Geyer, Omer Bar-Tal,..., Tali Dekel
230 2023-10-16 Real-time Photorealistic Dynamic Scene Representation and Rendering with 4D
Gaussian Splatting
link Zeyu Yang, Hongye Yang,..., Li Zhang
229 2023-07-13 InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and
Generation
link Yi Wang, Yinan He,..., Yu Qiao
224 2023-06-26 Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction
Tuning
link Fuxiao Liu, Kevin Lin,..., Lijuan Wang
219 2023-09-21 The Reversal Curse: LLMs trained on “A is B”
fail to learn “B is A”
link Lukas Berglund, Meg Tong,..., Owain Evans
219 2023-02-14 Universal Guidance for Diffusion Models link Arpit Bansal, Hong-Min Chu,..., Tom Goldstein
217 2023-05-22 ControlVideo: Training-free Controllable Text-to-video Generation link Yabo Zhang, Yuxiang Wei,..., Qi Tian
217 2023-08-12 GPT-4 Is Too Smart To Be Safe: Stealthy Chat
with LLMs via Cipher
link Youliang Yuan, Wenxiang Jiao,..., Zhaopeng Tu
215 2023-09-20 OpenChat: Advancing Open-source Language Models with Mixed-Quality Data link Guan Wang, Sijie Cheng,..., Yang Liu
209 2023-04-18 NaturalSpeech 2: Latent Diffusion Models are Natural and Zero-Shot
Speech and Singing Synthesizers
link Kai Shen, Zeqian Ju,..., Jiang Bian
209 2023-08-31 YaRN: Efficient Context Window Extension of Large Language Models link Bowen Peng, Jeffrey Quesnelle,..., Enrico Shippole
209 2023-06-05 SpQR: A Sparse-Quantized Representation for Near-Lossless LLM Weight Compression link Tim Dettmers, Ruslan A. Svirschevski,..., Dan Alistarh
207 2023-02-07 Effective Data Augmentation With Diffusion Models link Brandon Trabucco, Kyle Doherty,..., Ruslan Salakhutdinov
203 2023-06-08 PandaLM: An Automatic Evaluation Benchmark for LLM Instruction Tuning
Optimization
link Yidong Wang, Zhuohao Yu,..., Yue Zhang
198 2023-03-02 Human Motion Diffusion as a Generative Prior link Yoni Shafir, Guy Tevet,..., Amit Haim Bermano
197 2023-10-03 Model Tells You What to Discard: Adaptive KV Cache
Compression for LLMs
link Suyu Ge, Yunan Zhang,..., Jianfeng Gao
196 2023-12-25 What Makes Good Data for Alignment? A Comprehensive Study
of Automatic Data Selection in Instruction Tuning
link Wei Liu, Weihao Zeng,..., Junxian He
193 2023-09-13 Statistical Rejection Sampling Improves Preference Optimization link Tianqi Liu, Yao Zhao,..., Jialu Liu
190 2023-09-07 Large Language Models Are Not Robust Multiple Choice Selectors link Chujie Zheng, Hao Zhou,..., Minlie Huang
189 2023-05-04 Personalize Segment Anything Model with One Shot link Renrui Zhang, Zhengkai Jiang,..., Hongsheng Li
187 2023-08-16 Stochastic Controlled Averaging for Federated Learning with Communication Compression link Xinmeng Huang, Ping Li, Xiaoyun Li
185 2023-10-20 SALMONN: Towards Generic Hearing Abilities for Large Language Models link Changli Tang, Wenyi Yu,..., Chao Zhang
185 2023-10-03 LanguageBind: Extending Video-Language Pretraining to N-modality by Language-based Semantic
Alignment
link Bin Zhu, Bin Lin,..., Li Yuan
184 2023-10-12 Prometheus: Inducing Fine-Grained Evaluation Capability in Language Models link Seungone Kim, Jamin Shin,..., Minjoon Seo
180 2023-09-12 InstaFlow: One Step is Enough for High-Quality Diffusion-Based Text-to-Image
Generation
link Xingchao Liu, Xiwen Zhang,..., qiang liu
178 2023-09-15 Connecting Large Language Models with Evolutionary Algorithms Yields Powerful
Prompt Optimizers
link Qingyan Guo, Rui Wang,..., Yujiu Yang
176 2023-07-24 A Real-World WebAgent with Planning, Long Context Understanding, and
Program Synthesis
link Izzeddin Gur, Hiroki Furuta,..., Aleksandra Faust
176 2023-05-26 Large Language Models as Tool Makers link Tianle Cai, Xuezhi Wang,..., Denny Zhou
171 2023-10-02 Reasoning on Graphs: Faithful and Interpretable Large Language Model
Reasoning
link LINHAO LUO, Yuan-Fang Li,..., Shirui Pan
170 2022-08-30 The Alignment Problem from a Deep Learning Perspective link Richard Ngo, Lawrence Chan, Sören Mindermann
168 2023-10-20 Towards Understanding Sycophancy in Language Models link Mrinank Sharma, Meg Tong,..., Ethan Perez
165 2023-09-14 Safety-Tuned LLaMAs: Lessons From Improving the Safety of Large
Language Models that Follow Instructions
link Federico Bianchi, Mirac Suzgun,..., James Zou
163 None WizardLM: Empowering Large Pre-Trained Language Models to Follow Complex
Instructions
link Can Xu, Qingfeng Sun,..., Daxin Jiang
162 2023-08-25 OmniQuant: Omnidirectionally Calibrated Quantization for Large Language Models link Wenqi Shao, Mengzhao Chen,..., Ping Luo
161 2023-10-02 Making Retrieval-Augmented Language Models Robust to Irrelevant Context link Ori Yoran, Tomer Wolfson,..., Jonathan Berant
159 2023-09-20 DreamLLM: Synergistic Multimodal Comprehension and Creation link Runpei Dong, Chunrui Han,..., Li Yi
157 2023-11-14 Fine-Tuning Language Models for Factuality link Katherine Tian, Eric Mitchell,..., Chelsea Finn
156 2023-09-21 LMSYS-Chat-1M: A Large-Scale Real-World LLM Conversation Dataset link Lianmin Zheng, Wei-Lin Chiang,..., Hao Zhang
155 2023-12-04 The Unlocking Spell on Base LLMs: Rethinking Alignment
via In-Context Learning
link Bill Yuchen Lin, Abhilasha Ravichander,..., Yejin Choi
154 2023-10-01 Analyzing and Mitigating Object Hallucination in Large Vision-Language Models link Yiyang Zhou, Chenhang Cui,..., Huaxiu Yao
152 2024-05-02 WildChat: 1M ChatGPT Interaction Logs in the Wild link Wenting Zhao, Xiang Ren,..., Yuntian Deng
150 2023-10-25 Detecting Pretraining Data from Large Language Models link Weijia Shi, Anirudh Ajith,..., Luke Zettlemoyer
147 2023-10-11 Evaluating Large Language Models at Evaluating Instruction Following link Zhiyuan Zeng, Jiatong Yu,..., Danqi Chen
141 2023-09-21 LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models link Yukang Chen, Shengju Qian,..., Jiaya Jia
141 2023-11-15 DMV3D: Denoising Multi-view Diffusion Using 3D Large Reconstruction Model link Yinghao Xu, Hao Tan,..., Kai Zhang
141 2023-06-30 Provable Robust Watermarking for AI-Generated Text link Xuandong Zhao, Prabhanjan Vijendra Ananth,..., Yu-Xiang Wang
139 2023-10-22 Improved Techniques for Training Consistency Models link Yang Song, Prafulla Dhariwal
139 2023-09-25 Can LLM-Generated Misinformation Be Detected? link Canyu Chen, Kai Shu
139 2023-07-05 Building Cooperative Embodied Agents Modularly with Large Language Models link Hongxin Zhang, Weihua Du,..., Chuang Gan
139 2023-09-27 Finite Scalar Quantization: VQ-VAE Made Simple link Fabian Mentzer, David Minnen,..., Michael Tschannen
139 2023-09-07 DoLa: Decoding by Contrasting Layers Improves Factuality in Large
Language Models
link Yung-Sung Chuang, Yujia Xie,..., Pengcheng He
138 2023-08-15 Solving Challenging Math Word Problems Using GPT-4 Code Interpreter
with Code-based Self-Verification
link Aojun Zhou, Ke Wang,..., Hongsheng Li
136 2023-05-22 Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior
of Large Language Models in Knowledge Conflicts
link Jian Xie, Kai Zhang,..., Yu Su
135 2023-07-05 DragonDiffusion: Enabling Drag-style Manipulation on Diffusion Models link Chong Mou, Xintao Wang,..., Jian Zhang
135 2024-01-26 SliceGPT: Compress Large Language Models by Deleting Rows and
Columns
link Saleh Ashkboos, Maximilian L. Croci,..., James Hensman
134 2023-06-05 RepoBench: Benchmarking Repository-Level Code Auto-Completion Systems link Tianyang Liu, Canwen Xu, Julian McAuley
134 2023-09-29 Directly Fine-Tuning Diffusion Models on Differentiable Rewards link Kevin Clark, Paul Vicol,..., David J. Fleet
134 2023-07-04 Self-Consuming Generative Models Go MAD link Sina Alemohammad, Josue Casco-Rodriguez,..., Richard Baraniuk
134 2023-09-28 DiLu: A Knowledge-Driven Approach to Autonomous Driving with Large
Language Models
link Licheng Wen, Daocheng Fu,..., Yu Qiao
132 2023-10-03 Language Models Represent Space and Time link Wes Gurnee, Max Tegmark
131 2023-09-29 ToRA: A Tool-Integrated Reasoning Agent for Mathematical Problem Solving link Zhibin Gou, Zhihong Shao,..., Weizhu Chen
129 2023-05-18 Listen, Think, and Understand link Yuan Gong, Hongyin Luo,..., James R. Glass
129 2023-09-19 MINT: Evaluating LLMs in Multi-turn Interaction with Tools and
Language Feedback
link Xingyao Wang, Zihan Wang,..., Heng Ji
128 2023-03-14 Let 2D Diffusion Model Know 3D-Consistency for Robust Text-to-3D
Generation
link Junyoung Seo, Wooseok Jang,..., Seungryong Kim
126 2023-09-14 MMICL: Empowering Vision-language Model with Multi-Modal In-Context Learning link Haozhe Zhao, Zefan Cai,..., Baobao Chang
125 2023-09-25 Q-Bench: A Benchmark for General-Purpose Foundation Models on Low-level
Vision
link Haoning Wu, Zicheng Zhang,..., Weisi Lin
125 2023-10-25 DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior link Jingxiang Sun, Bo Zhang,..., Yebin Liu
124 2023-09-19 Language Modeling Is Compression link Gregoire Deletang, Anian Ruoss,..., Joel Veness
123 2023-05-23 Sophia: A Scalable Stochastic Second-order Optimizer for Language Model
Pre-training
link Hong Liu, Zhiyuan Li,..., Tengyu Ma
120 2023-10-02 RA-DIT: Retrieval-Augmented Dual Instruction Tuning link Xi Victoria Lin, Xilun Chen,..., Wen-tau Yih
119 2022-08-04 Conformal Risk Control link Anastasios Nikolas Angelopoulos, Stephen Bates,..., Tal Schuster
119 2023-10-31 SEINE: Short-to-Long Video Diffusion Model for Generative Transition and
Prediction
link Xinyuan Chen, Yaohui Wang,..., Ziwei Liu
119 2023-10-02 Making LLaMA SEE and Draw with SEED Tokenizer link Yuying Ge, Sijie Zhao,..., Ying Shan
118 2023-10-26 Proving Test Set Contamination in Black-Box Language Models link Yonatan Oren, Nicole Meister,..., Tatsunori Hashimoto
118 2023-10-17 VeRA: Vector-based Random Matrix Adaptation link Dawid Jan Kopiczko, Tijmen Blankevoort, Yuki M Asano
118 2023-11-21 GAIA: a benchmark for General AI Assistants link Grégoire Mialon, Clémentine Fourrier,..., Thomas Scialom
117 2023-06-21 EquiformerV2: Improved Equivariant Transformer for Scaling to Higher-Degree Representations link Yi-Lun Liao, Brandon M Wood,..., Tess Smidt
116 2023-07-26 Jailbreak in pieces: Compositional Adversarial Attacks on Multi-Modal Language
Models
link Erfan Shayegani, Yue Dong, Nael Abu-Ghazaleh
116 2023-10-08 Fast-DetectGPT: Efficient Zero-Shot Detection of Machine-Generated Text via Conditional
Probability Curvature
link Guangsheng Bao, Yanbin Zhao,..., Yue Zhang
114 2023-11-02 Vision-Language Foundation Models as Effective Robot Imitators link Xinghang Li, Minghuan Liu,..., Tao Kong
114 2023-02-06 Chain of Hindsight aligns Language Models with Feedback link Hao Liu, Carmelo Sferrazza, Pieter Abbeel
114 2023-10-16 Zero-Shot Robotic Manipulation with Pre-Trained Image-Editing Diffusion Models link Kevin Black, Mitsuhiko Nakamoto,..., Sergey Levine
113 2023-10-12 LoftQ: LoRA-Fine-Tuning-aware Quantization for Large Language Models link Yixiao Li, Yifan Yu,..., Tuo Zhao
113 2023-10-19 SalUn: Empowering Machine Unlearning via Gradient-based Weight Saliency in
Both Image Classification and Generation
link Chongyu Fan, Jiancheng Liu,..., Sijia Liu
112 2023-08-11 Self-Alignment with Instruction Backtranslation link Xian Li, Ping Yu,..., Mike Lewis
111 2023-10-10 Understanding the Effects of RLHF on LLM Generalisation and
Diversity
link Robert Kirk, Ishita Mediratta,..., Roberta Raileanu
110 2023-09-29 Data Filtering Networks link Alex Fang, Albin Madappally Jose,..., Vaishaal Shankar
109 2023-08-02 From Sparse to Soft Mixtures of Experts link Joan Puigcerver, Carlos Riquelme Ruiz,..., Neil Houlsby
109 2023-06-07 On the Reliability of Watermarks for Large Language Models link John Kirchenbauer, Jonas Geiping,..., Tom Goldstein
108 2023-10-29 AnomalyCLIP: Object-agnostic Prompt Learning for Zero-shot Anomaly Detection link Qihang Zhou, Guansong Pang,..., Jiming Chen
108 2023-08-21 AgentVerse: Facilitating Multi-Agent Collaboration and Exploring Emergent Behaviors link Weize Chen, Yusheng Su,..., Jie Zhou
107 2023-05-07 A Variational Perspective on Solving Inverse Problems with Diffusion
Models
link Morteza Mardani, Jiaming Song,..., Arash Vahdat
106 2023-08-14 OctoPack: Instruction Tuning Code Large Language Models link Niklas Muennighoff, Qian Liu,..., Shayne Longpre
106 2023-10-10 Multilingual Jailbreak Challenges in Large Language Models link Yue Deng, Wenxuan Zhang,..., Lidong Bing
106 2024-01-31 RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval link Parth Sarthi, Salman Abdullah,..., Christopher D Manning
106 2023-11-03 EmerNeRF: Emergent Spatial-Temporal Scene Decomposition via Self-Supervision link Jiawei Yang, Boris Ivanovic,..., Yue Wang
106 2023-10-04 SweetDreamer: Aligning Geometric Priors in 2D diffusion for Consistent
Text-to-3D
link Weiyu Li, Rui Chen,..., Ping Tan
105 2023-10-04 Reward Model Ensembles Help Mitigate Overoptimization link Thomas Coste, Usman Anwar,..., David Krueger
105 2023-10-24 What Algorithms can Transformers Learn? A Study in Length
Generalization
link Hattie Zhou, Arwen Bradley,..., Preetum Nakkiran
104 2023-10-08 TEMPO: Prompt-based Generative Pre-trained Transformer for Time Series Forecasting link Defu Cao, Furong Jia,..., Yan Liu
103 2024-04-19 SaProt: Protein Language Modeling with Structure-aware Vocabulary link Jin Su, Chenchen Han,..., Fajie Yuan
103 2023-06-09 Can Large Language Models Infer Causation from Correlation? link Zhijing Jin, Jiarui Liu,..., Bernhard Schölkopf
103 2023-10-19 Habitat 3.0: A Co-Habitat for Humans, Avatars, and Robots link Xavier Puig, Eric Undersander,..., Roozbeh Mottaghi
103 2023-05-04 ZipIt! Merging Models from Different Tasks without Training link George Stoica, Daniel Bolya,..., Judy Hoffman
102 2023-10-25 TD-MPC2: Scalable, Robust World Models for Continuous Control link Nicklas Hansen, Hao Su, Xiaolong Wang
102 2024-02-27 When Scaling Meets LLM Finetuning: The Effect of Data,
Model and Finetuning Method
link Biao Zhang, Zhongtao Liu,..., Orhan Firat
102 2023-05-31 MERT: Acoustic Music Understanding Model with Large-Scale Self-supervised Training link Yizhi LI, Ruibin Yuan,..., Jie Fu
100 2023-09-13 RAIN: Your Language Models Can Align Themselves without Finetuning link Yuhui Li, Fangyun Wei,..., Hongyang Zhang
99 2023-09-28 Demystifying CLIP Data link Hu Xu, Saining Xie,..., Christoph Feichtenhofer
99 2023-10-18 SOTOPIA: Interactive Evaluation for Social Intelligence in Language Agents link Xuhui Zhou, Hao Zhu,..., Maarten Sap
99 2023-05-25 Self-contradictory Hallucinations of Large Language Models: Evaluation, Detection and
Mitigation
link Niels Mündler, Jingxuan He,..., Martin Vechev
98 2023-06-16 Is Self-Repair a Silver Bullet for Code Generation? link Theo X. Olausson, Jeevana Priya Inala,..., Armando Solar-Lezama
98 2023-10-09 Take a Step Back: Evoking Reasoning via Abstraction in
Large Language Models
link Huaixiu Steven Zheng, Swaroop Mishra,..., Denny Zhou
98 2023-08-01 SelfCheck: Using LLMs to Zero-Shot Check Their Own Step-by-Step
Reasoning
link Ning Miao, Yee Whye Teh, Tom Rainforth
97 2023-10-01 BooookScore: A systematic exploration of book-length summarization in the
era of LLMs
link Yapei Chang, Kyle Lo,..., Mohit Iyyer
97 2023-08-16 TEST: Text Prototype Aligned Embedding to Activate LLM's Ability
for Time Series
link Chenxi Sun, Hongyan Li,..., Shenda Hong
96 None The Expressive Power of Transformers with Chain of Thought link William Merrill, Ashish Sabharwal
95 2023-11-20 PF-LRM: Pose-Free Large Reconstruction Model for Joint Pose and
Shape Prediction
link Peng Wang, Hao Tan,..., Kai Zhang
95 2023-10-25 PromptAgent: Strategic Planning with Language Models Enables Expert-level Prompt
Optimization
link Xinyuan Wang, Chenxi Li,..., Zhiting Hu
95 2023-08-25 Nougat: Neural Optical Understanding for Academic Documents link Lukas Blecher, Guillem Cucurull,..., Robert Stojnic
95 2023-10-12 OmniControl: Control Any Joint at Any Time for Human
Motion Generation
link Yiming Xie, Varun Jampani,..., Huaizu Jiang
94 2023-09-29 One For All: Towards Training One Graph Model For
All Classification Tasks
link Hao Liu, Jiarui Feng,..., Muhan Zhang
94 2023-11-08 Bias Runs Deep: Implicit Reasoning Biases in Persona-Assigned LLMs link Shashank Gupta, Vaishnavi Shrivastava,..., Tushar Khot
92 2023-10-04 MagicDrive: Street View Generation with Diverse 3D Geometry Control link Ruiyuan Gao, Kai Chen,..., Qiang Xu
91 2023-10-23 Function Vectors in Large Language Models link Eric Todd, Millicent Li,..., David Bau
90 2023-09-29 Can Sensitive Information Be Deleted From LLMs? Objectives for
Defending Against Extraction Attacks
link Vaidehi Patil, Peter Hase, Mohit Bansal
90 2023-07-16 Solving Inverse Problems with Latent Diffusion Models via Hard
Data Consistency
link Bowen Song, Soo Min Kwon,..., Liyue Shen
90 2023-07-20 FLASK: Fine-grained Language Model Evaluation based on Alignment Skill
Sets
link Seonghyeon Ye, Doyoung Kim,..., Minjoon Seo
88 2023-09-25 Identifying the Risks of LM Agents with an LM-Emulated
Sandbox
link Yangjun Ruan, Honghua Dong,..., Tatsunori Hashimoto
88 2023-08-07 Nearly $d$-Linear Convergence Bounds for Diffusion Models via Stochastic
Localization
link Joe Benton, Valentin De Bortoli,..., George Deligiannidis
88 2023-09-27 Towards Best Practices of Activation Patching in Language Models:
Metrics and Methods
link Fred Zhang, Neel Nanda
87 2024-02-20 Chain of Thought Empowers Transformers to Solve Inherently Serial
Problems
link Zhiyuan Li, Hong Liu,..., Tengyu Ma
87 2023-10-06 Talk like a Graph: Encoding Graphs for Large Language
Models
link Bahare Fatemi, Jonathan Halcrow, Bryan Perozzi
85 2023-08-16 Time Travel in LLMs: Tracing Data Contamination in Large
Language Models
link Shahriar Golchin, Mihai Surdeanu
85 2023-07-11 ReLoRA: High-Rank Training Through Low-Rank Updates link Vladislav Lialin, Sherin Muckatira,..., Anna Rumshisky
85 2023-10-05 MathCoder: Seamless Code Integration in LLMs for Enhanced Mathematical
Reasoning
link Ke Wang, Houxing Ren,..., Hongsheng Li
85 2023-05-19 Multimodal Web Navigation with Instruction-Finetuned Foundation Models link Hiroki Furuta, Kuang-Huei Lee,..., Izzeddin Gur
84 2023-09-11 Hypothesis Search: Inductive Reasoning with Language Models link Ruocheng Wang, Eric Zelikman,..., Noah Goodman
84 2023-10-04 AdaMerging: Adaptive Model Merging for Multi-Task Learning link Enneng Yang, Zhenyi Wang,..., Dacheng Tao
82 2023-09-26 QA-LoRA: Quantization-Aware Low-Rank Adaptation of Large Language Models link Yuhui Xu, Lingxi Xie,..., Qi Tian
81 2023-10-03 Think before you speak: Training Language Models With Pause
Tokens
link Sachin Goyal, Ziwei Ji,..., Vaishnavh Nagarajan
81 2023-09-11 Pushing Mixture of Experts to the Limit: Extremely Parameter
Efficient MoE for Instruction Tuning
link Ted Zadouri, Ahmet Üstün,..., Sara Hooker
81 2023-10-04 MetaTool Benchmark for Large Language Models: Deciding Whether to
Use Tools and Which to Use
link Yue Huang, Jiawen Shi,..., Lichao Sun
81 2024-01-09 Chain-of-Table: Evolving Tables in the Reasoning Chain for Table
Understanding
link Zilong Wang, Hao Zhang,..., Tomas Pfister
81 2023-08-03 The All-Seeing Project: Towards Panoptic Visual Recognition and Understanding
of the Open World
link Weiyun Wang, Min Shi,..., Yu Qiao
81 2023-10-23 FreeNoise: Tuning-Free Longer Video Diffusion via Noise Rescheduling link Haonan Qiu, Menghan Xia,..., Ziwei Liu
80 2023-10-31 What's In My Big Data? link Yanai Elazar, Akshita Bhagia,..., Jesse Dodge
79 2023-10-10 Uni3D: Exploring Unified 3D Representation at Scale link Junsheng Zhou, Jinsheng Wang,..., Xinlong Wang
79 2023-05-22 Chain-of-Knowledge: Grounding Large Language Models via Dynamic Knowledge Adapting
over Heterogeneous Sources
link Xingxuan Li, Ruochen Zhao,..., Lidong Bing
79 2023-12-20 Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation link Hongtao Wu, Ya Jing,..., Tao Kong
78 2023-07-07 One Step of Gradient Descent is Provably the Optimal
In-Context Learner with One Layer of Linear Self-Attention
link Arvind V. Mahankali, Tatsunori Hashimoto, Tengyu Ma
78 2023-05-27 DNA-GPT: Divergent N-Gram Analysis for Training-Free Detection of GPT-Generated
Text
link Xianjun Yang, Wei Cheng,..., Haifeng Chen
78 2023-10-30 Text-to-3D with Classifier Score Distillation link Xin Yu, Yuan-Chen Guo,..., XIAOJUAN QI
77 2023-10-04 Retrieval meets Long Context Large Language Models link Peng Xu, Wei Ping,..., Bryan Catanzaro
77 2023-10-16 Video Language Planning link Yilun Du, Sherry Yang,..., Jonathan Tompson
77 2023-10-08 Scaling Laws of RoPE-based Extrapolation link Xiaoran Liu, Hang Yan,..., Dahua Lin
77 2023-05-22 Matcher: Segment Anything with One Shot Using All-Purpose Feature
Matching
link Yang Liu, Muzhi Zhu,..., Chunhua Shen
76 2023-09-25 Small-scale proxies for large-scale Transformer training instabilities link Mitchell Wortsman, Peter J Liu,..., Simon Kornblith
76 2023-02-15 Learning Performance-Improving Code Edits link Alexander G Shypula, Aman Madaan,..., Amir Yazdanbakhsh
76 2023-09-29 Guiding Instruction-based Image Editing via Multimodal Large Language Models link Tsu-Jui Fu, Wenze Hu,..., Zhe Gan
76 2023-10-12 DistillSpec: Improving Speculative Decoding via Knowledge Distillation link Yongchao Zhou, Kaifeng Lyu,..., Rishabh Agarwal
76 None LLaMA-Adapter: Efficient Fine-tuning of Large Language Models with Zero-initialized
Attention
link Renrui Zhang, Jiaming Han,..., Peng Gao
75 2023-07-06 FITS: Modeling Time Series with $10k$ Parameters link Zhijian Xu, Ailing Zeng, Qiang Xu
75 2023-07-07 Teaching Arithmetic to Small Transformers link Nayoung Lee, Kartik Sreenivasan,..., Dimitris Papailiopoulos
75 2024-05-23 TimeMixer: Decomposable Multiscale Mixing for Time Series Forecasting link Shiyu Wang, Haixu Wu,..., JUN ZHOU
74 2024-02-06 INSIDE: LLMs' Internal States Retain the Power of Hallucination
Detection
link Chao Chen, Kai Liu,..., Jieping Ye
73 2023-08-17 Linearity of Relation Decoding in Transformer Language Models link Evan Hernandez, Arnab Sen Sharma,..., David Bau
73 2023-09-28 Beyond Reverse KL: Generalizing Direct Preference Optimization with Diverse
Divergence Constraints
link Chaoqi Wang, Yibo Jiang,..., Yuxin Chen
73 2023-10-10 OpenWebMath: An Open Dataset of High-Quality Mathematical Web Text link Keiran Paster, Marco Dos Santos,..., Jimmy Ba
73 2023-09-19 PoSE: Efficient Context Window Extension of LLMs via Positional
Skip-wise Training
link Dawei Zhu, Nan Yang,..., Sujian Li
72 None ModernTCN: A Modern Pure Convolution Structure for General Time
Series Analysis
link Luo donghao, wang xue
72 2023-10-16 Ring-A-Bell! How Reliable are Concept Removal Methods For Diffusion
Models?
link Yu-Lin Tsai, Chia-Yi Hsu,..., Chun-Ying Huang
72 2023-12-21 The Truth is in There: Improving Reasoning in Language
Models with Layer-Selective Rank Reduction
link Pratyusha Sharma, Jordan T. Ash, Dipendra Misra
71 2023-06-01 Vocos: Closing the gap between time-domain and Fourier-based neural
vocoders for high-quality audio synthesis
link Hubert Siuzdak
71 2023-03-10 Tag2Text: Guiding Vision-Language Model via Image Tagging link Xinyu Huang, Youcai Zhang,..., Lei Zhang
70 2023-10-09 Interpreting CLIP's Image Representation via Text-Based Decomposition link Yossi Gandelsman, Alexei A Efros, Jacob Steinhardt
70 2023-10-11 Beyond Memorization: Violating Privacy via Inference with Large Language
Models
link Robin Staab, Mark Vero,..., Martin Vechev
70 2023-10-02 GenSim: Generating Robotic Simulation Tasks via Large Language Models link Lirui Wang, Yiyang Ling,..., Xiaolong Wang
70 2023-10-09 NEFTune: Noisy Embeddings Improve Instruction Finetuning link Neel Jain, Ping-yeh Chiang,..., Tom Goldstein
70 2023-06-13 Mol-Instructions: A Large-Scale Biomolecular Instruction Dataset for Large Language
Models
link Yin Fang, Xiaozhuan Liang,..., Huajun Chen
70 2023-06-23 On-Policy Distillation of Language Models: Learning from Self-Generated Mistakes link Rishabh Agarwal, Nino Vieillard,..., Olivier Bachem
69 2023-10-12 Phenomenal Yet Puzzling: Testing Inductive Reasoning Capabilities of Language
Models with Hypothesis Refinement
link Linlu Qiu, Liwei Jiang,..., Xiang Ren
69 2023-10-05 GoLLIE: Annotation Guidelines improve Zero-Shot Information-Extraction link Oscar Sainz, Iker García-Ferrero,..., Eneko Agirre
68 2023-10-14 Mixed-Type Tabular Data Synthesis with Score-based Diffusion in Latent
Space
link Hengrui Zhang, Jiani Zhang,..., George Karypis
68 2023-10-03 Large Language Models as Analogical Reasoners link Michihiro Yasunaga, Xinyun Chen,..., Denny Zhou
67 2023-10-27 Can LLMs Keep a Secret? Testing Privacy
Implications of Language Models via Contextual Integrity Theory
link Niloofar Mireshghallah, Hyunwoo Kim,..., Yejin Choi
67 2023-11-02 Tensor Trust: Interpretable Prompt Injection Attacks from an Online
Game
link Sam Toyer, Olivia Watkins,..., Stuart Russell
67 2023-10-09 Generative Judge for Evaluating Alignment link Junlong Li, Shichao Sun,..., Pengfei Liu
67 2023-10-26 Noise-free Score Distillation link Oren Katzir, Or Patashnik,..., Dani Lischinski
67 None Adapting Large Language Models via Reading Comprehension link Daixuan Cheng, Shaohan Huang, Furu Wei
66 2023-05-31 Harnessing Explanations: LLM-to-LM Interpreter for Enhanced Text-Attributed Graph Representation
Learning
link Xiaoxin He, Xavier Bresson,..., Bryan Hooi
66 2024-04-22 Hybrid LLM: Cost-Efficient and Quality-Aware Query Routing link Dujian Ding, Ankur Mallick,..., Ahmed Hassan Awadallah
66 2023-10-19 Vision-Language Models are Zero-Shot Reward Models for Reinforcement Learning link Juan Rocamonde, Victoriano Montesinos,..., David Lindner
66 2023-10-09 FLATTEN: optical FLow-guided ATTENtion for consistent text-to-video editing link Yuren Cong, Mengmeng Xu,..., Sen He
65 2023-10-03 SE(3)-Stochastic Flow Matching for Protein Backbone Generation link Joey Bose, Tara Akhound-Sadegh,..., Alexander Tong
65 2023-10-12 ScaleCrafter: Tuning-free Higher-Resolution Visual Generation with Diffusion Models link Yingqing He, Shaoshu Yang,..., Ying Shan
65 2023-09-11 Does Writing with Language Models Reduce Content Diversity? link Vishakh Padmakumar, He He
65 2023-05-30 HIFA: High-fidelity Text-to-3D Generation with Advanced Diffusion Guidance link Junzhe Zhu, Peiye Zhuang, Sanmi Koyejo
64 2023-10-10 Lemur: Harmonizing Natural Language and Code for Language Agents link Yiheng Xu, Hongjin SU,..., Tao Yu
64 2023-10-07 Label-free Node Classification on Graphs with Large Language Models
(LLMs)
link Zhikai Chen, Haitao Mao,..., Jiliang Tang
64 2022-06-20 LUT-GEMM: Quantized Matrix Multiplication based on LUTs for Efficient
Inference in Large-Scale Generative Language Models
link Gunho Park, Baeseong park,..., Dongsoo Lee
63 2023-11-06 AnyText: Multilingual Visual Text Generation and Editing link Yuxiang Tuo, Wangmeng Xiang,..., Xuansong Xie
63 2023-08-04 Retroformer: Retrospective Large Language Agents with Policy Gradient Optimization link Weiran Yao, Shelby Heinecke,..., Silvio Savarese
63 None RECOMP: Improving Retrieval-Augmented LMs with Context Compression and Selective
Augmentation
link Fangyuan Xu, Weijia Shi, Eunsol Choi
63 2023-10-20 An LLM can Fool Itself: A Prompt-Based Adversarial Attack link Xilie Xu, Keyi Kong,..., Mohan Kankanhalli
62 2023-06-15 KoLA: Carefully Benchmarking World Knowledge of Large Language Models link Jifan Yu, Xiaozhi Wang,..., Juanzi Li
62 2023-07-13 In-context Autoencoder for Context Compression in a Large Language
Model
link Tao Ge, Hu Jing,..., Furu Wei
62 2023-06-21 DreamTime: An Improved Optimization Strategy for Diffusion-Guided 3D Generation link Yukun Huang, Jianan Wang,..., Lei Zhang
61 2023-10-17 Zipformer: A faster and better encoder for automatic speech
recognition
link Zengwei Yao, Liyong Guo,..., Daniel Povey
61 2023-09-14 Unified Human-Scene Interaction via Prompted Chain-of-Contacts link Zeqi Xiao, Tai Wang,..., Jiangmiao Pang
61 2023-10-27 Davidsonian Scene Graph: Improving Reliability in Fine-grained Evaluation for
Text-to-Image Generation
link Jaemin Cho, Yushi Hu,..., Su Wang
61 2023-06-09 FasterViT: Fast Vision Transformers with Hierarchical Attention link Ali Hatamizadeh, Greg Heinrich,..., Pavlo Molchanov
60 2023-10-02 CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction link Size Wu, Wenwei Zhang,..., Chen Change Loy
60 2023-08-08 SILO Language Models: Isolating Legal Risk In a Nonparametric
Datastore
link Sewon Min, Suchin Gururangan,..., Luke Zettlemoyer
60 2023-09-13 Sudden Drops in the Loss: Syntax Acquisition, Phase Transitions,
and Simplicity Bias in MLMs
link Angelica Chen, Ravid Shwartz-Ziv,..., Naomi Saphra
60 2023-11-21 Mechanistically analyzing the effects of fine-tuning on procedurally defined
tasks
link Samyak Jain, Robert Kirk,..., David Krueger
60 2023-10-31 The Generative AI Paradox: “What It Can Create, It
May Not Understand”
link Peter West, Ximing Lu,..., Yejin Choi
59 2023-03-10 Decomposed Diffusion Sampler for Accelerating Large-Scale Inverse Problems link Hyungjin Chung, Suhyeon Lee, Jong Chul Ye
59 2023-10-09 HyperAttention: Long-context Attention in Near-Linear Time link Insu Han, Rajesh Jayaram,..., Amir Zandieh
58 2023-02-07 Flow Matching on General Geometries link Ricky T. Q. Chen, Yaron Lipman
58 2023-08-31 SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models link Xin Zhang, Dong Zhang,..., Xipeng Qiu
57 2023-10-04 Generalization in diffusion models arises from geometry-adaptive harmonic representations link Zahra Kadkhodaie, Florentin Guth,..., Stéphane Mallat
57 2023-10-12 Circuit Component Reuse Across Tasks in Transformer Language Models link Jack Merullo, Carsten Eickhoff, Ellie Pavlick
57 2023-08-08 Fine-tuning Multimodal LLMs to Follow Zero-shot Demonstrative Instructions link Juncheng Li, Kaihang Pan,..., Yueting Zhuang
57 2023-12-08 Zoology: Measuring and Improving Recall in Efficient Language
Models
link Simran Arora, Sabri Eyuboglu,..., Christopher Re
57 2023-09-29 Denoising Diffusion Bridge Models link Linqi Zhou, Aaron Lou,..., Stefano Ermon
57 2023-10-09 Towards Lossless Dataset Distillation via Difficulty-Aligned Trajectory Matching link Ziyao Guo, Kai Wang,..., Yang You
56 2023-10-06 ReLU Strikes Back: Exploiting Activation Sparsity in Large Language
Models
link Seyed Iman Mirzadeh, Keivan Alizadeh-Vahid,..., Mehrdad Farajtabar
56 2023-08-07 UniversalNER: Targeted Distillation from Large Language Models for Open
Named Entity Recognition
link Wenxuan Zhou, Sheng Zhang,..., Hoifung Poon
56 2023-11-06 Consistent4D: Consistent 360° Dynamic Object Generation from Monocular Video link Yanqin Jiang, Li Zhang,..., Yao Yao
55 2023-10-02 SmartPlay : A Benchmark for LLMs as Intelligent Agents link Yue Wu, Xuan Tang,..., Yuanzhi Li
54 2023-11-28 Manifold Preserving Guided Diffusion link Yutong He, Naoki Murata,..., Stefano Ermon
54 2024-01-19 Knowledge Fusion of Large Language Models link Fanqi Wan, Xinting Huang,..., Shuming Shi
54 2023-05-23 VDT: General-purpose Video Diffusion Transformers via Mask Modeling link Haoyu Lu, Guoxing Yang,..., Mingyu Ding
54 2023-11-10 Parameter-Efficient Orthogonal Finetuning via Butterfly Factorization link Weiyang Liu, Zeju Qiu,..., Bernhard Schölkopf
53 2023-10-01 LEGO-Prover: Neural Theorem Proving with Growing Libraries link Haiming Wang, Huajian Xin,..., Xiaodan Liang
53 2023-09-28 At Which Training Stage Does Code Data Help LLMs
Reasoning?
link YINGWEI MA, Yue Liu,..., Shanshan Li
53 2023-02-12 Single Motion Diffusion link Sigal Raab, Inbal Leibovitch,..., Daniel Cohen-Or
53 2023-07-15 Think-on-Graph: Deep and Responsible Reasoning of Large Language Model
on Knowledge Graph
link Jiashuo Sun, Chengjin Xu,..., Jian Guo
53 2023-09-29 LLM-grounded Video Diffusion Models link Long Lian, Baifeng Shi,..., Boyi Li
53 2023-11-24 Universal Jailbreak Backdoors from Poisoned Human Feedback link Javier Rando, Florian Tramèr
52 2024-05-29 Large Brain Model for Learning Generic Representations with Tremendous
EEG Data in BCI
link Weibang Jiang, Liming Zhao, Bao-liang Lu
52 2023-11-03 RT-Trajectory: Robotic Task Generalization via Hindsight Trajectory Sketches link Jiayuan Gu, Sean Kirmani,..., Ted Xiao
52 2023-10-04 Kosmos-G: Generating Images in Context with Multimodal Large Language
Models
link Xichen Pan, Li Dong,..., Furu Wei
52 2023-09-18 Understanding Catastrophic Forgetting in Language Models via Implicit Inference link Suhas Kotha, Jacob Mitchell Springer, Aditi Raghunathan
52 2023-06-16 Conformal Language Modeling link Victor Quach, Adam Fisch,..., Regina Barzilay
52 2023-09-29 CRAFT: Customizing LLMs by Creating and Retrieving from Specialized
Toolsets
link Lifan Yuan, Yangyi Chen,..., Heng Ji
52 None Diffusion Posterior Sampling for Linear Inverse Problem Solving: A
Filtering Perspective
link Zehao Dou, Yang Song
52 2023-10-12 HyperHuman: Hyper-Realistic Human Generation with Latent Structural Diffusion link Xian Liu, Jian Ren,..., Sergey Tulyakov
51 2023-10-26 Large Language Models as Generalizable Policies for Embodied Tasks link Andrew Szot, Max Schwarzer,..., Alexander T Toshev
51 2023-10-10 A Semantic Invariant Robust Watermark for Large Language Models link Aiwei Liu, Leyi Pan,..., Lijie Wen
51 2024-02-15 Spike-driven Transformer V2: Meta Spiking Neural Network Architecture Inspiring
the Design of Next-generation Neuromorphic Chips
link Man Yao, JiaKui Hu,..., Guoqi Li
51 2022-11-07 MogaNet: Multi-order Gated Aggregation Network link Siyuan Li, Zedong Wang,..., Stan Z. Li
50 2023-10-02 DataInf: Efficiently Estimating Data Influence in LoRA-tuned LLMs and
Diffusion Models
link Yongchan Kwon, Eric Wu,..., James Zou
50 2024-03-18 Improving LoRA in Privacy-preserving Federated Learning link Youbang Sun, Zitao Li,..., Bolin Ding
50 2024-02-06 Fine-Tuned Language Models Generate Stable Inorganic Materials as Text link Nate Gruver, Anuroop Sriram,..., Zachary Ward Ulissi
50 2023-06-06 Turning large language models into cognitive models link Marcel Binz, Eric Schulz
50 2023-09-29 Motif: Intrinsic Motivation from Artificial Intelligence Feedback link Martin Klissarov, Pierluca D'Oro,..., Mikael Henaff
50 2023-09-21 Privacy-Preserving In-Context Learning with Differentially Private Few-Shot Generation link Xinyu Tang, Richard Shin,..., Robert Sim
49 2023-07-18 Overthinking the Truth: Understanding how Language Models Process False
Demonstrations
link Danny Halawi, Jean-Stanislas Denain, Jacob Steinhardt
49 2023-10-10 Advancing Pose-Guided Image Synthesis with Progressive Conditional Diffusion Models link Fei Shen, Hu Ye,..., Yang Wei
49 2023-10-04 Large Language Model Cascades with Mixture of Thought Representations
for Cost-Efficient Reasoning
link Murong Yue, Jie Zhao,..., Ziyu Yao
49 2024-02-22 Fine-Tuning Enhances Existing Mechanisms: A Case Study on Entity
Tracking
link Nikhil Prakash, Tamar Rott Shaham,..., David Bau
49 2022-09-08 Exploring Target Representations for Masked Autoencoders link xingbin liu, Jinghao Zhou,..., Rongrong Ji
48 2024-02-22 Cameras as Rays: Pose Estimation via Ray Diffusion link Jason Y. Zhang, Amy Lin,..., Shubham Tulsiani
48 2023-10-12 How Many Pretraining Tasks Are Needed for In-Context Learning
of Linear Regression?
link Jingfeng Wu, Difan Zou,..., Peter Bartlett
48 2023-12-13 Distributional Preference Learning: Understanding and Accounting for Hidden Context
in RLHF
link Anand Siththaranjan, Cassidy Laidlaw, Dylan Hadfield-Menell
48 2023-05-24 Unpaired Image-to-Image Translation via Neural Schrödinger Bridge link Beomsu Kim, Gihyun Kwon,..., Jong Chul Ye
48 2023-12-06 DiffusionSat: A Generative Foundation Model for Satellite Imagery link Samar Khanna, Patrick Liu,..., Stefano Ermon
48 2023-06-13 Synapse: Trajectory-as-Exemplar Prompting with Memory for Computer Control link Longtao Zheng, Rundong Wang,..., Bo An
48 2023-10-12 QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language
Models
link Jing Liu, Ruihao Gong,..., Bohan Zhuang
47 2023-09-25 LLMCarbon: Modeling the End-to-End Carbon Footprint of Large Language
Models
link Ahmad Faiz, Sotaro Kaneda,..., Lei Jiang
47 2023-10-24 MuSR: Testing the Limits of Chain-of-thought with Multistep Soft
Reasoning
link Zayne Rea Sprague, Xi Ye,..., Greg Durrett
47 2023-09-28 A Benchmark for Learning to Translate a New Language
from One Grammar Book
link Garrett Tanzer, Mirac Suzgun,..., Luke Melas-Kyriazi
47 2023-07-03 Improved sampling via learned diffusions link Lorenz Richter, Julius Berner
47 2023-10-13 Vision-by-Language for Training-Free Compositional Image Retrieval link Shyamgopal Karthik, Karsten Roth,..., Zeynep Akata
47 2023-09-28 Human Feedback is not Gold Standard link Tom Hosking, Phil Blunsom, Max Bartolo
47 2023-09-20 A Paradigm Shift in Machine Translation: Boosting Translation Performance
of Large Language Models
link Haoran Xu, Young Jin Kim,..., Hany Hassan Awadalla
47 2023-05-24 Mixture-of-Experts Meets Instruction Tuning: A Winning Combination for Large
Language Models
link Sheng Shen, Le Hou,..., Denny Zhou
47 2023-11-02 Copilot4D: Learning Unsupervised World Models for Autonomous Driving via
Discrete Diffusion
link Lunjun Zhang, Yuwen Xiong,..., Raquel Urtasun
47 2023-03-16 Rethinking Model Ensemble in Transfer-based Adversarial Attacks link Huanran Chen, Yichi Zhang,..., Jun Zhu
46 2023-12-03 The mechanistic basis of data dependence and abrupt learning
in an in-context classification task
link Gautam Reddy
46 2024-07-31 Detecting, Explaining, and Mitigating Memorization in Diffusion Models link Yuxin Wen, Yuchen Liu,..., Lingjuan Lyu
46 2023-08-23 Large Multilingual Models Pivot Zero-Shot Multimodal Learning across Languages link Jinyi Hu, Yuan Yao,..., Maosong Sun
46 2023-10-19 An Emulator for Fine-tuning Large Language Models using Small
Language Models
link Eric Mitchell, Rafael Rafailov,..., Christopher D Manning
46 2023-09-29 Batch Calibration: Rethinking Calibration for In-Context Learning and Prompt
Engineering
link Han Zhou, Xingchen Wan,..., Subhrajit Roy
46 2023-09-26 How to Catch an AI Liar: Lie Detection in
Black-Box LLMs by Asking Unrelated Questions
link Lorenzo Pacchiardi, Alex James Chan,..., Jan M. Brauner
46 2023-10-16 How Do Transformers Learn In-Context Beyond Simple Functions? A
Case Study on Learning with Representations
link Tianyu Guo, Wei Hu,..., Yu Bai
46 2024-03-04 Diffusion-TS: Interpretable Diffusion for General Time Series Generation link Xinyu Yuan, Yan Qiao
45 2023-10-16 In-Context Pretraining: Language Modeling Beyond Document Boundaries link Weijia Shi, Sewon Min,..., Mike Lewis
45 None DSPy: Compiling Declarative Language Model Calls into State-of-the-Art Pipelines link Omar Khattab, Arnav Singhvi,..., Christopher Potts
45 2023-06-20 Evaluating the Zero-shot Robustness of Instruction-tuned Language Models link Jiuding Sun, Chantal Shaib, Byron C Wallace
45 2024-02-06 The Hedgehog & the Porcupine: Expressive Linear Attentions with
Softmax Mimicry
link Michael Zhang, Kush Bhatia,..., Christopher Re
45 2023-06-30 Learning Delays in Spiking Neural Networks using Dilated Convolutions
with Learnable Spacings
link Ilyass Hammouamri, Ismail Khalfaoui-Hassani, Timothée Masquelier
45 2023-10-20 ToolChain: Efficient Action Space Navigation in Large Language Models
with A
Search
link Yuchen Zhuang, Xiang Chen,..., Chao Zhang
44 2023-05-18 Deep Temporal Graph Clustering link Meng Liu, Yue Liu,..., Xinwang Liu
44 2023-05-05 DisenBooth: Identity-Preserving Disentangled Tuning for Subject-Driven Text-to-Image Generation link Hong Chen, Yipeng Zhang,..., Wenwu Zhu
44 2023-02-02 Neural Common Neighbor with Completion for Link Prediction link Xiyuan Wang, Haotong Yang, Muhan Zhang
44 2023-10-26 The Expressive Power of Low-Rank Adaptation link Yuchen Zeng, Kangwook Lee
44 2023-08-06 Strategic Preys Make Acute Predators: Enhancing Camouflaged Object Detectors
by Generating Camouflaged Objects
link Chunming He, Kai Li,..., Fisher Yu
43 2023-11-11 Finetuning Text-to-Image Diffusion Models for Fairness link Xudong Shen, Chao Du,..., Mohan Kankanhalli
43 2023-03-08 InfoBatch: Lossless Training Speed Up by Unbiased Dynamic Data
Pruning
link Ziheng Qin, Kai Wang,..., Yang You
43 2023-10-06 Confronting Reward Model Overoptimization with Constrained RLHF link Ted Moskovitz, Aaditya K Singh,..., Stephen Marcus McAleer
43 2023-10-23 Matryoshka Diffusion Models link Jiatao Gu, Shuangfei Zhai,..., Navdeep Jaitly
42 2023-10-06 Amortizing intractable inference in large language models link Edward J Hu, Moksh Jain,..., Nikolay Malkin
42 2023-09-22 Unbiased Watermark for Large Language Models link Zhengmian Hu, Lichang Chen,..., Heng Huang
42 2024-02-06 Large Language Models to Enhance Bayesian Optimization link Tennison Liu, Nicolás Astorga,..., Mihaela van der Schaar
42 2023-05-19 LLM-CXR: Instruction-Finetuned LLM for CXR Image Understanding and Generation link Suhyeon Lee, Won Jun Kim,..., Jong Chul Ye
42 None Functional Interpolation for Relative Positions improves Long Context Transformers link Shanda Li, Chong You,..., Srinadh Bhojanapalli
42 2024-03-15 FeatUp: A Model-Agnostic Framework for Features at Any Resolution link Stephanie Fu, Mark Hamilton,..., William T. Freeman
42 2023-05-26 Training Socially Aligned Language Models on Simulated Social Interactions link Ruibo Liu, Ruixin Yang,..., Soroush Vosoughi
42 2023-06-01 Consistency-guided Prompt Learning for Vision-Language Models link Shuvendu Roy, Ali Etemad
41 2023-09-30 On the Stability of Iterative Retraining of Generative Models
on their own Data
link Quentin Bertrand, Joey Bose,..., Gauthier Gidel
41 2024-01-16 Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis link Zhenhui Ye, Tianyun Zhong,..., Zhou Zhao
41 2023-11-20 LQ-LoRA: Low-rank plus Quantized Matrix Decomposition for Efficient Language
Model Finetuning
link Han Guo, Philip Greengard,..., Yoon Kim
41 2023-10-18 Brain decoding: toward real-time reconstruction of visual perception link Yohann Benchetrit, Hubert Banville, Jean-Remi King
41 2024-03-20 BadEdit: Backdooring Large Language Models by Model Editing link Yanzhou Li, Tianlin Li,..., Yang Liu
41 2023-10-12 Transformers as Decision Makers: Provable In-Context Reinforcement Learning via
Supervised Pretraining
link Licong Lin, Yu Bai, Song Mei
41 2023-10-02 Linear attention is (maybe) all you need (to understand
Transformer optimization)
link Kwangjun Ahn, Xiang Cheng,..., Suvrit Sra
41 2023-10-16 Towards image compression with perfect realism at ultra-low bitrates link Marlene Careil, Matthew J. Muckley,..., Stéphane Lathuilière
41 None PnP Inversion: Boosting Diffusion-based Editing with 3 Lines of
Code
link Xuan Ju, Ailing Zeng,..., Qiang Xu
40 2019-02-14 CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater
Sample Efficiency and Simplicity
link Aditya Bhatt, Daniel Palenicek,..., Jan Peters
40 2023-09-20 Text2Reward: Reward Shaping with Language Models for Reinforcement Learning link Tianbao Xie, Siheng Zhao,..., Tao Yu
40 2023-10-13 CodeChain: Towards Modular Code Generation Through Chain of Self-revisions
with Representative Sub-modules
link Hung Le, Hailin Chen,..., Shafiq Joty
40 2023-10-02 ImagenHub: Standardizing the evaluation of conditional image generation models link Max Ku, Tianle Li,..., Wenhu Chen
39 2023-09-29 DyVal: Dynamic Evaluation of Large Language Models for Reasoning
Tasks
link Kaijie Zhu, Jiaao Chen,..., Xing Xie
39 2023-10-06 Universal Humanoid Motion Representations for Physics-Based Control link Zhengyi Luo, Jinkun Cao,..., Weipeng Xu
39 2023-07-31 AntGPT: Can Large Language Models Help Long-term Action Anticipation
from Videos?
link Qi Zhao, Shijie Wang,..., Chen Sun
39 2023-03-08 Magnushammer: A Transformer-Based Approach to Premise Selection link Maciej Mikuła, Szymon Tworkowski,..., Yuhuai Wu
39 2023-09-29 PB-LLM: Partially Binarized Large Language Models link Zhihang Yuan, Yuzhang Shang, Zhen Dong
39 2023-12-14 Successor Heads: Recurring, Interpretable Attention Heads In The Wild link Rhys Gould, Euan Ong,..., Arthur Conmy
39 2023-08-24 Bayesian Low-rank Adaptation for Large Language Models link Adam X. Yang, Maxime Robeyns,..., Laurence Aitchison
39 2023-10-02 Ground-A-Video: Zero-shot Grounded Video Editing using Text-to-image Diffusion Models link Hyeonho Jeong, Jong Chul Ye
39 2023-10-10 GeoLLM: Extracting Geospatial Knowledge from Large Language Models link Rohin Manvi, Samar Khanna,..., Stefano Ermon
39 2023-05-23 Language Model Self-improvement by Reinforcement Learning Contemplation link Jing-Cheng Pang, Pengyuan Wang,..., Yang Yu
38 2023-09-04 Relay Diffusion: Unifying diffusion process across resolutions for image
synthesis
link Jiayan Teng, Wendi Zheng,..., Jie Tang
38 2024-01-25 An Extensible Framework for Open Heterogeneous Collaborative Perception link Yifan Lu, Yue Hu,..., Siheng Chen
38 2024-01-25 Towards 3D Molecule-Text Interpretation in Language Models link Sihang Li, Zhiyuan Liu,..., Qi Tian
38 2023-09-17 OWL: A Large Language Model for IT Operations link Hongcheng Guo, Jian Yang,..., Zhoujun Li
38 2024-01-23 ARGS: Alignment as Reward-Guided Search link Maxim Khanov, Jirayu Burapacheep, Yixuan Li
38 2023-10-18 Progressive3D: Progressively Local Editing for Text-to-3D Content Creation with
Complex Semantic Prompts
link Xinhua Cheng, Tianyu Yang,..., Li Yuan
38 2023-09-26 Attention Satisfies: A Constraint-Satisfaction Lens on Factual Errors of
Language Models
link Mert Yuksekgonul, Varun Chandrasekaran,..., Besmira Nushi
38 2023-09-05 PromptTTS 2: Describing and Generating Voices with Text Prompt link Yichong Leng, Zhifang Guo,..., Jiang Bian
38 2024-01-20 Inducing High Energy-Latency of Large Vision-Language Models with Verbose
Images
link Kuofeng Gao, Yang Bai,..., Wei Liu
37 2023-05-29 Multiscale Positive-Unlabeled Detection of AI-Generated Texts link Yuchuan Tian, Hanting Chen,..., Yunhe Wang
37 2023-09-29 Robustness of AI-Image Detectors: Fundamental Limits and Practical Attacks link Mehrdad Saberi, Vinu Sankar Sadasivan,..., Soheil Feizi
37 2023-10-19 Model Merging by Uncertainty-Based Gradient Matching link Nico Daheim, Thomas Möllenhoff,..., Mohammad Emtiyaz Khan
37 2023-10-02 Compressing LLMs: The Truth is Rarely Pure and Never
Simple
link AJAY KUMAR JAISWAL, Zhe Gan,..., Yinfei Yang
37 2023-09-09 Unified Language-Vision Pretraining in LLM with Dynamic Discrete Visual
Tokenization
link Yang Jin, Kun Xu,..., Yadong MU
37 2023-03-27 Seer: Language Instructed Video Prediction with Latent Diffusion Models link Xianfan Gu, Chuan Wen,..., Yang Gao
37 2023-10-13 Dynamic Sparse No Training: Training-Free Fine-tuning for Sparse
LLMs
link Yuxin Zhang, Lirui Zhao,..., Rongrong Ji
37 2024-01-31 Convolution Meets LoRA: Parameter Efficient Finetuning for Segment Anything
Model
link Zihan Zhong, Zhiqiang Tang,..., Chun Yuan
37 2023-09-18 DFormer: Rethinking RGBD Representation Learning for Semantic Segmentation link Bowen Yin, Xuying Zhang,..., Qibin Hou
36 None On the Humanity of Conversational AI: Evaluating the Psychological
Portrayal of LLMs
link Jen-tse Huang, Wenxuan Wang,..., Michael Lyu
36 2023-06-01 TorchRL: A data-driven decision-making library for PyTorch link Albert Bou, Matteo Bettini,..., Vincent Moens
36 2023-10-05 EfficientDM: Efficient Quantization-Aware Fine-Tuning of Low-Bit Diffusion Models link Yefei He, Jing Liu,..., Bohan Zhuang
36 2024-01-13 BrainLM: A foundation model for brain activity recordings link Josue Ortega Caro, Antonio Henrique de Oliveira Fonseca,..., David van Dijk
36 2024-02-29 Curiosity-driven Red-teaming for Large Language Models link Zhang-Wei Hong, Idan Shenfeld,..., Pulkit Agrawal
36 2023-10-04 Diffusion Generative Flow Samplers: Improving learning signals through partial
trajectory optimization
link Dinghuai Zhang, Ricky T. Q. Chen,..., Yoshua Bengio
36 2023-10-03 DeepZero: Scaling Up Zeroth-Order Optimization for Deep Model Training link Aochuan Chen, Yimeng Zhang,..., Sijia Liu
36 2023-10-02 LEAP: Liberate Sparse-View 3D Modeling from Camera Poses link Hanwen Jiang, Zhenyu Jiang,..., Qixing Huang
36 2023-07-14 Mega-TTS 2: Boosting Prompting Mechanisms for Zero-Shot Speech Synthesis link Ziyue Jiang, Jinglin Liu,..., Zhou Zhao
35 2023-10-04 Understanding In-Context Learning in Transformers and LLMs by Learning
to Learn Discrete Functions
link Satwik Bhattamishra, Arkil Patel,..., Varun Kanade
35 2023-02-04 Multi-Source Diffusion Models for Simultaneous Music Generation and Separation link Giorgio Mariani, Irene Tallini,..., Emanuele Rodolà
35 2023-06-08 In-Context Learning through the Bayesian Prism link Madhur Panwar, Kabir Ahuja, Navin Goyal
35 2024-04-03 CLaM-TTS: Improving Neural Codec Language Model for Zero-Shot Text-to-Speech link Jaehyeon Kim, Keon Lee,..., Jaewoong Cho
35 2023-05-24 Alleviating Exposure Bias in Diffusion Models through Sampling with
Shifted Time Steps
link Mingxiao Li, Tingyu Qu,..., Marie-Francine Moens
35 2024-04-15 Language Model Cascades: Token-Level Uncertainty And Beyond link Neha Gupta, Harikrishna Narasimhan,..., Sanjiv Kumar
35 2023-10-18 Scalable Diffusion for Materials Generation link Sherry Yang, KwangHwan Cho,..., Ekin Dogus Cubuk
35 2024-01-09 Masked Audio Generation using a Single Non-Autoregressive Transformer link Alon Ziv, Itai Gat,..., Yossi Adi
35 2022-11-14 Composed Image Retrieval with Text Feedback via Multi-grained Uncertainty
Regularization
link Yiyang Chen, Zhedong Zheng,..., Tat-Seng Chua
35 2023-06-07 ViDA: Homeostatic Visual Domain Adapter for Continual Test Time
Adaptation
link Jiaming Liu, Senqiao Yang,..., Shanghang Zhang
34 2023-10-26 CADS: Unleashing the Diversity of Diffusion Models through Condition-Annealed
Sampling
link Seyedmorteza Sadat, Jakob Buhmann,..., Romann M. Weber
34 None Chain-of-Experts: When LLMs Meet Complex Operations Research Problems link Ziyang Xiao, Dongxiang Zhang,..., Gang Chen
34 2023-10-16 LLM Blueprint: Enabling Text-to-Image Generation with Complex and Detailed
Prompts
link Hanan Gani, Shariq Farooq Bhat,..., Peter Wonka
34 2023-11-02 The Blessing of Randomness: SDE Beats ODE in General
Diffusion-based Image Editing
link Shen Nie, Hanzhong Allan Guo,..., Chongxuan Li
34 2023-10-01 JoMA: Demystifying Multilayer Transformers via Joint Dynamics of MLP
and Attention
link Yuandong Tian, Yiping Wang,..., Simon Shaolei Du
34 2023-09-14 Large-Vocabulary 3D Diffusion Model with Transformer link Ziang Cao, Fangzhou Hong,..., Ziwei Liu
33 2023-11-24 Controlled Text Generation via Language Model Arithmetic link Jasper Dekoninck, Marc Fischer,..., Martin Vechev
33 2024-03-04 Making Pre-trained Language Models Great on Tabular Prediction link Jiahuan Yan, Bo Zheng,..., Jintai Chen
33 2024-04-17 SuRe: Summarizing Retrievals using Answer Candidates for Open-domain QA
of LLMs
link Jaehyung Kim, Jaehyun Nam,..., Jinwoo Shin
33 2024-01-20 BadChain: Backdoor Chain-of-Thought Prompting for Large Language Models link Zhen Xiang, Fengqing Jiang,..., Bo Li
33 2023-05-24 Differentially Private Synthetic Data via Foundation Model APIs 1:
Images
link Zinan Lin, Sivakanth Gopi,..., Sergey Yekhanin
33 2023-05-20 CARD: Channel Aligned Robust Blend Transformer for Time Series
Forecasting
link Xue Wang, Tian Zhou,..., Rong Jin
33 2024-05-02 Plan-Seq-Learn: Language Model Guided RL for Solving Long Horizon
Robotics Tasks
link Murtaza Dalal, Tarun Chiruvolu,..., Ruslan Salakhutdinov
33 2023-11-03 Tell Your Model Where to Attend: Post-hoc Attention Steering
for LLMs
link Qingru Zhang, Chandan Singh,..., Tuo Zhao
33 2023-12-26 LaneSegNet: Map Learning with Lane Segment Perception for Autonomous
Driving
link Tianyu Li, Peijin Jia,..., Hongyang Li
32 2024-02-16 Robust agents learn causal world models link Jonathan Richens, Tom Everitt
32 2023-10-26 SKILL-MIX: a Flexible and Expandable Family of Evaluations for
AI Models
link Dingli Yu, Simran Kaur,..., Sanjeev Arora
32 2023-10-26 How do Language Models Bind Entities in Context? link Jiahai Feng, Jacob Steinhardt
32 2023-12-12 Remote Sensing Vision-Language Foundation Models without Annotations via Ground
Remote Alignment
link Utkarsh Mall, Cheng Perng Phoo,..., Kavita Bala
32 2023-11-21 BEND: Benchmarking DNA Language Models on Biologically Meaningful Tasks link Frederikke Isa Marin, Felix Teufel,..., Wouter Boomsma
32 2024-01-31 Motion Guidance: Diffusion-Based Image Editing with Differentiable Motion Estimators link Daniel Geng, Andrew Owens
32 2023-10-16 Gaining Wisdom from Setbacks: Aligning Large Language Models via
Mistake Analysis
link Kai Chen, Chunwei Wang,..., Lifeng Shang
31 2023-09-28 Intriguing Properties of Generative Classifiers link Priyank Jaini, Kevin Clark, Robert Geirhos
31 2023-09-15 Scaling Laws for Sparsely-Connected Foundation Models link Elias Frantar, Carlos Riquelme Ruiz,..., Utku Evci
31 2023-10-04 Local Search GFlowNets link Minsu Kim, Taeyoung Yun,..., Jinkyoo Park
31 2023-10-06 Towards Foundation Models for Knowledge Graph Reasoning link Mikhail Galkin, Xinyu Yuan,..., Zhaocheng Zhu
31 2023-10-09 SALMON: Self-Alignment with Instructable Reward Models link Zhiqing Sun, Yikang Shen,..., Chuang Gan
31 2023-10-12 Tree-Planner: Efficient Close-loop Task Planning with Large Language Models link Mengkang Hu, Yao Mu,..., Ping Luo
31 2023-08-29 Elucidating the Exposure Bias in Diffusion Models link Mang Ning, Mingxiao Li,..., Itir Onal Ertugrul
31 2023-10-03 Unveiling the Pitfalls of Knowledge Editing for Large Language
Models
link Zhoubo Li, Ningyu Zhang,..., Huajun Chen
31 2023-08-03 Circumventing Concept Erasure Methods For Text-To-Image Generative Models link Minh Pham, Kelly O. Marshall,..., Chinmay Hegde
30 2023-06-08 Protein Discovery with Discrete Walk-Jump Sampling link Nathan C. Frey, Dan Berenberg,..., Saeed Saremi
30 None Würstchen: An Efficient Architecture for Large-Scale Text-to-Image Diffusion Models link Pablo Pernias, Dominic Rampas,..., Marc Aubreville
30 2024-02-14 MUSTARD: Mastering Uniform Synthesis of Theorem and Proof Data link Yinya Huang, Xiaohan Lin,..., Xiaodan Liang
30 2023-10-06 How to Capture Higher-order Correlations? Generalizing Matrix Softmax Attention
to Kronecker Computation
link Josh Alman, Zhao Song
30 2023-11-30 Dichotomy of Early and Late Phase Implicit Biases Can
Provably Induce Grokking
link Kaifeng Lyu, Jikai Jin,..., Wei Hu
30 2023-07-06 T-MARS: Improving Visual Representations by Circumventing Text Feature Learning link Pratyush Maini, Sachin Goyal,..., Aditi Raghunathan
30 2022-05-30 Neural Optimal Transport with General Cost Functionals link Arip Asadulaev, Alexander Korotin,..., Evgeny Burnaev
30 2023-10-02 Toward effective protection against diffusion-based mimicry through score distillation link Haotian Xue, Chumeng Liang,..., Yongxin Chen
30 2024-02-04 Pathformer: Multi-scale Transformers with Adaptive Pathways for Time Series
Forecasting
link Peng Chen, Yingying ZHANG,..., Chenjuan Guo
30 2023-08-02 Patched Denoising Diffusion Models For High-Resolution Image Synthesis link Zheng Ding, Mengqi Zhang,..., Zhuowen Tu
30 2023-12-28 STanHop: Sparse Tandem Hopfield Model for Memory-Enhanced Time Series
Prediction
link Dennis Wu, Jerry Yao-Chieh Hu,..., Han Liu
29 2024-01-16 Beyond Weisfeiler-Lehman: A Quantitative Framework for GNN Expressiveness link Bohang Zhang, Jingchu Gai,..., Liwei Wang
29 2023-02-06 One-shot Empirical Privacy Estimation for Federated Learning link Galen Andrew, Peter Kairouz,..., Vinith Menon Suriyakumar
29 None An Image Is Worth 1000 Lies: Transferability of Adversarial
Images across Prompts on Vision-Language Models
link Haochen Luo, Jindong Gu,..., Philip Torr
29 2023-12-07 On the Learnability of Watermarks for Language Models link Chenchen Gu, Xiang Lisa Li,..., Tatsunori Hashimoto
29 2023-07-17 COLLIE: Systematic Construction of Constrained Text Generation Tasks link Shunyu Yao, Howard Chen,..., Karthik R Narasimhan
29 2024-01-04 LLM Augmented LLMs: Expanding Capabilities through Composition link Rachit Bansal, Bidisha Samanta,..., Partha Talukdar
29 None Faithful Vision-Language Interpretation via Concept Bottleneck Models link Songning Lai, Lijie Hu,..., Di Wang
29 2023-11-22 Language Model Inversion link John Xavier Morris, Wenting Zhao,..., Alexander M Rush
29 2024-04-04 OpenNeRF: Open Set 3D Neural Scene Segmentation with Pixel-Wise
Features and Rendered Novel Views
link Francis Engelmann, Fabian Manhardt,..., Federico Tombari
28 2023-10-13 METRA: Scalable Unsupervised RL with Metric-Aware Abstraction link Seohong Park, Oleh Rybkin, Sergey Levine
28 2023-10-02 Merge, Then Compress: Demystify Efficient SMoE with Hints from
Its Routing Policy
link Pingzhi Li, Zhenyu Zhang,..., Tianlong Chen
28 2023-10-10 Geographic Location Encoding with Spherical Harmonics and Sinusoidal Representation
Networks
link Marc Rußwurm, Konstantin Klemmer,..., Devis Tuia
28 2024-05-03 What does the Knowledge Neuron Thesis Have to do
with Knowledge?
link Jingcheng Niu, Andrew Liu,..., Gerald Penn
28 2023-11-01 Plug-and-Play Policy Planner for Large Language Model Powered Dialogue
Agents
link Yang Deng, Wenxuan Zhang,..., Tat-Seng Chua
28 2022-11-17 How to Fine-Tune Vision Models with SGD link Ananya Kumar, Ruoqi Shen,..., Suriya Gunasekar
28 2023-10-20 Steve-Eye: Equipping LLM-based Embodied Agents with Visual Perception in
Open Worlds
link Sipeng Zheng, jiazheng liu,..., Zongqing Lu
28 2023-06-16 Jumanji: a Diverse Suite of Scalable Reinforcement Learning Environments
in JAX
link Clément Bonnet, Daniel Luo,..., Alexandre Laterre
28 2023-10-09 Grokking as the transition from lazy to rich training
dynamics
link Tanishq Kumar, Blake Bordelon,..., Cengiz Pehlevan
28 2023-10-03 AlignDiff: Aligning Diverse Human Preferences via Behavior-Customisable Diffusion Model link Zibin Dong, Yifu Yuan,..., Zhipeng Hu
28 2023-02-13 UniAdapter: Unified Parameter-Efficient Transfer Learning for Cross-modal Modeling link Haoyu Lu, Yuqi Huo,..., Mingyu Ding
28 2023-05-02 Privacy-Preserving In-Context Learning for Large Language Models link Tong Wu, Ashwinee Panda,..., Prateek Mittal
28 2023-10-25 Generative Pre-training for Speech with Flow Matching link Alexander H. Liu, Matthew Le,..., Wei-Ning Hsu
28 2023-09-27 Jointly Training Large Autoregressive Multimodal Models link Emanuele Aiello, LILI YU,..., Barlas Oguz
27 2024-03-18 Graph Neural Networks for Learning Equivariant Representations of Neural
Networks
link Miltiadis Kofinas, Boris Knyazev,..., David W. Zhang
27 None Monte Carlo guided Denoising Diffusion models for Bayesian linear
inverse problems.
link Gabriel Cardoso, Yazid Janati el idrissi,..., Eric Moulines
27 2024-02-07 InstructScene: Instruction-Driven 3D Indoor Scene Synthesis with Semantic Graph
Prior
link Chenguo Lin, Yadong MU
27 2023-12-18 Tuning LayerNorm in Attention: Towards Efficient Multi-Modal LLM Finetuning link Bingchen Zhao, Haoqin Tu,..., Cihang Xie
27 2023-11-10 FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor Cores link Daniel Y Fu, Hermann Kumbong,..., Christopher Re
27 2023-10-03 Tensor Programs VI: Feature Learning in Infinite Depth Neural
Networks
link Greg Yang, Dingli Yu,..., Soufiane Hayou
27 2023-10-25 CLEX: Continuous Length Extrapolation for Large Language Models link Guanzheng Chen, Xin Li,..., Lidong Bing
27 2023-11-08 Massive Editing for Large Language Models via Meta Learning link Chenmien Tan, Ge Zhang, Jie Fu
27 2023-12-08 Large-scale Training of Foundation Models for Wearable Biosignals link Salar Abbaspourazad, Oussama Elachqar,..., Ian Shapiro
27 2023-07-16 EasyTPP: Towards Open Benchmarking Temporal Point Processes link Siqiao Xue, Xiaoming Shi,..., Hongyuan Mei
27 2023-09-06 SLiMe: Segment Like Me link Aliasghar Khani, Saeid Asgari,..., Ghassan Hamarneh
27 2023-05-24 Spoken Question Answering and Speech Continuation Using Spectrogram-Powered LLM link Eliya Nachmani, Alon Levkovitch,..., Michelle Tadmor Ramanovich
27 2023-11-03 Simplifying Transformer Blocks link Bobby He, Thomas Hofmann
27 2023-09-30 Scaling for Training Time and Post-hoc Out-of-distribution Detection Enhancement link Kai Xu, Rongyu Chen,..., Angela Yao
27 2024-01-18 Divide and not forget: Ensemble of selectively trained
experts in Continual Learning
link Grzegorz Rypeść, Sebastian Cygert,..., Bartłomiej Twardowski
26 2023-05-17 Knowledge Card: Filling LLMs' Knowledge Gaps with Plug-in Specialized
Language Models
link Shangbin Feng, Weijia Shi,..., Yulia Tsvetkov
26 2023-11-07 Multi-View Causal Representation Learning with Partial Observability link Dingling Yao, Danru Xu,..., Francesco Locatello
26 2023-12-07 Graph Metanetworks for Processing Diverse Neural Architectures link Derek Lim, Haggai Maron,..., James Lucas
26 2023-11-27 DP-OPT: Make Large Language Model Your Privacy-Preserving Prompt Engineer link Junyuan Hong, Jiachen T. Wang,..., Zhangyang Wang
26 2023-10-12 GROOT: Learning to Follow Instructions by Watching Gameplay Videos link Shaofei Cai, Bowei Zhang,..., Yitao Liang
26 2023-10-07 Lemur: Integrating Large Language Models in Automated Program Verification link Haoze Wu, Clark Barrett, Nina Narodytska
26 2023-10-02 Controlling Vision-Language Models for Multi-Task Image Restoration link Ziwei Luo, Fredrik K. Gustafsson,..., Thomas B. Schön
26 2023-10-02 Locality-Aware Graph Rewiring in GNNs link Federico Barbero, Ameya Velingker,..., Francesco Di Giovanni
26 2023-05-26 An Efficient Membership Inference Attack for the Diffusion Model
by Proximal Initialization
link Fei Kong, Jinhao Duan,..., Kaidi Xu
26 2023-10-02 Fusing Models with Complementary Expertise link Hongyi Wang, Felipe Maia Polo,..., Mikhail Yurochkin
26 2023-02-21 Low Rank Matrix Completion via Robust Alternating Minimization in
Nearly Linear Time
link Yuzhou Gu, Zhao Song,..., Lichen Zhang
26 2023-10-01 Faithful Explanations of Black-box NLP Models Using LLM-generated Counterfactuals link Yair Ori Gat, Nitay Calderon,..., Roi Reichart
26 2023-11-25 LLM-Assisted Code Cleaning For Training Accurate Code Generators link Naman Jain, Tianjun Zhang,..., Ion Stoica
26 2023-03-11 Xformer: Hybrid X-Shaped Transformer for Image Denoising link Jiale Zhang, Yulun Zhang,..., Xiaokang Yang
26 2023-09-11 DePT: Decomposed Prompt Tuning for Parameter-Efficient Fine-tuning link Zhengxiang Shi, Aldo Lipani
25 2023-12-17 Learning to Act without Actions link Dominik Schmidt, Minqi Jiang
25 2024-01-03 Prototypical Information Bottlenecking and Disentangling for Multimodal Cancer Survival
Prediction
link Yilan Zhang, Yingxue Xu,..., Hao Chen
25 2023-10-17 Group Preference Optimization: Few-Shot Alignment of Large Language Models link Siyan Zhao, John Dang, Aditya Grover
25 2022-11-01 Two-stage LLM Fine-tuning with Less Specialization and More Generalization link Yihan Wang, Si Si,..., Sanjiv Kumar
25 2023-08-03 PARL: A Unified Framework for Policy Alignment in Reinforcement
Learning from Human Feedback
link Souradip Chakraborty, Amrit Bedi,..., Furong Huang
25 2023-12-05 MUFFIN: Curating Multi-Faceted Instructions for Improving Instruction Following link Renze Lou, Kai Zhang,..., Wenpeng Yin
25 2022-08-10 A Sublinear Adversarial Training Algorithm link Yeqi Gao, Lianke Qin,..., Yitan Wang
25 2023-10-24 TiC-CLIP: Continual Training of CLIP Models link Saurabh Garg, Mehrdad Farajtabar,..., Fartash Faghri
25 2023-10-19 Quality-Diversity through AI Feedback link Herbie Bradley, Andrew Dai,..., Joel Lehman
25 2023-11-26 GAIA: Zero-shot Talking Avatar Generation link Tianyu He, Junliang Guo,..., Jiang Bian
25 2023-07-30 An Unforgeable Publicly Verifiable Watermark for Large Language Models link Aiwei Liu, Leyi Pan,..., Philip S. Yu
25 2023-10-03 Benchmarking and Improving Generator-Validator Consistency of Language Models link Xiang Lisa Li, Vaishnavi Shrivastava,..., Percy Liang
25 2023-05-31 A Study of Bayesian Neural Network Surrogates for Bayesian
Optimization
link Yucen Lily Li, Tim G. J. Rudner, Andrew Gordon Wilson
25 2023-06-01 The Hidden Language of Diffusion Models link Hila Chefer, Oran Lang,..., Lior Wolf
25 2023-06-02 OMNI: Open-endedness via Models of human Notions of Interestingness link Jenny Zhang, Joel Lehman,..., Jeff Clune
25 2024-02-22 Towards Seamless Adaptation of Pre-trained Models for Visual Place
Recognition
link Feng Lu, Lijun Zhang,..., Chun Yuan
25 2024-02-08 Get What You Want, Not What You Don't: Image
Content Suppression for Text-to-Image Diffusion Models
link Senmao Li, Joost van de Weijer,..., jian Yang
25 2023-03-11 Recursive Generalization Transformer for Image Super-Resolution link Zheng Chen, Yulun Zhang,..., Xiaokang Yang
24 2023-10-24 On the Foundations of Shortcut Learning link Katherine Hermann, Hossein Mobahi,..., Michael Curtis Mozer
24 2024-03-29 Negative Label Guided OOD Detection with Pretrained Vision-Language Models link Xue Jiang, Feng Liu,..., Bo Han
24 2023-10-30 DrM: Mastering Visual Reinforcement Learning through Dormant Ratio Minimization link Guowei Xu, Ruijie Zheng,..., Huazhe Xu
24 2023-10-04 Multi-resolution HuBERT: Multi-resolution Speech Self-Supervised Learning with Masked Unit
Prediction
link Jiatong Shi, Hirofumi Inaguma,..., Anna Sun
24 2023-10-25 From Molecules to Materials: Pre-training Large Generalizable Models for
Atomic Property Prediction
link Nima Shoghi, Adeesh Kolluru,..., Brandon M Wood
24 2024-03-26 Don't Trust: Verify -- Grounding LLM Quantitative Reasoning with
Autoformalization
link Jin Peng Zhou, Charles E Staats,..., Yuhuai Wu
24 2023-07-28 Skeleton-of-Thought: Prompting LLMs for Efficient Parallel Generation link Xuefei Ning, Zinan Lin,..., Yu Wang
24 2023-06-01 AGILE3D: Attention Guided Interactive Multi-object 3D Segmentation link Yuanwen Yue, Sabarinath Mahadevan,..., Theodora Kontogianni
24 2023-03-07 Zeroth-Order Optimization Meets Human Feedback: Provable Learning via Ranking
Oracles
link Zhiwei Tang, Dmitry Rybin, Tsung-Hui Chang
24 2023-05-24 Sin3DM: Learning a Diffusion Model from a Single 3D
Textured Shape
link Rundi Wu, Ruoshi Liu,..., Changxi Zheng
24 2023-09-29 Spurious Feature Diversification Improves Out-of-distribution Generalization link LIN Yong, Lu Tan,..., Tong Zhang
24 2023-09-13 Query-Dependent Prompt Evaluation and Optimization with Offline Inverse RL link Hao Sun, Alihan Hüyük, Mihaela van der Schaar
23 2023-10-04 Never Train from Scratch: Fair Comparison of Long-Sequence Models
Requires Data-Driven Priors
link Ido Amos, Jonathan Berant, Ankit Gupta
23 2023-09-29 Understanding and Mitigating the Label Noise in Pre-training on
Downstream Tasks
link Hao Chen, Jindong Wang,..., Bhiksha Raj
23 2024-04-17 Variational Bayesian Last Layers link James Harrison, John Willes, Jasper Snoek
23 2023-10-19 Frozen Transformers in Language Models Are Effective Visual Encoder
Layers
link Ziqi Pang, Ziyang Xie,..., Yu-Xiong Wang
23 None Towards Reliable and Efficient Backdoor Trigger Inversion via Decoupling
Benign Features
link Xiong Xu, Kunzhe Huang,..., Kui Ren
23 2023-10-09 Sentence-level Prompts Benefit Composed Image Retrieval link Yang bai, Xinxing Xu,..., Chun-Mei Feng
23 2024-03-12 Entropy is not Enough for Test-Time Adaptation: From the
Perspective of Disentangled Factors
link Jonghyun Lee, Dahuin Jung,..., Sungroh Yoon
23 2022-01-07 Fair and Efficient Contribution Valuation for Vertical Federated Learning link Zhenan Fan, Huang Fang,..., Yong Zhang
23 2023-11-21 Looped Transformers are Better at Learning Learning Algorithms link Liu Yang, Kangwook Lee,..., Dimitris Papailiopoulos
23 2023-06-12 Retrieval-Enhanced Contrastive Vision-Text Models link Ahmet Iscen, Mathilde Caron,..., Cordelia Schmid
23 2023-09-19 MBR and QE Finetuning: Training-time Distillation of the Best
and Most Expensive Decoding Methods
link Mara Finkelstein, Markus Freitag
23 2023-09-26 SEPT: Towards Efficient Scene Representation Learning for Motion Prediction link Zhiqian Lan, Yuxuan Jiang,..., Shengbo Eben Li
23 2024-02-06 Space Group Constrained Crystal Generation link Rui Jiao, Wenbing Huang,..., Yang Liu
23 2024-02-01 Machine Unlearning for Image-to-Image Generative Models link Guihong Li, Hsiang Hsu,..., Radu Marculescu
23 2023-09-29 Leveraging Optimization for Adaptive Attacks on Image Watermarks link Nils Lukas, Abdulrahman Diaa,..., Florian Kerschbaum
23 2023-10-07 Parameter-Efficient Multi-Task Model Fusion with Partial Linearization link Anke Tang, Li Shen,..., Dacheng Tao
23 2024-02-18 BESA: Pruning Large Language Models with Blockwise Parameter-Efficient Sparsity
Allocation
link Peng Xu, Wenqi Shao,..., Ping Luo
23 2023-07-23 In-Context Learning Learns Label Relationships but Is Not Conventional
Learning
link Jannik Kossen, Yarin Gal, Tom Rainforth
23 2023-10-10 TopoMLP: A Simple yet Strong Pipeline for Driving Topology
Reasoning
link Dongming Wu, Jiahao Chang,..., Jianbing Shen
23 2023-12-27 Learning to Embed Time Series Patches Independently link Seunghan Lee, Taeyoung Park, Kibok Lee
22 2023-05-24 Provable Offline Preference-Based Reinforcement Learning link Wenhao Zhan, Masatoshi Uehara,..., Wen Sun
22 2023-09-04 On Penalty Methods for Nonconvex Bilevel Optimization and First-Order
Stochastic Approximation
link Jeongyeol Kwon, Dohyun Kwon,..., Robert D Nowak
22 2023-07-05 Reverse Diffusion Monte Carlo link Xunpeng Huang, Hanze Dong,..., Tong Zhang
22 2023-11-07 Beyond Imitation: Leveraging Fine-grained Quality Signals for Alignment link Geyang Guo, Ranchi Zhao,..., Ji-Rong Wen
22 2024-01-19 Escape Sky-high Cost: Early-stopping Self-Consistency for Multi-step Reasoning link Yiwei Li, Peiwen Yuan,..., Kan Li
22 2023-06-05 PolyVoice: Language Models for Speech to Speech Translation link Qian qian Dong, Zhiying Huang,..., Yuxuan Wang
22 2024-03-19 Do Generated Data Always Help Contrastive Learning? link Yifei Wang, Jizhe Zhang, Yisen Wang
22 2023-12-18 Hybrid Internal Model: Learning Agile Legged Locomotion with Simulated
Robot Response
link Junfeng Long, ZiRui Wang,..., Jiangmiao Pang
22 2024-02-02 Guiding Masked Representation Learning to Capture Spatio-Temporal Relationship of
Electrocardiogram
link Yeongyeon Na, Minje Park,..., Sunghoon Joo
22 2023-10-04 Posterior Sampling Based on Gradient Flows of the MMD
with Negative Distance Kernel
link Paul Hagemann, Johannes Hertrich,..., Gabriele Steidl
22 2023-06-15 FFB: A Fair Fairness Benchmark for In-Processing Group Fairness
Methods
link Xiaotian Han, Jianfeng Chi,..., Xia Hu
22 2023-10-09 TAIL: Task-specific Adapters for Imitation Learning with Large Pretrained
Models
link Zuxin Liu, Jesse Zhang,..., Rasool Fakoor
22 2023-10-10 Teaching Language Models to Hallucinate Less with Synthetic Tasks link Erik Jones, Hamid Palangi,..., Ece Kamar
22 2024-03-02 Polynormer: Polynomial-Expressive Graph Transformer in Linear Time link Chenhui Deng, Zichao Yue, Zhiru Zhang
22 2023-08-08 V-DETR: DETR with Vertex Relative Position Encoding for 3D
Object Detection
link Yichao Shen, Zigang Geng,..., Baining Guo
22 None Periodicity Decoupling Framework for Long-term Series Forecasting link Tao Dai, Beiliang Wu,..., Shu-Tao Xia
22 2023-10-01 Revisiting Link Prediction: a data perspective link Haitao Mao, Juanhui Li,..., Jiliang Tang
22 None Plug-and-Play: An Efficient Post-training Pruning Method for Large Language
Models
link Yingtao Zhang, Haoli Bai,..., Carlo Vittorio Cannistraci
22 2023-08-14 CausalLM is not optimal for in-context learning link Nan Ding, Tomer Levinboim,..., Radu Soricut
21 2024-04-15 ClimODE: Climate and Weather Forecasting with Physics-informed Neural ODEs link Yogesh Verma, Markus Heinonen, Vikas Garg
21 2023-01-22 Learning to Reject with a Fixed Predictor: Application to
Decontextualization
link Christopher Mohri, Daniel Andor,..., Yutao Zhong
21 2023-09-30 AutomaTikZ: Text-Guided Synthesis of Scientific Vector Graphics with TikZ link Jonas Belouadi, Anne Lauscher, Steffen Eger
21 2023-09-30 InstructCV: Instruction-Tuned Text-to-Image Diffusion Models as Vision Generalists link Yulu Gan, Sungwoo Park,..., Ahmed Alaa
21 2024-01-19 Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model link Yinan Zheng, Jianxiong Li,..., Jingjing Liu
21 2024-02-28 Deep Confident Steps to New Pockets: Strategies for Docking
Generalization
link Gabriele Corso, Arthur Deng,..., Tommi S. Jaakkola
21 2023-09-29 Consistency Models as a Rich and Efficient Policy Class
for Reinforcement Learning
link Zihan Ding, Chi Jin
21 2024-01-22 Fourier Transporter: Bi-Equivariant Robotic Manipulation in 3D link Haojie Huang, Owen Lewis Howell,..., Robin Walters
21 2023-04-12 Energy-guided Entropic Neural Optimal Transport link Petr Mokrov, Alexander Korotin,..., Evgeny Burnaev
21 2024-03-21 C-TPT: Calibrated Test-Time Prompt Tuning for Vision-Language Models via
Text Feature Dispersion
link Hee Suk Yoon, Eunseop Yoon,..., Chang D. Yoo
21 2023-01-05 Skip-Attention: Improving Vision Transformers by Paying Less Attention link Shashanka Venkataramanan, Amir Ghodrati,..., Amir Habibian
21 2023-09-12 Reasoning with Latent Diffusion in Offline Reinforcement Learning link Siddarth Venkatraman, Shivesh Khaitan,..., Glen Berseth