Last updated: 2025-05-19 23:35:37. Maintained by Weisen Jiang.

citation publish date title (pdf) review authors
2127 2023-07-04 SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis link Dustin Podell, Zion English,..., Robin Rombach
1899 2023-04-20 MiniGPT-4: Enhancing Vision-Language Understanding with Advanced Large Language Models link Deyao Zhu, Jun Chen,..., Mohamed Elhoseiny
1131 2023-07-17 FlashAttention-2: Faster Attention with Better Parallelism and Work Partitioning link Tri Dao
865 2023-05-31 Let's Verify Step by Step link Hunter Lightman, Vineet Kosaraju,..., Karl Cobbe
781 2023-07-10 AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific
Tuning
link Yuwei Guo, Ceyuan Yang,..., Bo Dai
639 2023-04-11 Teaching Large Language Models to Self-Debug link Xinyun Chen, Maxwell Lin,..., Denny Zhou
633 2023-06-14 WizardCoder: Empowering Code Large Language Models with Evol-Instruct link Ziyang Luo, Can Xu,..., Daxin Jiang
630 2023-10-17 Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection link Akari Asai, Zeqiu Wu,..., Hannaneh Hajishirzi
617 2023-07-31 ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world
APIs
link Yujia Qin, Shihao Liang,..., Maosong Sun
591 2023-09-28 DreamGaussian: Generative Gaussian Splatting for Efficient 3D Content Creation link Jiaxiang Tang, Jiawei Ren,..., Gang Zeng
589 2023-08-31 MVDream: Multi-view Diffusion for 3D Generation link Yichun Shi, Peng Wang,..., Xiao Yang
524 2023-10-05 Fine-tuning Aligned Language Models Compromises Safety, Even When Users
Do Not Intend To!
link Xiangyu Qi, Yi Zeng,..., Peter Henderson
496 2023-10-03 MathVista: Evaluating Mathematical Reasoning of Foundation Models in Visual
Contexts
link Pan Lu, Hritik Bansal,..., Jianfeng Gao
463 2023-10-10 SWE-bench: Can Language Models Resolve Real-world Github Issues? link Carlos E Jimenez, John Yang,..., Karthik R Narasimhan
445 2023-08-14 ChatEval: Towards Better LLM-based Evaluators through Multi-Agent Debate link Chi-Min Chan, Weize Chen,..., Zhiyuan Liu
437 2023-10-10 iTransformer: Inverted Transformers Are Effective for Time Series Forecasting link Yong Liu, Tengge Hu,..., Mingsheng Long
421 2023-10-03 Large Language Models Cannot Self-Correct Reasoning Yet link Jie Huang, Xinyun Chen,..., Denny Zhou
416 2023-09-07 SyncDreamer: Generating Multiview-consistent Images from a Single-view Image link Yuan Liu, Cheng Lin,..., Wenping Wang
412 2023-11-08 LRM: Large Reconstruction Model for Single Image to 3D link Yicong Hong, Kai Zhang,..., Hao Tan
381 2023-07-25 WebArena: A Realistic Web Environment for Building Autonomous Agents link Shuyan Zhou, Frank F. Xu,..., Graham Neubig
375 2023-09-07 Large Language Models as Optimizers link Chengrun Yang, Xuezhi Wang,..., Xinyun Chen
370 2023-06-22 Can LLMs Express Their Uncertainty? An Empirical Evaluation of
Confidence Elicitation in LLMs
link Miao Xiong, Zhiyuan Hu,..., Bryan Hooi
361 2023-09-11 MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning link Xiang Yue, Xingwei Qu,..., Wenhu Chen
357 2023-05-19 CRITIC: Large Language Models Can Self-Correct with Tool-Interactive Critiquing link Zhibin Gou, Zhihong Shao,..., Weizhu Chen
355 2023-06-30 Magic123: One Image to High-Quality 3D Object Generation Using
Both 2D and 3D Diffusion Priors
link Guocheng Qian, Jinjie Mai,..., Bernard Ghanem
355 2023-06-20 A Simple and Effective Pruning Approach for Large Language
Models
link Mingjie Sun, Zhuang Liu,..., J Zico Kolter
354 2023-10-03 Time-LLM: Time Series Forecasting by Reprogramming Large Language Models link Ming Jin, Shiyu Wang,..., Qingsong Wen
333 2023-09-15 Sparse Autoencoders Find Highly Interpretable Features in Language Models link Robert Huben, Hoagy Cunningham,..., Lee Sharkey
329 2023-09-21 MetaMath: Bootstrap Your Own Mathematical Questions for Large Language
Models
link Longhui Yu, Weisen Jiang,..., Weiyang Liu
316 2023-05-22 Training Diffusion Models with Reinforcement Learning link Kevin Black, Michael Janner,..., Sergey Levine
312 2023-09-28 Vision Transformers Need Registers link Timothée Darcet, Maxime Oquab,..., Piotr Bojanowski
301 2023-10-12 Ferret: Refer and Ground Anything Anywhere at Any Granularity link Haoxuan You, Haotian Zhang,..., Yinfei Yang
296 2023-10-17 Quantifying Language Models' Sensitivity to Spurious Features in Prompt
Design or: How I learned to start worrying about prompt formatting
link Melanie Sclar, Yejin Choi,..., Alane Suhr
291 2023-10-19 Eureka: Human-Level Reward Design via Coding Large Language Models link Yecheng Jason Ma, William Liang,..., Anima Anandkumar
289 2023-10-19 Safe RLHF: Safe Reinforcement Learning from Human Feedback link Josef Dai, Xuehai Pan,..., Yaodong Yang
278 2023-10-09 Language Model Beats Diffusion - Tokenizer is key to
visual generation
link Lijun Yu, Jose Lezama,..., Lu Jiang
271 2023-10-16 Llemma: An Open Language Model for Mathematics link Zhangir Azerbayev, Hailey Schoelkopf,..., Sean Welleck
268 2023-10-11 Catastrophic Jailbreak of Open-source LLMs via Exploiting Generation link Yangsibo Huang, Samyak Gupta,..., Danqi Chen
264 2023-10-10 Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning link Mengzhou Xia, Tianyu Gao,..., Danqi Chen
261 2023-08-07 AgentBench: Evaluating LLMs as Agents link Xiao Liu, Hao Yu,..., Jie Tang
258 2023-10-03 AutoDAN: Generating Stealthy Jailbreak Prompts on Aligned Large Language
Models
link Xiaogeng Liu, Nan Xu,..., Chaowei Xiao
254 2023-11-10 Instant3D: Fast Text-to-3D with Sparse-view Generation and Large Reconstruction
Model
link Jiahao Li, Hao Tan,..., Sai Bi
250 2023-07-19 TokenFlow: Consistent Diffusion Features for Consistent Video Editing link Michal Geyer, Omer Bar-Tal,..., Tali Dekel
245 2023-10-16 Real-time Photorealistic Dynamic Scene Representation and Rendering with 4D
Gaussian Splatting
link Zeyu Yang, Hongye Yang,..., Li Zhang
244 2023-07-13 InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and
Generation
link Yi Wang, Yinan He,..., Yu Qiao
242 2023-02-14 Universal Guidance for Diffusion Models link Arpit Bansal, Hong-Min Chu,..., Tom Goldstein
241 2023-06-26 Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction
Tuning
link Fuxiao Liu, Kevin Lin,..., Lijuan Wang
240 2023-09-21 The Reversal Curse: LLMs trained on “A is B”
fail to learn “B is A”
link Lukas Berglund, Meg Tong,..., Owain Evans
236 2023-05-22 ControlVideo: Training-free Controllable Text-to-video Generation link Yabo Zhang, Yuxiang Wei,..., Qi Tian
231 2023-02-07 Effective Data Augmentation With Diffusion Models link Brandon Trabucco, Kyle Doherty,..., Ruslan Salakhutdinov
231 2023-08-12 GPT-4 Is Too Smart To Be Safe: Stealthy Chat
with LLMs via Cipher
link Youliang Yuan, Wenxiang Jiao,..., Zhaopeng Tu
230 2023-06-05 SpQR: A Sparse-Quantized Representation for Near-Lossless LLM Weight Compression link Tim Dettmers, Ruslan A. Svirschevski,..., Dan Alistarh
228 2023-09-20 OpenChat: Advancing Open-source Language Models with Mixed-Quality Data link Guan Wang, Sijie Cheng,..., Yang Liu
226 2023-06-08 PandaLM: An Automatic Evaluation Benchmark for LLM Instruction Tuning
Optimization
link Yidong Wang, Zhuohao Yu,..., Yue Zhang
224 2023-08-31 YaRN: Efficient Context Window Extension of Large Language Models link Bowen Peng, Jeffrey Quesnelle,..., Enrico Shippole
221 2023-04-18 NaturalSpeech 2: Latent Diffusion Models are Natural and Zero-Shot
Speech and Singing Synthesizers
link Kai Shen, Zeqian Ju,..., Jiang Bian
217 2023-09-07 Large Language Models Are Not Robust Multiple Choice Selectors link Chujie Zheng, Hao Zhou,..., Minlie Huang
216 2023-10-03 Model Tells You What to Discard: Adaptive KV Cache
Compression for LLMs
link Suyu Ge, Yunan Zhang,..., Jianfeng Gao
214 2023-12-25 What Makes Good Data for Alignment? A Comprehensive Study
of Automatic Data Selection in Instruction Tuning
link Wei Liu, Weihao Zeng,..., Junxian He
213 2023-03-02 Human Motion Diffusion as a Generative Prior link Yoni Shafir, Guy Tevet,..., Amit Haim Bermano
210 2023-09-13 Statistical Rejection Sampling Improves Preference Optimization link Tianqi Liu, Yao Zhao,..., Jialu Liu
208 2023-10-12 Prometheus: Inducing Fine-Grained Evaluation Capability in Language Models link Seungone Kim, Jamin Shin,..., Minjoon Seo
207 2023-05-04 Personalize Segment Anything Model with One Shot link Renrui Zhang, Zhengkai Jiang,..., Hongsheng Li
202 2023-10-03 LanguageBind: Extending Video-Language Pretraining to N-modality by Language-based Semantic
Alignment
link Bin Zhu, Bin Lin,..., Li Yuan
200 2023-10-20 SALMONN: Towards Generic Hearing Abilities for Large Language Models link Changli Tang, Wenyi Yu,..., Chao Zhang
198 2023-07-24 A Real-World WebAgent with Planning, Long Context Understanding, and
Program Synthesis
link Izzeddin Gur, Hiroki Furuta,..., Aleksandra Faust
195 2023-08-16 Stochastic Controlled Averaging for Federated Learning with Communication Compression link Xinmeng Huang, Ping Li, Xiaoyun Li
193 2023-09-12 InstaFlow: One Step is Enough for High-Quality Diffusion-Based Text-to-Image
Generation
link Xingchao Liu, Xiwen Zhang,..., qiang liu
192 2023-10-20 Towards Understanding Sycophancy in Language Models link Mrinank Sharma, Meg Tong,..., Ethan Perez
186 None WizardLM: Empowering Large Pre-Trained Language Models to Follow Complex
Instructions
link Can Xu, Qingfeng Sun,..., Daxin Jiang
186 2023-10-02 Reasoning on Graphs: Faithful and Interpretable Large Language Model
Reasoning
link LINHAO LUO, Yuan-Fang Li,..., Shirui Pan
186 2023-05-26 Large Language Models as Tool Makers link Tianle Cai, Xuezhi Wang,..., Denny Zhou
182 2022-08-30 The Alignment Problem from a Deep Learning Perspective link Richard Ngo, Lawrence Chan, Sören Mindermann
180 2023-10-02 Making Retrieval-Augmented Language Models Robust to Irrelevant Context link Ori Yoran, Tomer Wolfson,..., Jonathan Berant
176 2023-09-21 LMSYS-Chat-1M: A Large-Scale Real-World LLM Conversation Dataset link Lianmin Zheng, Wei-Lin Chiang,..., Hao Zhang
176 2023-08-25 OmniQuant: Omnidirectionally Calibrated Quantization for Large Language Models link Wenqi Shao, Mengzhao Chen,..., Ping Luo
176 2023-09-14 Safety-Tuned LLaMAs: Lessons From Improving the Safety of Large
Language Models that Follow Instructions
link Federico Bianchi, Mirac Suzgun,..., James Zou
174 2024-05-02 WildChat: 1M ChatGPT Interaction Logs in the Wild link Wenting Zhao, Xiang Ren,..., Yuntian Deng
170 2023-09-20 DreamLLM: Synergistic Multimodal Comprehension and Creation link Runpei Dong, Chunrui Han,..., Li Yi
166 2023-11-14 Fine-Tuning Language Models for Factuality link Katherine Tian, Eric Mitchell,..., Chelsea Finn
166 2023-10-11 Evaluating Large Language Models at Evaluating Instruction Following link Zhiyuan Zeng, Jiatong Yu,..., Danqi Chen
164 2023-12-04 The Unlocking Spell on Base LLMs: Rethinking Alignment
via In-Context Learning
link Bill Yuchen Lin, Abhilasha Ravichander,..., Yejin Choi
162 2023-10-01 Analyzing and Mitigating Object Hallucination in Large Vision-Language Models link Yiyang Zhou, Chenhang Cui,..., Huaxiu Yao
161 2023-10-25 Detecting Pretraining Data from Large Language Models link Weijia Shi, Anirudh Ajith,..., Luke Zettlemoyer
158 2023-09-25 Can LLM-Generated Misinformation Be Detected? link Canyu Chen, Kai Shu
157 2023-07-05 Building Cooperative Embodied Agents Modularly with Large Language Models link Hongxin Zhang, Weihua Du,..., Chuang Gan
155 2023-05-22 Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior
of Large Language Models in Knowledge Conflicts
link Jian Xie, Kai Zhang,..., Yu Su
154 2023-06-30 Provable Robust Watermarking for AI-Generated Text link Xuandong Zhao, Prabhanjan Vijendra Ananth,..., Yu-Xiang Wang
153 2023-11-15 DMV3D: Denoising Multi-view Diffusion Using 3D Large Reconstruction Model link Yinghao Xu, Hao Tan,..., Kai Zhang
152 2023-09-21 LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models link Yukang Chen, Shengju Qian,..., Jiaya Jia
150 2023-09-27 Finite Scalar Quantization: VQ-VAE Made Simple link Fabian Mentzer, David Minnen,..., Michael Tschannen
149 2023-09-29 Directly Fine-Tuning Diffusion Models on Differentiable Rewards link Kevin Clark, Paul Vicol,..., David J. Fleet
148 2023-09-07 DoLa: Decoding by Contrasting Layers Improves Factuality in Large
Language Models
link Yung-Sung Chuang, Yujia Xie,..., Pengcheng He
147 2023-10-22 Improved Techniques for Training Consistency Models link Yang Song, Prafulla Dhariwal
147 2023-06-05 RepoBench: Benchmarking Repository-Level Code Auto-Completion Systems link Tianyang Liu, Canwen Xu, Julian McAuley
146 2023-09-28 DiLu: A Knowledge-Driven Approach to Autonomous Driving with Large
Language Models
link Licheng Wen, Daocheng Fu,..., Yu Qiao
145 2024-01-26 SliceGPT: Compress Large Language Models by Deleting Rows and
Columns
link Saleh Ashkboos, Maximilian L. Croci,..., James Hensman
145 2023-08-15 Solving Challenging Math Word Problems Using GPT-4 Code Interpreter
with Code-based Self-Verification
link Aojun Zhou, Ke Wang,..., Hongsheng Li
142 2023-09-29 ToRA: A Tool-Integrated Reasoning Agent for Mathematical Problem Solving link Zhibin Gou, Zhihong Shao,..., Weizhu Chen
141 2023-07-05 DragonDiffusion: Enabling Drag-style Manipulation on Diffusion Models link Chong Mou, Xintao Wang,..., Jian Zhang
141 2023-10-03 Language Models Represent Space and Time link Wes Gurnee, Max Tegmark
141 2023-11-21 GAIA: a benchmark for General AI Assistants link Grégoire Mialon, Clémentine Fourrier,..., Thomas Scialom
141 2023-09-19 MINT: Evaluating LLMs in Multi-turn Interaction with Tools and
Language Feedback
link Xingyao Wang, Zihan Wang,..., Heng Ji
140 2023-07-04 Self-Consuming Generative Models Go MAD link Sina Alemohammad, Josue Casco-Rodriguez,..., Richard Baraniuk
137 2023-11-02 Vision-Language Foundation Models as Effective Robot Imitators link Xinghang Li, Minghuan Liu,..., Tao Kong
136 2023-05-18 Listen, Think, and Understand link Yuan Gong, Hongyin Luo,..., James R. Glass
135 2023-09-25 Q-Bench: A Benchmark for General-Purpose Foundation Models on Low-level
Vision
link Haoning Wu, Zicheng Zhang,..., Weisi Lin
135 2023-03-14 Let 2D Diffusion Model Know 3D-Consistency for Robust Text-to-3D
Generation
link Junyoung Seo, Wooseok Jang,..., Seungryong Kim
133 2023-09-14 MMICL: Empowering Vision-language Model with Multi-Modal In-Context Learning link Haozhe Zhao, Zefan Cai,..., Baobao Chang
132 2023-10-16 Zero-Shot Robotic Manipulation with Pre-Trained Image-Editing Diffusion Models link Kevin Black, Mitsuhiko Nakamoto,..., Sergey Levine
132 2023-10-25 DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior link Jingxiang Sun, Bo Zhang,..., Yebin Liu
131 2023-06-21 EquiformerV2: Improved Equivariant Transformer for Scaling to Higher-Degree Representations link Yi-Lun Liao, Brandon M Wood,..., Tess Smidt
131 2023-08-21 AgentVerse: Facilitating Multi-Agent Collaboration and Exploring Emergent Behaviors link Weize Chen, Yusheng Su,..., Jie Zhou
131 2023-09-19 Language Modeling Is Compression link Gregoire Deletang, Anian Ruoss,..., Joel Veness
130 2023-10-17 VeRA: Vector-based Random Matrix Adaptation link Dawid Jan Kopiczko, Tijmen Blankevoort, Yuki M Asano
130 2023-10-08 Fast-DetectGPT: Efficient Zero-Shot Detection of Machine-Generated Text via Conditional
Probability Curvature
link Guangsheng Bao, Yanbin Zhao,..., Yue Zhang
129 2023-10-26 Proving Test Set Contamination in Black-Box Language Models link Yonatan Oren, Nicole Meister,..., Tatsunori Hashimoto
129 2023-10-02 RA-DIT: Retrieval-Augmented Dual Instruction Tuning link Xi Victoria Lin, Xilun Chen,..., Wen-tau Yih
128 2023-05-23 Sophia: A Scalable Stochastic Second-order Optimizer for Language Model
Pre-training
link Hong Liu, Zhiyuan Li,..., Tengyu Ma
128 2023-10-02 Making LLaMA SEE and Draw with SEED Tokenizer link Yuying Ge, Sijie Zhao,..., Ying Shan
127 2023-07-26 Jailbreak in pieces: Compositional Adversarial Attacks on Multi-Modal Language
Models
link Erfan Shayegani, Yue Dong, Nael Abu-Ghazaleh
126 2023-10-31 SEINE: Short-to-Long Video Diffusion Model for Generative Transition and
Prediction
link Xinyuan Chen, Yaohui Wang,..., Ziwei Liu
124 2024-02-27 When Scaling Meets LLM Finetuning: The Effect of Data,
Model and Finetuning Method
link Biao Zhang, Zhongtao Liu,..., Orhan Firat
124 2023-05-07 A Variational Perspective on Solving Inverse Problems with Diffusion
Models
link Morteza Mardani, Jiaming Song,..., Arash Vahdat
124 2023-09-29 Data Filtering Networks link Alex Fang, Albin Madappally Jose,..., Vaishaal Shankar
123 2023-10-12 LoftQ: LoRA-Fine-Tuning-aware Quantization for Large Language Models link Yixiao Li, Yifan Yu,..., Tuo Zhao
123 2023-08-11 Self-Alignment with Instruction Backtranslation link Xian Li, Ping Yu,..., Mike Lewis
123 2022-08-04 Conformal Risk Control link Anastasios Nikolas Angelopoulos, Stephen Bates,..., Tal Schuster
123 2023-10-19 SalUn: Empowering Machine Unlearning via Gradient-based Weight Saliency in
Both Image Classification and Generation
link Chongyu Fan, Jiancheng Liu,..., Sijia Liu
123 2023-10-08 TEMPO: Prompt-based Generative Pre-trained Transformer for Time Series Forecasting link Defu Cao, Furong Jia,..., Yan Liu
122 2023-10-29 AnomalyCLIP: Object-agnostic Prompt Learning for Zero-shot Anomaly Detection link Qihang Zhou, Guansong Pang,..., Jiming Chen
122 2024-01-31 RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval link Parth Sarthi, Salman Abdullah,..., Christopher D Manning
121 2023-10-25 TD-MPC2: Scalable, Robust World Models for Continuous Control link Nicklas Hansen, Hao Su, Xiaolong Wang
121 2023-10-10 Understanding the Effects of RLHF on LLM Generalisation and
Diversity
link Robert Kirk, Ishita Mediratta,..., Roberta Raileanu
119 2024-04-19 SaProt: Protein Language Modeling with Structure-aware Vocabulary link Jin Su, Chenchen Han,..., Fajie Yuan
117 2023-08-14 OctoPack: Instruction Tuning Code Large Language Models link Niklas Muennighoff, Qian Liu,..., Shayne Longpre
116 2023-10-18 SOTOPIA: Interactive Evaluation for Social Intelligence in Language Agents link Xuhui Zhou, Hao Zhu,..., Maarten Sap
116 2023-10-04 Reward Model Ensembles Help Mitigate Overoptimization link Thomas Coste, Usman Anwar,..., David Krueger
115 2023-02-06 Chain of Hindsight aligns Language Models with Feedback link Hao Liu, Carmelo Sferrazza, Pieter Abbeel
115 2023-06-09 Can Large Language Models Infer Causation from Correlation? link Zhijing Jin, Jiarui Liu,..., Bernhard Schölkopf
115 2023-10-04 SweetDreamer: Aligning Geometric Priors in 2D diffusion for Consistent
Text-to-3D
link Weiyu Li, Rui Chen,..., Ping Tan
115 2023-10-19 Habitat 3.0: A Co-Habitat for Humans, Avatars, and Robots link Xavier Puig, Eric Undersander,..., Roozbeh Mottaghi
114 2023-08-02 From Sparse to Soft Mixtures of Experts link Joan Puigcerver, Carlos Riquelme Ruiz,..., Neil Houlsby
113 2023-10-10 Multilingual Jailbreak Challenges in Large Language Models link Yue Deng, Wenxuan Zhang,..., Lidong Bing
113 2023-06-07 On the Reliability of Watermarks for Large Language Models link John Kirchenbauer, Jonas Geiping,..., Tom Goldstein
113 2023-11-03 EmerNeRF: Emergent Spatial-Temporal Scene Decomposition via Self-Supervision link Jiawei Yang, Boris Ivanovic,..., Yue Wang
112 2023-08-16 TEST: Text Prototype Aligned Embedding to Activate LLM's Ability
for Time Series
link Chenxi Sun, Hongyan Li,..., Shenda Hong
110 2023-09-29 One For All: Towards Training One Graph Model For
All Classification Tasks
link Hao Liu, Jiarui Feng,..., Muhan Zhang
110 2023-10-24 What Algorithms can Transformers Learn? A Study in Length
Generalization
link Hattie Zhou, Arwen Bradley,..., Preetum Nakkiran
110 2023-05-04 ZipIt! Merging Models from Different Tasks without Training link George Stoica, Daniel Bolya,..., Judy Hoffman
110 2023-08-01 SelfCheck: Using LLMs to Zero-Shot Check Their Own Step-by-Step
Reasoning
link Ning Miao, Yee Whye Teh, Tom Rainforth
108 2023-09-28 Demystifying CLIP Data link Hu Xu, Saining Xie,..., Christoph Feichtenhofer
108 2023-05-25 Self-contradictory Hallucinations of Large Language Models: Evaluation, Detection and
Mitigation
link Niels Mündler, Jingxuan He,..., Martin Vechev
108 2023-06-16 Is Self-Repair a Silver Bullet for Code Generation? link Theo X. Olausson, Jeevana Priya Inala,..., Armando Solar-Lezama
108 2023-10-12 OmniControl: Control Any Joint at Any Time for Human
Motion Generation
link Yiming Xie, Varun Jampani,..., Huaizu Jiang
107 2023-05-31 MERT: Acoustic Music Understanding Model with Large-Scale Self-supervised Training link Yizhi LI, Ruibin Yuan,..., Jie Fu
107 2023-09-13 RAIN: Your Language Models Can Align Themselves without Finetuning link Yuhui Li, Fangyun Wei,..., Hongyang Zhang
106 2023-10-01 BooookScore: A systematic exploration of book-length summarization in the
era of LLMs
link Yapei Chang, Kyle Lo,..., Mohit Iyyer
106 2023-08-25 Nougat: Neural Optical Understanding for Academic Documents link Lukas Blecher, Guillem Cucurull,..., Robert Stojnic
106 2023-10-09 Take a Step Back: Evoking Reasoning via Abstraction in
Large Language Models
link Huaixiu Steven Zheng, Swaroop Mishra,..., Denny Zhou
105 None The Expressive Power of Transformers with Chain of Thought link William Merrill, Ashish Sabharwal
103 2023-10-04 MagicDrive: Street View Generation with Diverse 3D Geometry Control link Ruiyuan Gao, Kai Chen,..., Qiang Xu
101 2023-08-07 Nearly $d$-Linear Convergence Bounds for Diffusion Models via Stochastic
Localization
link Joe Benton, Valentin De Bortoli,..., George Deligiannidis
101 2023-10-25 PromptAgent: Strategic Planning with Language Models Enables Expert-level Prompt
Optimization
link Xinyuan Wang, Chenxi Li,..., Zhiting Hu
101 2023-11-08 Bias Runs Deep: Implicit Reasoning Biases in Persona-Assigned LLMs link Shashank Gupta, Vaishnavi Shrivastava,..., Tushar Khot
101 2024-05-23 TimeMixer: Decomposable Multiscale Mixing for Time Series Forecasting link Shiyu Wang, Haixu Wu,..., JUN ZHOU
100 2023-07-16 Solving Inverse Problems with Latent Diffusion Models via Hard
Data Consistency
link Bowen Song, Soo Min Kwon,..., Liyue Shen
100 2023-11-20 PF-LRM: Pose-Free Large Reconstruction Model for Joint Pose and
Shape Prediction
link Peng Wang, Hao Tan,..., Kai Zhang
100 2023-10-23 Function Vectors in Large Language Models link Eric Todd, Millicent Li,..., David Bau
97 2023-09-25 Identifying the Risks of LM Agents with an LM-Emulated
Sandbox
link Yangjun Ruan, Honghua Dong,..., Tatsunori Hashimoto
97 2023-07-20 FLASK: Fine-grained Language Model Evaluation based on Alignment Skill
Sets
link Seonghyeon Ye, Doyoung Kim,..., Minjoon Seo
97 2023-09-11 Hypothesis Search: Inductive Reasoning with Language Models link Ruocheng Wang, Eric Zelikman,..., Noah Goodman
97 2023-10-04 AdaMerging: Adaptive Model Merging for Multi-Task Learning link Enneng Yang, Zhenyi Wang,..., Dacheng Tao
97 2023-09-27 Towards Best Practices of Activation Patching in Language Models:
Metrics and Methods
link Fred Zhang, Neel Nanda
96 2024-02-20 Chain of Thought Empowers Transformers to Solve Inherently Serial
Problems
link Zhiyuan Li, Hong Liu,..., Tengyu Ma
95 2023-10-03 Think before you speak: Training Language Models With Pause
Tokens
link Sachin Goyal, Ziwei Ji,..., Vaishnavh Nagarajan
94 2023-09-29 Can Sensitive Information Be Deleted From LLMs? Objectives for
Defending Against Extraction Attacks
link Vaidehi Patil, Peter Hase, Mohit Bansal
94 2023-12-20 Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation link Hongtao Wu, Ya Jing,..., Tao Kong
94 2023-10-05 MathCoder: Seamless Code Integration in LLMs for Enhanced Mathematical
Reasoning
link Ke Wang, Houxing Ren,..., Hongsheng Li
93 2023-10-06 Talk like a Graph: Encoding Graphs for Large Language
Models
link Bahare Fatemi, Jonathan Halcrow, Bryan Perozzi
93 2023-07-11 ReLoRA: High-Rank Training Through Low-Rank Updates link Vladislav Lialin, Sherin Muckatira,..., Anna Rumshisky
92 None ModernTCN: A Modern Pure Convolution Structure for General Time
Series Analysis
link Luo donghao, wang xue
92 2023-08-16 Time Travel in LLMs: Tracing Data Contamination in Large
Language Models
link Shahriar Golchin, Mihai Surdeanu
91 2024-01-09 Chain-of-Table: Evolving Tables in the Reasoning Chain for Table
Understanding
link Zilong Wang, Hao Zhang,..., Tomas Pfister
91 2023-05-19 Multimodal Web Navigation with Instruction-Finetuned Foundation Models link Hiroki Furuta, Kuang-Huei Lee,..., Izzeddin Gur
89 2023-09-26 QA-LoRA: Quantization-Aware Low-Rank Adaptation of Large Language Models link Yuhui Xu, Lingxi Xie,..., Qi Tian
88 2023-09-29 Guiding Instruction-based Image Editing via Multimodal Large Language Models link Tsu-Jui Fu, Wenze Hu,..., Zhe Gan
88 2023-09-11 Pushing Mixture of Experts to the Limit: Extremely Parameter
Efficient MoE for Instruction Tuning
link Ted Zadouri, Ahmet Üstün,..., Sara Hooker
88 2023-10-23 FreeNoise: Tuning-Free Longer Video Diffusion via Noise Rescheduling link Haonan Qiu, Menghan Xia,..., Ziwei Liu
87 2023-10-10 Uni3D: Exploring Unified 3D Representation at Scale link Junsheng Zhou, Jinsheng Wang,..., Xinlong Wang
87 2023-05-22 Chain-of-Knowledge: Grounding Large Language Models via Dynamic Knowledge Adapting
over Heterogeneous Sources
link Xingxuan Li, Ruochen Zhao,..., Lidong Bing
85 2023-10-16 Video Language Planning link Yilun Du, Sherry Yang,..., Jonathan Tompson
85 None LLaMA-Adapter: Efficient Fine-tuning of Large Language Models with Zero-initialized
Attention
link Renrui Zhang, Jiaming Han,..., Peng Gao
84 2023-10-31 What's In My Big Data? link Yanai Elazar, Akshita Bhagia,..., Jesse Dodge
84 2023-08-17 Linearity of Relation Decoding in Transformer Language Models link Evan Hernandez, Arnab Sen Sharma,..., David Bau
84 2023-05-27 DNA-GPT: Divergent N-Gram Analysis for Training-Free Detection of GPT-Generated
Text
link Xianjun Yang, Wei Cheng,..., Haifeng Chen
84 2023-08-03 The All-Seeing Project: Towards Panoptic Visual Recognition and Understanding
of the Open World
link Weiyun Wang, Min Shi,..., Yu Qiao
84 2023-06-23 On-Policy Distillation of Language Models: Learning from Self-Generated Mistakes link Rishabh Agarwal, Nino Vieillard,..., Olivier Bachem
83 2024-02-06 INSIDE: LLMs' Internal States Retain the Power of Hallucination
Detection
link Chao Chen, Kai Liu,..., Jieping Ye
83 2023-10-08 Scaling Laws of RoPE-based Extrapolation link Xiaoran Liu, Hang Yan,..., Dahua Lin
83 2023-10-30 Text-to-3D with Classifier Score Distillation link Xin Yu, Yuan-Chen Guo,..., XIAOJUAN QI
83 2023-05-22 Matcher: Segment Anything with One Shot Using All-Purpose Feature
Matching
link Yang Liu, Muzhi Zhu,..., Chunhua Shen
82 2023-10-14 Mixed-Type Tabular Data Synthesis with Score-based Diffusion in Latent
Space
link Hengrui Zhang, Jiani Zhang,..., George Karypis
82 2023-07-06 FITS: Modeling Time Series with $10k$ Parameters link Zhijian Xu, Ailing Zeng, Qiang Xu
82 2023-09-28 Beyond Reverse KL: Generalizing Direct Preference Optimization with Diverse
Divergence Constraints
link Chaoqi Wang, Yibo Jiang,..., Yuxin Chen
81 2023-09-25 Small-scale proxies for large-scale Transformer training instabilities link Mitchell Wortsman, Peter J Liu,..., Simon Kornblith
81 2023-10-09 Interpreting CLIP's Image Representation via Text-Based Decomposition link Yossi Gandelsman, Alexei A Efros, Jacob Steinhardt
81 2023-02-15 Learning Performance-Improving Code Edits link Alexander G Shypula, Aman Madaan,..., Amir Yazdanbakhsh
81 2023-07-07 Teaching Arithmetic to Small Transformers link Nayoung Lee, Kartik Sreenivasan,..., Dimitris Papailiopoulos
81 2023-10-04 MetaTool Benchmark for Large Language Models: Deciding Whether to
Use Tools and Which to Use
link Yue Huang, Jiawen Shi,..., Lichao Sun
80 2023-07-07 One Step of Gradient Descent is Provably the Optimal
In-Context Learner with One Layer of Linear Self-Attention
link Arvind V. Mahankali, Tatsunori Hashimoto, Tengyu Ma
80 2023-09-11 Does Writing with Language Models Reduce Content Diversity? link Vishakh Padmakumar, He He
79 2023-10-04 Retrieval meets Long Context Large Language Models link Peng Xu, Wei Ping,..., Bryan Catanzaro
78 2023-10-10 OpenWebMath: An Open Dataset of High-Quality Mathematical Web Text link Keiran Paster, Marco Dos Santos,..., Jimmy Ba
78 2023-10-12 DistillSpec: Improving Speculative Decoding via Knowledge Distillation link Yongchao Zhou, Kaifeng Lyu,..., Rishabh Agarwal
78 2023-12-21 The Truth is in There: Improving Reasoning in Language
Models with Layer-Selective Rank Reduction
link Pratyusha Sharma, Jordan T. Ash, Dipendra Misra
78 2023-09-19 PoSE: Efficient Context Window Extension of LLMs via Positional
Skip-wise Training
link Dawei Zhu, Nan Yang,..., Sujian Li
77 2024-04-22 Hybrid LLM: Cost-Efficient and Quality-Aware Query Routing link Dujian Ding, Ankur Mallick,..., Ahmed Hassan Awadallah
77 2023-10-03 Large Language Models as Analogical Reasoners link Michihiro Yasunaga, Xinyun Chen,..., Denny Zhou
76 2023-10-02 GenSim: Generating Robotic Simulation Tasks via Large Language Models link Lirui Wang, Yiyang Ling,..., Xiaolong Wang
76 2023-10-09 Generative Judge for Evaluating Alignment link Junlong Li, Shichao Sun,..., Pengfei Liu
76 2023-10-16 Ring-A-Bell! How Reliable are Concept Removal Methods For Diffusion
Models?
link Yu-Lin Tsai, Chia-Yi Hsu,..., Chun-Ying Huang
76 2023-06-13 Mol-Instructions: A Large-Scale Biomolecular Instruction Dataset for Large Language
Models
link Yin Fang, Xiaozhuan Liang,..., Huajun Chen
76 2023-10-19 Vision-Language Models are Zero-Shot Reward Models for Reinforcement Learning link Juan Rocamonde, Victoriano Montesinos,..., David Lindner
75 2023-10-12 Phenomenal Yet Puzzling: Testing Inductive Reasoning Capabilities of Language
Models with Hypothesis Refinement
link Linlu Qiu, Liwei Jiang,..., Xiang Ren
75 2023-11-06 AnyText: Multilingual Visual Text Generation and Editing link Yuxiang Tuo, Wangmeng Xiang,..., Xuansong Xie
75 2023-10-09 NEFTune: Noisy Embeddings Improve Instruction Finetuning link Neel Jain, Ping-yeh Chiang,..., Tom Goldstein
75 2023-06-01 Vocos: Closing the gap between time-domain and Fourier-based neural
vocoders for high-quality audio synthesis
link Hubert Siuzdak
75 2023-10-05 GoLLIE: Annotation Guidelines improve Zero-Shot Information-Extraction link Oscar Sainz, Iker García-Ferrero,..., Eneko Agirre
75 None Adapting Large Language Models via Reading Comprehension link Daixuan Cheng, Shaohan Huang, Furu Wei
74 2023-10-11 Beyond Memorization: Violating Privacy via Inference with Large Language
Models
link Robin Staab, Mark Vero,..., Martin Vechev
74 2023-10-09 FLATTEN: optical FLow-guided ATTENtion for consistent text-to-video editing link Yuren Cong, Mengmeng Xu,..., Sen He
74 2023-03-10 Tag2Text: Guiding Vision-Language Model via Image Tagging link Xinyu Huang, Youcai Zhang,..., Lei Zhang
73 2023-10-27 Can LLMs Keep a Secret? Testing Privacy
Implications of Language Models via Contextual Integrity Theory
link Niloofar Mireshghallah, Hyunwoo Kim,..., Yejin Choi
73 2023-10-03 SE(3)-Stochastic Flow Matching for Protein Backbone Generation link Joey Bose, Tara Akhound-Sadegh,..., Alexander Tong
73 2022-06-20 LUT-GEMM: Quantized Matrix Multiplication based on LUTs for Efficient
Inference in Large-Scale Generative Language Models
link Gunho Park, Baeseong park,..., Dongsoo Lee
73 2023-05-30 HIFA: High-fidelity Text-to-3D Generation with Advanced Diffusion Guidance link Junzhe Zhu, Peiye Zhuang, Sanmi Koyejo
72 2023-08-04 Retroformer: Retrospective Large Language Agents with Policy Gradient Optimization link Weiran Yao, Shelby Heinecke,..., Silvio Savarese
72 None RECOMP: Improving Retrieval-Augmented LMs with Context Compression and Selective
Augmentation
link Fangyuan Xu, Weijia Shi, Eunsol Choi
72 2023-10-07 Label-free Node Classification on Graphs with Large Language Models
(LLMs)
link Zhikai Chen, Haitao Mao,..., Jiliang Tang
71 2023-10-12 ScaleCrafter: Tuning-free Higher-Resolution Visual Generation with Diffusion Models link Yingqing He, Shaoshu Yang,..., Ying Shan
71 2023-11-02 Tensor Trust: Interpretable Prompt Injection Attacks from an Online
Game
link Sam Toyer, Olivia Watkins,..., Stuart Russell
71 2023-05-31 Harnessing Explanations: LLM-to-LM Interpreter for Enhanced Text-Attributed Graph Representation
Learning
link Xiaoxin He, Xavier Bresson,..., Bryan Hooi
70 2023-10-26 Noise-free Score Distillation link Oren Katzir, Or Patashnik,..., Dani Lischinski
69 2023-10-02 CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction link Size Wu, Wenwei Zhang,..., Chen Change Loy
68 2024-05-29 Large Brain Model for Learning Generic Representations with Tremendous
EEG Data in BCI
link Weibang Jiang, Liming Zhao, Bao-liang Lu
68 2023-08-08 Fine-tuning Multimodal LLMs to Follow Zero-shot Demonstrative Instructions link Juncheng Li, Kaihang Pan,..., Yueting Zhuang
68 2023-10-31 The Generative AI Paradox: “What It Can Create, It
May Not Understand”
link Peter West, Ximing Lu,..., Yejin Choi
68 2023-10-20 An LLM can Fool Itself: A Prompt-Based Adversarial Attack link Xilie Xu, Keyi Kong,..., Mohan Kankanhalli
67 2023-02-07 Flow Matching on General Geometries link Ricky T. Q. Chen, Yaron Lipman
67 2023-10-10 Lemur: Harmonizing Natural Language and Code for Language Agents link Yiheng Xu, Hongjin SU,..., Tao Yu
67 2023-09-14 Unified Human-Scene Interaction via Prompted Chain-of-Contacts link Zeqi Xiao, Tai Wang,..., Jiangmiao Pang
66 2023-06-15 KoLA: Carefully Benchmarking World Knowledge of Large Language Models link Jifan Yu, Xiaozhi Wang,..., Juanzi Li
66 2023-07-13 In-context Autoencoder for Context Compression in a Large Language
Model
link Tao Ge, Hu Jing,..., Furu Wei
66 2023-10-27 Davidsonian Scene Graph: Improving Reliability in Fine-grained Evaluation for
Text-to-Image Generation
link Jaemin Cho, Yushi Hu,..., Su Wang
66 2023-12-08 Zoology: Measuring and Improving Recall in Efficient Language
Models
link Simran Arora, Sabri Eyuboglu,..., Christopher Re
66 2023-06-09 FasterViT: Fast Vision Transformers with Hierarchical Attention link Ali Hatamizadeh, Greg Heinrich,..., Pavlo Molchanov
65 2023-10-17 Zipformer: A faster and better encoder for automatic speech
recognition
link Zengwei Yao, Liyong Guo,..., Daniel Povey
64 2023-10-04 Generalization in diffusion models arises from geometry-adaptive harmonic representations link Zahra Kadkhodaie, Florentin Guth,..., Stéphane Mallat
64 2023-11-28 Manifold Preserving Guided Diffusion link Yutong He, Naoki Murata,..., Stefano Ermon
64 2023-10-02 SmartPlay : A Benchmark for LLMs as Intelligent Agents link Yue Wu, Xuan Tang,..., Yuanzhi Li
63 2023-08-08 SILO Language Models: Isolating Legal Risk In a Nonparametric
Datastore
link Sewon Min, Suchin Gururangan,..., Luke Zettlemoyer
63 2023-06-21 DreamTime: An Improved Optimization Strategy for Diffusion-Guided 3D Generation link Yukun Huang, Jianan Wang,..., Lei Zhang
63 2023-03-10 Decomposed Diffusion Sampler for Accelerating Large-Scale Inverse Problems link Hyungjin Chung, Suhyeon Lee, Jong Chul Ye
63 2023-09-29 Denoising Diffusion Bridge Models link Linqi Zhou, Aaron Lou,..., Stefano Ermon
62 2023-10-01 LEGO-Prover: Neural Theorem Proving with Growing Libraries link Haiming Wang, Huajian Xin,..., Xiaodan Liang
62 2023-10-12 Circuit Component Reuse Across Tasks in Transformer Language Models link Jack Merullo, Carsten Eickhoff, Ellie Pavlick
62 2023-11-03 RT-Trajectory: Robotic Task Generalization via Hindsight Trajectory Sketches link Jiayuan Gu, Sean Kirmani,..., Ted Xiao
62 2023-09-13 Sudden Drops in the Loss: Syntax Acquisition, Phase Transitions,
and Simplicity Bias in MLMs
link Angelica Chen, Ravid Shwartz-Ziv,..., Naomi Saphra
62 2023-10-04 Kosmos-G: Generating Images in Context with Multimodal Large Language
Models
link Xichen Pan, Li Dong,..., Furu Wei
61 2023-11-21 Mechanistically analyzing the effects of fine-tuning on procedurally defined
tasks
link Samyak Jain, Robert Kirk,..., David Krueger
61 2024-01-19 Knowledge Fusion of Large Language Models link Fanqi Wan, Xinting Huang,..., Shuming Shi
61 2023-10-09 HyperAttention: Long-context Attention in Near-Linear Time link Insu Han, Rajesh Jayaram,..., Amir Zandieh
61 2023-08-31 SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models link Xin Zhang, Dong Zhang,..., Xipeng Qiu
61 2023-10-09 Towards Lossless Dataset Distillation via Difficulty-Aligned Trajectory Matching link Ziyao Guo, Kai Wang,..., Yang You
60 2023-10-06 ReLU Strikes Back: Exploiting Activation Sparsity in Large Language
Models
link Seyed Iman Mirzadeh, Keivan Alizadeh-Vahid,..., Mehrdad Farajtabar
60 2023-11-24 Universal Jailbreak Backdoors from Poisoned Human Feedback link Javier Rando, Florian Tramèr
60 None Diffusion Posterior Sampling for Linear Inverse Problem Solving: A
Filtering Perspective
link Zehao Dou, Yang Song
59 2023-08-07 UniversalNER: Targeted Distillation from Large Language Models for Open
Named Entity Recognition
link Wenxuan Zhou, Sheng Zhang,..., Hoifung Poon
59 2023-07-15 Think-on-Graph: Deep and Responsible Reasoning of Large Language Model
on Knowledge Graph
link Jiashuo Sun, Chengjin Xu,..., Jian Guo
58 2023-10-26 Large Language Models as Generalizable Policies for Embodied Tasks link Andrew Szot, Max Schwarzer,..., Alexander T Toshev
58 2023-09-29 CRAFT: Customizing LLMs by Creating and Retrieving from Specialized
Toolsets
link Lifan Yuan, Yangyi Chen,..., Heng Ji
58 2024-03-04 Diffusion-TS: Interpretable Diffusion for General Time Series Generation link Xinyu Yuan, Yan Qiao
57 2023-09-28 At Which Training Stage Does Code Data Help LLMs
Reasoning?
link YINGWEI MA, Yue Liu,..., Shanshan Li
57 2023-02-12 Single Motion Diffusion link Sigal Raab, Inbal Leibovitch,..., Daniel Cohen-Or
57 2023-09-18 Understanding Catastrophic Forgetting in Language Models via Implicit Inference link Suhas Kotha, Jacob Mitchell Springer, Aditi Raghunathan
57 2023-11-06 Consistent4D: Consistent 360° Dynamic Object Generation from Monocular Video link Yanqin Jiang, Li Zhang,..., Yao Yao
57 2023-09-20 A Paradigm Shift in Machine Translation: Boosting Translation Performance
of Large Language Models
link Haoran Xu, Young Jin Kim,..., Hany Hassan Awadalla
57 2023-12-06 DiffusionSat: A Generative Foundation Model for Satellite Imagery link Samar Khanna, Patrick Liu,..., Stefano Ermon
57 2023-06-13 Synapse: Trajectory-as-Exemplar Prompting with Memory for Computer Control link Longtao Zheng, Rundong Wang,..., Bo An
57 2023-10-12 HyperHuman: Hyper-Realistic Human Generation with Latent Structural Diffusion link Xian Liu, Jian Ren,..., Sergey Tulyakov
56 2023-09-25 LLMCarbon: Modeling the End-to-End Carbon Footprint of Large Language
Models
link Ahmad Faiz, Sotaro Kaneda,..., Lei Jiang
56 2023-10-10 Advancing Pose-Guided Image Synthesis with Progressive Conditional Diffusion Models link Fei Shen, Hu Ye,..., Yang Wei
56 2024-03-18 Improving LoRA in Privacy-preserving Federated Learning link Youbang Sun, Zitao Li,..., Bolin Ding
56 2024-02-06 Fine-Tuned Language Models Generate Stable Inorganic Materials as Text link Nate Gruver, Anuroop Sriram,..., Zachary Ward Ulissi
56 2023-05-23 VDT: General-purpose Video Diffusion Transformers via Mask Modeling link Haoyu Lu, Guoxing Yang,..., Mingyu Ding
56 2023-11-10 Parameter-Efficient Orthogonal Finetuning via Butterfly Factorization link Weiyang Liu, Zeju Qiu,..., Bernhard Schölkopf
56 2024-02-15 Spike-driven Transformer V2: Meta Spiking Neural Network Architecture Inspiring
the Design of Next-generation Neuromorphic Chips
link Man Yao, JiaKui Hu,..., Guoqi Li
56 2022-11-07 MogaNet: Multi-order Gated Aggregation Network link Siyuan Li, Zedong Wang,..., Stan Z. Li
55 2023-10-24 MuSR: Testing the Limits of Chain-of-thought with Multistep Soft
Reasoning
link Zayne Rea Sprague, Xi Ye,..., Greg Durrett
55 2023-12-13 Distributional Preference Learning: Understanding and Accounting for Hidden Context
in RLHF
link Anand Siththaranjan, Cassidy Laidlaw, Dylan Hadfield-Menell
55 2023-10-04 Large Language Model Cascades with Mixture of Thought Representations
for Cost-Efficient Reasoning
link Murong Yue, Jie Zhao,..., Ziyu Yao
55 2023-05-24 Unpaired Image-to-Image Translation via Neural Schrödinger Bridge link Beomsu Kim, Gihyun Kwon,..., Jong Chul Ye
55 2023-06-16 Conformal Language Modeling link Victor Quach, Adam Fisch,..., Regina Barzilay
55 2023-03-16 Rethinking Model Ensemble in Transfer-based Adversarial Attacks link Huanran Chen, Yichi Zhang,..., Jun Zhu
54 2024-02-22 Cameras as Rays: Pose Estimation via Ray Diffusion link Jason Y. Zhang, Amy Lin,..., Shubham Tulsiani
54 2023-09-29 Motif: Intrinsic Motivation from Artificial Intelligence Feedback link Martin Klissarov, Pierluca D'Oro,..., Mikael Henaff
54 2023-05-24 Mixture-of-Experts Meets Instruction Tuning: A Winning Combination for Large
Language Models
link Sheng Shen, Le Hou,..., Denny Zhou
54 2023-09-29 LLM-grounded Video Diffusion Models link Long Lian, Baifeng Shi,..., Boyi Li
54 2023-10-10 A Semantic Invariant Robust Watermark for Large Language Models link Aiwei Liu, Leyi Pan,..., Lijie Wen
53 2023-10-20 ToolChain: Efficient Action Space Navigation in Large Language Models
with A
Search
link Yuchen Zhuang, Xiang Chen,..., Chao Zhang
53 2023-10-02 DataInf: Efficiently Estimating Data Influence in LoRA-tuned LLMs and
Diffusion Models
link Yongchan Kwon, Eric Wu,..., James Zou
53 2023-09-29 Batch Calibration: Rethinking Calibration for In-Context Learning and Prompt
Engineering
link Han Zhou, Xingchen Wan,..., Subhrajit Roy
52 2023-07-18 Overthinking the Truth: Understanding how Language Models Process False
Demonstrations
link Danny Halawi, Jean-Stanislas Denain, Jacob Steinhardt
52 None DSPy: Compiling Declarative Language Model Calls into State-of-the-Art Pipelines link Omar Khattab, Arnav Singhvi,..., Christopher Potts
52 2023-07-03 Improved sampling via learned diffusions link Lorenz Richter, Julius Berner
52 2023-10-13 Vision-by-Language for Training-Free Compositional Image Retrieval link Shyamgopal Karthik, Karsten Roth,..., Zeynep Akata
52 2023-06-06 Turning large language models into cognitive models link Marcel Binz, Eric Schulz
52 2023-09-21 Privacy-Preserving In-Context Learning with Differentially Private Few-Shot Generation link Xinyu Tang, Richard Shin,..., Robert Sim
52 2024-02-22 Fine-Tuning Enhances Existing Mechanisms: A Case Study on Entity
Tracking
link Nikhil Prakash, Tamar Rott Shaham,..., David Bau
52 2023-06-01 Consistency-guided Prompt Learning for Vision-Language Models link Shuvendu Roy, Ali Etemad
51 2024-07-31 Detecting, Explaining, and Mitigating Memorization in Diffusion Models link Yuxin Wen, Yuchen Liu,..., Lingjuan Lyu
51 2023-09-28 A Benchmark for Learning to Translate a New Language
from One Grammar Book
link Garrett Tanzer, Mirac Suzgun,..., Luke Melas-Kyriazi
51 2023-10-19 An Emulator for Fine-tuning Large Language Models using Small
Language Models
link Eric Mitchell, Rafael Rafailov,..., Christopher D Manning
51 None PnP Inversion: Boosting Diffusion-based Editing with 3 Lines of
Code
link Xuan Ju, Ailing Zeng,..., Qiang Xu
51 2023-10-12 QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language
Models
link Jing Liu, Ruihao Gong,..., Bohan Zhuang
50 2023-12-03 The mechanistic basis of data dependence and abrupt learning
in an in-context classification task
link Gautam Reddy
50 2024-03-20 BadEdit: Backdooring Large Language Models by Model Editing link Yanzhou Li, Tianlin Li,..., Yang Liu
50 2023-11-02 Copilot4D: Learning Unsupervised World Models for Autonomous Driving via
Discrete Diffusion
link Lunjun Zhang, Yuwen Xiong,..., Raquel Urtasun
50 2022-09-08 Exploring Target Representations for Masked Autoencoders link xingbin liu, Jinghao Zhou,..., Rongrong Ji
49 2023-11-11 Finetuning Text-to-Image Diffusion Models for Fairness link Xudong Shen, Chao Du,..., Mohan Kankanhalli
49 2023-10-12 How Many Pretraining Tasks Are Needed for In-Context Learning
of Linear Regression?
link Jingfeng Wu, Difan Zou,..., Peter Bartlett
49 2024-03-15 FeatUp: A Model-Agnostic Framework for Features at Any Resolution link Stephanie Fu, Mark Hamilton,..., William T. Freeman
49 2023-07-31 AntGPT: Can Large Language Models Help Long-term Action Anticipation
from Videos?
link Qi Zhao, Shijie Wang,..., Chen Sun
49 2023-09-28 Human Feedback is not Gold Standard link Tom Hosking, Phil Blunsom, Max Bartolo
49 2023-10-26 The Expressive Power of Low-Rank Adaptation link Yuchen Zeng, Kangwook Lee
48 2023-10-16 In-Context Pretraining: Language Modeling Beyond Document Boundaries link Weijia Shi, Sewon Min,..., Mike Lewis
48 2023-09-20 Text2Reward: Reward Shaping with Language Models for Reinforcement Learning link Tianbao Xie, Siheng Zhao,..., Tao Yu
48 2023-08-23 Large Multilingual Models Pivot Zero-Shot Multimodal Learning across Languages link Jinyi Hu, Yuan Yao,..., Maosong Sun
48 2023-10-06 Universal Humanoid Motion Representations for Physics-Based Control link Zhengyi Luo, Jinkun Cao,..., Weipeng Xu
48 2023-09-26 How to Catch an AI Liar: Lie Detection in
Black-Box LLMs by Asking Unrelated Questions
link Lorenzo Pacchiardi, Alex James Chan,..., Jan M. Brauner
47 2023-03-08 InfoBatch: Lossless Training Speed Up by Unbiased Dynamic Data
Pruning
link Ziheng Qin, Kai Wang,..., Yang You
47 2023-10-06 Confronting Reward Model Overoptimization with Constrained RLHF link Ted Moskovitz, Aaditya K Singh,..., Stephen Marcus McAleer
47 2023-06-20 Evaluating the Zero-shot Robustness of Instruction-tuned Language Models link Jiuding Sun, Chantal Shaib, Byron C Wallace
47 2024-02-06 The Hedgehog & the Porcupine: Expressive Linear Attentions with
Softmax Mimicry
link Michael Zhang, Kush Bhatia,..., Christopher Re
47 2023-06-30 Learning Delays in Spiking Neural Networks using Dilated Convolutions
with Learnable Spacings
link Ilyass Hammouamri, Ismail Khalfaoui-Hassani, Timothée Masquelier
47 2023-05-19 LLM-CXR: Instruction-Finetuned LLM for CXR Image Understanding and Generation link Suhyeon Lee, Won Jun Kim,..., Jong Chul Ye
47 2023-05-18 Deep Temporal Graph Clustering link Meng Liu, Yue Liu,..., Xinwang Liu
47 2023-05-05 DisenBooth: Identity-Preserving Disentangled Tuning for Subject-Driven Text-to-Image Generation link Hong Chen, Yipeng Zhang,..., Wenwu Zhu
47 2023-10-16 How Do Transformers Learn In-Context Beyond Simple Functions? A
Case Study on Learning with Representations
link Tianyu Guo, Wei Hu,..., Yu Bai
47 2023-10-02 ImagenHub: Standardizing the evaluation of conditional image generation models link Max Ku, Tianle Li,..., Wenhu Chen
46 2023-10-06 Amortizing intractable inference in large language models link Edward J Hu, Moksh Jain,..., Nikolay Malkin
46 2023-10-05 EfficientDM: Efficient Quantization-Aware Fine-Tuning of Low-Bit Diffusion Models link Yefei He, Jing Liu,..., Bohan Zhuang
46 2024-02-06 Large Language Models to Enhance Bayesian Optimization link Tennison Liu, Nicolás Astorga,..., Mihaela van der Schaar
46 2023-02-02 Neural Common Neighbor with Completion for Link Prediction link Xiyuan Wang, Haotong Yang, Muhan Zhang
45 2023-09-22 Unbiased Watermark for Large Language Models link Zhengmian Hu, Lichang Chen,..., Heng Huang
45 2024-01-16 Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis link Zhenhui Ye, Tianyun Zhong,..., Zhou Zhao
45 2023-05-26 Training Socially Aligned Language Models on Simulated Social Interactions link Ruibo Liu, Ruixin Yang,..., Soroush Vosoughi
45 2023-10-02 Compressing LLMs: The Truth is Rarely Pure and Never
Simple
link AJAY KUMAR JAISWAL, Zhe Gan,..., Yinfei Yang
45 2023-10-16 Towards image compression with perfect realism at ultra-low bitrates link Marlene Careil, Matthew J. Muckley,..., Stéphane Lathuilière
44 None Functional Interpolation for Relative Positions improves Long Context Transformers link Shanda Li, Chong You,..., Srinadh Bhojanapalli
44 2023-10-18 Brain decoding: toward real-time reconstruction of visual perception link Yohann Benchetrit, Hubert Banville, Jean-Remi King
44 2023-12-14 Successor Heads: Recurring, Interpretable Attention Heads In The Wild link Rhys Gould, Euan Ong,..., Arthur Conmy
44 2024-01-23 ARGS: Alignment as Reward-Guided Search link Maxim Khanov, Jirayu Burapacheep, Yixuan Li
44 2023-08-24 Bayesian Low-rank Adaptation for Large Language Models link Adam X. Yang, Maxime Robeyns,..., Laurence Aitchison
44 2023-08-06 Strategic Preys Make Acute Predators: Enhancing Camouflaged Object Detectors
by Generating Camouflaged Objects
link Chunming He, Kai Li,..., Fisher Yu
44 2023-07-14 Mega-TTS 2: Boosting Prompting Mechanisms for Zero-Shot Speech Synthesis link Ziyue Jiang, Jinglin Liu,..., Zhou Zhao
43 2019-02-14 CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater
Sample Efficiency and Simplicity
link Aditya Bhatt, Daniel Palenicek,..., Jan Peters
43 2023-09-30 On the Stability of Iterative Retraining of Generative Models
on their own Data
link Quentin Bertrand, Joey Bose,..., Gauthier Gidel
43 2023-11-20 LQ-LoRA: Low-rank plus Quantized Matrix Decomposition for Efficient Language
Model Finetuning
link Han Guo, Philip Greengard,..., Yoon Kim
43 2023-10-12 Transformers as Decision Makers: Provable In-Context Reinforcement Learning via
Supervised Pretraining
link Licong Lin, Yu Bai, Song Mei
43 2023-10-03 DeepZero: Scaling Up Zeroth-Order Optimization for Deep Model Training link Aochuan Chen, Yimeng Zhang,..., Sijia Liu
43 2023-10-23 Matryoshka Diffusion Models link Jiatao Gu, Shuangfei Zhai,..., Navdeep Jaitly
43 2023-10-19 Model Merging by Uncertainty-Based Gradient Matching link Nico Daheim, Thomas Möllenhoff,..., Mohammad Emtiyaz Khan
43 2023-09-29 PB-LLM: Partially Binarized Large Language Models link Zhihang Yuan, Yuzhang Shang, Zhen Dong
43 2023-10-02 Linear attention is (maybe) all you need (to understand
Transformer optimization)
link Kwangjun Ahn, Xiang Cheng,..., Suvrit Sra
43 2023-10-18 Progressive3D: Progressively Local Editing for Text-to-3D Content Creation with
Complex Semantic Prompts
link Xinhua Cheng, Tianyu Yang,..., Li Yuan
43 2023-10-10 GeoLLM: Extracting Geospatial Knowledge from Large Language Models link Rohin Manvi, Samar Khanna,..., Stefano Ermon
43 2023-09-18 DFormer: Rethinking RGBD Representation Learning for Semantic Segmentation link Bowen Yin, Xuying Zhang,..., Qibin Hou
43 2023-05-23 Language Model Self-improvement by Reinforcement Learning Contemplation link Jing-Cheng Pang, Pengyuan Wang,..., Yang Yu
42 None On the Humanity of Conversational AI: Evaluating the Psychological
Portrayal of LLMs
link Jen-tse Huang, Wenxuan Wang,..., Michael Lyu
42 2023-05-29 Multiscale Positive-Unlabeled Detection of AI-Generated Texts link Yuchuan Tian, Hanting Chen,..., Yunhe Wang
42 2023-09-29 DyVal: Dynamic Evaluation of Large Language Models for Reasoning
Tasks
link Kaijie Zhu, Jiaao Chen,..., Xing Xie
42 2024-04-15 Language Model Cascades: Token-Level Uncertainty And Beyond link Neha Gupta, Harikrishna Narasimhan,..., Sanjiv Kumar
42 2024-01-25 An Extensible Framework for Open Heterogeneous Collaborative Perception link Yifan Lu, Yue Hu,..., Siheng Chen
42 2023-03-08 Magnushammer: A Transformer-Based Approach to Premise Selection link Maciej Mikuła, Szymon Tworkowski,..., Yuhuai Wu
42 2024-01-25 Towards 3D Molecule-Text Interpretation in Language Models link Sihang Li, Zhiyuan Liu,..., Qi Tian
42 2023-10-13 CodeChain: Towards Modular Code Generation Through Chain of Self-revisions
with Representative Sub-modules
link Hung Le, Hailin Chen,..., Shafiq Joty
41 2023-10-04 Understanding In-Context Learning in Transformers and LLMs by Learning
to Learn Discrete Functions
link Satwik Bhattamishra, Arkil Patel,..., Varun Kanade
41 2023-09-04 Relay Diffusion: Unifying diffusion process across resolutions for image
synthesis
link Jiayan Teng, Wendi Zheng,..., Jie Tang
41 2024-01-13 BrainLM: A foundation model for brain activity recordings link Josue Ortega Caro, Antonio Henrique de Oliveira Fonseca,..., David van Dijk
41 2023-09-09 Unified Language-Vision Pretraining in LLM with Dynamic Discrete Visual
Tokenization
link Yang Jin, Kun Xu,..., Yadong MU
41 2023-09-17 OWL: A Large Language Model for IT Operations link Hongcheng Guo, Jian Yang,..., Zhoujun Li
41 2024-02-04 Pathformer: Multi-scale Transformers with Adaptive Pathways for Time Series
Forecasting
link Peng Chen, Yingying ZHANG,..., Chenjuan Guo
41 2023-10-02 Ground-A-Video: Zero-shot Grounded Video Editing using Text-to-image Diffusion Models link Hyeonho Jeong, Jong Chul Ye
41 2024-01-31 Convolution Meets LoRA: Parameter Efficient Finetuning for Segment Anything
Model
link Zihan Zhong, Zhiqiang Tang,..., Chun Yuan
40 2023-05-24 Alleviating Exposure Bias in Diffusion Models through Sampling with
Shifted Time Steps
link Mingxiao Li, Tingyu Qu,..., Marie-Francine Moens
40 2023-09-29 Robustness of AI-Image Detectors: Fundamental Limits and Practical Attacks link Mehrdad Saberi, Vinu Sankar Sadasivan,..., Soheil Feizi
40 2023-10-04 Diffusion Generative Flow Samplers: Improving learning signals through partial
trajectory optimization
link Dinghuai Zhang, Ricky T. Q. Chen,..., Yoshua Bengio
40 2023-03-27 Seer: Language Instructed Video Prediction with Latent Diffusion Models link Xianfan Gu, Chuan Wen,..., Yang Gao
40 2023-10-13 Dynamic Sparse No Training: Training-Free Fine-tuning for Sparse
LLMs
link Yuxin Zhang, Lirui Zhao,..., Rongrong Ji
40 2023-09-26 Attention Satisfies: A Constraint-Satisfaction Lens on Factual Errors of
Language Models
link Mert Yuksekgonul, Varun Chandrasekaran,..., Besmira Nushi
40 2023-09-05 PromptTTS 2: Describing and Generating Voices with Text Prompt link Yichong Leng, Zhifang Guo,..., Jiang Bian
39 2024-04-03 CLaM-TTS: Improving Neural Codec Language Model for Zero-Shot Text-to-Speech link Jaehyeon Kim, Keon Lee,..., Jaewoong Cho
39 2023-12-12 Remote Sensing Vision-Language Foundation Models without Annotations via Ground
Remote Alignment
link Utkarsh Mall, Cheng Perng Phoo,..., Kavita Bala
39 2024-02-29 Curiosity-driven Red-teaming for Large Language Models link Zhang-Wei Hong, Idan Shenfeld,..., Pulkit Agrawal
39 None Chain-of-Experts: When LLMs Meet Complex Operations Research Problems link Ziyang Xiao, Dongxiang Zhang,..., Gang Chen
39 2023-10-18 Scalable Diffusion for Materials Generation link Sherry Yang, KwangHwan Cho,..., Ekin Dogus Cubuk
39 2024-05-02 Plan-Seq-Learn: Language Model Guided RL for Solving Long Horizon
Robotics Tasks
link Murtaza Dalal, Tarun Chiruvolu,..., Ruslan Salakhutdinov
39 2023-08-03 Circumventing Concept Erasure Methods For Text-To-Image Generative Models link Minh Pham, Kelly O. Marshall,..., Chinmay Hegde
39 2024-01-20 Inducing High Energy-Latency of Large Vision-Language Models with Verbose
Images
link Kuofeng Gao, Yang Bai,..., Wei Liu
38 2023-10-26 CADS: Unleashing the Diversity of Diffusion Models through Condition-Annealed
Sampling
link Seyedmorteza Sadat, Jakob Buhmann,..., Romann M. Weber
38 2023-06-08 In-Context Learning through the Bayesian Prism link Madhur Panwar, Kabir Ahuja, Navin Goyal
38 2024-01-20 BadChain: Backdoor Chain-of-Thought Prompting for Large Language Models link Zhen Xiang, Fengqing Jiang,..., Bo Li
38 2023-10-16 LLM Blueprint: Enabling Text-to-Image Generation with Complex and Detailed
Prompts
link Hanan Gani, Shariq Farooq Bhat,..., Peter Wonka
38 2023-05-20 CARD: Channel Aligned Robust Blend Transformer for Time Series
Forecasting
link Xue Wang, Tian Zhou,..., Rong Jin
37 2023-02-04 Multi-Source Diffusion Models for Simultaneous Music Generation and Separation link Giorgio Mariani, Irene Tallini,..., Emanuele Rodolà
37 2023-06-01 TorchRL: A data-driven decision-making library for PyTorch link Albert Bou, Matteo Bettini,..., Vincent Moens
37 2024-03-04 Making Pre-trained Language Models Great on Tabular Prediction link Jiahuan Yan, Bo Zheng,..., Jintai Chen
37 2023-11-02 The Blessing of Randomness: SDE Beats ODE in General
Diffusion-based Image Editing
link Shen Nie, Hanzhong Allan Guo,..., Chongxuan Li
37 2022-11-14 Composed Image Retrieval with Text Feedback via Multi-grained Uncertainty
Regularization
link Yiyang Chen, Zhedong Zheng,..., Tat-Seng Chua
37 2023-10-02 LEAP: Liberate Sparse-View 3D Modeling from Camera Poses link Hanwen Jiang, Zhenyu Jiang,..., Qixing Huang
36 2024-02-16 Robust agents learn causal world models link Jonathan Richens, Tom Everitt
36 2023-11-24 Controlled Text Generation via Language Model Arithmetic link Jasper Dekoninck, Marc Fischer,..., Martin Vechev
36 2023-05-24 Differentially Private Synthetic Data via Foundation Model APIs 1:
Images
link Zinan Lin, Sivakanth Gopi,..., Sergey Yekhanin
36 2023-10-06 Towards Foundation Models for Knowledge Graph Reasoning link Mikhail Galkin, Xinyu Yuan,..., Zhaocheng Zhu
36 2024-01-09 Masked Audio Generation using a Single Non-Autoregressive Transformer link Alon Ziv, Itai Gat,..., Yossi Adi
36 2023-11-03 Tell Your Model Where to Attend: Post-hoc Attention Steering
for LLMs
link Qingru Zhang, Chandan Singh,..., Tuo Zhao
36 2023-06-07 ViDA: Homeostatic Visual Domain Adapter for Continual Test Time
Adaptation
link Jiaming Liu, Senqiao Yang,..., Shanghang Zhang
36 2023-09-14 Large-Vocabulary 3D Diffusion Model with Transformer link Ziang Cao, Fangzhou Hong,..., Ziwei Liu
35 2023-10-10 Geographic Location Encoding with Spherical Harmonics and Sinusoidal Representation
Networks
link Marc Rußwurm, Konstantin Klemmer,..., Devis Tuia
35 2024-02-14 MUSTARD: Mastering Uniform Synthesis of Theorem and Proof Data link Yinya Huang, Xiaohan Lin,..., Xiaodan Liang
35 2023-09-15 Scaling Laws for Sparsely-Connected Foundation Models link Elias Frantar, Carlos Riquelme Ruiz,..., Utku Evci
35 2023-10-09 SALMON: Self-Alignment with Instructable Reward Models link Zhiqing Sun, Yikang Shen,..., Chuang Gan
35 2023-10-01 JoMA: Demystifying Multilayer Transformers via Joint Dynamics of MLP
and Attention
link Yuandong Tian, Yiping Wang,..., Simon Shaolei Du
35 2023-08-29 Elucidating the Exposure Bias in Diffusion Models link Mang Ning, Mingxiao Li,..., Itir Onal Ertugrul
34 2024-01-16 Beyond Weisfeiler-Lehman: A Quantitative Framework for GNN Expressiveness link Bohang Zhang, Jingchu Gai,..., Liwei Wang
34 2023-10-13 METRA: Scalable Unsupervised RL with Metric-Aware Abstraction link Seohong Park, Oleh Rybkin, Sergey Levine
34 2024-04-17 SuRe: Summarizing Retrievals using Answer Candidates for Open-domain QA
of LLMs
link Jaehyung Kim, Jaehyun Nam,..., Jinwoo Shin
34 2023-10-26 SKILL-MIX: a Flexible and Expandable Family of Evaluations for
AI Models
link Dingli Yu, Simran Kaur,..., Sanjeev Arora
34 2023-10-26 How do Language Models Bind Entities in Context? link Jiahai Feng, Jacob Steinhardt
34 2023-11-01 Plug-and-Play Policy Planner for Large Language Model Powered Dialogue
Agents
link Yang Deng, Wenxuan Zhang,..., Tat-Seng Chua
34 2024-01-04 LLM Augmented LLMs: Expanding Capabilities through Composition link Rachit Bansal, Bidisha Samanta,..., Partha Talukdar
34 2024-01-31 Motion Guidance: Diffusion-Based Image Editing with Differentiable Motion Estimators link Daniel Geng, Andrew Owens
34 2023-12-26 LaneSegNet: Map Learning with Lane Segment Perception for Autonomous
Driving
link Tianyu Li, Peijin Jia,..., Hongyang Li
33 2023-06-08 Protein Discovery with Discrete Walk-Jump Sampling link Nathan C. Frey, Dan Berenberg,..., Saeed Saremi
33 2023-10-02 Merge, Then Compress: Demystify Efficient SMoE with Hints from
Its Routing Policy
link Pingzhi Li, Zhenyu Zhang,..., Tianlong Chen
33 2023-09-28 Intriguing Properties of Generative Classifiers link Priyank Jaini, Kevin Clark, Robert Geirhos
33 2023-10-04 Local Search GFlowNets link Minsu Kim, Taeyoung Yun,..., Jinkyoo Park
33 2023-07-06 T-MARS: Improving Visual Representations by Circumventing Text Feature Learning link Pratyush Maini, Sachin Goyal,..., Aditi Raghunathan
33 2023-11-21 BEND: Benchmarking DNA Language Models on Biologically Meaningful Tasks link Frederikke Isa Marin, Felix Teufel,..., Wouter Boomsma
33 2023-10-12 Tree-Planner: Efficient Close-loop Task Planning with Large Language Models link Mengkang Hu, Yao Mu,..., Ping Luo
33 2023-08-02 Patched Denoising Diffusion Models For High-Resolution Image Synthesis link Zheng Ding, Mengqi Zhang,..., Zhuowen Tu
33 2023-10-16 Gaining Wisdom from Setbacks: Aligning Large Language Models via
Mistake Analysis
link Kai Chen, Chunwei Wang,..., Lifeng Shang
32 2023-02-06 One-shot Empirical Privacy Estimation for Federated Learning link Galen Andrew, Peter Kairouz,..., Vinith Menon Suriyakumar
32 None Würstchen: An Efficient Architecture for Large-Scale Text-to-Image Diffusion Models link Pablo Pernias, Dominic Rampas,..., Marc Aubreville
32 None An Image Is Worth 1000 Lies: Transferability of Adversarial
Images across Prompts on Vision-Language Models
link Haochen Luo, Jindong Gu,..., Philip Torr
32 2024-02-07 InstructScene: Instruction-Driven 3D Indoor Scene Synthesis with Semantic Graph
Prior
link Chenguo Lin, Yadong MU
32 2023-12-18 Tuning LayerNorm in Attention: Towards Efficient Multi-Modal LLM Finetuning link Bingchen Zhao, Haoqin Tu,..., Cihang Xie
32 2023-11-30 Dichotomy of Early and Late Phase Implicit Biases Can
Provably Induce Grokking
link Kaifeng Lyu, Jikai Jin,..., Wei Hu
32 2023-10-25 From Molecules to Materials: Pre-training Large Generalizable Models for
Atomic Property Prediction
link Nima Shoghi, Adeesh Kolluru,..., Brandon M Wood
32 2023-07-17 COLLIE: Systematic Construction of Constrained Text Generation Tasks link Shunyu Yao, Howard Chen,..., Karthik R Narasimhan
32 2022-05-30 Neural Optimal Transport with General Cost Functionals link Arip Asadulaev, Alexander Korotin,..., Evgeny Burnaev
32 2023-10-02 Toward effective protection against diffusion-based mimicry through score distillation link Haotian Xue, Chumeng Liang,..., Yongxin Chen
32 2023-10-03 Unveiling the Pitfalls of Knowledge Editing for Large Language
Models
link Zhoubo Li, Ningyu Zhang,..., Huajun Chen
32 2023-09-30 Scaling for Training Time and Post-hoc Out-of-distribution Detection Enhancement link Kai Xu, Rongyu Chen,..., Angela Yao
32 2024-04-04 OpenNeRF: Open Set 3D Neural Scene Segmentation with Pixel-Wise
Features and Rendered Novel Views
link Francis Engelmann, Fabian Manhardt,..., Federico Tombari
31 2023-10-06 How to Capture Higher-order Correlations? Generalizing Matrix Softmax Attention
to Kronecker Computation
link Josh Alman, Zhao Song
31 2024-03-12 Entropy is not Enough for Test-Time Adaptation: From the
Perspective of Disentangled Factors
link Jonghyun Lee, Dahuin Jung,..., Sungroh Yoon
31 2023-12-07 On the Learnability of Watermarks for Language Models link Chenchen Gu, Xiang Lisa Li,..., Tatsunori Hashimoto
31 2023-10-02 Controlling Vision-Language Models for Multi-Task Image Restoration link Ziwei Luo, Fredrik K. Gustafsson,..., Thomas B. Schön
31 2023-10-09 Grokking as the transition from lazy to rich training
dynamics
link Tanishq Kumar, Blake Bordelon,..., Cengiz Pehlevan
31 2023-12-08 Large-scale Training of Foundation Models for Wearable Biosignals link Salar Abbaspourazad, Oussama Elachqar,..., Ian Shapiro
31 2023-07-30 An Unforgeable Publicly Verifiable Watermark for Large Language Models link Aiwei Liu, Leyi Pan,..., Philip S. Yu
31 2023-02-13 UniAdapter: Unified Parameter-Efficient Transfer Learning for Cross-modal Modeling link Haoyu Lu, Yuqi Huo,..., Mingyu Ding
31 2023-10-25 Generative Pre-training for Speech with Flow Matching link Alexander H. Liu, Matthew Le,..., Wei-Ning Hsu
31 2023-11-22 Language Model Inversion link John Xavier Morris, Wenting Zhao,..., Alexander M Rush
31 2023-12-28 STanHop: Sparse Tandem Hopfield Model for Memory-Enhanced Time Series
Prediction
link Dennis Wu, Jerry Yao-Chieh Hu,..., Han Liu
30 None Monte Carlo guided Denoising Diffusion models for Bayesian linear
inverse problems.
link Gabriel Cardoso, Yazid Janati el idrissi,..., Eric Moulines
30 2023-11-07 Multi-View Causal Representation Learning with Partial Observability link Dingling Yao, Danru Xu,..., Francesco Locatello
30 2023-12-17 Learning to Act without Actions link Dominik Schmidt, Minqi Jiang
30 2023-12-07 Graph Metanetworks for Processing Diverse Neural Architectures link Derek Lim, Haggai Maron,..., James Lucas
30 2024-05-03 What does the Knowledge Neuron Thesis Have to do
with Knowledge?
link Jingcheng Niu, Andrew Liu,..., Gerald Penn
30 2023-07-28 Skeleton-of-Thought: Prompting LLMs for Efficient Parallel Generation link Xuefei Ning, Zinan Lin,..., Yu Wang
30 2023-05-26 An Efficient Membership Inference Attack for the Diffusion Model
by Proximal Initialization
link Fei Kong, Jinhao Duan,..., Kaidi Xu
30 2023-09-06 SLiMe: Segment Like Me link Aliasghar Khani, Saeid Asgari,..., Ghassan Hamarneh
30 2023-10-07 Parameter-Efficient Multi-Task Model Fusion with Partial Linearization link Anke Tang, Li Shen,..., Dacheng Tao
30 2023-05-24 Spoken Question Answering and Speech Continuation Using Spectrogram-Powered LLM link Eliya Nachmani, Alon Levkovitch,..., Michelle Tadmor Ramanovich
30 2023-11-03 Simplifying Transformer Blocks link Bobby He, Thomas Hofmann
30 2023-09-11 DePT: Decomposed Prompt Tuning for Parameter-Efficient Fine-tuning link Zhengxiang Shi, Aldo Lipani
29 2024-03-18 Graph Neural Networks for Learning Equivariant Representations of Neural
Networks
link Miltiadis Kofinas, Boris Knyazev,..., David W. Zhang
29 2023-11-27 DP-OPT: Make Large Language Model Your Privacy-Preserving Prompt Engineer link Junyuan Hong, Jiachen T. Wang,..., Zhangyang Wang
29 2023-11-10 FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor Cores link Daniel Y Fu, Hermann Kumbong,..., Christopher Re
29 2023-10-17 Group Preference Optimization: Few-Shot Alignment of Large Language Models link Siyan Zhao, John Dang, Aditya Grover
29 2022-11-01 Two-stage LLM Fine-tuning with Less Specialization and More Generalization link Yihan Wang, Si Si,..., Sanjiv Kumar
29 2024-01-19 Escape Sky-high Cost: Early-stopping Self-Consistency for Multi-step Reasoning link Yiwei Li, Peiwen Yuan,..., Kan Li
29 2023-10-25 CLEX: Continuous Length Extrapolation for Large Language Models link Guanzheng Chen, Xin Li,..., Lidong Bing
29 2022-11-17 How to Fine-Tune Vision Models with SGD link Ananya Kumar, Ruoqi Shen,..., Suriya Gunasekar
29 2023-10-20 Steve-Eye: Equipping LLM-based Embodied Agents with Visual Perception in
Open Worlds
link Sipeng Zheng, jiazheng liu,..., Zongqing Lu
29 2023-11-08 Massive Editing for Large Language Models via Meta Learning link Chenmien Tan, Ge Zhang, Jie Fu
29 2023-06-16 Jumanji: a Diverse Suite of Scalable Reinforcement Learning Environments
in JAX
link Clément Bonnet, Daniel Luo,..., Alexandre Laterre
29 2023-07-16 EasyTPP: Towards Open Benchmarking Temporal Point Processes link Siqiao Xue, Xiaoming Shi,..., Hongyuan Mei
29 2023-10-03 AlignDiff: Aligning Diverse Human Preferences via Behavior-Customisable Diffusion Model link Zibin Dong, Yifu Yuan,..., Zhipeng Hu
29 2023-05-02 Privacy-Preserving In-Context Learning for Large Language Models link Tong Wu, Ashwinee Panda,..., Prateek Mittal
29 2024-02-06 Space Group Constrained Crystal Generation link Rui Jiao, Wenbing Huang,..., Yang Liu
29 None Faithful Vision-Language Interpretation via Concept Bottleneck Models link Songning Lai, Lijie Hu,..., Di Wang
29 2023-09-27 Jointly Training Large Autoregressive Multimodal Models link Emanuele Aiello, LILI YU,..., Barlas Oguz
28 2023-05-17 Knowledge Card: Filling LLMs' Knowledge Gaps with Plug-in Specialized
Language Models
link Shangbin Feng, Weijia Shi,..., Yulia Tsvetkov
28 2024-03-29 Negative Label Guided OOD Detection with Pretrained Vision-Language Models link Xue Jiang, Feng Liu,..., Bo Han
28 2023-10-30 DrM: Mastering Visual Reinforcement Learning through Dormant Ratio Minimization link Guowei Xu, Ruijie Zheng,..., Huazhe Xu
28 2023-10-03 Tensor Programs VI: Feature Learning in Infinite Depth Neural
Networks
link Greg Yang, Dingli Yu,..., Soufiane Hayou
28 2023-05-31 A Study of Bayesian Neural Network Surrogates for Bayesian
Optimization
link Yucen Lily Li, Tim G. J. Rudner, Andrew Gordon Wilson
28 2023-06-02 OMNI: Open-endedness via Models of human Notions of Interestingness link Jenny Zhang, Joel Lehman,..., Jeff Clune
28 2023-10-01 Faithful Explanations of Black-box NLP Models Using LLM-generated Counterfactuals link Yair Ori Gat, Nitay Calderon,..., Roi Reichart
28 2024-02-22 Towards Seamless Adaptation of Pre-trained Models for Visual Place
Recognition
link Feng Lu, Lijun Zhang,..., Chun Yuan
28 2023-03-11 Xformer: Hybrid X-Shaped Transformer for Image Denoising link Jiale Zhang, Yulun Zhang,..., Xiaokang Yang
28 2024-01-18 Divide and not forget: Ensemble of selectively trained
experts in Continual Learning
link Grzegorz Rypeść, Sebastian Cygert,..., Bartłomiej Twardowski
27 2024-03-26 Don't Trust: Verify -- Grounding LLM Quantitative Reasoning with
Autoformalization
link Jin Peng Zhou, Charles E Staats,..., Yuhuai Wu
27 2023-09-30 AutomaTikZ: Text-Guided Synthesis of Scientific Vector Graphics with TikZ link Jonas Belouadi, Anne Lauscher, Steffen Eger
27 2023-10-07 Lemur: Integrating Large Language Models in Automated Program Verification link Haoze Wu, Clark Barrett, Nina Narodytska
27 2023-11-07 Beyond Imitation: Leveraging Fine-grained Quality Signals for Alignment link Geyang Guo, Ranchi Zhao,..., Ji-Rong Wen
27 2023-10-25 Frequency-Aware Transformer for Learned Image Compression link Han Li, Shaohui Li,..., Hongkai Xiong
27 2024-02-28 Deep Confident Steps to New Pockets: Strategies for Docking
Generalization
link Gabriele Corso, Arthur Deng,..., Tommi S. Jaakkola
27 2023-10-02 Locality-Aware Graph Rewiring in GNNs link Federico Barbero, Ameya Velingker,..., Francesco Di Giovanni
27 2023-06-01 AGILE3D: Attention Guided Interactive Multi-object 3D Segmentation link Yuanwen Yue, Sabarinath Mahadevan,..., Theodora Kontogianni
27 2023-10-03 Benchmarking and Improving Generator-Validator Consistency of Language Models link Xiang Lisa Li, Vaishnavi Shrivastava,..., Percy Liang
27 2024-02-18 BESA: Pruning Large Language Models with Blockwise Parameter-Efficient Sparsity
Allocation
link Peng Xu, Wenqi Shao,..., Ping Luo
27 2023-07-23 In-Context Learning Learns Label Relationships but Is Not Conventional
Learning
link Jannik Kossen, Yarin Gal, Tom Rainforth
27 2023-11-25 LLM-Assisted Code Cleaning For Training Accurate Code Generators link Naman Jain, Tianjun Zhang,..., Ion Stoica
27 2024-02-05 How Does Unlabeled Data Provably Help Out-of-Distribution Detection? link Xuefeng Du, Zhen Fang,..., Yixuan Li
27 None Plug-and-Play: An Efficient Post-training Pruning Method for Large Language
Models
link Yingtao Zhang, Haoli Bai,..., Carlo Vittorio Cannistraci
27 2023-09-13 Query-Dependent Prompt Evaluation and Optimization with Offline Inverse RL link Hao Sun, Alihan Hüyük, Mihaela van der Schaar
27 2023-12-27 Learning to Embed Time Series Patches Independently link Seunghan Lee, Taeyoung Park, Kibok Lee
27 2024-02-08 Get What You Want, Not What You Don't: Image
Content Suppression for Text-to-Image Diffusion Models
link Senmao Li, Joost van de Weijer,..., jian Yang
27 2023-03-11 Recursive Generalization Transformer for Image Super-Resolution link Zheng Chen, Yulun Zhang,..., Xiaokang Yang
26 2024-04-15 ClimODE: Climate and Weather Forecasting with Physics-informed Neural ODEs link Yogesh Verma, Markus Heinonen, Vikas Garg
26 2023-05-29 On Diffusion Modeling for Anomaly Detection link Victor Livernoche, Vineet Jain,..., Siamak Ravanbakhsh
26 2023-09-29 Understanding and Mitigating the Label Noise in Pre-training on
Downstream Tasks
link Hao Chen, Jindong Wang,..., Bhiksha Raj
26 2023-10-24 On the Foundations of Shortcut Learning link Katherine Hermann, Hossein Mobahi,..., Michael Curtis Mozer
26 2023-10-09 Sentence-level Prompts Benefit Composed Image Retrieval link Yang bai, Xinxing Xu,..., Chun-Mei Feng
26 2023-10-12 GROOT: Learning to Follow Instructions by Watching Gameplay Videos link Shaofei Cai, Bowei Zhang,..., Yitao Liang
26 2023-12-05 MUFFIN: Curating Multi-Faceted Instructions for Improving Instruction Following link Renze Lou, Kai Zhang,..., Wenpeng Yin
26 2023-10-24 TiC-CLIP: Continual Training of CLIP Models link Saurabh Garg, Mehrdad Farajtabar,..., Fartash Faghri
26 2023-12-18 Hybrid Internal Model: Learning Agile Legged Locomotion with Simulated
Robot Response
link Junfeng Long, ZiRui Wang,..., Jiangmiao Pang
26 2023-06-12 Retrieval-Enhanced Contrastive Vision-Text Models link Ahmet Iscen, Mathilde Caron,..., Cordelia Schmid
26 2023-10-02 Fusing Models with Complementary Expertise link Hongyi Wang, Felipe Maia Polo,..., Mikhail Yurochkin
26 2023-03-07 Zeroth-Order Optimization Meets Human Feedback: Provable Learning via Ranking
Oracles
link Zhiwei Tang, Dmitry Rybin, Tsung-Hui Chang
26 2023-09-19 MBR and QE Finetuning: Training-time Distillation of the Best
and Most Expensive Decoding Methods
link Mara Finkelstein, Markus Freitag
26 2023-06-01 The Hidden Language of Diffusion Models link Hila Chefer, Oran Lang,..., Lior Wolf
26 2023-02-21 Low Rank Matrix Completion via Robust Alternating Minimization in
Nearly Linear Time
link Yuzhou Gu, Zhao Song,..., Lichen Zhang
26 2023-05-29 Improved Probabilistic Image-Text Representations link Sanghyuk Chun
25 2023-10-19 Frozen Transformers in Language Models Are Effective Visual Encoder
Layers
link Ziqi Pang, Ziyang Xie,..., Yu-Xiong Wang
25 2024-01-03 Prototypical Information Bottlenecking and Disentangling for Multimodal Cancer Survival
Prediction
link Yilan Zhang, Yingxue Xu,..., Hao Chen
25 2023-09-04 On Penalty Methods for Nonconvex Bilevel Optimization and First-Order
Stochastic Approximation
link Jeongyeol Kwon, Dohyun Kwon,..., Robert D Nowak
25 2022-01-07 Fair and Efficient Contribution Valuation for Vertical Federated Learning link Zhenan Fan, Huang Fang,..., Yong Zhang
25 2023-09-29 Navigating the Design Space of Equivariant Diffusion-Based Generative Models
for De Novo 3D Molecule Generation
link Tuan Le, Julian Cremer,..., Kristof T Schütt
25 2023-08-03 PARL: A Unified Framework for Policy Alignment in Reinforcement
Learning from Human Feedback
link Souradip Chakraborty, Amrit Bedi,..., Furong Huang
25 2022-08-10 A Sublinear Adversarial Training Algorithm link Yeqi Gao, Lianke Qin,..., Yitan Wang
25 2023-09-29 Consistency Models as a Rich and Efficient Policy Class
for Reinforcement Learning
link Zihan Ding, Chi Jin
25 2023-10-19 Quality-Diversity through AI Feedback link Herbie Bradley, Andrew Dai,..., Joel Lehman
25 2023-11-26 GAIA: Zero-shot Talking Avatar Generation link Tianyu He, Junliang Guo,..., Jiang Bian
25 2023-01-05 Skip-Attention: Improving Vision Transformers by Paying Less Attention link Shashanka Venkataramanan, Amir Ghodrati,..., Amir Habibian
25 2023-06-15 FFB: A Fair Fairness Benchmark for In-Processing Group Fairness
Methods
link Xiaotian Han, Jianfeng Chi,..., Xia Hu
25 2023-10-09 TAIL: Task-specific Adapters for Imitation Learning with Large Pretrained
Models
link Zuxin Liu, Jesse Zhang,..., Rasool Fakoor
25 2023-09-26 SEPT: Towards Efficient Scene Representation Learning for Motion Prediction link Zhiqian Lan, Yuxuan Jiang,..., Shengbo Eben Li
25 2024-02-01 Machine Unlearning for Image-to-Image Generative Models link Guihong Li, Hsiang Hsu,..., Radu Marculescu
25 None Periodicity Decoupling Framework for Long-term Series Forecasting link Tao Dai, Beiliang Wu,..., Shu-Tao Xia
25 2023-10-10 TopoMLP: A Simple yet Strong Pipeline for Driving Topology
Reasoning
link Dongming Wu, Jiahao Chang,..., Jianbing Shen
25 2024-01-22 MVSFormer++: Revealing the Devil in Transformer's Details for Multi-View
Stereo
link Chenjie Cao, Xinlin Ren, Yanwei Fu
24 2023-10-04 Never Train from Scratch: Fair Comparison of Long-Sequence Models
Requires Data-Driven Priors
link Ido Amos, Jonathan Berant, Ankit Gupta
24 2023-05-24 Provable Offline Preference-Based Reinforcement Learning link Wenhao Zhan, Masatoshi Uehara,..., Wen Sun
24 2023-10-04 Multi-resolution HuBERT: Multi-resolution Speech Self-Supervised Learning with Masked Unit
Prediction
link Jiatong Shi, Hirofumi Inaguma,..., Anna Sun
24 None Towards Reliable and Efficient Backdoor Trigger Inversion via Decoupling
Benign Features
link Xiong Xu, Kunzhe Huang,..., Kui Ren
24 2023-07-05 Reverse Diffusion Monte Carlo link Xunpeng Huang, Hanze Dong,..., Tong Zhang
24 2024-01-05 Simple Hierarchical Planning with Diffusion link Chang Chen, Fei Deng,..., Sungjin Ahn
24 2024-01-19 Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model link Yinan Zheng, Jianxiong Li,..., Jingjing Liu
24 2023-11-21 Looped Transformers are Better at Learning Learning Algorithms link Liu Yang, Kangwook Lee,..., Dimitris Papailiopoulos
24 2024-02-02 Guiding Masked Representation Learning to Capture Spatio-Temporal Relationship of
Electrocardiogram
link Yeongyeon Na, Minje Park,..., Sunghoon Joo
24 2023-10-04 Posterior Sampling Based on Gradient Flows of the MMD
with Negative Distance Kernel
link Paul Hagemann, Johannes Hertrich,..., Gabriele Steidl
24 2023-10-10 Teaching Language Models to Hallucinate Less with Synthetic Tasks link Erik Jones, Hamid Palangi,..., Ece Kamar
24 2023-06-05 Str2Str: A Score-based Framework for Zero-shot Protein Conformation Sampling link Jiarui Lu, Bozitao Zhong,..., Jian Tang
24 2023-10-04 Fast, Expressive $\mathrm{SE}(n)$ Equivariant Networks through Weight-Sharing in Position-Orientation
Space
link Erik J Bekkers, Sharvaree Vadgama,..., David W. Romero
24 2023-09-29 Leveraging Optimization for Adaptive Attacks on Image Watermarks link Nils Lukas, Abdulrahman Diaa,..., Florian Kerschbaum
24 2023-08-08 V-DETR: DETR with Vertex Relative Position Encoding for 3D
Object Detection
link Yichao Shen, Zigang Geng,..., Baining Guo
24 2023-10-01 Revisiting Link Prediction: a data perspective link Haitao Mao, Juanhui Li,..., Jiliang Tang
24 2023-10-06 THOUGHT PROPAGATION: AN ANALOGICAL APPROACH TO COMPLEX REASONING WITH
LARGE LANGUAGE MODELS
link Junchi Yu, Ran He, Zhitao Ying
24 2023-05-24 Sin3DM: Learning a Diffusion Model from a Single 3D
Textured Shape
link Rundi Wu, Ruoshi Liu,..., Changxi Zheng
24 2023-09-29 Spurious Feature Diversification Improves Out-of-distribution Generalization link LIN Yong, Lu Tan,..., Tong Zhang
24 2023-07-21 PINNsFormer: A Transformer-Based Framework For Physics-Informed Neural Networks link Zhiyuan Zhao, Xueying Ding, B. Aditya Prakash