Last updated: 2025-04-16 04:16:43. Maintained by Weisen Jiang.

citation publish date title (pdf) review authors
3952 2023-04-17 Visual Instruction Tuning link Haotian Liu, Chunyuan Li,..., Yong Jae Lee
2993 2023-05-29 Direct Preference Optimization: Your Language Model is Secretly a
Reward Model
link Rafael Rafailov, Archit Sharma,..., Chelsea Finn
2122 2023-05-23 QLoRA: Efficient Finetuning of Quantized LLMs link Tim Dettmers, Artidoro Pagnoni,..., Luke Zettlemoyer
1808 2023-05-11 InstructBLIP: Towards General-purpose Vision-Language Models with Instruction Tuning link Wenliang Dai, Junnan Li,..., Steven Hoi
1564 2023-05-17 Tree of Thoughts: Deliberate Problem Solving with Large Language
Models
link Shunyu Yao, Dian Yu,..., Karthik R Narasimhan
1493 2023-02-09 Toolformer: Language Models Can Teach Themselves to Use Tools link Timo Schick, Jane Dwivedi-Yu,..., Thomas Scialom
1306 2023-03-30 Self-Refine: Iterative Refinement with Self-Feedback link Aman Madaan, Niket Tandon,..., Peter Clark
986 2023-03-20 Reflexion: language agents with verbal reinforcement learning link Noah Shinn, Federico Cassano,..., Shunyu Yao
789 None HuggingGPT: Solving AI Tasks with ChatGPT and its Friends
in Hugging Face
link Yongliang Shen, Kaitao Song,..., Yueting Zhuang
775 2023-07-05 Jailbroken: How Does LLM Safety Training Fail? link Alexander Wei, Nika Haghtalab, Jacob Steinhardt
764 2023-05-25 ProlificDreamer: High-Fidelity and Diverse Text-to-3D Generation with Variational Score
Distillation
link Zhengyi Wang, Cheng Lu,..., Jun Zhu
724 2023-05-18 LIMA: Less Is More for Alignment link Chunting Zhou, Pengfei Liu,..., Omer Levy
691 2023-05-02 Is Your Code Generated by ChatGPT Really Correct? Rigorous
Evaluation of Large Language Models for Code Generation
link Jiawei Liu, Chunqiu Steven Xia,..., LINGMING ZHANG
512 2023-02-27 Language Is Not All You Need: Aligning Perception with
Language Models
link Shaohan Huang, Li Dong,..., Furu Wei
507 2023-05-22 AlpacaFarm: A Simulation Framework for Methods that Learn from
Human Feedback
link Yann Dubois, Xuechen Li,..., Tatsunori Hashimoto
447 2023-06-06 Inference-Time Intervention: Eliciting Truthful Answers from a Language Model link Kenneth Li, Oam Patel,..., Martin Wattenberg
429 2023-05-18 VisionLLM: Large Language Model is also an Open-Ended Decoder
for Vision-Centric Tasks
link Wenhai Wang, Zhe Chen,..., Jifeng Dai
425 2023-04-13 Segment Everything Everywhere All at Once link Xueyan Zou, Jianwei Yang,..., Yong Jae Lee
410 2023-06-29 One-2-3-45: Any Single Image to 3D Mesh in 45
Seconds without Per-Shape Optimization
link Minghua Liu, Chao Xu,..., Hao Su
369 None Are Emergent Abilities of Large Language Models a Mirage? link Rylan Schaeffer, Brando Miranda, Sanmi Koyejo
346 2023-05-07 Language Models Don't Always Say What They Think: Unfaithful
Explanations in Chain-of-Thought Prompting
link Miles Turpin, Julian Michael,..., Samuel R. Bowman
345 2023-03-31 CAMEL: Communicative Agents for "Mind" Exploration of Large Language
Model Society
link Guohao Li, Hasan Abed Al Kader Hammoud,..., Bernard Ghanem
332 2023-02-23 One Fits All: Power General Time Series Analysis by
Pretrained LM
link Tian Zhou, Peisong Niu,..., Rong Jin
332 2023-05-19 LLM-Pruner: On the Structural Pruning of Large Language Models link Xinyin Ma, Gongfan Fang, Xinchao Wang
328 2023-04-11 RRHF: Rank Responses to Align Language Models with Human
Feedback
link Hongyi Yuan, Zheng Yuan,..., Fei Huang
325 2023-05-02 Pick-a-Pic: An Open Dataset of User Preferences for Text-to-Image
Generation
link Yuval Kirstain, Adam Polyak,..., Omer Levy
323 2023-06-08 Simple and Controllable Music Generation link Jade Copet, Felix Kreuk,..., Alexandre Défossez
322 2023-02-13 Symbolic Discovery of Optimization Algorithms link Xiangning Chen, Chen Liang,..., Quoc V Le
317 2023-03-30 Language Models can Solve Computer Tasks link Geunwoo Kim, Pierre Baldi, Stephen Marcus McAleer
313 2023-05-29 Faith and Fate: Limits of Transformers on Compositionality link Nouha Dziri, Ximing Lu,..., Yejin Choi
304 2023-06-03 VideoComposer: Compositional Video Synthesis with Motion Controllability link Xiang Wang, Hangjie Yuan,..., Jingren Zhou
296 2023-05-04 Principle-Driven Self-Alignment of Language Models from Scratch with Minimal
Human Supervision
link Zhiqing Sun, Yikang Shen,..., Chuang Gan
284 2023-04-19 Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models link Pan Lu, Baolin Peng,..., Jianfeng Gao
284 2023-06-02 Segment Anything in High Quality link Lei Ke, Mingqiao Ye,..., Fisher Yu
282 2023-06-02 Fine-Grained Human Feedback Gives Better Rewards for Language Model
Training
link Zeqiu Wu, Yushi Hu,..., Hannaneh Hajishirzi
282 2023-10-11 Large Language Models Are Zero-Shot Time Series Forecasters link Nate Gruver, Marc Anton Finzi,..., Andrew Gordon Wilson
280 2023-04-21 DIN-SQL: Decomposed In-Context Learning of Text-to-SQL with Self-Correction link Mohammadreza Pourreza, Davood Rafiei
280 2023-05-24 BLIP-Diffusion: Pre-trained Subject Representation for Controllable Text-to-Image Generation and
Editing
link Dongxu Li, Junnan Li, Steven Hoi
274 2023-04-12 ImageReward: Learning and Evaluating Human Preferences for Text-to-Image Generation link Jiazheng Xu, Xiao Liu,..., Yuxiao Dong
268 2023-03-23 Paraphrasing evades detectors of AI-generated text, but retrieval is
an effective defense
link Kalpesh Krishna, Yixiao Song,..., Mohit Iyyer
266 2023-06-11 High-Fidelity Audio Compression with Improved RVQGAN link Rithesh Kumar, Prem Seetharaman,..., Kundan Kumar
258 2023-04-28 Towards Automated Circuit Discovery for Mechanistic Interpretability link Arthur Conmy, Augustine N. Mavor-Parker,..., Adrià Garriga-Alonso
250 2022-08-19 Cold Diffusion: Inverting Arbitrary Image Transforms Without Noise link Arpit Bansal, Eitan Borgnia,..., Tom Goldstein
248 2023-06-26 MotionGPT: Human Motion as a Foreign Language link Biao Jiang, Xin Chen,..., Tao Chen
244 2023-06-23 Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale link Matthew Le, Apoorv Vyas,..., Wei-Ning Hsu
230 2023-06-06 Emergent Correspondence from Image Diffusion link Luming Tang, Menglin Jia,..., Bharath Hariharan
228 2023-02-07 Hard Prompts Made Easy: Gradient-Based Discrete Optimization for Prompt
Tuning and Discovery
link Yuxin Wen, Neel Jain,..., Tom Goldstein
225 2023-06-01 Diffusion Self-Guidance for Controllable Image Generation link Dave Epstein, Allan Jabri,..., Aleksander Holynski
224 None 3D-LLM: Injecting the 3D World into Large Language Models link Yining Hong, Haoyu Zhen,..., Chuang Gan
224 2023-06-02 TIES-Merging: Resolving Interference When Merging Models link Prateek Yadav, Derek Tam,..., Mohit Bansal
220 2023-06-24 H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large
Language Models
link Zhenyu Zhang, Ying Sheng,..., Beidi Chen
219 2023-05-26 Generating Images with Multimodal Language Models link Jing Yu Koh, Daniel Fried, Ruslan Salakhutdinov
217 2023-05-25 Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models link Shihao Zhao, Dongdong Chen,..., Kwan-Yee K. Wong
214 2023-06-26 Are aligned neural networks adversarially aligned? link Nicholas Carlini, Milad Nasr,..., Ludwig Schmidt
203 2023-05-25 On the Planning Abilities of Large Language Models -
A Critical Investigation
link Karthik Valmeekam, Matthew Marquez,..., Subbarao Kambhampati
201 2023-05-24 EmbodiedGPT: Vision-Language Pre-Training via Embodied Chain of Thought link Yao Mu, Qinglong Zhang,..., Ping Luo
200 2023-06-27 HyenaDNA: Long-Range Genomic Sequence Modeling at Single Nucleotide Resolution link Eric Nguyen, Michael Poli,..., Stephen Baccus
199 2023-05-24 Towards Revealing the Mystery behind Chain of Thought: A
Theoretical Perspective
link Guhao Feng, Bohang Zhang,..., Liwei Wang
197 2023-05-30 GPT4Tools: Teaching Large Language Model to Use Tools via
Self-instruction
link Rui Yang, Lin Song,..., Ying Shan
188 2023-05-26 Scissorhands: Exploiting the Persistence of Importance Hypothesis for LLM
KV Cache Compression at Test Time
link Zichang Liu, Aditya Desai,..., Anshumali Shrivastava
188 2023-02-09 UniPC: A Unified Predictor-Corrector Framework for Fast Sampling of
Diffusion Models
link Wenliang Zhao, Lujia Bai,..., Jiwen Lu
181 2023-07-23 ResShift: Efficient Diffusion Model for Image Super-resolution by Residual
Shifting
link Zongsheng Yue, Jianyi Wang, Chen Change Loy
177 2023-04-01 Subject-driven Text-to-Image Generation via Apprenticeship Learning link Wenhu Chen, Hexiang Hu,..., William W. Cohen
176 2023-05-25 Scaling Data-Constrained Language Models link Niklas Muennighoff, Alexander M Rush,..., Colin Raffel
171 2023-05-23 Large Language Models as Commonsense Knowledge for Large-Scale Task
Planning
link Zirui Zhao, Wee Sun Lee, David Hsu
167 2023-07-25 QuIP: 2-Bit Quantization of Large Language Models With Guarantees link Jerry Chee, Yaohui Cai,..., Christopher De Sa
167 2023-09-20 Gold-YOLO: Efficient Object Detector via Gather-and-Distribute Mechanism link Chengcheng Wang, Wei He,..., Kai Han
164 2023-05-27 Fine-Tuning Language Models with Just Forward Passes link Sadhika Malladi, Tianyu Gao,..., Sanjeev Arora
164 2023-05-31 The Impact of Positional Encoding on Length Generalization in
Transformers
link Amirhossein Kazemnejad, Inkit Padhi,..., Siva Reddy
162 2023-05-17 Can Language Models Solve Graph Problems in Natural Language? link Heng Wang, Shangbin Feng,..., Yulia Tsvetkov
160 2023-06-16 Scaling Open-Vocabulary Object Detection link Matthias Minderer, Alexey A. Gritsenko, Neil Houlsby
160 2023-05-19 Any-to-Any Generation via Composable Diffusion link Zineng Tang, Ziyi Yang,..., Mohit Bansal
160 2023-03-31 Where are we in the search for an Artificial
Visual Cortex for Embodied Intelligence?
link Arjun Majumdar, Karmesh Yadav,..., Franziska Meier
159 2023-05-19 ToolkenGPT: Augmenting Frozen Language Models with Massive Tools via
Tool Embeddings
link Shibo Hao, Tianyang Liu,..., Zhiting Hu
159 2023-01-10 Does Localization Inform Editing? Surprising Differences in Causality-Based Localization
vs. Knowledge Editing in Language Models
link Peter Hase, Mohit Bansal,..., Asma Ghandeharioun
158 2023-05-17 DoReMi: Optimizing Data Mixtures Speeds Up Language Model Pretraining link Sang Michael Xie, Hieu Pham,..., Adams Wei Yu
158 2023-05-29 Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept Customization of Diffusion
Models
link Yuchao Gu, Xintao Wang,..., Mike Zheng Shou
157 2023-05-24 A Tale of Two Features: Stable Diffusion Complements DINO
for Zero-Shot Semantic Correspondence
link Junyi Zhang, Charles Herrmann,..., Ming-Hsuan Yang
157 2023-02-06 Data Selection for Language Models via Importance Resampling link Sang Michael Xie, Shibani Santurkar,..., Percy Liang
156 2023-05-26 On Evaluating Adversarial Robustness of Large Vision-Language Models link Yunqing Zhao, Tianyu Pang,..., Min Lin
153 2023-06-07 Transformers as Statisticians: Provable In-Context Learning with In-Context Algorithm
Selection
link Yu Bai, Fan Chen,..., Song Mei
150 2023-06-23 OpenMask3D: Open-Vocabulary 3D Instance Segmentation link Ayça Takmaz, Elisabetta Fedele,..., Francis Engelmann
146 2023-05-19 Pengi: An Audio Language Model for Audio Tasks link Soham Deshmukh, Benjamin Elizalde,..., Huaming Wang
144 2023-05-24 Leveraging Pre-trained Large Language Models to Construct and Utilize
World Models for Model-based Task Planning
link Lin Guan, Karthik Valmeekam,..., Subbarao Kambhampati
143 2023-07-03 MVDiffusion: Enabling Holistic Multi-view Image Generation with Correspondence-Aware Diffusion link Shitao Tang, Fuyang Zhang,..., Yasutaka Furukawa
143 2023-05-24 LayoutGPT: Compositional Visual Planning and Generation with Large Language
Models
link Weixi Feng, Wanrong Zhu,..., William Yang Wang
141 2023-05-31 Improving CLIP Training with Language Rewrites link Lijie Fan, Dilip Krishnan,..., Yonglong Tian
140 2023-05-24 In-Context Impersonation Reveals Large Language Models' Strengths and Biases link Leonard Salewski, Stephan Alaniz,..., Zeynep Akata
140 2023-06-01 SnapFusion: Text-to-Image Diffusion Model on Mobile Devices within Two
Seconds
link Yanyu Li, Huan Wang,..., Jian Ren
139 2023-06-01 Transformers learn to implement preconditioned gradient descent for in-context
learning
link Kwangjun Ahn, Xiang Cheng,..., Suvrit Sra
134 2023-06-01 StableRep: Synthetic Images from Text-to-Image Models Make Strong Visual
Representation Learners
link Yonglong Tian, Lijie Fan,..., Dilip Krishnan
133 2022-11-20 Aging with GRACE: Lifelong Model Editing with Discrete Key-Value
Adaptors
link Thomas Hartvigsen, Swami Sankaranarayanan,..., Marzyeh Ghassemi
132 2023-06-12 Controlling Text-to-Image Diffusion by Orthogonal Finetuning link Zeju Qiu, Weiyang Liu,..., Bernhard Schölkopf
131 2022-12-19 Optimizing Prompts for Text-to-Image Generation link Yaru Hao, Zewen Chi,..., Furu Wei
127 2023-05-25 DPOK: Reinforcement Learning for Fine-tuning Text-to-Image Diffusion Models link Ying Fan, Olivia Watkins,..., Kimin Lee
126 2021-12-24 Counterfactual Memorization in Neural Language Models link Chiyuan Zhang, Daphne Ippolito,..., Nicholas Carlini
125 2023-07-06 Focused Transformer: Contrastive Training for Context Scaling link Szymon Tworkowski, Konrad Staniszewski,..., Piotr Miłoś
120 2023-05-27 SwiftSage: A Generative Agent with Fast and Slow Thinking
for Complex Interactive Tasks
link Bill Yuchen Lin, Yicheng Fu,..., Xiang Ren
120 2023-05-29 RAPHAEL: Text-to-Image Generation via Large Mixture of Diffusion Paths link Zeyue Xue, Guanglu Song,..., Ping Luo
118 2023-05-11 Self-Chained Image-Language Model for Video Localization and Question Answering link Shoubin Yu, Jaemin Cho,..., Mohit Bansal
118 2023-05-31 Understanding and Mitigating Copying in Diffusion Models link Gowthami Somepalli, Vasu Singla,..., Tom Goldstein
118 2023-05-02 Unlimiformer: Long-Range Transformers with Unlimited Length Input link Amanda Bertsch, Uri Alon,..., Matthew R. Gormley
118 2023-06-07 Rewarded soups: towards Pareto-optimal alignment by interpolating weights fine-tuned
on diverse rewards
link Alexandre Rame, Guillaume Couairon,..., Matthieu Cord
117 2023-02-20 Towards Unbounded Machine Unlearning link Meghdad Kurmanji, Peter Triantafillou,..., Eleni Triantafillou
117 2023-05-01 Self-Evaluation Guided Beam Search for Reasoning link Yuxi Xie, Kenji Kawaguchi,..., Qizhe Xie
116 2022-09-14 Lossy Image Compression with Conditional Diffusion Models link Ruihan Yang, Stephan Mandt
115 2023-05-23 Diffusion Hyperfeatures: Searching Through Time and Space for Semantic
Correspondence
link Grace Luo, Lisa Dunlap,..., Trevor Darrell
114 2023-03-01 Understanding Diffusion Objectives as the ELBO with Simple Data
Augmentation
link Diederik P Kingma, Ruiqi Gao
114 2023-05-26 AdaPlanner: Adaptive Planning from Feedback with Language Models link Haotian Sun, Yuchen Zhuang,..., Chao Zhang
113 2023-05-18 Structural Pruning for Diffusion Models link Gongfan Fang, Xinyin Ma, Xinchao Wang
113 2023-06-06 Deductive Verification of Chain-of-Thought Reasoning link Zhan Ling, Yunhao Fang,..., Hao Su
111 2023-04-21 Emergent and Predictable Memorization in Large Language Models link Stella Biderman, USVSN Sai Prashanth,..., Edward Raff
110 2023-09-01 Geometry-Informed Neural Operator for Large-Scale 3D PDEs link Zongyi Li, Nikola Borislavov Kovachki,..., Anima Anandkumar
110 2023-11-10 Frequency-domain MLPs are More Effective Learners in Time Series
Forecasting
link Kun Yi, Qi Zhang,..., Zhendong Niu
110 2023-04-30 How does GPT-2 compute greater-than?: Interpreting mathematical abilities in
a pre-trained language model
link Michael Hanna, Ollie Liu, Alexandre Variengien
110 None Segment Anything in 3D with NeRFs link Jiazhong Cen, Zanwei Zhou,..., Qi Tian
109 2023-05-18 UniControl: A Unified Diffusion Model for Controllable Visual Generation
In the Wild
link Can Qin, Shu Zhang,..., Ran Xu
108 2023-02-16 DIFUSCO: Graph-based Diffusion Solvers for Combinatorial Optimization link Zhiqing Sun, Yiming Yang
107 2023-05-18 OpenShape: Scaling Up 3D Shape Representation Towards Open-World Understanding link Minghua Liu, Ruoxi Shi,..., Hao Su
106 2023-02-02 SceneScape: Text-Driven Consistent Scene Generation link Rafail Fridman, Amit Abecasis,..., Tali Dekel
105 2022-08-08 Deep Patch Visual Odometry link Zachary Teed, Lahav Lipson, Jia Deng
104 2023-07-26 Evaluating the Moral Beliefs Encoded in LLMs link Nino Scherrer, Claudia Shi,..., David Blei
103 2023-07-04 Spike-driven Transformer link Man Yao, JiaKui Hu,..., Guoqi Li
101 2023-05-29 Reconstructing the Mind's Eye: fMRI-to-Image with Contrastive Learning and
Diffusion Priors
link Paul Steven Scotti, Atmadeep Banerjee,..., Tanishq Mathew Abraham
101 2023-06-26 Composing Parameter-Efficient Modules with Arithmetic Operation link Jinghan Zhang, Shiqi Chen,..., Junxian He
101 2023-01-31 What Makes Good Examples for Visual In-Context Learning? link Yuanhan Zhang, Kaiyang Zhou, Ziwei Liu
100 2023-04-11 Model Sparsity Can Simplify Machine Unlearning link Jinghan Jia, Jiancheng Liu,..., Sijia Liu
100 2023-03-09 Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning link Mitsuhiko Nakamoto, Yuexiang Zhai,..., Sergey Levine
98 2023-06-06 LEACE: Perfect linear concept erasure in closed form link Nora Belrose, David Schneider-Joseph,..., Stella Biderman
97 2023-03-27 Text-to-Image Diffusion Models are Zero Shot Classifiers link Kevin Clark, Priyank Jaini
97 2023-05-18 TextDiffuser: Diffusion Models as Text Painters link Jingye Chen, Yupan Huang,..., Furu Wei
96 2023-05-22 Task Arithmetic in the Tangent Space: Improved Editing of
Pre-Trained Models
link Guillermo Ortiz-Jimenez, Alessandro Favero, Pascal Frossard
95 2023-05-17 Selective Amnesia: A Continual Learning Approach to Forgetting in
Deep Generative Models
link Alvin Heng, Harold Soh
95 2023-05-29 Diff-Instruct: A Universal Approach for Transferring Knowledge From Pre-trained
Diffusion Models
link Weijian Luo, Tianyang Hu,..., Zhihua Zhang
95 2023-07-07 RADAR: Robust AI-Text Detection via Adversarial Learning link Xiaomeng Hu, Pin-Yu Chen, Tsung-Yi Ho
93 2021-07-19 Epistemic Neural Networks link Ian Osband, Zheng Wen,..., Benjamin Van Roy
93 2023-05-31 Protein Design with Guided Discrete Diffusion link Nate Gruver, Samuel Don Stanton,..., Andrew Gordon Wilson
93 2023-06-13 StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and
Adversarial Training with Large Speech Language Models
link Yinghao Aaron Li, Cong Han,..., Nima Mesgarani
92 2023-06-15 Linguistic Binding in Diffusion Models: Enhancing Attribute Correspondence through
Attention Map Alignment
link Royi Rassin, Eran Hirsch,..., Gal Chechik
92 2023-05-17 Language Model Tokenizers Introduce Unfairness Between Languages link Aleksandar Petrov, Emanuele La Malfa,..., Adel Bibi
91 2023-08-11 DatasetDM: Synthesizing Data with Perception Annotations Using Diffusion Models link Weijia Wu, Yuzhong Zhao,..., Chunhua Shen
91 2023-05-30 Koopa: Learning Non-stationary Time Series Dynamics with Koopman Predictors link Yong Liu, Chenyu Li,..., Mingsheng Long
90 2023-06-15 DreamSim: Learning New Dimensions of Human Visual Similarity using
Synthetic Data
link Stephanie Fu, Netanel Yakir Tamir,..., Phillip Isola
89 2023-06-28 On the Exploitability of Instruction Tuning link Manli Shu, Jiongxiao Wang,..., Tom Goldstein
89 2023-07-12 Patch n’ Pack: NaViT, a Vision Transformer for any
Aspect Ratio and Resolution
link Mostafa Dehghani, Basil Mustafa,..., Neil Houlsby
89 2023-11-10 FourierGNN: Rethinking Multivariate Time Series Forecasting from a Pure
Graph Perspective
link Kun Yi, Qi Zhang,..., Zhendong Niu
89 2023-05-29 VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset link Sihan Chen, Handong Li,..., Jing Liu
88 2023-06-30 The Clock and the Pizza: Two Stories in Mechanistic
Explanation of Neural Networks
link Ziqian Zhong, Ziming Liu,..., Jacob Andreas
88 2023-06-07 Exposing flaws of generative model evaluation metrics and their
unfair treatment of diffusion models
link George Stein, Jesse C. Cresswell,..., Gabriel Loaiza-Ganem
88 2023-02-15 Speculative Decoding with Big Little Decoder link Sehoon Kim, Karttikeya Mangalam,..., Kurt Keutzer
88 2023-07-05 RanPAC: Random Projections and Pre-trained Models for Continual Learning link Mark McDonnell, Dong Gong,..., Anton van den Hengel
88 2023-05-18 PTQD: Accurate Post-Training Quantization for Diffusion Models link Yefei He, Luping Liu,..., Bohan Zhuang
87 2023-07-04 ProPILE: Probing Privacy Leakage in Large Language Models link Siwon Kim, Sangdoo Yun,..., Seong Joon Oh
87 2023-05-18 Language Models Meet World Models: Embodied Experiences Enhance Language
Models
link Jiannan Xiang, Tianhua Tao,..., Zhiting Hu
87 2023-04-25 Patch Diffusion: Faster and More Data-Efficient Training of Diffusion
Models
link Zhendong Wang, Yifan Jiang,..., Mingyuan Zhou
87 2023-01-27 Large Language Models Are Latent Variable Models: Explaining and
Finding Good Demonstrations for In-Context Learning
link Xinyi Wang, Wanrong Zhu,..., William Yang Wang
86 2023-06-15 DreamHuman: Animatable 3D Avatars from Text link Nikos Kolotouros, Thiemo Alldieck,..., Cristian Sminchisescu
86 2023-02-22 Guiding Large Language Models via Directional Stimulus Prompting link Zekun Li, Baolin Peng,..., Xifeng Yan
86 2023-05-23 Memory-Efficient Fine-Tuning of Compressed Large Language Models via sub-4-bit
Integer Quantization
link Jeonghoon Kim, Jung Hyun Lee,..., Dongsoo Lee
86 2023-10-25 DDCoT: Duty-Distinct Chain-of-Thought Prompting for Multimodal Reasoning in Language
Models
link Ge Zheng, Bin Yang,..., Sibei Yang
86 2023-05-18 Weakly-Supervised Concealed Object Segmentation with SAM-based Pseudo Labeling and
Multi-scale Feature Grouping
link Chunming He, Kai Li,..., Xiu Li
85 2023-04-07 Why think step by step? Reasoning emerges from the
locality of experience
link Ben Prystawski, Michael Y. Li, Noah Goodman
85 None PromptIR: Prompting for All-in-One Image Restoration link Vaishnav Potlapalli, Syed Waqas Zamir,..., Fahad Khan
85 2023-05-24 Cheap and Quick: Efficient Vision-Language Instruction Tuning for Large
Language Models
link Gen Luo, Yiyi Zhou,..., Rongrong Ji
84 2023-03-14 The Learnability of In-Context Learning link Noam Wies, Yoav Levine, Amnon Shashua
84 2023-05-03 Lift Yourself Up: Retrieval-augmented Text Generation with Self-Memory link Xin Cheng, Di Luo,..., Rui Yan
84 2023-03-23 Towards Better Dynamic Graph Learning: New Architecture and Unified
Library
link Le Yu, Leilei Sun,..., Weifeng Lv
83 2023-07-02 Solving Linear Inverse Problems Provably via Posterior Sampling with
Latent Diffusion Models
link Litu Rout, Negin Raoof,..., Sanjay Shakkottai
82 2023-06-20 Diffusion with Forward Models: Solving Stochastic Inverse Problems Without
Direct Supervision
link Ayush Tewari, Tianwei Yin,..., Vincent Sitzmann
82 2023-05-24 Unsupervised Semantic Correspondence Using Stable Diffusion link Eric Hedlin, Gopal Sharma,..., Kwang Moo Yi
81 2023-05-12 MEGABYTE: Predicting Million-byte Sequences with Multiscale Transformers link LILI YU, Daniel Simig,..., Mike Lewis
80 2023-06-29 Diff-Foley: Synchronized Video-to-Audio Synthesis with Latent Diffusion Models link Simian Luo, Chuanhao Yan,..., Hang Zhao
79 2023-02-02 SimMTM: A Simple Pre-Training Framework for Masked Time-Series Modeling link Jiaxiang Dong, Haixu Wu,..., Mingsheng Long
79 2023-06-15 Segment Any Point Cloud Sequences by Distilling Vision Foundation
Models
link Youquan Liu, Lingdong Kong,..., Ziwei Liu
79 2023-05-29 Diffusion Model is an Effective Planner and Data Synthesizer
for Multi-Task Reinforcement Learning
link Haoran He, Chenjia Bai,..., Xuelong Li
79 2023-03-07 Structured State Space Models for In-Context Reinforcement Learning link Chris Lu, Yannick Schroecker,..., Feryal Behbahani
79 2023-06-22 Quantizable Transformers: Removing Outliers by Helping Attention Heads Do
Nothing
link Yelysei Bondarenko, Markus Nagel, Tijmen Blankevoort
79 2023-06-12 Augmenting Language Models with Long-Term Memory link Weizhi Wang, Li Dong,..., Furu Wei
78 2023-06-07 Intrinsic Dimension Estimation for Robust Detection of AI-Generated Texts link Eduard Tulchinskii, Kristian Kuznetsov,..., Irina Piontkovskaya
78 2023-05-15 Interpretability at Scale: Identifying Causal Mechanisms in Alpaca link Zhengxuan Wu, Atticus Geiger,..., Noah Goodman
77 2023-06-05 Representational Strengths and Limitations of Transformers link Clayton Sanford, Daniel Hsu, Matus Telgarsky
77 2023-02-28 EvoPrompting: Language Models for Code-Level Neural Architecture Search link Angelica Chen, David Dohan, David So
77 2023-02-02 Convolutional Neural Operators for robust and accurate learning of
PDEs
link Bogdan Raonic, Roberto Molinaro,..., Emmanuel de Bezenac
77 2023-02-26 Fast Attention Requires Bounded Entries link Josh Alman, Zhao Song
76 2023-05-19 The probability flow ODE is provably fast link Sitan Chen, Sinho Chewi,..., Adil Salim
76 2023-06-29 Michelangelo: Conditional 3D Shape Generation based on Shape-Image-Text Aligned
Latent Representation
link Zibo Zhao, Wen Liu,..., Shenghua Gao
75 2023-06-01 White-Box Transformers via Sparse Rate Reduction link Yaodong Yu, Sam Buchanan,..., Yi Ma
74 2023-05-15 Privacy Auditing with One (1) Training Run link Thomas Steinke, Milad Nasr, Matthew Jagielski
74 2023-05-24 Testing the General Deductive Reasoning Capacity of Large Language
Models Using OOD Examples
link Abulhair Saparov, Richard Yuanzhe Pang,..., He He
73 2023-06-01 Birth of a Transformer: A Memory Viewpoint link Alberto Bietti, Vivien Cabannes,..., Leon Bottou
72 2023-09-25 Dataset Diffusion: Diffusion-based Synthetic Data Generation for Pixel-Level Semantic
Segmentation
link Quang Ho Nguyen, Truong Tuan Vu,..., Khoi Nguyen
71 2023-10-11 Hierarchical Decomposition of Prompt-Based Continual Learning: Rethinking Obscured Sub-optimality link Liyuan Wang, Jingyi Xie,..., Jun Zhu
71 2022-05-20 Evaluating and Inducing Personality in Pre-trained Language Models link Guangyuan Jiang, Manjie Xu,..., Yixin Zhu
70 2023-01-12 Tracr: Compiled Transformers as a Laboratory for Interpretability link David Lindner, Janos Kramar,..., Vladimir Mikulik
70 2023-10-20 DPM-Solver-v3: Improved Diffusion ODE Solver with Empirical Model Statistics link Kaiwen Zheng, Cheng Lu,..., Jun Zhu
70 2023-06-30 Practical and Asymptotically Exact Conditional Sampling in Diffusion Models link Luhuan Wu, Brian L. Trippe,..., John Patrick Cunningham
70 2023-09-27 GeoCLIP: Clip-Inspired Alignment between Locations and Images for Effective
Worldwide Geo-localization
link Vicente Vivanco Cepeda, Gaurav Kumar Nayak, Mubarak Shah
70 2023-05-22 VanillaNet: the Power of Minimalism in Deep Learning link Hanting Chen, Yunhe Wang,..., Dacheng Tao
69 2023-06-26 Supervised Pretraining Can Learn In-Context Reinforcement Learning link Jonathan Lee, Annie Xie,..., Emma Brunskill
69 2023-08-10 PDE-Refiner: Achieving Accurate Long Rollouts with Neural PDE Solvers link Phillip Lippe, Bastiaan S. Veeling,..., Johannes Brandstetter
69 2023-03-23 The Quantization Model of Neural Scaling link Eric J Michaud, Ziming Liu,..., Max Tegmark
69 2022-12-19 Latent Diffusion for Language Generation link Justin Lovelace, Varsha Kishore,..., Kilian Q Weinberger
69 None Describe, Explain, Plan and Select: Interactive Planning with LLMs
Enables Open-World Multi-Task Agents
link Zihao Wang, Shaofei Cai,..., Yitao Liang
68 2023-05-25 Scan and Snap: Understanding Training Dynamics and Token Composition
in 1-layer Transformer
link Yuandong Tian, Yiping Wang,..., Simon Shaolei Du
68 2023-05-21 DreamWaltz: Make a Scene with Complex 3D Animatable Avatars link Yukun Huang, Jianan Wang,..., Lei Zhang
68 2023-05-19 PointGPT: Auto-regressively Generative Pre-training from Point Clouds link Guangyan Chen, Meiling Wang,..., Yufeng Yue
68 2023-05-29 GlyphControl: Glyph Conditional Control for Visual Text Generation link Yukang Yang, Dongnan Gui,..., Kai Chen
67 2023-06-02 The Surprising Effectiveness of Diffusion Models for Optical Flow
and Monocular Depth Estimation
link Saurabh Saxena, Charles Herrmann,..., David J. Fleet
67 2023-05-01 In-Context Learning Unlocked for Diffusion Models link Zhendong Wang, Yifan Jiang,..., Mingyuan Zhou
67 2023-05-22 To Repeat or Not To Repeat: Insights from Scaling
LLM under Token-Crisis
link Fuzhao Xue, Yao Fu,..., Yang You
67 2023-05-17 What You See is What You Read? Improving Text-Image
Alignment Evaluation
link Michal Yarom, Yonatan Bitton,..., Idan Szpektor
66 2023-06-01 STEVE-1: A Generative Model for Text-to-Behavior in Minecraft link Shalev Lifshitz, Keiran Paster,..., Sheila A. McIlraith
66 2023-05-18 LLMScore: Unveiling the Power of Large Language Models in
Text-to-Image Synthesis Evaluation
link Yujie Lu, Xianjun Yang,..., William Yang Wang
66 2023-05-31 Med-UniC: Unifying Cross-Lingual Medical Vision-Language Pre-Training by Diminishing Bias link Zhongwei Wan, Che Liu,..., Rossella Arcucci
65 2023-11-08 Hierarchically Gated Recurrent Neural Network for Sequence Modeling link Zhen Qin, Songlin Yang, Yiran Zhong
64 2023-06-26 Pretraining task diversity and the emergence of non-Bayesian in-context
learning for regression
link Allan Raventos, Mansheej Paul,..., Surya Ganguli
64 2023-05-08 Recommender Systems with Generative Retrieval link Shashank Rajput, Nikhil Mehta,..., Maheswaran Sathiamoorthy
64 2023-05-25 Diversify Your Vision Datasets with Automatic Diffusion-based Augmentation link Lisa Dunlap, Alyssa Umino,..., Trevor Darrell
64 2023-05-23 Weakly Supervised 3D Open-vocabulary Segmentation link Kunhao Liu, Fangneng Zhan,..., Shijian Lu
64 2022-06-14 Automatic Clipping: Differentially Private Deep Learning Made Easier and
Stronger
link Zhiqi Bu, Yu-Xiang Wang,..., George Karypis
64 2023-02-03 Rethinking Semi-Supervised Medical Image Segmentation: A Variance-Reduction Perspective link Chenyu You, Weicheng Dai,..., James s Duncan
63 2023-10-26 Global Structure-Aware Diffusion Process for Low-light Image Enhancement link Jinhui HOU, Zhiyu Zhu,..., Hui Yuan
63 2023-03-29 Diffusion Schrödinger Bridge Matching link Yuyang Shi, Valentin De Bortoli,..., Arnaud Doucet
62 2023-07-20 A Definition of Continual Reinforcement Learning link David Abel, Andre Barreto,..., Satinder Singh
61 2023-10-31 Unexpected Improvements to Expected Improvement for Bayesian Optimization link Sebastian Ament, Sam Daulton,..., Eytan Bakshy
61 2023-09-15 Compositional Foundation Models for Hierarchical Planning link Anurag Ajay, Seungwook Han,..., Pulkit Agrawal
61 2023-09-25 Evaluating Cognitive Maps and Planning in Large Language Models
with CogEval
link Ida Momennejad, Hosein Hasanbeig,..., Jonathan Larson
61 2023-03-12 Synthetic Experience Replay link Cong Lu, Philip J. Ball,..., Jack Parker-Holder
60 2023-06-22 Squeeze, Recover and Relabel: Dataset Condensation at ImageNet Scale
From A New Perspective
link Zeyuan Yin, Eric Xing, Zhiqiang Shen
60 2023-10-11 RoboCLIP: One Demonstration is Enough to Learn Robot Policies link Sumedh Anand Sontakke, Jesse Zhang,..., Laurent Itti
60 2023-06-08 SyncDiffusion: Coherent Montage via Synchronized Joint Diffusions link Yuseung Lee, Kunho Kim,..., Minhyuk Sung
59 2023-05-22 Hierarchical Integration Diffusion Model for Realistic Image Deblurring link Zheng Chen, Yulun Zhang,..., Xin Yuan
59 2023-11-14 MADG: Margin-based Adversarial Learning for Domain Generalization link Aveen Dayal, Vimal K B,..., Vineeth N. Balasubramanian
58 2023-05-30 Ambient Diffusion: Learning Clean Distributions from Corrupted Data link Giannis Daras, Kulin Shah,..., Adam Klivans
58 2023-05-28 GIMLET: A Unified Graph-Text Model for Instruction-Based Molecule Zero-Shot
Learning
link Haiteng Zhao, Shengchao Liu,..., Qi Liu
58 2023-05-18 Content-based Unrestricted Adversarial Attack link Zhaoyu Chen, Bo Li,..., Wenqiang Zhang
58 2023-06-02 LoCoOp: Few-Shot Out-of-Distribution Detection via Prompt Learning link Atsuyuki Miyai, Qing Yu,..., Kiyoharu Aizawa
58 2023-05-24 Flocks of Stochastic Parrots: Differentially Private Prompt Learning for
Large Language Models
link Haonan Duan, Adam Dziedzic,..., Franziska Boenisch
57 2023-05-23 Uncertainty Quantification over Graph with Conformalized Graph Neural Networks link Kexin Huang, Ying Jin,..., Jure Leskovec
57 2023-05-21 PRODIGY: Enabling In-context Learning Over Graphs link Qian Huang, Hongyu Ren,..., Jure Leskovec
57 2023-10-23 SpecTr: Fast Speculative Decoding via Optimal Transport link Ziteng Sun, Ananda Theertha Suresh,..., Felix Yu
57 2023-03-03 Revisiting Adversarial Training for ImageNet: Architectures, Training and Generalization
across Threat Models
link Naman Deep Singh, Francesco Croce, Matthias Hein
57 2023-09-24 GraphAdapter: Tuning Vision-Language Models With Dual Knowledge Graph link Xin Li, Dongze Lian,..., Xinchao Wang
57 2023-12-07 CLadder: Assessing Causal Reasoning in Language Models link Zhijing Jin, Yuen Chen,..., Bernhard Schölkopf
57 2023-09-25 Free-Bloom: Zero-Shot Text-to-Video Generator with LLM Director and LDM
Animator
link Hanzhuo Huang, Yufan Feng,..., Sibei Yang
57 2023-07-04 DiT-3D: Exploring Plain Diffusion Transformers for 3D Shape Generation link Shentong Mo, Enze Xie,..., Zhenguo Li
56 2023-03-20 Object-Centric Slot Diffusion link Jindong Jiang, Fei Deng,..., Sungjin Ahn
56 2023-11-02 Align Your Prompts: Test-Time Prompting with Distribution Alignment for
Zero-Shot Generalization
link Jameel Hassan Abdul Samadh, Hanan Gani,..., Salman Khan
56 2023-05-31 Efficient Diffusion Policies For Offline Reinforcement Learning link Bingyi Kang, Xiao Ma,..., Shuicheng YAN
56 2023-10-08 FedFed: Feature Distillation against Data Heterogeneity in Federated Learning link Zhiqin Yang, Yonggang Zhang,..., Bo Han
55 2023-12-11 4M: Massively Multimodal Masked Modeling link David Mizrahi, Roman Bachmann,..., Amir Zamir
55 2023-05-27 Scalable Transformer for PDE Surrogate Modeling link Zijie Li, Dule Shu, Amir Barati Farimani
55 2023-11-03 ForecastPFN: Synthetically-Trained Zero-Shot Forecasting link Samuel Dooley, Gurnoor Singh Khurana,..., Colin White
54 2023-06-09 $S^3$: Increasing GPU Utilization during Generative Inference for Higher
Throughput
link Yunho Jin, Chun-Feng Wu,..., Gu-Yeon Wei
54 2023-06-01 Towards Foundation Models for Scientific Machine Learning: Characterizing Scaling
and Transfer Behavior
link Shashank Subramanian, Peter Harrington,..., Amir Gholami
54 2023-05-19 Scaling laws for language encoding models in fMRI link Richard Antonello, Aditya Vaidya, Alexander Huth
53 2023-06-26 Equivariant flow matching link Leon Klein, Andreas Krämer, Frank Noe
53 2023-07-31 Conformal PID Control for Time Series Prediction link Anastasios Nikolas Angelopoulos, Emmanuel Candes, Ryan Tibshirani
53 2023-10-12 Neural Combinatorial Optimization with Heavy Decoder: Toward Large Scale
Generalization
link Fu Luo, Xi Lin,..., Zhenkun Wang
52 2023-05-19 Cinematic Mindscapes: High-quality Video Reconstruction from Brain Activity link Zijiao Chen, Jiaxin Qing, Juan Helen Zhou
52 2023-02-12 MarioGPT: Open-Ended Text2Level Generation through Large Language Models link Shyam Sudhakaran, Miguel González-Duque,..., Sebastian Risi
52 2023-06-01 Nonparametric Identifiability of Causal Representations from Unknown Interventions link Julius von Kügelgen, Michel Besserve,..., Bernhard Schölkopf
52 2023-06-07 Fine-Grained Visual Prompting link Lingfeng Yang, Yueze Wang,..., Jian Yang
52 2023-07-19 PreDiff: Precipitation Nowcasting with Latent Diffusion Models link Zhihan Gao, Xingjian Shi,..., Bernie Wang
52 2023-05-22 Textually Pretrained Speech Language Models link Michael Hassid, Tal Remez,..., Yossi Adi
52 2023-07-30 Crystal Structure Prediction by Joint Equivariant Diffusion link Rui Jiao, Wenbing Huang,..., Yang Liu
51 2023-05-25 Dynamic Context Pruning for Efficient and Interpretable Autoregressive Transformers link Sotiris Anagnostidis, Dario Pavllo,..., Thomas Hofmann
51 2023-05-19 Post Hoc Explanations of Language Models Can Improve Language
Models
link Satyapriya Krishna, Jiaqi Ma,..., Himabindu Lakkaraju
51 2023-09-25 FeCAM: Exploiting the Heterogeneity of Class Distributions in Exemplar-Free
Continual Learning
link Dipam Goswami, Yuyang Liu,..., Joost van de Weijer
51 2023-07-24 Understanding the Latent Space of Diffusion Models through the
Lens of Riemannian Geometry
link Yong-Hyun Park, Mingi Kwon,..., Youngjung Uh
50 2023-05-31 From Pixels to UI Actions: Learning to Follow Instructions
via Graphical User Interfaces
link Peter Shaw, Mandar Joshi,..., Kristina Toutanova
50 2023-04-27 Convergence of Adam Under Relaxed Assumptions link Haochuan Li, Alexander Rakhlin, Ali Jadbabaie
50 2023-06-05 Structure-free Graph Condensation: From Large-scale Graphs to Condensed Graph-free
Data
link Xin Zheng, Miao Zhang,..., Shirui Pan
50 2023-04-11 Conditional Adapters: Parameter-efficient Transfer Learning with Fast Inference link Tao Lei, Junwen Bai,..., Ming-Wei Chang
50 2023-05-11 An Inverse Scaling Law for CLIP Training link Xianhang Li, Zeyu Wang, Cihang Xie
50 2023-05-28 Knowledge-Augmented Reasoning Distillation for Small Language Models in Knowledge-Intensive
Tasks
link Minki Kang, Seanie Lee,..., Sung Ju Hwang
50 2023-10-19 Fast Model DeBias with Machine Unlearning link Ruizhe Chen, Jianfei Yang,..., Zuozhu Liu
50 2023-10-30 One-for-All: Bridge the Gap Between Heterogeneous Architectures in Knowledge
Distillation
link Zhiwei Hao, Jianyuan Guo,..., Chang Xu
50 2023-05-23 Video Prediction Models as Rewards for Reinforcement Learning link Alejandro Escontrela, Ademi Adeniji,..., Pieter Abbeel
49 2023-05-31 Dense and Aligned Captions (DAC) Promote Compositional Reasoning in
VL Models
link Sivan Doveh, Assaf Arbelle,..., Leonid Karlinsky
49 2023-06-03 DYffusion: A Dynamics-informed Diffusion Model for Spatiotemporal Forecasting link Salva Rühling Cachay, Bo Zhao,..., Rose Yu
49 2023-07-06 MomentDiff: Generative Video Moment Retrieval from Random to Real link Pandeng Li, Chen-Wei Xie,..., Yongdong Zhang
49 2023-07-12 Identifiability Guarantees for Causal Disentanglement from Soft Interventions link Jiaqi Zhang, Kristjan Greenewald,..., Caroline Uhler
49 2023-06-01 Inserting Anybody in Diffusion Models via Celeb Basis link Ge Yuan, Xiaodong Cun,..., Huicheng Zheng
49 2023-10-13 Rank-DETR for High Quality Object Detection link Yifan Pu, Weicong Liang,..., Gao Huang
49 2023-05-22 Response Length Perception and Sequence Scheduling: An LLM-Empowered
LLM Inference Pipeline
link Zangwei Zheng, Xiaozhe Ren,..., Yang You
49 2023-09-27 Dynamic Prompt Learning: Addressing Cross-Attention Leakage for Text-Based Image
Editing
link Kai Wang, Fei Yang,..., Joost van de Weijer
48 2023-06-13 Image Captioners Are Scalable Vision Learners Too link Michael Tschannen, Manoj Kumar,..., Lucas Beyer
48 2023-06-30 SPAE: Semantic Pyramid AutoEncoder for Multimodal Generation with Frozen
LLMs
link Lijun Yu, Yong Cheng,..., Lu Jiang
48 2023-06-04 Temporal Dynamic Quantization for Diffusion Models link Junhyuk So, Jungwon Lee,..., Eunhyeok Park
48 2023-05-30 Likelihood-Based Diffusion Language Models link Ishaan Gulrajani, Tatsunori Hashimoto
48 2023-05-22 Getting ViT in Shape: Scaling Laws for Compute-Optimal Model
Design
link Ibrahim Alabdulmohsin, Xiaohua Zhai,..., Lucas Beyer
48 2023-05-16 AR-Diffusion: Auto-Regressive Diffusion Model for Text Generation link Tong Wu, Zhihao Fan,..., Weizhu Chen
48 2023-05-25 MixFormerV2: Efficient Fully Transformer Tracking link Yutao Cui, Tianhui Song,..., Limin Wang
48 2022-12-15 MAViL: Masked Audio-Video Learners link Po-Yao Huang, Vasu Sharma,..., Christoph Feichtenhofer
47 2023-07-26 Skill-it! A data-driven skills framework for understanding and training
language models
link Mayee F Chen, Nicholas Roberts,..., Christopher Re
47 2022-09-01 ID and OOD Performance Are Sometimes Inversely Correlated on
Real-world Datasets
link Damien Teney, LIN Yong,..., Ehsan Abbasnejad
47 2022-12-20 Parsel🐍: Algorithmic Reasoning with Language Models by Composing Decompositions link Eric Zelikman, Qian Huang,..., Nick Haber
47 2023-05-30 Grammar Prompting for Domain-Specific Language Generation with Large
Language Models
link Bailin Wang, Zi Wang,..., Yoon Kim
47 2023-11-01 CROMA: Remote Sensing Representations with Contrastive Radar-Optical Masked Autoencoders link Anthony Fuller, Koreen Millard, James R Green
47 2023-02-09 Read and Reap the Rewards: Learning to Play Atari
with the Help of Instruction Manuals
link Yue Wu, Yewen Fan,..., Tom Mitchell
46 2023-06-01 Exposing Attention Glitches with Flip-Flop Language Modeling link Bingbin Liu, Jordan T. Ash,..., Cyril Zhang
46 2023-05-25 Efficient Neural Music Generation link Max W. Y. Lam, Qiao Tian,..., Yuxuan Wang
46 2023-02-01 The geometry of hidden representations of large transformer models link Lucrezia Valeriani, Diego Doimo,..., Alberto Cazzaniga
46 2023-04-03 AUDIT: Audio Editing by Following Instructions with Latent Diffusion
Models
link Yuancheng Wang, Zeqian Ju,..., sheng zhao
46 2023-10-09 Domain Watermark: Effective and Harmless Dataset Copyright Protection is
Closed at Hand
link Junfeng Guo, Yiming Li,..., Bo Li
46 2023-05-24 Inverse Preference Learning: Preference-based RL without a Reward Function link Joey Hejna, Dorsa Sadigh
46 None PromptRestorer: A Prompting Image Restoration Method with Degradation Perception link Cong Wang, Jinshan Pan,..., Junyang Chen
45 2023-05-23 Siamese Masked Autoencoders link Agrim Gupta, Jiajun Wu,..., Li Fei-Fei
45 2023-05-25 Parallel Sampling of Diffusion Models link Andy Shih, Suneel Belkhale,..., Nima Anari
45 2023-05-24 Deep Reinforcement Learning with Plasticity Injection link Evgenii Nikishin, Junhyuk Oh,..., Andre Barreto
45 2023-05-25 Knowledge Diffusion for Distillation link Tao Huang, Yuan Zhang,..., Chang Xu
45 2023-09-10 SA-Solver: Stochastic Adams Solver for Fast Sampling of Diffusion
Models
link Shuchen Xue, Mingyang Yi,..., Zhi-Ming Ma
45 2023-05-30 PanoGen: Text-Conditioned Panoramic Environment Generation for Vision-and-Language Navigation link Jialu Li, Mohit Bansal
45 2022-10-06 A Logic for Expressing Log-Precision Transformers link William Merrill, Ashish Sabharwal
45 2023-05-24 Exploring Diverse In-Context Configurations for Image Captioning link Xu Yang, Yongliang Wu,..., Xin Geng
45 2023-06-12 VillanDiffusion: A Unified Backdoor Attack Framework for Diffusion Models link Sheng-Yen Chou, Pin-Yu Chen, Tsung-Yi Ho
45 2023-02-27 Permutation Equivariant Neural Functionals link Allan Zhou, Kaien Yang,..., Chelsea Finn
45 None How2comm: Communication-Efficient and Collaboration-Pragmatic Multi-Agent Perception link Dingkang Yang, Kun Yang,..., Lihua Zhang
44 None When Do Graph Neural Networks Help with Node Classification?
Investigating the Homophily Principle on Node Distinguishability
link Sitao Luan, Chenqing Hua,..., Doina Precup
44 2023-10-31 Generate What You Prefer: Reshaping Sequential Recommendation via Guided
Diffusion
link Zhengyi Yang, Jiancan Wu,..., Xiangnan He
44 2023-10-25 CoDet: Co-occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection link Chuofan Ma, Yi Jiang,..., XIAOJUAN QI
44 2023-10-23 Large Language Models are Visual Reasoning Coordinators link Liangyu Chen, Bo Li,..., Ziwei Liu
44 2023-06-08 Factorized Contrastive Learning: Going Beyond Multi-view Redundancy link Paul Pu Liang, Zihao Deng,..., Russ Salakhutdinov
43 2023-05-18 SlotDiffusion: Object-Centric Generative Modeling with Diffusion Models link Ziyi Wu, Jingyu Hu,..., Animesh Garg
43 2023-06-11 A Holistic Approach to Unifying Automatic Concept Extraction and
Concept Importance Estimation
link Thomas FEL, Victor Boutin,..., Thomas Serre
43 2023-05-05 Large Language Models for Automated Data Science: Introducing CAAFE
for Context-Aware Automated Feature Engineering
link Noah Hollmann, Samuel Müller, Frank Hutter
43 2023-02-28 Goal Driven Discovery of Distributional Differences via Language Descriptions link Ruiqi Zhong, Peter Zhang,..., Jacob Steinhardt
43 2023-05-31 Direct Diffusion Bridge using Data Consistency for Inverse Problems link Hyungjin Chung, Jeongsol Kim, Jong Chul Ye
43 2023-10-23 FreeMask: Synthetic Images with Dense Annotations Make Stronger Segmentation
Models
link Lihe Yang, Xiaogang Xu,..., Hengshuang Zhao
43 2023-09-23 Dream the Impossible: Outlier Imagination with Diffusion Models link Xuefeng Du, Yiyou Sun,..., Yixuan Li
42 2023-05-26 Flow Matching for Scalable Simulation-Based Inference link Jonas Bernhard Wildberger, Maximilian Dax,..., Bernhard Schölkopf
42 2023-05-09 The emergence of clusters in self-attention dynamics link Borjan Geshkovski, Cyril Letrouit,..., Philippe Rigollet
42 2023-06-21 Training Transformers with 4-bit Integers link Haocheng Xi, ChangHao Li,..., Jun Zhu
42 2023-06-26 DiffSketcher: Text Guided Vector Sketch Synthesis through Latent Diffusion
Models
link XiMing Xing, Chuang Wang,..., Dong Xu
42 2023-12-08 Few-Shot Class-Incremental Learning via Training-Free Prototype Calibration link Qi-Wei Wang, Da-Wei Zhou,..., Han-Jia Ye
42 2023-05-30 Joint Bayesian Inference of Graphical Structure and Parameters with
a Single Generative Flow Network
link Tristan Deleu, Mizu Nishikawa-Toomey,..., Yoshua Bengio
42 2023-06-26 Restart Sampling for Improving Generative Processes link Yilun Xu, Mingyang Deng,..., Tommi S. Jaakkola
42 2023-10-21 Contrast Everything: A Hierarchical Contrastive Framework for Medical Time-Series link Yihe Wang, Yu Han,..., Xiang Zhang
42 2023-06-04 For SALE: State-Action Representation Learning for Deep Reinforcement Learning link Scott Fujimoto, Wei-Di Chang,..., David Meger
42 2022-12-21 Hidden Poison: Machine Unlearning Enables Camouflaged Poisoning Attacks link Jimmy Z. Di, Jack Douglas,..., Ayush Sekhari
42 2023-05-26 Language Models Can Improve Event Prediction by Few-Shot Abductive
Reasoning
link Xiaoming Shi, Siqiao Xue,..., Hongyuan Mei
42 2023-06-06 Towards Label-free Scene Understanding by Vision Foundation Models link Runnan Chen, Youquan Liu,..., Wenping Wang
42 2023-05-26 CRoSS: Diffusion Model Makes Controllable, Robust and Secure Image
Steganography
link Jiwen Yu, Xuanyu Zhang,..., Jian Zhang
41 2023-10-18 Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture link Daniel Y Fu, Simran Arora,..., Christopher Re
41 2023-10-13 Compositional Abilities Emerge Multiplicatively: Exploring Diffusion Models on a
Synthetic Task
link Maya Okawa, Ekdeep Singh Lubana,..., Hidenori Tanaka
41 2023-06-16 HiNeRV: Video Compression with Hierarchical Encoding-based Neural Representation link Ho Man Kwan, Ge Gao,..., David Bull
41 None Tree-Rings Watermarks: Invisible Fingerprints for Diffusion Images link Yuxin Wen, John Kirchenbauer,..., Tom Goldstein
41 2023-10-25 Towards Self-Interpretable Graph-Level Anomaly Detection link Yixin Liu, Kaize Ding,..., Shirui Pan
41 2023-06-04 Data Quality in Imitation Learning link Suneel Belkhale, Yuchen Cui, Dorsa Sadigh
41 2023-07-07 Scalable Membership Inference Attacks via Quantile Regression link Martin Andres Bertran, Shuai Tang,..., Steven Wu
41 2023-09-25 DeepACO: Neural-enhanced Ant Systems for Combinatorial Optimization link Haoran Ye, Jiarui Wang,..., Yong Li
41 2023-06-10 TensorNet: Cartesian Tensor Representations for Efficient Learning of Molecular
Potentials
link Guillem Simeon, Gianni De Fabritiis
41 2023-12-22 FineMoGen: Fine-Grained Spatio-Temporal Motion Generation and Editing link Mingyuan Zhang, Huirong Li,..., Ziwei Liu
41 2022-11-25 Expanding Small-Scale Datasets with Guided Imagination link Yifan Zhang, Daquan Zhou,..., Jiashi Feng
41 2023-06-05 HeadSculpt: Crafting 3D Head Avatars with Text link Xiao Han, Yukang Cao,..., Kwan-Yee K. Wong
40 2023-06-28 Separable Physics-Informed Neural Networks link Junwoo Cho, Seungtae Nam,..., Eunbyung Park
40 2023-07-12 No Train No Gain: Revisiting Efficient Training Algorithms For
Transformer-based Language Models
link Jean Kaddour, Oscar Key,..., Matt Kusner
40 2023-02-17 Consistent Diffusion Models: Mitigating Sampling Drift by Learning to
be Consistent
link Giannis Daras, Yuval Dagan,..., Constantinos Costis Daskalakis
40 2023-06-29 Graph Denoising Diffusion for Inverse Protein Folding link Kai Yi, Bingxin Zhou,..., Yu Guang Wang
40 2023-12-12 Equivariant Flow Matching with Hybrid Probability Transport for 3D
Molecule Generation
link Yuxuan Song, Jingjing Gong,..., Wei-Ying Ma
40 2023-09-25 IEBins: Iterative Elastic Bins for Monocular Depth Estimation link Shuwei Shao, Zhongcai Pei,..., Zhengguo Li
40 2022-09-30 Universal Prompt Tuning for Graph Neural Networks link Taoran Fang, Yunchao Mercer Zhang,..., Lei CHEN
39 2023-07-22 HIQL: Offline Goal-Conditioned RL with Latent States as Actions link Seohong Park, Dibya Ghosh,..., Sergey Levine
39 2023-06-02 Spatially Resolved Gene Expression Prediction from Histology Images via
Bi-modal Contrastive Learning
link Ronald Xie, Kuan Pang,..., Gary Bader
39 2023-06-20 LVM-Med: Learning Large-Scale Self-Supervised Vision Models for Medical Imaging
via Second-order Graph Matching
link Duy Minh Ho Nguyen, Hoang Nguyen,..., Mathias Niepert
39 2023-05-25 Diffusion-Based Adversarial Sample Generation for Improved Stealthiness and Controllability link Haotian Xue, Alexandre Araujo,..., Yongxin Chen
39 2023-07-21 Predict, Refine, Synthesize: Self-Guiding Diffusion Models for Probabilistic Time
Series Forecasting
link Marcel Kollovieh, Abdul Fatir Ansari,..., Bernie Wang
39 2023-07-07 Autodecoding Latent 3D Diffusion Models link Evangelos Ntavelis, Aliaksandr Siarohin,..., Sergey Tulyakov
38 2023-02-02 Timewarp: Transferable Acceleration of Molecular Dynamics by Learning Time-Coarsened
Dynamics
link Leon Klein, Andrew Y. K. Foong,..., Ryota Tomioka
38 2023-07-06 Pruning vs Quantization: Which is Better? link Andrey Kuzmin, Markus Nagel,..., Tijmen Blankevoort
38 2023-02-14 Energy Transformer link Benjamin Hoover, Yuchen Liang,..., Dmitry Krotov
38 2023-08-16 Towards Personalized Federated Learning via Heterogeneous Model Reassembly link Jiaqi Wang, Xingyi Yang,..., Fenglong Ma
38 2023-09-29 Asynchrony-Robust Collaborative Perception via Bird's Eye View Flow link Sizhe Wei, Yuxi Wei,..., Ya Zhang
38 2023-04-23 DiffTraj: Generating GPS Trajectory with Diffusion Probabilistic Model link Yuanshao Zhu, Yongchao Ye,..., James Yu
37 2023-05-22 On quantum backpropagation, information reuse, and cheating measurement collapse link Amira Abbas, Robbie King,..., Jarrod Ryan McClean
37 None P-Flow: A Fast and Data-Efficient Zero-Shot TTS through Speech
Prompting
link Sungwon Kim, Kevin J. Shih,..., Bryan Catanzaro
37 2024-02-01 ViCA-NeRF: View-Consistency-Aware 3D Editing of Neural Radiance Fields link Jiahua Dong, Yu-Xiong Wang
37 2023-03-01 Time Series as Images: Vision Transformer for Irregularly Sampled
Time Series
link Zekun Li, Shiyang Li, Xifeng Yan
37 2023-06-07 Object-Centric Learning for Real-World Videos by Predicting Temporal Feature
Similarities
link Andrii Zadaianchuk, Maximilian Seitzer, Georg Martius
36 2022-11-02 Entropic Neural Optimal Transport via Diffusion Processes link Nikita Gushchin, Alexander Kolesov,..., Evgeny Burnaev
36 2022-10-26 The Goldilocks of Pragmatic Understanding: Fine-Tuning Strategy Matters for
Implicature Resolution by LLMs
link Laura Eline Ruis, Akbir Khan,..., Edward Grefenstette
36 2023-06-23 Max-Margin Token Selection in Attention Mechanism link Davoud Ataee Tarzanagh, Yingcong Li,..., Samet Oymak
36 2023-06-23 Scaling MLPs: A Tale of Inductive Bias link Gregor Bachmann, Sotiris Anagnostidis, Thomas Hofmann
36 2023-06-15 Class-Conditional Conformal Prediction with Many Classes link Tiffany Ding, Anastasios Nikolas Angelopoulos,..., Ryan Tibshirani
36 2023-02-14 Bounding training data reconstruction in DP-SGD link Jamie Hayes, Borja Balle, Saeed Mahloujifar
36 2023-06-08 Boosting Adversarial Transferability by Achieving Flat Local Maxima link Zhijin Ge, Hongying Liu,..., Yuanyuan Liu
36 2023-04-25 Stable and low-precision training for large-scale vision-language models link Mitchell Wortsman, Tim Dettmers,..., Ludwig Schmidt
36 2023-05-29 PHOTOSWAP: Personalized Subject Swapping in Images link Jing Gu, Yilin Wang,..., Xin Eric Wang
36 2023-10-22 Hierarchical Vector Quantized Transformer for Multi-class Unsupervised Anomaly Detection link Ruiying Lu, YuJie Wu,..., Ruimin Hu
36 2023-04-02 SEENN: Towards Temporal Spiking Early Exit Neural Networks link Yuhang Li, Tamar Geller,..., Priyadarshini Panda
36 2023-04-25 Parallel Spiking Neurons with High Efficiency and Ability to
Learn Long-term Dependencies
link Wei Fang, Zhaofei Yu,..., Yonghong Tian
35 2023-07-28 AbDiffuser: full-atom generation of in-vitro functioning antibodies link Karolis Martinkus, Jan Ludwiczak,..., Andreas Loukas
35 2023-03-01 Grounded Decoding: Guiding Text Generation with Grounded Models for
Embodied Agents
link Wenlong Huang, Fei Xia,..., brian ichter
35 2022-12-06 GAUCHE: A Library for Gaussian Processes in Chemistry link Ryan-Rhys Griffiths, Leo Klarner,..., Jian Tang
35 2023-07-20 OBJECT 3DIT: Language-guided 3D-aware Image Editing link Oscar Michel, Anand Bhattad,..., Tanmay Gupta
35 2023-05-17 End-To-End Latent Variational Diffusion Models for Inverse Problems in
High Energy Physics
link Alexander Shmakov, Kevin Greif,..., Daniel Whiteson
35 2023-08-27 Revisiting Scalarization in Multi-Task Learning: A Theoretical Perspective link Yuzheng Hu, Ruicheng Xian,..., Han Zhao
35 2023-06-02 Interpretable and Explainable Logical Policies via Neurally Guided Symbolic
Abstraction
link Quentin Delfosse, Hikaru Shindo,..., Kristian Kersting
35 2023-06-29 Neural Polarizer: A Lightweight and Effective Backdoor Defense via
Purifying Poisoned Features
link Mingli Zhu, Shaokui Wei,..., Baoyuan Wu
35 None BIOT: Biosignal Transformer for Cross-data Learning in the Wild link Chaoqi Yang, M Brandon Westover, Jimeng Sun
35 2023-05-30 LANCE: Stress-testing Visual Models by Generating Language-guided Counterfactual Images link Viraj Uday Prabhu, Sriram Yenamandra,..., Judy Hoffman
35 2023-07-05 Elastic Decision Transformer link Yueh-Hua Wu, Xiaolong Wang, Masashi Hamaya
34 2023-07-07 When Do Transformers Shine in RL? Decoupling Memory from
Credit Assignment
link Tianwei Ni, Michel Ma,..., Pierre-Luc Bacon
34 2023-04-02 Saddle-to-Saddle Dynamics in Diagonal Linear Networks link Scott Pesme, Nicolas Flammarion
34 2023-07-10 Compositional Generalization from First Principles link Thaddäus Wiedemer, Prasanna Mayilvahanan,..., Wieland Brendel
34 2023-06-14 Training-free Diffusion Model Adaptation for Variable-Sized Text-to-Image Synthesis link Zhiyu Jin, Xuli Shen,..., Xiangyang Xue
34 2023-06-09 PoET: A generative model of protein families as sequences-of-sequences link Timothy Fei Truong Jr, Tristan Bepler
34 2022-06-27 Supply-Side Equilibria in Recommender Systems link Meena Jagadeesan, Nikhil Garg, Jacob Steinhardt
34 None Adaptive Normalization for Non-stationary Time Series Forecasting: A Temporal
Slice Perspective
link Zhiding Liu, Mingyue Cheng,..., Enhong Chen
34 2022-12-23 A-NeSI: A Scalable Approximate Method for Probabilistic Neurosymbolic Inference link Emile van Krieken, Thiviyan Thanapalasingam,..., Annette Ten Teije
34 None CrossGNN: Confronting Noisy Multivariate Time Series Via Cross Interaction
Refinement
link Qihe Huang, Lei Shen,..., Yang Wang
34 None Dynamic Personalized Federated Learning with Adaptive Differential Privacy link Xiyuan Yang, Wenke Huang, Mang Ye
34 2023-07-20 Shared Adversarial Unlearning: Backdoor Mitigation by Unlearning Shared Adversarial
Examples
link Shaokui Wei, Mingda Zhang,..., Baoyuan Wu
34 2023-07-03 Hierarchical Open-vocabulary Universal Image Segmentation link Xudong Wang, Shufan Li,..., Trevor Darrell
33 2023-05-18 Clifford Group Equivariant Neural Networks link David Ruhe, Johannes Brandstetter, Patrick Forré
33 None Let the Flows Tell: Solving Graph Combinatorial Problems
with GFlowNets
link Dinghuai Zhang, Hanjun Dai,..., Ling Pan
33 None Two-Stage Learning to Defer with Multiple Experts link Anqi Mao, Christopher Mohri,..., Yutao Zhong
33 2023-10-27 Learning to Search Feasible and Infeasible Regions of Routing
Problems with Flexible Neural k-Opt
link Yining Ma, Zhiguang Cao, Yeow Meng Chee
33 2023-05-26 Causal Component Analysis link Wendong Liang, Armin Kekić,..., Bernhard Schölkopf
33 2023-05-24 ALGO: Synthesizing Algorithmic Programs with Generated Oracle Verifiers link Kexun Zhang, Danqing Wang,..., Lei Li
33 2023-12-13 Distributed Inference and Fine-tuning of Large Language Models Over
The Internet
link Alexander Borzunov, Max Ryabinin,..., Colin Raffel
33 2023-06-13 (Amplified) Banded Matrix Factorization: A unified approach to private
training
link Christopher A. Choquette-Choo, Arun Ganesh,..., Zheng Xu
33 2023-05-26 Demo2Code: From Summarizing Demonstrations to Synthesizing Code via Extended
Chain-of-Thought
link Huaxiaoyue Wang, Gonzalo Gonzalez-Pumariega,..., Sanjiban Choudhury
33 2023-09-25 LinGCN: Structural Linearized Graph Convolutional Network for Homomorphically Encrypted
Inference
link Hongwu Peng, Ran Ran,..., Caiwen Ding
33 2023-10-23 Rethinking Tokenizer and Decoder in Masked Graph Modeling for
Molecules
link Zhiyuan Liu, Yaorui Shi,..., Tat-Seng Chua
33 2023-05-30 Intriguing Properties of Quantization at Scale link Arash Ahmadian, Saurabh Dash,..., Sara Hooker
33 2022-06-07 A*Net: A Scalable Path-based Reasoning Approach for Knowledge Graphs link Zhaocheng Zhu, Xinyu Yuan,..., Jian Tang
33 2023-06-18 Online Map Vectorization for Autonomous Driving: A Rasterization Perspective link Gongjie Zhang, Jiahao Lin,..., Zuoguan Wang
33 2023-10-04 CoDA: Collaborative Novel Box Discovery and Cross-modal Alignment for
Open-vocabulary 3D Object Detection
link Yang Cao, Yihan Zeng,..., Dan Xu
32 2023-06-01 Learning Transformer Programs link Dan Friedman, Alexander Wettig, Danqi Chen
32 2023-05-18 Smoothing the Landscape Boosts the Signal for SGD: Optimal
Sample Complexity for Learning Single Index Models
link Alex Damian, Eshaan Nichani,..., Jason D. Lee
32 2023-05-25 Demystifying Oversmoothing in Attention-Based Graph Neural Networks link Xinyi Wu, Amir Ajorlou,..., Ali Jadbabaie
32 2022-10-03 Rank-N-Contrast: Learning Continuous Representations for Regression link Kaiwen Zha, Peng Cao,..., Dina Katabi
32 2023-07-14 HyTrel: Hypergraph-enhanced Tabular Data Representation Learning link Pei Chen, Soumajyoti Sarkar,..., George Karypis
32 2023-10-24 A Unified, Scalable Framework for Neural Population Decoding link Mehdi Azabou, Vinam Arora,..., Eva L Dyer
32 2023-05-24 Reverse Engineering Self-Supervised Learning link Ido Ben-Shaul, Ravid Shwartz-Ziv,..., Yann LeCun
32 2022-03-29 Causal de Finetti: On the Identification of Invariant Causal
Structure in Exchangeable Data
link Siyuan Guo, Viktor Tóth,..., Ferenc Huszár
32 2023-03-19 Unsupervised Learning for Solving the Travelling Salesman Problem link Yimeng Min, Yiwei Bai, Carla P Gomes
32 None SwapPrompt: Test-Time Prompt Adaptation for Vision-Language Models link Xiaosong Ma, Jie ZHANG,..., Wenchao Xu
32 2023-05-26 SOC: Semantic-Assisted Object Cluster for Referring Video Object
Segmentation
link Zhuoyan Luo, Yicheng Xiao,..., Yujiu Yang
32 2023-05-16 Revisiting the Minimalist Approach to Offline Reinforcement Learning link Denis Tarasov, Vladislav Kurenkov,..., Sergey Kolesnikov
32 2023-03-23 Fairness-guided Few-shot Prompting for Large Language Models link Huan Ma, Changqing Zhang,..., Bingzhe Wu
32 2023-09-23 Deciphering Spatio-Temporal Graph Forecasting: A Causal Lens and Treatment link Yutong Xia, Yuxuan Liang,..., Roger Zimmermann
32 2023-10-13 Does Graph Distillation See Like Vision Dataset Counterpart? link Beining Yang, Kai Wang,..., Jianxin Li
32 2022-09-13 Characterizing Graph Datasets for Node Classification: Homophily-Heterophily Dichotomy and
Beyond
link Oleg Platonov, Denis Kuznedelev,..., Liudmila Prokhorenkova
32 2023-06-01 StyleGAN knows Normal, Depth, Albedo, and More link Anand Bhattad, Daniel McKee,..., David Forsyth
31 2023-02-21 Diffusion Models and Semi-Supervised Learners Benefit Mutually with Few
Labels
link Zebin You, Yong Zhong,..., Jun Zhu
31 2023-07-13 Reward-Directed Conditional Diffusion: Provable Distribution Estimation and Reward Improvement link Hui Yuan, Kaixuan Huang,..., Mengdi Wang
31 2023-10-07 VLATTACK: Multimodal Adversarial Attacks on Vision-Language Tasks via Pre-trained
Models
link Ziyi Yin, Muchao Ye,..., Fenglong Ma
31 2023-09-15 Towards Last-layer Retraining for Group Robustness with Fewer Annotations link Tyler LaBonte, Vidya Muthukumar, Abhishek Kumar
31 2023-05-22 Meta-in-context learning in large language models link Julian Coda-Forno, Marcel Binz,..., Eric Schulz
31 2023-08-02 ImageBrush: Learning Visual In-Context Instructions for Exemplar-Based Image Manipulation link Yasheng SUN, Yifan Yang,..., Hideki Koike
31 2022-11-25 Bypass Exponential Time Preprocessing: Fast Neural Network Training via
Weight-Data Correlation Preprocessing
link Josh Alman, Jiehao Liang,..., Danyang Zhuo
31 2023-03-22 EDGI: Equivariant Diffusion for Planning with Embodied Agents link Johann Brehmer, Joey Bose,..., Taco Cohen
31 None ClusterFomer: Clustering As A Universal Visual Learner link James Chenhao Liang, Yiming Cui,..., Dongfang Liu
31 2023-05-18 DiffUTE: Universal Text Editing Diffusion Model link Haoxing Chen, Zhuoer Xu,..., Weiqiang Wang
31 2023-05-30 SheetCopilot: Bringing Software Productivity to the Next Level through
Large Language Models
link Hongxin Li, Jingran Su,..., Zhaoxiang Zhang
30 2023-05-30 Real-World Image Variation by Aligning Diffusion Inversion Chain link Yuechen ZHANG, Jinbo Xing,..., Jiaya Jia
30 None IBA: Towards Irreversible Backdoor Attacks in Federated Learning link Dung Thuy Nguyen, Tuan Minh Nguyen,..., KOK SENG WONG
30 2023-10-27 Optimal Transport for Treatment Effect Estimation link Hao Wang, Jiajun Fan,..., Ruiming Tang
30 2023-02-07 Concept Algebra for (Score-Based) Text-Controlled Generative Models link Zihao Wang, Lin Gui,..., Victor Veitch
30 2023-06-21 Mass-Producing Failures of Multimodal Systems with Language Models link Shengbang Tong, Erik Jones, Jacob Steinhardt
30 2023-11-14 The Transient Nature of Emergent In-Context Learning in Transformers link Aaditya K Singh, Stephanie C.Y. Chan,..., Felix Hill
30 2023-10-29 Does Invariant Graph Learning via Environment Augmentation Learn Invariance? link Yongqiang Chen, Yatao Bian,..., James Cheng
30 2022-10-04 ASIF: Coupled Data Turns Unimodal Models to Multimodal without
Training
link Antonio Norelli, Marco Fumero,..., Francesco Locatello
30 2022-04-04 Training Fully Connected Neural Networks is $\exists\mathbb{R}$-Complete link Daniel Bertschinger, Christoph Hertrich,..., Simon Weber
30 2023-05-26 Contrast, Attend and Diffuse to Decode High-Resolution Images from
Brain Activities
link Jingyuan Sun, Mingxiao Li,..., Marie-Francine Moens
30 2023-02-08 Sample-efficient Multi-objective Molecular Optimization with GFlowNets link Yiheng Zhu, Jialu Wu,..., Jian Wu
30 2023-05-26 The Curious Price of Distributional Robustness in Reinforcement Learning
with a Generative Model
link Laixi Shi, Gen Li,..., Yuejie Chi
30 2023-10-14 STORM: Efficient Stochastic Transformer based World Models for Reinforcement
Learning
link Weipu Zhang, Gang Wang,..., Gao Huang
30 2023-05-15 Bridging the Domain Gap: Self-Supervised 3D Scene Understanding with
Foundation Models
link Zhimin Chen, Longlong Jing,..., Bing Li
30 2023-04-10 H2RBox-v2: Incorporating Symmetry for Boosting Horizontal Box Supervised Oriented
Object Detection
link Yi Yu, Xue Yang,..., Junchi Yan
29 2023-06-02 Convex and Non-convex Optimization Under Generalized Smoothness link Haochuan Li, Jian Qian,..., Ali Jadbabaie
29 None DeWave: Discrete Encoding of EEG Waves for EEG to
Text Translation
link Yiqun Duan, Charles Zhou,..., Chin-teng Lin
29 2023-05-31 Not All Neuro-Symbolic Concepts Are Created Equal: Analysis and
Mitigation of Reasoning Shortcuts
link Emanuele Marconato, Stefano Teso,..., Andrea Passerini
29 2024-01-17 POP-3D: Open-Vocabulary 3D Occupancy Prediction from Images link Antonín Vobecký, Oriane Siméoni,..., Josef Sivic
29 2023-06-12 Transformers learn through gradual rank increase link Enric Boix-Adserà, Etai Littwin,..., Joshua M. Susskind
29 2023-03-13 Transformer-based Planning for Symbolic Regression link Parshin Shojaee, Kazem Meidani,..., Chandan K. Reddy
29 2023-10-02 Disentangling Voice and Content with Self-Supervision for Speaker Recognition link Tianchi Liu, Kong Aik Lee,..., Haizhou Li
29 2023-06-06 The Emergence of Essential Sparsity in Large Pre-trained Models:
The Weights that Matter
link AJAY KUMAR JAISWAL, Shiwei Liu,..., Zhangyang Wang
29 None Conformal Prediction for Uncertainty-Aware Planning with Diffusion Dynamics Model link Jiankai Sun, Yiqi Jiang,..., Mac Schwager
29 2023-02-08 Taming Local Effects in Graph-based Spatiotemporal Forecasting link Andrea Cini, Ivan Marisca,..., Cesare Alippi
29 2022-11-05 Unleashing the Power of Graph Data Augmentation on Covariate
Distribution Shift
link Yongduo Sui, Qitian Wu,..., Xiangnan He
29 2023-06-02 Demystifying Structural Disparity in Graph Neural Networks: Can One
Size Fit All?
link Haitao Mao, Zhikai Chen,..., Jiliang Tang
29 2023-06-19 Beyond Normal: On the Evaluation of Mutual Information Estimators link Paweł Czyż, Frederic Grabowski,..., Alexander Marx
29 2023-06-18 Score-based Data Assimilation link François Rozet, Gilles Louppe
29 2023-06-01 DiffPack: A Torsional Diffusion Model for Autoregressive Protein Side-Chain
Packing
link Yangtian Zhang, Zuobai Zhang,..., Jian Tang
29 2024-01-04 Improving Diffusion-Based Image Synthesis with Context Prediction link Ling Yang, Jingwei Liu,..., Bin CUI
29 2023-06-08 Learning Fine-grained View-Invariant Representations from Unpaired Ego-Exo Videos via
Temporal Alignment
link Zihui Xue, Kristen Grauman
29 2024-02-12 A Closer Look at the Robustness of Contrastive Language-Image
Pre-Training (CLIP)
link Weijie Tu, Weijian Deng, Tom Gedeon
28 2023-09-04 Memory Efficient Optimizers with 4-bit States link Bingrui Li, Jianfei Chen, Jun Zhu
28 2023-01-26 Break It Down: Evidence for Structural Compositionality in
Neural Networks
link Michael A. Lepori, Thomas Serre, Ellie Pavlick
28 None QuantSR: Accurate Low-bit Quantization for Efficient Image Super-Resolution link Haotong Qin, Yulun Zhang,..., Fisher Yu
28 2023-06-12 Operator Learning with Neural Fields: Tackling PDEs on General
Geometries
link Louis Serrano, Lise Le Boudec,..., patrick gallinari
28 2023-05-31 A Unified Framework for U-Net Design and Analysis link Christopher Williams, Fabian Falck,..., Saifuddin Syed
28 None Not All Out-of-Distribution Data Are Harmful to Open-Set Active
Learning
link Yang Yang, Yuxuan Zhang,..., Yi Xu
28 2023-04-07 A new perspective on building efficient and expressive 3D
equivariant graph neural networks
link weitao Du, Yuanqi Du,..., Zhi-Ming Ma
28 2023-11-03 On the Generalization Properties of Diffusion Models link Puheng Li, Zhong Li,..., Jiang Bian
28 2020-10-13 Unified Lower Bounds for Interactive High-dimensional Estimation under Information
Constraints
link Jayadev Acharya, Clement Louis Canonne,..., Himanshu Tyagi
28 2023-05-16 Double Pessimism is Provably Efficient for Distributionally Robust Offline
Reinforcement Learning: Generic Algorithm and Robust Partial Coverage
link Jose Blanchet, Miao Lu,..., Han Zhong
28 2023-04-10 Prompt Pre-Training with Twenty-Thousand Classes for Open-Vocabulary Visual Recognition link Shuhuai Ren, Aston Zhang,..., Xu Sun
28 2023-05-30 Complex Query Answering on Eventuality Knowledge Graph with Implicit
Logical Constraints
link Jiaxin Bai, Xin Liu,..., Yangqiu Song
28 2023-10-17 DELIFFAS: Deformable Light Fields for Fast Avatar Synthesis link YoungJoong Kwon, Lingjie Liu,..., Christian Theobalt
28 2023-09-14 Where2Explore: Few-shot Affordance Learning for Unseen Novel Categories of
Articulated Objects
link Chuanruo Ning, Ruihai Wu,..., Hao Dong
28 None StyleDrop: Text-to-Image Synthesis of Any Style link Kihyuk Sohn, Lu Jiang,..., Daniel Castro Chin
28 2023-05-22 Neural Functional Transformers link Allan Zhou, Kaien Yang,..., Chelsea Finn
28 None Q-DM: An Efficient Low-bit Quantized Diffusion Model link Yanjing Li, Sheng Xu,..., Baochang Zhang
28 2023-07-04 On the Constrained Time-Series Generation Problem link Andrea Coletta, Sriram Gopalakrishnan,..., Svitlana Vyetrenko
28 2023-10-23 Data Pruning via Moving-one-Sample-out link Haoru Tan, Sitong Wu,..., XIAOJUAN QI
28 2023-11-03 Learning to Augment Distributions for Out-of-distribution Detection link Qizhou Wang, Zhen Fang,..., Bo Han
28 2023-05-30 Multi-modal Queried Object Detection in the Wild link Yifan Xu, Mengdan Zhang,..., Changsheng Xu
28 2023-11-28 No Representation Rules Them All in Category Discovery link Sagar Vaze, Andrea Vedaldi, Andrew Zisserman
28 2023-02-23 Quantifying & Modeling Multimodal Interactions: An Information Decomposition Framework link Paul Pu Liang, Yun Cheng,..., Louis-Philippe Morency
27 2023-06-02 Towards In-context Scene Understanding link Ivana Balazevic, David Steiner,..., Olivier J Henaff
27 2023-05-20 A Scalable Neural Network for DSIC Affine Maximizer Auction
Design
link Zhijian Duan, Haoran Sun,..., Xiaotie Deng
27 2023-06-09 Topology-Aware Uncertainty for Image Segmentation link Saumya Gupta, Yikai Zhang,..., Chao Chen
27 2023-05-17 Explain Any Concept: Segment Anything Meets Concept-Based Explanation link Ao Sun, Pingchuan Ma,..., Shuai Wang
27 2023-10-17 Towards Generic Semi-Supervised Framework for Volumetric Medical Image Segmentation link Haonan Wang, Xiaomeng Li
27 2023-06-12 TrojLLM: A Black-box Trojan Prompt Attack on Large Language
Models
link Jiaqi Xue, Mengxin Zheng,..., Qian Lou
27 2023-09-22 OneNet: Enhancing Time Series Forecasting Models under Concept Drift
by Online Ensembling
link YiFan Zhang, Qingsong Wen,..., Tieniu Tan
27 2023-05-31 FlowCam: Training Generalizable 3D Radiance Fields without Camera Poses
via Pixel-Aligned Scene Flow
link Cameron Omid Smith, Yilun Du,..., Vincent Sitzmann
27 2023-12-11 TabMT: Generating tabular data with masked transformers link Manbir S Gulati, Paul F Roysdon
27 2023-09-23 Defending Pre-trained Language Models as Few-shot Learners against Backdoor
Attacks
link Zhaohan Xi, Tianyu Du,..., Ting Wang
27 2023-06-01 Lightweight Vision Transformer with Bidirectional Interaction link Qihang Fan, Huaibo Huang,..., Ran He
27 2023-04-04 The expressive power of pooling in Graph Neural Networks link Filippo Maria Bianchi, Veronica Lachi
27 2023-05-29 LaFTer: Label-Free Tuning of Zero-shot Classifier using Language and
Unlabeled Image Collections
link Muhammad Jehanzeb Mirza, Leonid Karlinsky,..., Horst Bischof
26 2023-04-19 Bridging RL Theory and Practice with the Effective Horizon link Cassidy Laidlaw, Stuart Russell, Anca Dragan
26 2023-04-06 Dynamics of Finite Width Kernel and Prediction Fluctuations in
Mean Field Neural Networks
link Blake Bordelon, Cengiz Pehlevan
26 2023-10-29 Label Poisoning is All You Need link Rishi Dev Jha, Jonathan Hayase, Sewoong Oh
26 2023-10-30 MoCa: Measuring Human-Language Model Alignment on Causal and Moral
Judgment Tasks
link Allen Nie, Yuhui Zhang,..., Tobias Gerstenberg
26 2023-03-02 SHAP-IQ: Unified Approximation of any-order Shapley Interactions link Fabian Fumagalli, Maximilian Muschalik,..., Barbara Eva Hammer
26 2023-01-09 BQ-NCO: Bisimulation Quotienting for Efficient Neural Combinatorial Optimization link Darko Drakulic, Sofia Michel,..., Jean-Marc Andreoli
26 2023-06-08 RDumb: A simple approach that questions our progress in
continual test-time adaptation
link Ori Press, Steffen Schneider,..., Matthias Bethge
26 2023-06-30 The Shaped Transformer: Attention Models in the Infinite Depth-and-Width
Limit
link Lorenzo Noci, Chuning Li,..., Daniel M. Roy
26 2023-06-01 Make Pre-trained Model Reversible: From Parameter to Memory Efficient
Fine-Tuning
link Baohao Liao, Shaomu Tan, Christof Monz
26 2023-07-17 Going Beyond Linear Mode Connectivity: The Layerwise Linear Feature
Connectivity
link Zhanpeng Zhou, Yongyi Yang,..., Wei Hu
26 2023-10-31 BasisFormer: Attention-based Time Series Forecasting with Learnable and Interpretable
Basis
link Zelin Ni, Hang Yu,..., Weiyao Lin
26 2023-11-13 A Data-Free Approach to Mitigate Catastrophic Forgetting in Federated
Class Incremental Learning for Vision Tasks
link Sara Babakniya, Zalan Fabian,..., Salman Avestimehr
26 2023-11-05 Scenario Diffusion: Controllable Driving Scenario Generation With Diffusion link Ethan Pronovost, Meghana Reddy Ganesina,..., Nicholas Roy
26 2023-06-06 FAMO: Fast Adaptive Multitask Optimization link Bo Liu, Yihao Feng,..., qiang liu
26 2023-05-31 Representation Equivalent Neural Operators: a Framework for Alias-free Operator
Learning
link Francesca Bartolucci, Emmanuel de Bezenac,..., Rima Alaifari
26 2023-05-20 Brain encoding models based on multimodal transformers can transfer
across language and vision
link Jerry Tang, Meng Du,..., Alexander Huth
26 2023-11-03 Flow-Based Feature Fusion for Vehicle-Infrastructure Cooperative 3D Object Detection link Haibao Yu, Yingjuan Tang,..., Zaiqing Nie
26 None Removing Hidden Confounding in Recommendation: A Unified Multi-Task Learning
Approach
link Haoxuan Li, Kunhan Wu,..., Peng Wu
26 2023-06-07 Improving neural network representations using human similarity judgments link Lukas Muttenthaler, Lorenz Linhardt,..., Simon Kornblith
26 2023-07-24 Described Object Detection: Liberating Object Detection with Flexible Expressions link Chi Xie, Zhao Zhang,..., Shuang Liang
26 2023-11-02 Act As You Wish: Fine-Grained Control of Motion Diffusion
Model with Hierarchical Semantic Graphs
link Peng Jin, Yang Wu,..., Li Yuan
26 2023-10-31 HEDNet: A Hierarchical Encoder-Decoder Network for 3D Object Detection
in Point Clouds
link Gang Zhang, Chen Junnan,..., Xiaolin Hu
26 2023-12-12 One-Step Diffusion Distillation via Deep Equilibrium Models link Zhengyang Geng, Ashwini Pokle, J Zico Kolter
26 2023-05-31 Spontaneous symmetry breaking in generative diffusion models link Gabriel Raya, Luca Ambrogioni
26 2023-09-19 PGDiff: Guiding Diffusion Models for Versatile Face Restoration via
Partial Guidance
link Peiqing Yang, Shangchen Zhou,..., Chen Change Loy
25 2023-07-20 Sharpness Minimization Algorithms Do Not Only Minimize Sharpness To
Achieve Better Generalization
link Kaiyue Wen, Zhiyuan Li, Tengyu Ma
25 2023-06-01 Thought Cloning: Learning to Think while Acting by Imitating
Human Thinking
link Shengran Hu, Jeff Clune
25 2023-12-03 Honesty Is the Best Policy: Defining and Mitigating AI
Deception
link Francis Rhys Ward, Francesca Toni,..., Tom Everitt
25 2023-10-07 Subspace Identification for Multi-Source Domain Adaptation link Zijian Li, Ruichu Cai,..., Kun Zhang
25 2023-01-27 Alignment with human representations supports robust few-shot learning link Ilia Sucholutsky, Thomas L. Griffiths
25 2023-12-07 Differentiable Registration of Images and LiDAR Point Clouds with
VoxelPoint-to-Pixel Matching
link Junsheng Zhou, Baorui Ma,..., Zhizhong Han
25 2023-12-06 Language Model Alignment with Elastic Reset link Michael Noukhovitch, Samuel Lavoie,..., Aaron Courville
25 2022-10-07 Winner Takes It All: Training Performant RL Populations for
Combinatorial Optimization
link Nathan Grinsztajn, Daniel Furelos-Blanco,..., Thomas D Barrett
25 None Learning Topology-Agnostic EEG Representations with Geometry-Aware Modeling link Ke Yi, Yansen Wang,..., Dongsheng Li
25 2023-10-28 This Looks Like Those: Illuminating Prototypical Concepts Using Multiple
Visualizations
link Chiyu Ma, Brandon Zhao,..., Cynthia Rudin
25 2023-06-07 Stochastic Collapse: How Gradient Noise Attracts SGD Dynamics Towards
Simpler Subnetworks
link Feng Chen, Daniel Kunin,..., Surya Ganguli
25 2023-09-22 Neural Data Transformer 2: Multi-context Pretraining for Neural Spiking
Activity
link Joel Ye, Jennifer L Collinger,..., Robert Gaunt
25 2023-04-04 Privacy Amplification via Compression: Achieving the Optimal Privacy-Accuracy-Communication Trade-off
in Distributed Mean Estimation
link Wei-Ning Chen, Dan Song,..., Peter Kairouz
25 2023-10-19 Real-Time Motion Prediction via Heterogeneous Polyline Transformer with Relative
Pose Encoding
link Zhejun Zhang, Alexander Liniger,..., Luc Van Gool
25 2022-12-15 Joint processing of linguistic properties in brains and language
models
link SUBBA REDDY OOTA, Manish Gupta, Mariya Toneva
25 2023-05-15 A Theoretical Analysis of Optimistic Proximal Policy Optimization in
Linear Markov Decision Processes
link Han Zhong, Tong Zhang
25 None Achieving Cross Modal Generalization with Multimodal Unified Representation link Yan Xia, Hai Huang,..., Zhou Zhao
25 2023-04-06 Graph Mixture of Experts: Learning on Large-Scale Graphs with
Explicit Diversity Modeling
link Haotao Wang, Ziyu Jiang,..., Zhangyang Wang
25 2023-02-21 Adversarial Model for Offline Reinforcement Learning link Mohak Bhardwaj, Tengyang Xie,..., Ching-An Cheng
25 2023-06-02 DaTaSeg: Taming a Universal Multi-Dataset Multi-Task Segmentation Model link Xiuye Gu, Yin Cui,..., David A Ross
25 2023-10-21 Diversified Outlier Exposure for Out-of-Distribution Detection via Informative Extrapolation link Jianing Zhu, Geng Yu,..., Bo Han
25 None MonoUNI: A Unified Vehicle and Infrastructure-side Monocular 3D Object
Detection Network with Sufficient Depth Clues
link Jinrang Jia, Zhenjia Li, Yifeng Shi
25 2023-05-29 Aligning Optimization Trajectories with Diffusion Models for Constrained Design
Generation
link Giorgio Giannone, Akash Srivastava,..., Faez Ahmed
24 2023-05-18 Paxion: Patching Action Knowledge in Video-Language Foundation Models link Zhenhailong Wang, Ansel Blume,..., Heng Ji
24 2023-10-02 Equivariant Adaptation of Large Pretrained Models link Arnab Kumar Mondal, Siba Smarak Panigrahi,..., Siamak Ravanbakhsh
24 2023-06-21 Joint Prompt Optimization of Stacked LLMs using Variational Inference link Alessandro Sordoni, Xingdi Yuan,..., Nicolas Le Roux
24 2023-12-22 Energy-based learning algorithms for analog computing: a comparative study link Benjamin Scellier, Maxence Ernoult,..., Suhas Kumar
24 2023-01-30 Direct Preference-based Policy Optimization without Reward Modeling link Gaon An, Junhyeok Lee,..., Hyun Oh Song
24 2023-06-19 PLASTIC: Improving Input and Label Plasticity for Sample Efficient
Reinforcement Learning
link Hojoon Lee, Hanseul Cho,..., Chulhee Yun