3952 |
2023-04-17 |
Visual Instruction Tuning |
link |
Haotian Liu, Chunyuan Li,..., Yong Jae Lee |
2993 |
2023-05-29 |
Direct Preference Optimization: Your Language Model is Secretly a Reward Model |
link |
Rafael Rafailov, Archit Sharma,..., Chelsea Finn |
2122 |
2023-05-23 |
QLoRA: Efficient Finetuning of Quantized LLMs |
link |
Tim Dettmers, Artidoro Pagnoni,..., Luke Zettlemoyer |
1808 |
2023-05-11 |
InstructBLIP: Towards General-purpose Vision-Language Models with Instruction Tuning |
link |
Wenliang Dai, Junnan Li,..., Steven Hoi |
1564 |
2023-05-17 |
Tree of Thoughts: Deliberate Problem Solving with Large Language Models |
link |
Shunyu Yao, Dian Yu,..., Karthik R Narasimhan |
1493 |
2023-02-09 |
Toolformer: Language Models Can Teach Themselves to Use Tools |
link |
Timo Schick, Jane Dwivedi-Yu,..., Thomas Scialom |
1306 |
2023-03-30 |
Self-Refine: Iterative Refinement with Self-Feedback |
link |
Aman Madaan, Niket Tandon,..., Peter Clark |
986 |
2023-03-20 |
Reflexion: language agents with verbal reinforcement learning |
link |
Noah Shinn, Federico Cassano,..., Shunyu Yao |
789 |
None |
HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in Hugging Face |
link |
Yongliang Shen, Kaitao Song,..., Yueting Zhuang |
775 |
2023-07-05 |
Jailbroken: How Does LLM Safety Training Fail? |
link |
Alexander Wei, Nika Haghtalab, Jacob Steinhardt |
764 |
2023-05-25 |
ProlificDreamer: High-Fidelity and Diverse Text-to-3D Generation with Variational Score Distillation |
link |
Zhengyi Wang, Cheng Lu,..., Jun Zhu |
724 |
2023-05-18 |
LIMA: Less Is More for Alignment |
link |
Chunting Zhou, Pengfei Liu,..., Omer Levy |
691 |
2023-05-02 |
Is Your Code Generated by ChatGPT Really Correct? Rigorous Evaluation of Large Language Models for Code Generation |
link |
Jiawei Liu, Chunqiu Steven Xia,..., LINGMING ZHANG |
512 |
2023-02-27 |
Language Is Not All You Need: Aligning Perception with Language Models |
link |
Shaohan Huang, Li Dong,..., Furu Wei |
507 |
2023-05-22 |
AlpacaFarm: A Simulation Framework for Methods that Learn from Human Feedback |
link |
Yann Dubois, Xuechen Li,..., Tatsunori Hashimoto |
447 |
2023-06-06 |
Inference-Time Intervention: Eliciting Truthful Answers from a Language Model |
link |
Kenneth Li, Oam Patel,..., Martin Wattenberg |
429 |
2023-05-18 |
VisionLLM: Large Language Model is also an Open-Ended Decoder for Vision-Centric Tasks |
link |
Wenhai Wang, Zhe Chen,..., Jifeng Dai |
425 |
2023-04-13 |
Segment Everything Everywhere All at Once |
link |
Xueyan Zou, Jianwei Yang,..., Yong Jae Lee |
410 |
2023-06-29 |
One-2-3-45: Any Single Image to 3D Mesh in 45 Seconds without Per-Shape Optimization |
link |
Minghua Liu, Chao Xu,..., Hao Su |
369 |
None |
Are Emergent Abilities of Large Language Models a Mirage? |
link |
Rylan Schaeffer, Brando Miranda, Sanmi Koyejo |
346 |
2023-05-07 |
Language Models Don't Always Say What They Think: Unfaithful Explanations in Chain-of-Thought Prompting |
link |
Miles Turpin, Julian Michael,..., Samuel R. Bowman |
345 |
2023-03-31 |
CAMEL: Communicative Agents for "Mind" Exploration of Large Language Model Society |
link |
Guohao Li, Hasan Abed Al Kader Hammoud,..., Bernard Ghanem |
332 |
2023-02-23 |
One Fits All: Power General Time Series Analysis by Pretrained LM |
link |
Tian Zhou, Peisong Niu,..., Rong Jin |
332 |
2023-05-19 |
LLM-Pruner: On the Structural Pruning of Large Language Models |
link |
Xinyin Ma, Gongfan Fang, Xinchao Wang |
328 |
2023-04-11 |
RRHF: Rank Responses to Align Language Models with Human Feedback |
link |
Hongyi Yuan, Zheng Yuan,..., Fei Huang |
325 |
2023-05-02 |
Pick-a-Pic: An Open Dataset of User Preferences for Text-to-Image Generation |
link |
Yuval Kirstain, Adam Polyak,..., Omer Levy |
323 |
2023-06-08 |
Simple and Controllable Music Generation |
link |
Jade Copet, Felix Kreuk,..., Alexandre Défossez |
322 |
2023-02-13 |
Symbolic Discovery of Optimization Algorithms |
link |
Xiangning Chen, Chen Liang,..., Quoc V Le |
317 |
2023-03-30 |
Language Models can Solve Computer Tasks |
link |
Geunwoo Kim, Pierre Baldi, Stephen Marcus McAleer |
313 |
2023-05-29 |
Faith and Fate: Limits of Transformers on Compositionality |
link |
Nouha Dziri, Ximing Lu,..., Yejin Choi |
304 |
2023-06-03 |
VideoComposer: Compositional Video Synthesis with Motion Controllability |
link |
Xiang Wang, Hangjie Yuan,..., Jingren Zhou |
296 |
2023-05-04 |
Principle-Driven Self-Alignment of Language Models from Scratch with Minimal Human Supervision |
link |
Zhiqing Sun, Yikang Shen,..., Chuang Gan |
284 |
2023-04-19 |
Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models |
link |
Pan Lu, Baolin Peng,..., Jianfeng Gao |
284 |
2023-06-02 |
Segment Anything in High Quality |
link |
Lei Ke, Mingqiao Ye,..., Fisher Yu |
282 |
2023-06-02 |
Fine-Grained Human Feedback Gives Better Rewards for Language Model Training |
link |
Zeqiu Wu, Yushi Hu,..., Hannaneh Hajishirzi |
282 |
2023-10-11 |
Large Language Models Are Zero-Shot Time Series Forecasters |
link |
Nate Gruver, Marc Anton Finzi,..., Andrew Gordon Wilson |
280 |
2023-04-21 |
DIN-SQL: Decomposed In-Context Learning of Text-to-SQL with Self-Correction |
link |
Mohammadreza Pourreza, Davood Rafiei |
280 |
2023-05-24 |
BLIP-Diffusion: Pre-trained Subject Representation for Controllable Text-to-Image Generation and Editing |
link |
Dongxu Li, Junnan Li, Steven Hoi |
274 |
2023-04-12 |
ImageReward: Learning and Evaluating Human Preferences for Text-to-Image Generation |
link |
Jiazheng Xu, Xiao Liu,..., Yuxiao Dong |
268 |
2023-03-23 |
Paraphrasing evades detectors of AI-generated text, but retrieval is an effective defense |
link |
Kalpesh Krishna, Yixiao Song,..., Mohit Iyyer |
266 |
2023-06-11 |
High-Fidelity Audio Compression with Improved RVQGAN |
link |
Rithesh Kumar, Prem Seetharaman,..., Kundan Kumar |
258 |
2023-04-28 |
Towards Automated Circuit Discovery for Mechanistic Interpretability |
link |
Arthur Conmy, Augustine N. Mavor-Parker,..., Adrià Garriga-Alonso |
250 |
2022-08-19 |
Cold Diffusion: Inverting Arbitrary Image Transforms Without Noise |
link |
Arpit Bansal, Eitan Borgnia,..., Tom Goldstein |
248 |
2023-06-26 |
MotionGPT: Human Motion as a Foreign Language |
link |
Biao Jiang, Xin Chen,..., Tao Chen |
244 |
2023-06-23 |
Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale |
link |
Matthew Le, Apoorv Vyas,..., Wei-Ning Hsu |
230 |
2023-06-06 |
Emergent Correspondence from Image Diffusion |
link |
Luming Tang, Menglin Jia,..., Bharath Hariharan |
228 |
2023-02-07 |
Hard Prompts Made Easy: Gradient-Based Discrete Optimization for Prompt Tuning and Discovery |
link |
Yuxin Wen, Neel Jain,..., Tom Goldstein |
225 |
2023-06-01 |
Diffusion Self-Guidance for Controllable Image Generation |
link |
Dave Epstein, Allan Jabri,..., Aleksander Holynski |
224 |
None |
3D-LLM: Injecting the 3D World into Large Language Models |
link |
Yining Hong, Haoyu Zhen,..., Chuang Gan |
224 |
2023-06-02 |
TIES-Merging: Resolving Interference When Merging Models |
link |
Prateek Yadav, Derek Tam,..., Mohit Bansal |
220 |
2023-06-24 |
H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models |
link |
Zhenyu Zhang, Ying Sheng,..., Beidi Chen |
219 |
2023-05-26 |
Generating Images with Multimodal Language Models |
link |
Jing Yu Koh, Daniel Fried, Ruslan Salakhutdinov |
217 |
2023-05-25 |
Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models |
link |
Shihao Zhao, Dongdong Chen,..., Kwan-Yee K. Wong |
214 |
2023-06-26 |
Are aligned neural networks adversarially aligned? |
link |
Nicholas Carlini, Milad Nasr,..., Ludwig Schmidt |
203 |
2023-05-25 |
On the Planning Abilities of Large Language Models - A Critical Investigation |
link |
Karthik Valmeekam, Matthew Marquez,..., Subbarao Kambhampati |
201 |
2023-05-24 |
EmbodiedGPT: Vision-Language Pre-Training via Embodied Chain of Thought |
link |
Yao Mu, Qinglong Zhang,..., Ping Luo |
200 |
2023-06-27 |
HyenaDNA: Long-Range Genomic Sequence Modeling at Single Nucleotide Resolution |
link |
Eric Nguyen, Michael Poli,..., Stephen Baccus |
199 |
2023-05-24 |
Towards Revealing the Mystery behind Chain of Thought: A Theoretical Perspective |
link |
Guhao Feng, Bohang Zhang,..., Liwei Wang |
197 |
2023-05-30 |
GPT4Tools: Teaching Large Language Model to Use Tools via Self-instruction |
link |
Rui Yang, Lin Song,..., Ying Shan |
188 |
2023-05-26 |
Scissorhands: Exploiting the Persistence of Importance Hypothesis for LLM KV Cache Compression at Test Time |
link |
Zichang Liu, Aditya Desai,..., Anshumali Shrivastava |
188 |
2023-02-09 |
UniPC: A Unified Predictor-Corrector Framework for Fast Sampling of Diffusion Models |
link |
Wenliang Zhao, Lujia Bai,..., Jiwen Lu |
181 |
2023-07-23 |
ResShift: Efficient Diffusion Model for Image Super-resolution by Residual Shifting |
link |
Zongsheng Yue, Jianyi Wang, Chen Change Loy |
177 |
2023-04-01 |
Subject-driven Text-to-Image Generation via Apprenticeship Learning |
link |
Wenhu Chen, Hexiang Hu,..., William W. Cohen |
176 |
2023-05-25 |
Scaling Data-Constrained Language Models |
link |
Niklas Muennighoff, Alexander M Rush,..., Colin Raffel |
171 |
2023-05-23 |
Large Language Models as Commonsense Knowledge for Large-Scale Task Planning |
link |
Zirui Zhao, Wee Sun Lee, David Hsu |
167 |
2023-07-25 |
QuIP: 2-Bit Quantization of Large Language Models With Guarantees |
link |
Jerry Chee, Yaohui Cai,..., Christopher De Sa |
167 |
2023-09-20 |
Gold-YOLO: Efficient Object Detector via Gather-and-Distribute Mechanism |
link |
Chengcheng Wang, Wei He,..., Kai Han |
164 |
2023-05-27 |
Fine-Tuning Language Models with Just Forward Passes |
link |
Sadhika Malladi, Tianyu Gao,..., Sanjeev Arora |
164 |
2023-05-31 |
The Impact of Positional Encoding on Length Generalization in Transformers |
link |
Amirhossein Kazemnejad, Inkit Padhi,..., Siva Reddy |
162 |
2023-05-17 |
Can Language Models Solve Graph Problems in Natural Language? |
link |
Heng Wang, Shangbin Feng,..., Yulia Tsvetkov |
160 |
2023-06-16 |
Scaling Open-Vocabulary Object Detection |
link |
Matthias Minderer, Alexey A. Gritsenko, Neil Houlsby |
160 |
2023-05-19 |
Any-to-Any Generation via Composable Diffusion |
link |
Zineng Tang, Ziyi Yang,..., Mohit Bansal |
160 |
2023-03-31 |
Where are we in the search for an Artificial Visual Cortex for Embodied Intelligence? |
link |
Arjun Majumdar, Karmesh Yadav,..., Franziska Meier |
159 |
2023-05-19 |
ToolkenGPT: Augmenting Frozen Language Models with Massive Tools via Tool Embeddings |
link |
Shibo Hao, Tianyang Liu,..., Zhiting Hu |
159 |
2023-01-10 |
Does Localization Inform Editing? Surprising Differences in Causality-Based Localization vs. Knowledge Editing in Language Models |
link |
Peter Hase, Mohit Bansal,..., Asma Ghandeharioun |
158 |
2023-05-17 |
DoReMi: Optimizing Data Mixtures Speeds Up Language Model Pretraining |
link |
Sang Michael Xie, Hieu Pham,..., Adams Wei Yu |
158 |
2023-05-29 |
Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept Customization of Diffusion Models |
link |
Yuchao Gu, Xintao Wang,..., Mike Zheng Shou |
157 |
2023-05-24 |
A Tale of Two Features: Stable Diffusion Complements DINO for Zero-Shot Semantic Correspondence |
link |
Junyi Zhang, Charles Herrmann,..., Ming-Hsuan Yang |
157 |
2023-02-06 |
Data Selection for Language Models via Importance Resampling |
link |
Sang Michael Xie, Shibani Santurkar,..., Percy Liang |
156 |
2023-05-26 |
On Evaluating Adversarial Robustness of Large Vision-Language Models |
link |
Yunqing Zhao, Tianyu Pang,..., Min Lin |
153 |
2023-06-07 |
Transformers as Statisticians: Provable In-Context Learning with In-Context Algorithm Selection |
link |
Yu Bai, Fan Chen,..., Song Mei |
150 |
2023-06-23 |
OpenMask3D: Open-Vocabulary 3D Instance Segmentation |
link |
Ayça Takmaz, Elisabetta Fedele,..., Francis Engelmann |
146 |
2023-05-19 |
Pengi: An Audio Language Model for Audio Tasks |
link |
Soham Deshmukh, Benjamin Elizalde,..., Huaming Wang |
144 |
2023-05-24 |
Leveraging Pre-trained Large Language Models to Construct and Utilize World Models for Model-based Task Planning |
link |
Lin Guan, Karthik Valmeekam,..., Subbarao Kambhampati |
143 |
2023-07-03 |
MVDiffusion: Enabling Holistic Multi-view Image Generation with Correspondence-Aware Diffusion |
link |
Shitao Tang, Fuyang Zhang,..., Yasutaka Furukawa |
143 |
2023-05-24 |
LayoutGPT: Compositional Visual Planning and Generation with Large Language Models |
link |
Weixi Feng, Wanrong Zhu,..., William Yang Wang |
141 |
2023-05-31 |
Improving CLIP Training with Language Rewrites |
link |
Lijie Fan, Dilip Krishnan,..., Yonglong Tian |
140 |
2023-05-24 |
In-Context Impersonation Reveals Large Language Models' Strengths and Biases |
link |
Leonard Salewski, Stephan Alaniz,..., Zeynep Akata |
140 |
2023-06-01 |
SnapFusion: Text-to-Image Diffusion Model on Mobile Devices within Two Seconds |
link |
Yanyu Li, Huan Wang,..., Jian Ren |
139 |
2023-06-01 |
Transformers learn to implement preconditioned gradient descent for in-context learning |
link |
Kwangjun Ahn, Xiang Cheng,..., Suvrit Sra |
134 |
2023-06-01 |
StableRep: Synthetic Images from Text-to-Image Models Make Strong Visual Representation Learners |
link |
Yonglong Tian, Lijie Fan,..., Dilip Krishnan |
133 |
2022-11-20 |
Aging with GRACE: Lifelong Model Editing with Discrete Key-Value Adaptors |
link |
Thomas Hartvigsen, Swami Sankaranarayanan,..., Marzyeh Ghassemi |
132 |
2023-06-12 |
Controlling Text-to-Image Diffusion by Orthogonal Finetuning |
link |
Zeju Qiu, Weiyang Liu,..., Bernhard Schölkopf |
131 |
2022-12-19 |
Optimizing Prompts for Text-to-Image Generation |
link |
Yaru Hao, Zewen Chi,..., Furu Wei |
127 |
2023-05-25 |
DPOK: Reinforcement Learning for Fine-tuning Text-to-Image Diffusion Models |
link |
Ying Fan, Olivia Watkins,..., Kimin Lee |
126 |
2021-12-24 |
Counterfactual Memorization in Neural Language Models |
link |
Chiyuan Zhang, Daphne Ippolito,..., Nicholas Carlini |
125 |
2023-07-06 |
Focused Transformer: Contrastive Training for Context Scaling |
link |
Szymon Tworkowski, Konrad Staniszewski,..., Piotr Miłoś |
120 |
2023-05-27 |
SwiftSage: A Generative Agent with Fast and Slow Thinking for Complex Interactive Tasks |
link |
Bill Yuchen Lin, Yicheng Fu,..., Xiang Ren |
120 |
2023-05-29 |
RAPHAEL: Text-to-Image Generation via Large Mixture of Diffusion Paths |
link |
Zeyue Xue, Guanglu Song,..., Ping Luo |
118 |
2023-05-11 |
Self-Chained Image-Language Model for Video Localization and Question Answering |
link |
Shoubin Yu, Jaemin Cho,..., Mohit Bansal |
118 |
2023-05-31 |
Understanding and Mitigating Copying in Diffusion Models |
link |
Gowthami Somepalli, Vasu Singla,..., Tom Goldstein |
118 |
2023-05-02 |
Unlimiformer: Long-Range Transformers with Unlimited Length Input |
link |
Amanda Bertsch, Uri Alon,..., Matthew R. Gormley |
118 |
2023-06-07 |
Rewarded soups: towards Pareto-optimal alignment by interpolating weights fine-tuned on diverse rewards |
link |
Alexandre Rame, Guillaume Couairon,..., Matthieu Cord |
117 |
2023-02-20 |
Towards Unbounded Machine Unlearning |
link |
Meghdad Kurmanji, Peter Triantafillou,..., Eleni Triantafillou |
117 |
2023-05-01 |
Self-Evaluation Guided Beam Search for Reasoning |
link |
Yuxi Xie, Kenji Kawaguchi,..., Qizhe Xie |
116 |
2022-09-14 |
Lossy Image Compression with Conditional Diffusion Models |
link |
Ruihan Yang, Stephan Mandt |
115 |
2023-05-23 |
Diffusion Hyperfeatures: Searching Through Time and Space for Semantic Correspondence |
link |
Grace Luo, Lisa Dunlap,..., Trevor Darrell |
114 |
2023-03-01 |
Understanding Diffusion Objectives as the ELBO with Simple Data Augmentation |
link |
Diederik P Kingma, Ruiqi Gao |
114 |
2023-05-26 |
AdaPlanner: Adaptive Planning from Feedback with Language Models |
link |
Haotian Sun, Yuchen Zhuang,..., Chao Zhang |
113 |
2023-05-18 |
Structural Pruning for Diffusion Models |
link |
Gongfan Fang, Xinyin Ma, Xinchao Wang |
113 |
2023-06-06 |
Deductive Verification of Chain-of-Thought Reasoning |
link |
Zhan Ling, Yunhao Fang,..., Hao Su |
111 |
2023-04-21 |
Emergent and Predictable Memorization in Large Language Models |
link |
Stella Biderman, USVSN Sai Prashanth,..., Edward Raff |
110 |
2023-09-01 |
Geometry-Informed Neural Operator for Large-Scale 3D PDEs |
link |
Zongyi Li, Nikola Borislavov Kovachki,..., Anima Anandkumar |
110 |
2023-11-10 |
Frequency-domain MLPs are More Effective Learners in Time Series Forecasting |
link |
Kun Yi, Qi Zhang,..., Zhendong Niu |
110 |
2023-04-30 |
How does GPT-2 compute greater-than?: Interpreting mathematical abilities in a pre-trained language model |
link |
Michael Hanna, Ollie Liu, Alexandre Variengien |
110 |
None |
Segment Anything in 3D with NeRFs |
link |
Jiazhong Cen, Zanwei Zhou,..., Qi Tian |
109 |
2023-05-18 |
UniControl: A Unified Diffusion Model for Controllable Visual Generation In the Wild |
link |
Can Qin, Shu Zhang,..., Ran Xu |
108 |
2023-02-16 |
DIFUSCO: Graph-based Diffusion Solvers for Combinatorial Optimization |
link |
Zhiqing Sun, Yiming Yang |
107 |
2023-05-18 |
OpenShape: Scaling Up 3D Shape Representation Towards Open-World Understanding |
link |
Minghua Liu, Ruoxi Shi,..., Hao Su |
106 |
2023-02-02 |
SceneScape: Text-Driven Consistent Scene Generation |
link |
Rafail Fridman, Amit Abecasis,..., Tali Dekel |
105 |
2022-08-08 |
Deep Patch Visual Odometry |
link |
Zachary Teed, Lahav Lipson, Jia Deng |
104 |
2023-07-26 |
Evaluating the Moral Beliefs Encoded in LLMs |
link |
Nino Scherrer, Claudia Shi,..., David Blei |
103 |
2023-07-04 |
Spike-driven Transformer |
link |
Man Yao, JiaKui Hu,..., Guoqi Li |
101 |
2023-05-29 |
Reconstructing the Mind's Eye: fMRI-to-Image with Contrastive Learning and Diffusion Priors |
link |
Paul Steven Scotti, Atmadeep Banerjee,..., Tanishq Mathew Abraham |
101 |
2023-06-26 |
Composing Parameter-Efficient Modules with Arithmetic Operation |
link |
Jinghan Zhang, Shiqi Chen,..., Junxian He |
101 |
2023-01-31 |
What Makes Good Examples for Visual In-Context Learning? |
link |
Yuanhan Zhang, Kaiyang Zhou, Ziwei Liu |
100 |
2023-04-11 |
Model Sparsity Can Simplify Machine Unlearning |
link |
Jinghan Jia, Jiancheng Liu,..., Sijia Liu |
100 |
2023-03-09 |
Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning |
link |
Mitsuhiko Nakamoto, Yuexiang Zhai,..., Sergey Levine |
98 |
2023-06-06 |
LEACE: Perfect linear concept erasure in closed form |
link |
Nora Belrose, David Schneider-Joseph,..., Stella Biderman |
97 |
2023-03-27 |
Text-to-Image Diffusion Models are Zero Shot Classifiers |
link |
Kevin Clark, Priyank Jaini |
97 |
2023-05-18 |
TextDiffuser: Diffusion Models as Text Painters |
link |
Jingye Chen, Yupan Huang,..., Furu Wei |
96 |
2023-05-22 |
Task Arithmetic in the Tangent Space: Improved Editing of Pre-Trained Models |
link |
Guillermo Ortiz-Jimenez, Alessandro Favero, Pascal Frossard |
95 |
2023-05-17 |
Selective Amnesia: A Continual Learning Approach to Forgetting in Deep Generative Models |
link |
Alvin Heng, Harold Soh |
95 |
2023-05-29 |
Diff-Instruct: A Universal Approach for Transferring Knowledge From Pre-trained Diffusion Models |
link |
Weijian Luo, Tianyang Hu,..., Zhihua Zhang |
95 |
2023-07-07 |
RADAR: Robust AI-Text Detection via Adversarial Learning |
link |
Xiaomeng Hu, Pin-Yu Chen, Tsung-Yi Ho |
93 |
2021-07-19 |
Epistemic Neural Networks |
link |
Ian Osband, Zheng Wen,..., Benjamin Van Roy |
93 |
2023-05-31 |
Protein Design with Guided Discrete Diffusion |
link |
Nate Gruver, Samuel Don Stanton,..., Andrew Gordon Wilson |
93 |
2023-06-13 |
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models |
link |
Yinghao Aaron Li, Cong Han,..., Nima Mesgarani |
92 |
2023-06-15 |
Linguistic Binding in Diffusion Models: Enhancing Attribute Correspondence through Attention Map Alignment |
link |
Royi Rassin, Eran Hirsch,..., Gal Chechik |
92 |
2023-05-17 |
Language Model Tokenizers Introduce Unfairness Between Languages |
link |
Aleksandar Petrov, Emanuele La Malfa,..., Adel Bibi |
91 |
2023-08-11 |
DatasetDM: Synthesizing Data with Perception Annotations Using Diffusion Models |
link |
Weijia Wu, Yuzhong Zhao,..., Chunhua Shen |
91 |
2023-05-30 |
Koopa: Learning Non-stationary Time Series Dynamics with Koopman Predictors |
link |
Yong Liu, Chenyu Li,..., Mingsheng Long |
90 |
2023-06-15 |
DreamSim: Learning New Dimensions of Human Visual Similarity using Synthetic Data |
link |
Stephanie Fu, Netanel Yakir Tamir,..., Phillip Isola |
89 |
2023-06-28 |
On the Exploitability of Instruction Tuning |
link |
Manli Shu, Jiongxiao Wang,..., Tom Goldstein |
89 |
2023-07-12 |
Patch n’ Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution |
link |
Mostafa Dehghani, Basil Mustafa,..., Neil Houlsby |
89 |
2023-11-10 |
FourierGNN: Rethinking Multivariate Time Series Forecasting from a Pure Graph Perspective |
link |
Kun Yi, Qi Zhang,..., Zhendong Niu |
89 |
2023-05-29 |
VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset |
link |
Sihan Chen, Handong Li,..., Jing Liu |
88 |
2023-06-30 |
The Clock and the Pizza: Two Stories in Mechanistic Explanation of Neural Networks |
link |
Ziqian Zhong, Ziming Liu,..., Jacob Andreas |
88 |
2023-06-07 |
Exposing flaws of generative model evaluation metrics and their unfair treatment of diffusion models |
link |
George Stein, Jesse C. Cresswell,..., Gabriel Loaiza-Ganem |
88 |
2023-02-15 |
Speculative Decoding with Big Little Decoder |
link |
Sehoon Kim, Karttikeya Mangalam,..., Kurt Keutzer |
88 |
2023-07-05 |
RanPAC: Random Projections and Pre-trained Models for Continual Learning |
link |
Mark McDonnell, Dong Gong,..., Anton van den Hengel |
88 |
2023-05-18 |
PTQD: Accurate Post-Training Quantization for Diffusion Models |
link |
Yefei He, Luping Liu,..., Bohan Zhuang |
87 |
2023-07-04 |
ProPILE: Probing Privacy Leakage in Large Language Models |
link |
Siwon Kim, Sangdoo Yun,..., Seong Joon Oh |
87 |
2023-05-18 |
Language Models Meet World Models: Embodied Experiences Enhance Language Models |
link |
Jiannan Xiang, Tianhua Tao,..., Zhiting Hu |
87 |
2023-04-25 |
Patch Diffusion: Faster and More Data-Efficient Training of Diffusion Models |
link |
Zhendong Wang, Yifan Jiang,..., Mingyuan Zhou |
87 |
2023-01-27 |
Large Language Models Are Latent Variable Models: Explaining and Finding Good Demonstrations for In-Context Learning |
link |
Xinyi Wang, Wanrong Zhu,..., William Yang Wang |
86 |
2023-06-15 |
DreamHuman: Animatable 3D Avatars from Text |
link |
Nikos Kolotouros, Thiemo Alldieck,..., Cristian Sminchisescu |
86 |
2023-02-22 |
Guiding Large Language Models via Directional Stimulus Prompting |
link |
Zekun Li, Baolin Peng,..., Xifeng Yan |
86 |
2023-05-23 |
Memory-Efficient Fine-Tuning of Compressed Large Language Models via sub-4-bit Integer Quantization |
link |
Jeonghoon Kim, Jung Hyun Lee,..., Dongsoo Lee |
86 |
2023-10-25 |
DDCoT: Duty-Distinct Chain-of-Thought Prompting for Multimodal Reasoning in Language Models |
link |
Ge Zheng, Bin Yang,..., Sibei Yang |
86 |
2023-05-18 |
Weakly-Supervised Concealed Object Segmentation with SAM-based Pseudo Labeling and Multi-scale Feature Grouping |
link |
Chunming He, Kai Li,..., Xiu Li |
85 |
2023-04-07 |
Why think step by step? Reasoning emerges from the locality of experience |
link |
Ben Prystawski, Michael Y. Li, Noah Goodman |
85 |
None |
PromptIR: Prompting for All-in-One Image Restoration |
link |
Vaishnav Potlapalli, Syed Waqas Zamir,..., Fahad Khan |
85 |
2023-05-24 |
Cheap and Quick: Efficient Vision-Language Instruction Tuning for Large Language Models |
link |
Gen Luo, Yiyi Zhou,..., Rongrong Ji |
84 |
2023-03-14 |
The Learnability of In-Context Learning |
link |
Noam Wies, Yoav Levine, Amnon Shashua |
84 |
2023-05-03 |
Lift Yourself Up: Retrieval-augmented Text Generation with Self-Memory |
link |
Xin Cheng, Di Luo,..., Rui Yan |
84 |
2023-03-23 |
Towards Better Dynamic Graph Learning: New Architecture and Unified Library |
link |
Le Yu, Leilei Sun,..., Weifeng Lv |
83 |
2023-07-02 |
Solving Linear Inverse Problems Provably via Posterior Sampling with Latent Diffusion Models |
link |
Litu Rout, Negin Raoof,..., Sanjay Shakkottai |
82 |
2023-06-20 |
Diffusion with Forward Models: Solving Stochastic Inverse Problems Without Direct Supervision |
link |
Ayush Tewari, Tianwei Yin,..., Vincent Sitzmann |
82 |
2023-05-24 |
Unsupervised Semantic Correspondence Using Stable Diffusion |
link |
Eric Hedlin, Gopal Sharma,..., Kwang Moo Yi |
81 |
2023-05-12 |
MEGABYTE: Predicting Million-byte Sequences with Multiscale Transformers |
link |
LILI YU, Daniel Simig,..., Mike Lewis |
80 |
2023-06-29 |
Diff-Foley: Synchronized Video-to-Audio Synthesis with Latent Diffusion Models |
link |
Simian Luo, Chuanhao Yan,..., Hang Zhao |
79 |
2023-02-02 |
SimMTM: A Simple Pre-Training Framework for Masked Time-Series Modeling |
link |
Jiaxiang Dong, Haixu Wu,..., Mingsheng Long |
79 |
2023-06-15 |
Segment Any Point Cloud Sequences by Distilling Vision Foundation Models |
link |
Youquan Liu, Lingdong Kong,..., Ziwei Liu |
79 |
2023-05-29 |
Diffusion Model is an Effective Planner and Data Synthesizer for Multi-Task Reinforcement Learning |
link |
Haoran He, Chenjia Bai,..., Xuelong Li |
79 |
2023-03-07 |
Structured State Space Models for In-Context Reinforcement Learning |
link |
Chris Lu, Yannick Schroecker,..., Feryal Behbahani |
79 |
2023-06-22 |
Quantizable Transformers: Removing Outliers by Helping Attention Heads Do Nothing |
link |
Yelysei Bondarenko, Markus Nagel, Tijmen Blankevoort |
79 |
2023-06-12 |
Augmenting Language Models with Long-Term Memory |
link |
Weizhi Wang, Li Dong,..., Furu Wei |
78 |
2023-06-07 |
Intrinsic Dimension Estimation for Robust Detection of AI-Generated Texts |
link |
Eduard Tulchinskii, Kristian Kuznetsov,..., Irina Piontkovskaya |
78 |
2023-05-15 |
Interpretability at Scale: Identifying Causal Mechanisms in Alpaca |
link |
Zhengxuan Wu, Atticus Geiger,..., Noah Goodman |
77 |
2023-06-05 |
Representational Strengths and Limitations of Transformers |
link |
Clayton Sanford, Daniel Hsu, Matus Telgarsky |
77 |
2023-02-28 |
EvoPrompting: Language Models for Code-Level Neural Architecture Search |
link |
Angelica Chen, David Dohan, David So |
77 |
2023-02-02 |
Convolutional Neural Operators for robust and accurate learning of PDEs |
link |
Bogdan Raonic, Roberto Molinaro,..., Emmanuel de Bezenac |
77 |
2023-02-26 |
Fast Attention Requires Bounded Entries |
link |
Josh Alman, Zhao Song |
76 |
2023-05-19 |
The probability flow ODE is provably fast |
link |
Sitan Chen, Sinho Chewi,..., Adil Salim |
76 |
2023-06-29 |
Michelangelo: Conditional 3D Shape Generation based on Shape-Image-Text Aligned Latent Representation |
link |
Zibo Zhao, Wen Liu,..., Shenghua Gao |
75 |
2023-06-01 |
White-Box Transformers via Sparse Rate Reduction |
link |
Yaodong Yu, Sam Buchanan,..., Yi Ma |
74 |
2023-05-15 |
Privacy Auditing with One (1) Training Run |
link |
Thomas Steinke, Milad Nasr, Matthew Jagielski |
74 |
2023-05-24 |
Testing the General Deductive Reasoning Capacity of Large Language Models Using OOD Examples |
link |
Abulhair Saparov, Richard Yuanzhe Pang,..., He He |
73 |
2023-06-01 |
Birth of a Transformer: A Memory Viewpoint |
link |
Alberto Bietti, Vivien Cabannes,..., Leon Bottou |
72 |
2023-09-25 |
Dataset Diffusion: Diffusion-based Synthetic Data Generation for Pixel-Level Semantic Segmentation |
link |
Quang Ho Nguyen, Truong Tuan Vu,..., Khoi Nguyen |
71 |
2023-10-11 |
Hierarchical Decomposition of Prompt-Based Continual Learning: Rethinking Obscured Sub-optimality |
link |
Liyuan Wang, Jingyi Xie,..., Jun Zhu |
71 |
2022-05-20 |
Evaluating and Inducing Personality in Pre-trained Language Models |
link |
Guangyuan Jiang, Manjie Xu,..., Yixin Zhu |
70 |
2023-01-12 |
Tracr: Compiled Transformers as a Laboratory for Interpretability |
link |
David Lindner, Janos Kramar,..., Vladimir Mikulik |
70 |
2023-10-20 |
DPM-Solver-v3: Improved Diffusion ODE Solver with Empirical Model Statistics |
link |
Kaiwen Zheng, Cheng Lu,..., Jun Zhu |
70 |
2023-06-30 |
Practical and Asymptotically Exact Conditional Sampling in Diffusion Models |
link |
Luhuan Wu, Brian L. Trippe,..., John Patrick Cunningham |
70 |
2023-09-27 |
GeoCLIP: Clip-Inspired Alignment between Locations and Images for Effective Worldwide Geo-localization |
link |
Vicente Vivanco Cepeda, Gaurav Kumar Nayak, Mubarak Shah |
70 |
2023-05-22 |
VanillaNet: the Power of Minimalism in Deep Learning |
link |
Hanting Chen, Yunhe Wang,..., Dacheng Tao |
69 |
2023-06-26 |
Supervised Pretraining Can Learn In-Context Reinforcement Learning |
link |
Jonathan Lee, Annie Xie,..., Emma Brunskill |
69 |
2023-08-10 |
PDE-Refiner: Achieving Accurate Long Rollouts with Neural PDE Solvers |
link |
Phillip Lippe, Bastiaan S. Veeling,..., Johannes Brandstetter |
69 |
2023-03-23 |
The Quantization Model of Neural Scaling |
link |
Eric J Michaud, Ziming Liu,..., Max Tegmark |
69 |
2022-12-19 |
Latent Diffusion for Language Generation |
link |
Justin Lovelace, Varsha Kishore,..., Kilian Q Weinberger |
69 |
None |
Describe, Explain, Plan and Select: Interactive Planning with LLMs Enables Open-World Multi-Task Agents |
link |
Zihao Wang, Shaofei Cai,..., Yitao Liang |
68 |
2023-05-25 |
Scan and Snap: Understanding Training Dynamics and Token Composition in 1-layer Transformer |
link |
Yuandong Tian, Yiping Wang,..., Simon Shaolei Du |
68 |
2023-05-21 |
DreamWaltz: Make a Scene with Complex 3D Animatable Avatars |
link |
Yukun Huang, Jianan Wang,..., Lei Zhang |
68 |
2023-05-19 |
PointGPT: Auto-regressively Generative Pre-training from Point Clouds |
link |
Guangyan Chen, Meiling Wang,..., Yufeng Yue |
68 |
2023-05-29 |
GlyphControl: Glyph Conditional Control for Visual Text Generation |
link |
Yukang Yang, Dongnan Gui,..., Kai Chen |
67 |
2023-06-02 |
The Surprising Effectiveness of Diffusion Models for Optical Flow and Monocular Depth Estimation |
link |
Saurabh Saxena, Charles Herrmann,..., David J. Fleet |
67 |
2023-05-01 |
In-Context Learning Unlocked for Diffusion Models |
link |
Zhendong Wang, Yifan Jiang,..., Mingyuan Zhou |
67 |
2023-05-22 |
To Repeat or Not To Repeat: Insights from Scaling LLM under Token-Crisis |
link |
Fuzhao Xue, Yao Fu,..., Yang You |
67 |
2023-05-17 |
What You See is What You Read? Improving Text-Image Alignment Evaluation |
link |
Michal Yarom, Yonatan Bitton,..., Idan Szpektor |
66 |
2023-06-01 |
STEVE-1: A Generative Model for Text-to-Behavior in Minecraft |
link |
Shalev Lifshitz, Keiran Paster,..., Sheila A. McIlraith |
66 |
2023-05-18 |
LLMScore: Unveiling the Power of Large Language Models in Text-to-Image Synthesis Evaluation |
link |
Yujie Lu, Xianjun Yang,..., William Yang Wang |
66 |
2023-05-31 |
Med-UniC: Unifying Cross-Lingual Medical Vision-Language Pre-Training by Diminishing Bias |
link |
Zhongwei Wan, Che Liu,..., Rossella Arcucci |
65 |
2023-11-08 |
Hierarchically Gated Recurrent Neural Network for Sequence Modeling |
link |
Zhen Qin, Songlin Yang, Yiran Zhong |
64 |
2023-06-26 |
Pretraining task diversity and the emergence of non-Bayesian in-context learning for regression |
link |
Allan Raventos, Mansheej Paul,..., Surya Ganguli |
64 |
2023-05-08 |
Recommender Systems with Generative Retrieval |
link |
Shashank Rajput, Nikhil Mehta,..., Maheswaran Sathiamoorthy |
64 |
2023-05-25 |
Diversify Your Vision Datasets with Automatic Diffusion-based Augmentation |
link |
Lisa Dunlap, Alyssa Umino,..., Trevor Darrell |
64 |
2023-05-23 |
Weakly Supervised 3D Open-vocabulary Segmentation |
link |
Kunhao Liu, Fangneng Zhan,..., Shijian Lu |
64 |
2022-06-14 |
Automatic Clipping: Differentially Private Deep Learning Made Easier and Stronger |
link |
Zhiqi Bu, Yu-Xiang Wang,..., George Karypis |
64 |
2023-02-03 |
Rethinking Semi-Supervised Medical Image Segmentation: A Variance-Reduction Perspective |
link |
Chenyu You, Weicheng Dai,..., James s Duncan |
63 |
2023-10-26 |
Global Structure-Aware Diffusion Process for Low-light Image Enhancement |
link |
Jinhui HOU, Zhiyu Zhu,..., Hui Yuan |
63 |
2023-03-29 |
Diffusion Schrödinger Bridge Matching |
link |
Yuyang Shi, Valentin De Bortoli,..., Arnaud Doucet |
62 |
2023-07-20 |
A Definition of Continual Reinforcement Learning |
link |
David Abel, Andre Barreto,..., Satinder Singh |
61 |
2023-10-31 |
Unexpected Improvements to Expected Improvement for Bayesian Optimization |
link |
Sebastian Ament, Sam Daulton,..., Eytan Bakshy |
61 |
2023-09-15 |
Compositional Foundation Models for Hierarchical Planning |
link |
Anurag Ajay, Seungwook Han,..., Pulkit Agrawal |
61 |
2023-09-25 |
Evaluating Cognitive Maps and Planning in Large Language Models with CogEval |
link |
Ida Momennejad, Hosein Hasanbeig,..., Jonathan Larson |
61 |
2023-03-12 |
Synthetic Experience Replay |
link |
Cong Lu, Philip J. Ball,..., Jack Parker-Holder |
60 |
2023-06-22 |
Squeeze, Recover and Relabel: Dataset Condensation at ImageNet Scale From A New Perspective |
link |
Zeyuan Yin, Eric Xing, Zhiqiang Shen |
60 |
2023-10-11 |
RoboCLIP: One Demonstration is Enough to Learn Robot Policies |
link |
Sumedh Anand Sontakke, Jesse Zhang,..., Laurent Itti |
60 |
2023-06-08 |
SyncDiffusion: Coherent Montage via Synchronized Joint Diffusions |
link |
Yuseung Lee, Kunho Kim,..., Minhyuk Sung |
59 |
2023-05-22 |
Hierarchical Integration Diffusion Model for Realistic Image Deblurring |
link |
Zheng Chen, Yulun Zhang,..., Xin Yuan |
59 |
2023-11-14 |
MADG: Margin-based Adversarial Learning for Domain Generalization |
link |
Aveen Dayal, Vimal K B,..., Vineeth N. Balasubramanian |
58 |
2023-05-30 |
Ambient Diffusion: Learning Clean Distributions from Corrupted Data |
link |
Giannis Daras, Kulin Shah,..., Adam Klivans |
58 |
2023-05-28 |
GIMLET: A Unified Graph-Text Model for Instruction-Based Molecule Zero-Shot Learning |
link |
Haiteng Zhao, Shengchao Liu,..., Qi Liu |
58 |
2023-05-18 |
Content-based Unrestricted Adversarial Attack |
link |
Zhaoyu Chen, Bo Li,..., Wenqiang Zhang |
58 |
2023-06-02 |
LoCoOp: Few-Shot Out-of-Distribution Detection via Prompt Learning |
link |
Atsuyuki Miyai, Qing Yu,..., Kiyoharu Aizawa |
58 |
2023-05-24 |
Flocks of Stochastic Parrots: Differentially Private Prompt Learning for Large Language Models |
link |
Haonan Duan, Adam Dziedzic,..., Franziska Boenisch |
57 |
2023-05-23 |
Uncertainty Quantification over Graph with Conformalized Graph Neural Networks |
link |
Kexin Huang, Ying Jin,..., Jure Leskovec |
57 |
2023-05-21 |
PRODIGY: Enabling In-context Learning Over Graphs |
link |
Qian Huang, Hongyu Ren,..., Jure Leskovec |
57 |
2023-10-23 |
SpecTr: Fast Speculative Decoding via Optimal Transport |
link |
Ziteng Sun, Ananda Theertha Suresh,..., Felix Yu |
57 |
2023-03-03 |
Revisiting Adversarial Training for ImageNet: Architectures, Training and Generalization across Threat Models |
link |
Naman Deep Singh, Francesco Croce, Matthias Hein |
57 |
2023-09-24 |
GraphAdapter: Tuning Vision-Language Models With Dual Knowledge Graph |
link |
Xin Li, Dongze Lian,..., Xinchao Wang |
57 |
2023-12-07 |
CLadder: Assessing Causal Reasoning in Language Models |
link |
Zhijing Jin, Yuen Chen,..., Bernhard Schölkopf |
57 |
2023-09-25 |
Free-Bloom: Zero-Shot Text-to-Video Generator with LLM Director and LDM Animator |
link |
Hanzhuo Huang, Yufan Feng,..., Sibei Yang |
57 |
2023-07-04 |
DiT-3D: Exploring Plain Diffusion Transformers for 3D Shape Generation |
link |
Shentong Mo, Enze Xie,..., Zhenguo Li |
56 |
2023-03-20 |
Object-Centric Slot Diffusion |
link |
Jindong Jiang, Fei Deng,..., Sungjin Ahn |
56 |
2023-11-02 |
Align Your Prompts: Test-Time Prompting with Distribution Alignment for Zero-Shot Generalization |
link |
Jameel Hassan Abdul Samadh, Hanan Gani,..., Salman Khan |
56 |
2023-05-31 |
Efficient Diffusion Policies For Offline Reinforcement Learning |
link |
Bingyi Kang, Xiao Ma,..., Shuicheng YAN |
56 |
2023-10-08 |
FedFed: Feature Distillation against Data Heterogeneity in Federated Learning |
link |
Zhiqin Yang, Yonggang Zhang,..., Bo Han |
55 |
2023-12-11 |
4M: Massively Multimodal Masked Modeling |
link |
David Mizrahi, Roman Bachmann,..., Amir Zamir |
55 |
2023-05-27 |
Scalable Transformer for PDE Surrogate Modeling |
link |
Zijie Li, Dule Shu, Amir Barati Farimani |
55 |
2023-11-03 |
ForecastPFN: Synthetically-Trained Zero-Shot Forecasting |
link |
Samuel Dooley, Gurnoor Singh Khurana,..., Colin White |
54 |
2023-06-09 |
$S^3$: Increasing GPU Utilization during Generative Inference for Higher Throughput |
link |
Yunho Jin, Chun-Feng Wu,..., Gu-Yeon Wei |
54 |
2023-06-01 |
Towards Foundation Models for Scientific Machine Learning: Characterizing Scaling and Transfer Behavior |
link |
Shashank Subramanian, Peter Harrington,..., Amir Gholami |
54 |
2023-05-19 |
Scaling laws for language encoding models in fMRI |
link |
Richard Antonello, Aditya Vaidya, Alexander Huth |
53 |
2023-06-26 |
Equivariant flow matching |
link |
Leon Klein, Andreas Krämer, Frank Noe |
53 |
2023-07-31 |
Conformal PID Control for Time Series Prediction |
link |
Anastasios Nikolas Angelopoulos, Emmanuel Candes, Ryan Tibshirani |
53 |
2023-10-12 |
Neural Combinatorial Optimization with Heavy Decoder: Toward Large Scale Generalization |
link |
Fu Luo, Xi Lin,..., Zhenkun Wang |
52 |
2023-05-19 |
Cinematic Mindscapes: High-quality Video Reconstruction from Brain Activity |
link |
Zijiao Chen, Jiaxin Qing, Juan Helen Zhou |
52 |
2023-02-12 |
MarioGPT: Open-Ended Text2Level Generation through Large Language Models |
link |
Shyam Sudhakaran, Miguel González-Duque,..., Sebastian Risi |
52 |
2023-06-01 |
Nonparametric Identifiability of Causal Representations from Unknown Interventions |
link |
Julius von Kügelgen, Michel Besserve,..., Bernhard Schölkopf |
52 |
2023-06-07 |
Fine-Grained Visual Prompting |
link |
Lingfeng Yang, Yueze Wang,..., Jian Yang |
52 |
2023-07-19 |
PreDiff: Precipitation Nowcasting with Latent Diffusion Models |
link |
Zhihan Gao, Xingjian Shi,..., Bernie Wang |
52 |
2023-05-22 |
Textually Pretrained Speech Language Models |
link |
Michael Hassid, Tal Remez,..., Yossi Adi |
52 |
2023-07-30 |
Crystal Structure Prediction by Joint Equivariant Diffusion |
link |
Rui Jiao, Wenbing Huang,..., Yang Liu |
51 |
2023-05-25 |
Dynamic Context Pruning for Efficient and Interpretable Autoregressive Transformers |
link |
Sotiris Anagnostidis, Dario Pavllo,..., Thomas Hofmann |
51 |
2023-05-19 |
Post Hoc Explanations of Language Models Can Improve Language Models |
link |
Satyapriya Krishna, Jiaqi Ma,..., Himabindu Lakkaraju |
51 |
2023-09-25 |
FeCAM: Exploiting the Heterogeneity of Class Distributions in Exemplar-Free Continual Learning |
link |
Dipam Goswami, Yuyang Liu,..., Joost van de Weijer |
51 |
2023-07-24 |
Understanding the Latent Space of Diffusion Models through the Lens of Riemannian Geometry |
link |
Yong-Hyun Park, Mingi Kwon,..., Youngjung Uh |
50 |
2023-05-31 |
From Pixels to UI Actions: Learning to Follow Instructions via Graphical User Interfaces |
link |
Peter Shaw, Mandar Joshi,..., Kristina Toutanova |
50 |
2023-04-27 |
Convergence of Adam Under Relaxed Assumptions |
link |
Haochuan Li, Alexander Rakhlin, Ali Jadbabaie |
50 |
2023-06-05 |
Structure-free Graph Condensation: From Large-scale Graphs to Condensed Graph-free Data |
link |
Xin Zheng, Miao Zhang,..., Shirui Pan |
50 |
2023-04-11 |
Conditional Adapters: Parameter-efficient Transfer Learning with Fast Inference |
link |
Tao Lei, Junwen Bai,..., Ming-Wei Chang |
50 |
2023-05-11 |
An Inverse Scaling Law for CLIP Training |
link |
Xianhang Li, Zeyu Wang, Cihang Xie |
50 |
2023-05-28 |
Knowledge-Augmented Reasoning Distillation for Small Language Models in Knowledge-Intensive Tasks |
link |
Minki Kang, Seanie Lee,..., Sung Ju Hwang |
50 |
2023-10-19 |
Fast Model DeBias with Machine Unlearning |
link |
Ruizhe Chen, Jianfei Yang,..., Zuozhu Liu |
50 |
2023-10-30 |
One-for-All: Bridge the Gap Between Heterogeneous Architectures in Knowledge Distillation |
link |
Zhiwei Hao, Jianyuan Guo,..., Chang Xu |
50 |
2023-05-23 |
Video Prediction Models as Rewards for Reinforcement Learning |
link |
Alejandro Escontrela, Ademi Adeniji,..., Pieter Abbeel |
49 |
2023-05-31 |
Dense and Aligned Captions (DAC) Promote Compositional Reasoning in VL Models |
link |
Sivan Doveh, Assaf Arbelle,..., Leonid Karlinsky |
49 |
2023-06-03 |
DYffusion: A Dynamics-informed Diffusion Model for Spatiotemporal Forecasting |
link |
Salva Rühling Cachay, Bo Zhao,..., Rose Yu |
49 |
2023-07-06 |
MomentDiff: Generative Video Moment Retrieval from Random to Real |
link |
Pandeng Li, Chen-Wei Xie,..., Yongdong Zhang |
49 |
2023-07-12 |
Identifiability Guarantees for Causal Disentanglement from Soft Interventions |
link |
Jiaqi Zhang, Kristjan Greenewald,..., Caroline Uhler |
49 |
2023-06-01 |
Inserting Anybody in Diffusion Models via Celeb Basis |
link |
Ge Yuan, Xiaodong Cun,..., Huicheng Zheng |
49 |
2023-10-13 |
Rank-DETR for High Quality Object Detection |
link |
Yifan Pu, Weicong Liang,..., Gao Huang |
49 |
2023-05-22 |
Response Length Perception and Sequence Scheduling: An LLM-Empowered LLM Inference Pipeline |
link |
Zangwei Zheng, Xiaozhe Ren,..., Yang You |
49 |
2023-09-27 |
Dynamic Prompt Learning: Addressing Cross-Attention Leakage for Text-Based Image Editing |
link |
Kai Wang, Fei Yang,..., Joost van de Weijer |
48 |
2023-06-13 |
Image Captioners Are Scalable Vision Learners Too |
link |
Michael Tschannen, Manoj Kumar,..., Lucas Beyer |
48 |
2023-06-30 |
SPAE: Semantic Pyramid AutoEncoder for Multimodal Generation with Frozen LLMs |
link |
Lijun Yu, Yong Cheng,..., Lu Jiang |
48 |
2023-06-04 |
Temporal Dynamic Quantization for Diffusion Models |
link |
Junhyuk So, Jungwon Lee,..., Eunhyeok Park |
48 |
2023-05-30 |
Likelihood-Based Diffusion Language Models |
link |
Ishaan Gulrajani, Tatsunori Hashimoto |
48 |
2023-05-22 |
Getting ViT in Shape: Scaling Laws for Compute-Optimal Model Design |
link |
Ibrahim Alabdulmohsin, Xiaohua Zhai,..., Lucas Beyer |
48 |
2023-05-16 |
AR-Diffusion: Auto-Regressive Diffusion Model for Text Generation |
link |
Tong Wu, Zhihao Fan,..., Weizhu Chen |
48 |
2023-05-25 |
MixFormerV2: Efficient Fully Transformer Tracking |
link |
Yutao Cui, Tianhui Song,..., Limin Wang |
48 |
2022-12-15 |
MAViL: Masked Audio-Video Learners |
link |
Po-Yao Huang, Vasu Sharma,..., Christoph Feichtenhofer |
47 |
2023-07-26 |
Skill-it! A data-driven skills framework for understanding and training language models |
link |
Mayee F Chen, Nicholas Roberts,..., Christopher Re |
47 |
2022-09-01 |
ID and OOD Performance Are Sometimes Inversely Correlated on Real-world Datasets |
link |
Damien Teney, LIN Yong,..., Ehsan Abbasnejad |
47 |
2022-12-20 |
Parsel🐍: Algorithmic Reasoning with Language Models by Composing Decompositions |
link |
Eric Zelikman, Qian Huang,..., Nick Haber |
47 |
2023-05-30 |
Grammar Prompting for Domain-Specific Language Generation with Large Language Models |
link |
Bailin Wang, Zi Wang,..., Yoon Kim |
47 |
2023-11-01 |
CROMA: Remote Sensing Representations with Contrastive Radar-Optical Masked Autoencoders |
link |
Anthony Fuller, Koreen Millard, James R Green |
47 |
2023-02-09 |
Read and Reap the Rewards: Learning to Play Atari with the Help of Instruction Manuals |
link |
Yue Wu, Yewen Fan,..., Tom Mitchell |
46 |
2023-06-01 |
Exposing Attention Glitches with Flip-Flop Language Modeling |
link |
Bingbin Liu, Jordan T. Ash,..., Cyril Zhang |
46 |
2023-05-25 |
Efficient Neural Music Generation |
link |
Max W. Y. Lam, Qiao Tian,..., Yuxuan Wang |
46 |
2023-02-01 |
The geometry of hidden representations of large transformer models |
link |
Lucrezia Valeriani, Diego Doimo,..., Alberto Cazzaniga |
46 |
2023-04-03 |
AUDIT: Audio Editing by Following Instructions with Latent Diffusion Models |
link |
Yuancheng Wang, Zeqian Ju,..., sheng zhao |
46 |
2023-10-09 |
Domain Watermark: Effective and Harmless Dataset Copyright Protection is Closed at Hand |
link |
Junfeng Guo, Yiming Li,..., Bo Li |
46 |
2023-05-24 |
Inverse Preference Learning: Preference-based RL without a Reward Function |
link |
Joey Hejna, Dorsa Sadigh |
46 |
None |
PromptRestorer: A Prompting Image Restoration Method with Degradation Perception |
link |
Cong Wang, Jinshan Pan,..., Junyang Chen |
45 |
2023-05-23 |
Siamese Masked Autoencoders |
link |
Agrim Gupta, Jiajun Wu,..., Li Fei-Fei |
45 |
2023-05-25 |
Parallel Sampling of Diffusion Models |
link |
Andy Shih, Suneel Belkhale,..., Nima Anari |
45 |
2023-05-24 |
Deep Reinforcement Learning with Plasticity Injection |
link |
Evgenii Nikishin, Junhyuk Oh,..., Andre Barreto |
45 |
2023-05-25 |
Knowledge Diffusion for Distillation |
link |
Tao Huang, Yuan Zhang,..., Chang Xu |
45 |
2023-09-10 |
SA-Solver: Stochastic Adams Solver for Fast Sampling of Diffusion Models |
link |
Shuchen Xue, Mingyang Yi,..., Zhi-Ming Ma |
45 |
2023-05-30 |
PanoGen: Text-Conditioned Panoramic Environment Generation for Vision-and-Language Navigation |
link |
Jialu Li, Mohit Bansal |
45 |
2022-10-06 |
A Logic for Expressing Log-Precision Transformers |
link |
William Merrill, Ashish Sabharwal |
45 |
2023-05-24 |
Exploring Diverse In-Context Configurations for Image Captioning |
link |
Xu Yang, Yongliang Wu,..., Xin Geng |
45 |
2023-06-12 |
VillanDiffusion: A Unified Backdoor Attack Framework for Diffusion Models |
link |
Sheng-Yen Chou, Pin-Yu Chen, Tsung-Yi Ho |
45 |
2023-02-27 |
Permutation Equivariant Neural Functionals |
link |
Allan Zhou, Kaien Yang,..., Chelsea Finn |
45 |
None |
How2comm: Communication-Efficient and Collaboration-Pragmatic Multi-Agent Perception |
link |
Dingkang Yang, Kun Yang,..., Lihua Zhang |
44 |
None |
When Do Graph Neural Networks Help with Node Classification? Investigating the Homophily Principle on Node Distinguishability |
link |
Sitao Luan, Chenqing Hua,..., Doina Precup |
44 |
2023-10-31 |
Generate What You Prefer: Reshaping Sequential Recommendation via Guided Diffusion |
link |
Zhengyi Yang, Jiancan Wu,..., Xiangnan He |
44 |
2023-10-25 |
CoDet: Co-occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection |
link |
Chuofan Ma, Yi Jiang,..., XIAOJUAN QI |
44 |
2023-10-23 |
Large Language Models are Visual Reasoning Coordinators |
link |
Liangyu Chen, Bo Li,..., Ziwei Liu |
44 |
2023-06-08 |
Factorized Contrastive Learning: Going Beyond Multi-view Redundancy |
link |
Paul Pu Liang, Zihao Deng,..., Russ Salakhutdinov |
43 |
2023-05-18 |
SlotDiffusion: Object-Centric Generative Modeling with Diffusion Models |
link |
Ziyi Wu, Jingyu Hu,..., Animesh Garg |
43 |
2023-06-11 |
A Holistic Approach to Unifying Automatic Concept Extraction and Concept Importance Estimation |
link |
Thomas FEL, Victor Boutin,..., Thomas Serre |
43 |
2023-05-05 |
Large Language Models for Automated Data Science: Introducing CAAFE for Context-Aware Automated Feature Engineering |
link |
Noah Hollmann, Samuel Müller, Frank Hutter |
43 |
2023-02-28 |
Goal Driven Discovery of Distributional Differences via Language Descriptions |
link |
Ruiqi Zhong, Peter Zhang,..., Jacob Steinhardt |
43 |
2023-05-31 |
Direct Diffusion Bridge using Data Consistency for Inverse Problems |
link |
Hyungjin Chung, Jeongsol Kim, Jong Chul Ye |
43 |
2023-10-23 |
FreeMask: Synthetic Images with Dense Annotations Make Stronger Segmentation Models |
link |
Lihe Yang, Xiaogang Xu,..., Hengshuang Zhao |
43 |
2023-09-23 |
Dream the Impossible: Outlier Imagination with Diffusion Models |
link |
Xuefeng Du, Yiyou Sun,..., Yixuan Li |
42 |
2023-05-26 |
Flow Matching for Scalable Simulation-Based Inference |
link |
Jonas Bernhard Wildberger, Maximilian Dax,..., Bernhard Schölkopf |
42 |
2023-05-09 |
The emergence of clusters in self-attention dynamics |
link |
Borjan Geshkovski, Cyril Letrouit,..., Philippe Rigollet |
42 |
2023-06-21 |
Training Transformers with 4-bit Integers |
link |
Haocheng Xi, ChangHao Li,..., Jun Zhu |
42 |
2023-06-26 |
DiffSketcher: Text Guided Vector Sketch Synthesis through Latent Diffusion Models |
link |
XiMing Xing, Chuang Wang,..., Dong Xu |
42 |
2023-12-08 |
Few-Shot Class-Incremental Learning via Training-Free Prototype Calibration |
link |
Qi-Wei Wang, Da-Wei Zhou,..., Han-Jia Ye |
42 |
2023-05-30 |
Joint Bayesian Inference of Graphical Structure and Parameters with a Single Generative Flow Network |
link |
Tristan Deleu, Mizu Nishikawa-Toomey,..., Yoshua Bengio |
42 |
2023-06-26 |
Restart Sampling for Improving Generative Processes |
link |
Yilun Xu, Mingyang Deng,..., Tommi S. Jaakkola |
42 |
2023-10-21 |
Contrast Everything: A Hierarchical Contrastive Framework for Medical Time-Series |
link |
Yihe Wang, Yu Han,..., Xiang Zhang |
42 |
2023-06-04 |
For SALE: State-Action Representation Learning for Deep Reinforcement Learning |
link |
Scott Fujimoto, Wei-Di Chang,..., David Meger |
42 |
2022-12-21 |
Hidden Poison: Machine Unlearning Enables Camouflaged Poisoning Attacks |
link |
Jimmy Z. Di, Jack Douglas,..., Ayush Sekhari |
42 |
2023-05-26 |
Language Models Can Improve Event Prediction by Few-Shot Abductive Reasoning |
link |
Xiaoming Shi, Siqiao Xue,..., Hongyuan Mei |
42 |
2023-06-06 |
Towards Label-free Scene Understanding by Vision Foundation Models |
link |
Runnan Chen, Youquan Liu,..., Wenping Wang |
42 |
2023-05-26 |
CRoSS: Diffusion Model Makes Controllable, Robust and Secure Image Steganography |
link |
Jiwen Yu, Xuanyu Zhang,..., Jian Zhang |
41 |
2023-10-18 |
Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture |
link |
Daniel Y Fu, Simran Arora,..., Christopher Re |
41 |
2023-10-13 |
Compositional Abilities Emerge Multiplicatively: Exploring Diffusion Models on a Synthetic Task |
link |
Maya Okawa, Ekdeep Singh Lubana,..., Hidenori Tanaka |
41 |
2023-06-16 |
HiNeRV: Video Compression with Hierarchical Encoding-based Neural Representation |
link |
Ho Man Kwan, Ge Gao,..., David Bull |
41 |
None |
Tree-Rings Watermarks: Invisible Fingerprints for Diffusion Images |
link |
Yuxin Wen, John Kirchenbauer,..., Tom Goldstein |
41 |
2023-10-25 |
Towards Self-Interpretable Graph-Level Anomaly Detection |
link |
Yixin Liu, Kaize Ding,..., Shirui Pan |
41 |
2023-06-04 |
Data Quality in Imitation Learning |
link |
Suneel Belkhale, Yuchen Cui, Dorsa Sadigh |
41 |
2023-07-07 |
Scalable Membership Inference Attacks via Quantile Regression |
link |
Martin Andres Bertran, Shuai Tang,..., Steven Wu |
41 |
2023-09-25 |
DeepACO: Neural-enhanced Ant Systems for Combinatorial Optimization |
link |
Haoran Ye, Jiarui Wang,..., Yong Li |
41 |
2023-06-10 |
TensorNet: Cartesian Tensor Representations for Efficient Learning of Molecular Potentials |
link |
Guillem Simeon, Gianni De Fabritiis |
41 |
2023-12-22 |
FineMoGen: Fine-Grained Spatio-Temporal Motion Generation and Editing |
link |
Mingyuan Zhang, Huirong Li,..., Ziwei Liu |
41 |
2022-11-25 |
Expanding Small-Scale Datasets with Guided Imagination |
link |
Yifan Zhang, Daquan Zhou,..., Jiashi Feng |
41 |
2023-06-05 |
HeadSculpt: Crafting 3D Head Avatars with Text |
link |
Xiao Han, Yukang Cao,..., Kwan-Yee K. Wong |
40 |
2023-06-28 |
Separable Physics-Informed Neural Networks |
link |
Junwoo Cho, Seungtae Nam,..., Eunbyung Park |
40 |
2023-07-12 |
No Train No Gain: Revisiting Efficient Training Algorithms For Transformer-based Language Models |
link |
Jean Kaddour, Oscar Key,..., Matt Kusner |
40 |
2023-02-17 |
Consistent Diffusion Models: Mitigating Sampling Drift by Learning to be Consistent |
link |
Giannis Daras, Yuval Dagan,..., Constantinos Costis Daskalakis |
40 |
2023-06-29 |
Graph Denoising Diffusion for Inverse Protein Folding |
link |
Kai Yi, Bingxin Zhou,..., Yu Guang Wang |
40 |
2023-12-12 |
Equivariant Flow Matching with Hybrid Probability Transport for 3D Molecule Generation |
link |
Yuxuan Song, Jingjing Gong,..., Wei-Ying Ma |
40 |
2023-09-25 |
IEBins: Iterative Elastic Bins for Monocular Depth Estimation |
link |
Shuwei Shao, Zhongcai Pei,..., Zhengguo Li |
40 |
2022-09-30 |
Universal Prompt Tuning for Graph Neural Networks |
link |
Taoran Fang, Yunchao Mercer Zhang,..., Lei CHEN |
39 |
2023-07-22 |
HIQL: Offline Goal-Conditioned RL with Latent States as Actions |
link |
Seohong Park, Dibya Ghosh,..., Sergey Levine |
39 |
2023-06-02 |
Spatially Resolved Gene Expression Prediction from Histology Images via Bi-modal Contrastive Learning |
link |
Ronald Xie, Kuan Pang,..., Gary Bader |
39 |
2023-06-20 |
LVM-Med: Learning Large-Scale Self-Supervised Vision Models for Medical Imaging via Second-order Graph Matching |
link |
Duy Minh Ho Nguyen, Hoang Nguyen,..., Mathias Niepert |
39 |
2023-05-25 |
Diffusion-Based Adversarial Sample Generation for Improved Stealthiness and Controllability |
link |
Haotian Xue, Alexandre Araujo,..., Yongxin Chen |
39 |
2023-07-21 |
Predict, Refine, Synthesize: Self-Guiding Diffusion Models for Probabilistic Time Series Forecasting |
link |
Marcel Kollovieh, Abdul Fatir Ansari,..., Bernie Wang |
39 |
2023-07-07 |
Autodecoding Latent 3D Diffusion Models |
link |
Evangelos Ntavelis, Aliaksandr Siarohin,..., Sergey Tulyakov |
38 |
2023-02-02 |
Timewarp: Transferable Acceleration of Molecular Dynamics by Learning Time-Coarsened Dynamics |
link |
Leon Klein, Andrew Y. K. Foong,..., Ryota Tomioka |
38 |
2023-07-06 |
Pruning vs Quantization: Which is Better? |
link |
Andrey Kuzmin, Markus Nagel,..., Tijmen Blankevoort |
38 |
2023-02-14 |
Energy Transformer |
link |
Benjamin Hoover, Yuchen Liang,..., Dmitry Krotov |
38 |
2023-08-16 |
Towards Personalized Federated Learning via Heterogeneous Model Reassembly |
link |
Jiaqi Wang, Xingyi Yang,..., Fenglong Ma |
38 |
2023-09-29 |
Asynchrony-Robust Collaborative Perception via Bird's Eye View Flow |
link |
Sizhe Wei, Yuxi Wei,..., Ya Zhang |
38 |
2023-04-23 |
DiffTraj: Generating GPS Trajectory with Diffusion Probabilistic Model |
link |
Yuanshao Zhu, Yongchao Ye,..., James Yu |
37 |
2023-05-22 |
On quantum backpropagation, information reuse, and cheating measurement collapse |
link |
Amira Abbas, Robbie King,..., Jarrod Ryan McClean |
37 |
None |
P-Flow: A Fast and Data-Efficient Zero-Shot TTS through Speech Prompting |
link |
Sungwon Kim, Kevin J. Shih,..., Bryan Catanzaro |
37 |
2024-02-01 |
ViCA-NeRF: View-Consistency-Aware 3D Editing of Neural Radiance Fields |
link |
Jiahua Dong, Yu-Xiong Wang |
37 |
2023-03-01 |
Time Series as Images: Vision Transformer for Irregularly Sampled Time Series |
link |
Zekun Li, Shiyang Li, Xifeng Yan |
37 |
2023-06-07 |
Object-Centric Learning for Real-World Videos by Predicting Temporal Feature Similarities |
link |
Andrii Zadaianchuk, Maximilian Seitzer, Georg Martius |
36 |
2022-11-02 |
Entropic Neural Optimal Transport via Diffusion Processes |
link |
Nikita Gushchin, Alexander Kolesov,..., Evgeny Burnaev |
36 |
2022-10-26 |
The Goldilocks of Pragmatic Understanding: Fine-Tuning Strategy Matters for Implicature Resolution by LLMs |
link |
Laura Eline Ruis, Akbir Khan,..., Edward Grefenstette |
36 |
2023-06-23 |
Max-Margin Token Selection in Attention Mechanism |
link |
Davoud Ataee Tarzanagh, Yingcong Li,..., Samet Oymak |
36 |
2023-06-23 |
Scaling MLPs: A Tale of Inductive Bias |
link |
Gregor Bachmann, Sotiris Anagnostidis, Thomas Hofmann |
36 |
2023-06-15 |
Class-Conditional Conformal Prediction with Many Classes |
link |
Tiffany Ding, Anastasios Nikolas Angelopoulos,..., Ryan Tibshirani |
36 |
2023-02-14 |
Bounding training data reconstruction in DP-SGD |
link |
Jamie Hayes, Borja Balle, Saeed Mahloujifar |
36 |
2023-06-08 |
Boosting Adversarial Transferability by Achieving Flat Local Maxima |
link |
Zhijin Ge, Hongying Liu,..., Yuanyuan Liu |
36 |
2023-04-25 |
Stable and low-precision training for large-scale vision-language models |
link |
Mitchell Wortsman, Tim Dettmers,..., Ludwig Schmidt |
36 |
2023-05-29 |
PHOTOSWAP: Personalized Subject Swapping in Images |
link |
Jing Gu, Yilin Wang,..., Xin Eric Wang |
36 |
2023-10-22 |
Hierarchical Vector Quantized Transformer for Multi-class Unsupervised Anomaly Detection |
link |
Ruiying Lu, YuJie Wu,..., Ruimin Hu |
36 |
2023-04-02 |
SEENN: Towards Temporal Spiking Early Exit Neural Networks |
link |
Yuhang Li, Tamar Geller,..., Priyadarshini Panda |
36 |
2023-04-25 |
Parallel Spiking Neurons with High Efficiency and Ability to Learn Long-term Dependencies |
link |
Wei Fang, Zhaofei Yu,..., Yonghong Tian |
35 |
2023-07-28 |
AbDiffuser: full-atom generation of in-vitro functioning antibodies |
link |
Karolis Martinkus, Jan Ludwiczak,..., Andreas Loukas |
35 |
2023-03-01 |
Grounded Decoding: Guiding Text Generation with Grounded Models for Embodied Agents |
link |
Wenlong Huang, Fei Xia,..., brian ichter |
35 |
2022-12-06 |
GAUCHE: A Library for Gaussian Processes in Chemistry |
link |
Ryan-Rhys Griffiths, Leo Klarner,..., Jian Tang |
35 |
2023-07-20 |
OBJECT 3DIT: Language-guided 3D-aware Image Editing |
link |
Oscar Michel, Anand Bhattad,..., Tanmay Gupta |
35 |
2023-05-17 |
End-To-End Latent Variational Diffusion Models for Inverse Problems in High Energy Physics |
link |
Alexander Shmakov, Kevin Greif,..., Daniel Whiteson |
35 |
2023-08-27 |
Revisiting Scalarization in Multi-Task Learning: A Theoretical Perspective |
link |
Yuzheng Hu, Ruicheng Xian,..., Han Zhao |
35 |
2023-06-02 |
Interpretable and Explainable Logical Policies via Neurally Guided Symbolic Abstraction |
link |
Quentin Delfosse, Hikaru Shindo,..., Kristian Kersting |
35 |
2023-06-29 |
Neural Polarizer: A Lightweight and Effective Backdoor Defense via Purifying Poisoned Features |
link |
Mingli Zhu, Shaokui Wei,..., Baoyuan Wu |
35 |
None |
BIOT: Biosignal Transformer for Cross-data Learning in the Wild |
link |
Chaoqi Yang, M Brandon Westover, Jimeng Sun |
35 |
2023-05-30 |
LANCE: Stress-testing Visual Models by Generating Language-guided Counterfactual Images |
link |
Viraj Uday Prabhu, Sriram Yenamandra,..., Judy Hoffman |
35 |
2023-07-05 |
Elastic Decision Transformer |
link |
Yueh-Hua Wu, Xiaolong Wang, Masashi Hamaya |
34 |
2023-07-07 |
When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment |
link |
Tianwei Ni, Michel Ma,..., Pierre-Luc Bacon |
34 |
2023-04-02 |
Saddle-to-Saddle Dynamics in Diagonal Linear Networks |
link |
Scott Pesme, Nicolas Flammarion |
34 |
2023-07-10 |
Compositional Generalization from First Principles |
link |
Thaddäus Wiedemer, Prasanna Mayilvahanan,..., Wieland Brendel |
34 |
2023-06-14 |
Training-free Diffusion Model Adaptation for Variable-Sized Text-to-Image Synthesis |
link |
Zhiyu Jin, Xuli Shen,..., Xiangyang Xue |
34 |
2023-06-09 |
PoET: A generative model of protein families as sequences-of-sequences |
link |
Timothy Fei Truong Jr, Tristan Bepler |
34 |
2022-06-27 |
Supply-Side Equilibria in Recommender Systems |
link |
Meena Jagadeesan, Nikhil Garg, Jacob Steinhardt |
34 |
None |
Adaptive Normalization for Non-stationary Time Series Forecasting: A Temporal Slice Perspective |
link |
Zhiding Liu, Mingyue Cheng,..., Enhong Chen |
34 |
2022-12-23 |
A-NeSI: A Scalable Approximate Method for Probabilistic Neurosymbolic Inference |
link |
Emile van Krieken, Thiviyan Thanapalasingam,..., Annette Ten Teije |
34 |
None |
CrossGNN: Confronting Noisy Multivariate Time Series Via Cross Interaction Refinement |
link |
Qihe Huang, Lei Shen,..., Yang Wang |
34 |
None |
Dynamic Personalized Federated Learning with Adaptive Differential Privacy |
link |
Xiyuan Yang, Wenke Huang, Mang Ye |
34 |
2023-07-20 |
Shared Adversarial Unlearning: Backdoor Mitigation by Unlearning Shared Adversarial Examples |
link |
Shaokui Wei, Mingda Zhang,..., Baoyuan Wu |
34 |
2023-07-03 |
Hierarchical Open-vocabulary Universal Image Segmentation |
link |
Xudong Wang, Shufan Li,..., Trevor Darrell |
33 |
2023-05-18 |
Clifford Group Equivariant Neural Networks |
link |
David Ruhe, Johannes Brandstetter, Patrick Forré |
33 |
None |
Let the Flows Tell: Solving Graph Combinatorial Problems with GFlowNets |
link |
Dinghuai Zhang, Hanjun Dai,..., Ling Pan |
33 |
None |
Two-Stage Learning to Defer with Multiple Experts |
link |
Anqi Mao, Christopher Mohri,..., Yutao Zhong |
33 |
2023-10-27 |
Learning to Search Feasible and Infeasible Regions of Routing Problems with Flexible Neural k-Opt |
link |
Yining Ma, Zhiguang Cao, Yeow Meng Chee |
33 |
2023-05-26 |
Causal Component Analysis |
link |
Wendong Liang, Armin Kekić,..., Bernhard Schölkopf |
33 |
2023-05-24 |
ALGO: Synthesizing Algorithmic Programs with Generated Oracle Verifiers |
link |
Kexun Zhang, Danqing Wang,..., Lei Li |
33 |
2023-12-13 |
Distributed Inference and Fine-tuning of Large Language Models Over The Internet |
link |
Alexander Borzunov, Max Ryabinin,..., Colin Raffel |
33 |
2023-06-13 |
(Amplified) Banded Matrix Factorization: A unified approach to private training |
link |
Christopher A. Choquette-Choo, Arun Ganesh,..., Zheng Xu |
33 |
2023-05-26 |
Demo2Code: From Summarizing Demonstrations to Synthesizing Code via Extended Chain-of-Thought |
link |
Huaxiaoyue Wang, Gonzalo Gonzalez-Pumariega,..., Sanjiban Choudhury |
33 |
2023-09-25 |
LinGCN: Structural Linearized Graph Convolutional Network for Homomorphically Encrypted Inference |
link |
Hongwu Peng, Ran Ran,..., Caiwen Ding |
33 |
2023-10-23 |
Rethinking Tokenizer and Decoder in Masked Graph Modeling for Molecules |
link |
Zhiyuan Liu, Yaorui Shi,..., Tat-Seng Chua |
33 |
2023-05-30 |
Intriguing Properties of Quantization at Scale |
link |
Arash Ahmadian, Saurabh Dash,..., Sara Hooker |
33 |
2022-06-07 |
A*Net: A Scalable Path-based Reasoning Approach for Knowledge Graphs |
link |
Zhaocheng Zhu, Xinyu Yuan,..., Jian Tang |
33 |
2023-06-18 |
Online Map Vectorization for Autonomous Driving: A Rasterization Perspective |
link |
Gongjie Zhang, Jiahao Lin,..., Zuoguan Wang |
33 |
2023-10-04 |
CoDA: Collaborative Novel Box Discovery and Cross-modal Alignment for Open-vocabulary 3D Object Detection |
link |
Yang Cao, Yihan Zeng,..., Dan Xu |
32 |
2023-06-01 |
Learning Transformer Programs |
link |
Dan Friedman, Alexander Wettig, Danqi Chen |
32 |
2023-05-18 |
Smoothing the Landscape Boosts the Signal for SGD: Optimal Sample Complexity for Learning Single Index Models |
link |
Alex Damian, Eshaan Nichani,..., Jason D. Lee |
32 |
2023-05-25 |
Demystifying Oversmoothing in Attention-Based Graph Neural Networks |
link |
Xinyi Wu, Amir Ajorlou,..., Ali Jadbabaie |
32 |
2022-10-03 |
Rank-N-Contrast: Learning Continuous Representations for Regression |
link |
Kaiwen Zha, Peng Cao,..., Dina Katabi |
32 |
2023-07-14 |
HyTrel: Hypergraph-enhanced Tabular Data Representation Learning |
link |
Pei Chen, Soumajyoti Sarkar,..., George Karypis |
32 |
2023-10-24 |
A Unified, Scalable Framework for Neural Population Decoding |
link |
Mehdi Azabou, Vinam Arora,..., Eva L Dyer |
32 |
2023-05-24 |
Reverse Engineering Self-Supervised Learning |
link |
Ido Ben-Shaul, Ravid Shwartz-Ziv,..., Yann LeCun |
32 |
2022-03-29 |
Causal de Finetti: On the Identification of Invariant Causal Structure in Exchangeable Data |
link |
Siyuan Guo, Viktor Tóth,..., Ferenc Huszár |
32 |
2023-03-19 |
Unsupervised Learning for Solving the Travelling Salesman Problem |
link |
Yimeng Min, Yiwei Bai, Carla P Gomes |
32 |
None |
SwapPrompt: Test-Time Prompt Adaptation for Vision-Language Models |
link |
Xiaosong Ma, Jie ZHANG,..., Wenchao Xu |
32 |
2023-05-26 |
SOC: Semantic-Assisted Object Cluster for Referring Video Object Segmentation |
link |
Zhuoyan Luo, Yicheng Xiao,..., Yujiu Yang |
32 |
2023-05-16 |
Revisiting the Minimalist Approach to Offline Reinforcement Learning |
link |
Denis Tarasov, Vladislav Kurenkov,..., Sergey Kolesnikov |
32 |
2023-03-23 |
Fairness-guided Few-shot Prompting for Large Language Models |
link |
Huan Ma, Changqing Zhang,..., Bingzhe Wu |
32 |
2023-09-23 |
Deciphering Spatio-Temporal Graph Forecasting: A Causal Lens and Treatment |
link |
Yutong Xia, Yuxuan Liang,..., Roger Zimmermann |
32 |
2023-10-13 |
Does Graph Distillation See Like Vision Dataset Counterpart? |
link |
Beining Yang, Kai Wang,..., Jianxin Li |
32 |
2022-09-13 |
Characterizing Graph Datasets for Node Classification: Homophily-Heterophily Dichotomy and Beyond |
link |
Oleg Platonov, Denis Kuznedelev,..., Liudmila Prokhorenkova |
32 |
2023-06-01 |
StyleGAN knows Normal, Depth, Albedo, and More |
link |
Anand Bhattad, Daniel McKee,..., David Forsyth |
31 |
2023-02-21 |
Diffusion Models and Semi-Supervised Learners Benefit Mutually with Few Labels |
link |
Zebin You, Yong Zhong,..., Jun Zhu |
31 |
2023-07-13 |
Reward-Directed Conditional Diffusion: Provable Distribution Estimation and Reward Improvement |
link |
Hui Yuan, Kaixuan Huang,..., Mengdi Wang |
31 |
2023-10-07 |
VLATTACK: Multimodal Adversarial Attacks on Vision-Language Tasks via Pre-trained Models |
link |
Ziyi Yin, Muchao Ye,..., Fenglong Ma |
31 |
2023-09-15 |
Towards Last-layer Retraining for Group Robustness with Fewer Annotations |
link |
Tyler LaBonte, Vidya Muthukumar, Abhishek Kumar |
31 |
2023-05-22 |
Meta-in-context learning in large language models |
link |
Julian Coda-Forno, Marcel Binz,..., Eric Schulz |
31 |
2023-08-02 |
ImageBrush: Learning Visual In-Context Instructions for Exemplar-Based Image Manipulation |
link |
Yasheng SUN, Yifan Yang,..., Hideki Koike |
31 |
2022-11-25 |
Bypass Exponential Time Preprocessing: Fast Neural Network Training via Weight-Data Correlation Preprocessing |
link |
Josh Alman, Jiehao Liang,..., Danyang Zhuo |
31 |
2023-03-22 |
EDGI: Equivariant Diffusion for Planning with Embodied Agents |
link |
Johann Brehmer, Joey Bose,..., Taco Cohen |
31 |
None |
ClusterFomer: Clustering As A Universal Visual Learner |
link |
James Chenhao Liang, Yiming Cui,..., Dongfang Liu |
31 |
2023-05-18 |
DiffUTE: Universal Text Editing Diffusion Model |
link |
Haoxing Chen, Zhuoer Xu,..., Weiqiang Wang |
31 |
2023-05-30 |
SheetCopilot: Bringing Software Productivity to the Next Level through Large Language Models |
link |
Hongxin Li, Jingran Su,..., Zhaoxiang Zhang |
30 |
2023-05-30 |
Real-World Image Variation by Aligning Diffusion Inversion Chain |
link |
Yuechen ZHANG, Jinbo Xing,..., Jiaya Jia |
30 |
None |
IBA: Towards Irreversible Backdoor Attacks in Federated Learning |
link |
Dung Thuy Nguyen, Tuan Minh Nguyen,..., KOK SENG WONG |
30 |
2023-10-27 |
Optimal Transport for Treatment Effect Estimation |
link |
Hao Wang, Jiajun Fan,..., Ruiming Tang |
30 |
2023-02-07 |
Concept Algebra for (Score-Based) Text-Controlled Generative Models |
link |
Zihao Wang, Lin Gui,..., Victor Veitch |
30 |
2023-06-21 |
Mass-Producing Failures of Multimodal Systems with Language Models |
link |
Shengbang Tong, Erik Jones, Jacob Steinhardt |
30 |
2023-11-14 |
The Transient Nature of Emergent In-Context Learning in Transformers |
link |
Aaditya K Singh, Stephanie C.Y. Chan,..., Felix Hill |
30 |
2023-10-29 |
Does Invariant Graph Learning via Environment Augmentation Learn Invariance? |
link |
Yongqiang Chen, Yatao Bian,..., James Cheng |
30 |
2022-10-04 |
ASIF: Coupled Data Turns Unimodal Models to Multimodal without Training |
link |
Antonio Norelli, Marco Fumero,..., Francesco Locatello |
30 |
2022-04-04 |
Training Fully Connected Neural Networks is $\exists\mathbb{R}$-Complete |
link |
Daniel Bertschinger, Christoph Hertrich,..., Simon Weber |
30 |
2023-05-26 |
Contrast, Attend and Diffuse to Decode High-Resolution Images from Brain Activities |
link |
Jingyuan Sun, Mingxiao Li,..., Marie-Francine Moens |
30 |
2023-02-08 |
Sample-efficient Multi-objective Molecular Optimization with GFlowNets |
link |
Yiheng Zhu, Jialu Wu,..., Jian Wu |
30 |
2023-05-26 |
The Curious Price of Distributional Robustness in Reinforcement Learning with a Generative Model |
link |
Laixi Shi, Gen Li,..., Yuejie Chi |
30 |
2023-10-14 |
STORM: Efficient Stochastic Transformer based World Models for Reinforcement Learning |
link |
Weipu Zhang, Gang Wang,..., Gao Huang |
30 |
2023-05-15 |
Bridging the Domain Gap: Self-Supervised 3D Scene Understanding with Foundation Models |
link |
Zhimin Chen, Longlong Jing,..., Bing Li |
30 |
2023-04-10 |
H2RBox-v2: Incorporating Symmetry for Boosting Horizontal Box Supervised Oriented Object Detection |
link |
Yi Yu, Xue Yang,..., Junchi Yan |
29 |
2023-06-02 |
Convex and Non-convex Optimization Under Generalized Smoothness |
link |
Haochuan Li, Jian Qian,..., Ali Jadbabaie |
29 |
None |
DeWave: Discrete Encoding of EEG Waves for EEG to Text Translation |
link |
Yiqun Duan, Charles Zhou,..., Chin-teng Lin |
29 |
2023-05-31 |
Not All Neuro-Symbolic Concepts Are Created Equal: Analysis and Mitigation of Reasoning Shortcuts |
link |
Emanuele Marconato, Stefano Teso,..., Andrea Passerini |
29 |
2024-01-17 |
POP-3D: Open-Vocabulary 3D Occupancy Prediction from Images |
link |
Antonín Vobecký, Oriane Siméoni,..., Josef Sivic |
29 |
2023-06-12 |
Transformers learn through gradual rank increase |
link |
Enric Boix-Adserà, Etai Littwin,..., Joshua M. Susskind |
29 |
2023-03-13 |
Transformer-based Planning for Symbolic Regression |
link |
Parshin Shojaee, Kazem Meidani,..., Chandan K. Reddy |
29 |
2023-10-02 |
Disentangling Voice and Content with Self-Supervision for Speaker Recognition |
link |
Tianchi Liu, Kong Aik Lee,..., Haizhou Li |
29 |
2023-06-06 |
The Emergence of Essential Sparsity in Large Pre-trained Models: The Weights that Matter |
link |
AJAY KUMAR JAISWAL, Shiwei Liu,..., Zhangyang Wang |
29 |
None |
Conformal Prediction for Uncertainty-Aware Planning with Diffusion Dynamics Model |
link |
Jiankai Sun, Yiqi Jiang,..., Mac Schwager |
29 |
2023-02-08 |
Taming Local Effects in Graph-based Spatiotemporal Forecasting |
link |
Andrea Cini, Ivan Marisca,..., Cesare Alippi |
29 |
2022-11-05 |
Unleashing the Power of Graph Data Augmentation on Covariate Distribution Shift |
link |
Yongduo Sui, Qitian Wu,..., Xiangnan He |
29 |
2023-06-02 |
Demystifying Structural Disparity in Graph Neural Networks: Can One Size Fit All? |
link |
Haitao Mao, Zhikai Chen,..., Jiliang Tang |
29 |
2023-06-19 |
Beyond Normal: On the Evaluation of Mutual Information Estimators |
link |
Paweł Czyż, Frederic Grabowski,..., Alexander Marx |
29 |
2023-06-18 |
Score-based Data Assimilation |
link |
François Rozet, Gilles Louppe |
29 |
2023-06-01 |
DiffPack: A Torsional Diffusion Model for Autoregressive Protein Side-Chain Packing |
link |
Yangtian Zhang, Zuobai Zhang,..., Jian Tang |
29 |
2024-01-04 |
Improving Diffusion-Based Image Synthesis with Context Prediction |
link |
Ling Yang, Jingwei Liu,..., Bin CUI |
29 |
2023-06-08 |
Learning Fine-grained View-Invariant Representations from Unpaired Ego-Exo Videos via Temporal Alignment |
link |
Zihui Xue, Kristen Grauman |
29 |
2024-02-12 |
A Closer Look at the Robustness of Contrastive Language-Image Pre-Training (CLIP) |
link |
Weijie Tu, Weijian Deng, Tom Gedeon |
28 |
2023-09-04 |
Memory Efficient Optimizers with 4-bit States |
link |
Bingrui Li, Jianfei Chen, Jun Zhu |
28 |
2023-01-26 |
Break It Down: Evidence for Structural Compositionality in Neural Networks |
link |
Michael A. Lepori, Thomas Serre, Ellie Pavlick |
28 |
None |
QuantSR: Accurate Low-bit Quantization for Efficient Image Super-Resolution |
link |
Haotong Qin, Yulun Zhang,..., Fisher Yu |
28 |
2023-06-12 |
Operator Learning with Neural Fields: Tackling PDEs on General Geometries |
link |
Louis Serrano, Lise Le Boudec,..., patrick gallinari |
28 |
2023-05-31 |
A Unified Framework for U-Net Design and Analysis |
link |
Christopher Williams, Fabian Falck,..., Saifuddin Syed |
28 |
None |
Not All Out-of-Distribution Data Are Harmful to Open-Set Active Learning |
link |
Yang Yang, Yuxuan Zhang,..., Yi Xu |
28 |
2023-04-07 |
A new perspective on building efficient and expressive 3D equivariant graph neural networks |
link |
weitao Du, Yuanqi Du,..., Zhi-Ming Ma |
28 |
2023-11-03 |
On the Generalization Properties of Diffusion Models |
link |
Puheng Li, Zhong Li,..., Jiang Bian |
28 |
2020-10-13 |
Unified Lower Bounds for Interactive High-dimensional Estimation under Information Constraints |
link |
Jayadev Acharya, Clement Louis Canonne,..., Himanshu Tyagi |
28 |
2023-05-16 |
Double Pessimism is Provably Efficient for Distributionally Robust Offline Reinforcement Learning: Generic Algorithm and Robust Partial Coverage |
link |
Jose Blanchet, Miao Lu,..., Han Zhong |
28 |
2023-04-10 |
Prompt Pre-Training with Twenty-Thousand Classes for Open-Vocabulary Visual Recognition |
link |
Shuhuai Ren, Aston Zhang,..., Xu Sun |
28 |
2023-05-30 |
Complex Query Answering on Eventuality Knowledge Graph with Implicit Logical Constraints |
link |
Jiaxin Bai, Xin Liu,..., Yangqiu Song |
28 |
2023-10-17 |
DELIFFAS: Deformable Light Fields for Fast Avatar Synthesis |
link |
YoungJoong Kwon, Lingjie Liu,..., Christian Theobalt |
28 |
2023-09-14 |
Where2Explore: Few-shot Affordance Learning for Unseen Novel Categories of Articulated Objects |
link |
Chuanruo Ning, Ruihai Wu,..., Hao Dong |
28 |
None |
StyleDrop: Text-to-Image Synthesis of Any Style |
link |
Kihyuk Sohn, Lu Jiang,..., Daniel Castro Chin |
28 |
2023-05-22 |
Neural Functional Transformers |
link |
Allan Zhou, Kaien Yang,..., Chelsea Finn |
28 |
None |
Q-DM: An Efficient Low-bit Quantized Diffusion Model |
link |
Yanjing Li, Sheng Xu,..., Baochang Zhang |
28 |
2023-07-04 |
On the Constrained Time-Series Generation Problem |
link |
Andrea Coletta, Sriram Gopalakrishnan,..., Svitlana Vyetrenko |
28 |
2023-10-23 |
Data Pruning via Moving-one-Sample-out |
link |
Haoru Tan, Sitong Wu,..., XIAOJUAN QI |
28 |
2023-11-03 |
Learning to Augment Distributions for Out-of-distribution Detection |
link |
Qizhou Wang, Zhen Fang,..., Bo Han |
28 |
2023-05-30 |
Multi-modal Queried Object Detection in the Wild |
link |
Yifan Xu, Mengdan Zhang,..., Changsheng Xu |
28 |
2023-11-28 |
No Representation Rules Them All in Category Discovery |
link |
Sagar Vaze, Andrea Vedaldi, Andrew Zisserman |
28 |
2023-02-23 |
Quantifying & Modeling Multimodal Interactions: An Information Decomposition Framework |
link |
Paul Pu Liang, Yun Cheng,..., Louis-Philippe Morency |
27 |
2023-06-02 |
Towards In-context Scene Understanding |
link |
Ivana Balazevic, David Steiner,..., Olivier J Henaff |
27 |
2023-05-20 |
A Scalable Neural Network for DSIC Affine Maximizer Auction Design |
link |
Zhijian Duan, Haoran Sun,..., Xiaotie Deng |
27 |
2023-06-09 |
Topology-Aware Uncertainty for Image Segmentation |
link |
Saumya Gupta, Yikai Zhang,..., Chao Chen |
27 |
2023-05-17 |
Explain Any Concept: Segment Anything Meets Concept-Based Explanation |
link |
Ao Sun, Pingchuan Ma,..., Shuai Wang |
27 |
2023-10-17 |
Towards Generic Semi-Supervised Framework for Volumetric Medical Image Segmentation |
link |
Haonan Wang, Xiaomeng Li |
27 |
2023-06-12 |
TrojLLM: A Black-box Trojan Prompt Attack on Large Language Models |
link |
Jiaqi Xue, Mengxin Zheng,..., Qian Lou |
27 |
2023-09-22 |
OneNet: Enhancing Time Series Forecasting Models under Concept Drift by Online Ensembling |
link |
YiFan Zhang, Qingsong Wen,..., Tieniu Tan |
27 |
2023-05-31 |
FlowCam: Training Generalizable 3D Radiance Fields without Camera Poses via Pixel-Aligned Scene Flow |
link |
Cameron Omid Smith, Yilun Du,..., Vincent Sitzmann |
27 |
2023-12-11 |
TabMT: Generating tabular data with masked transformers |
link |
Manbir S Gulati, Paul F Roysdon |
27 |
2023-09-23 |
Defending Pre-trained Language Models as Few-shot Learners against Backdoor Attacks |
link |
Zhaohan Xi, Tianyu Du,..., Ting Wang |
27 |
2023-06-01 |
Lightweight Vision Transformer with Bidirectional Interaction |
link |
Qihang Fan, Huaibo Huang,..., Ran He |
27 |
2023-04-04 |
The expressive power of pooling in Graph Neural Networks |
link |
Filippo Maria Bianchi, Veronica Lachi |
27 |
2023-05-29 |
LaFTer: Label-Free Tuning of Zero-shot Classifier using Language and Unlabeled Image Collections |
link |
Muhammad Jehanzeb Mirza, Leonid Karlinsky,..., Horst Bischof |
26 |
2023-04-19 |
Bridging RL Theory and Practice with the Effective Horizon |
link |
Cassidy Laidlaw, Stuart Russell, Anca Dragan |
26 |
2023-04-06 |
Dynamics of Finite Width Kernel and Prediction Fluctuations in Mean Field Neural Networks |
link |
Blake Bordelon, Cengiz Pehlevan |
26 |
2023-10-29 |
Label Poisoning is All You Need |
link |
Rishi Dev Jha, Jonathan Hayase, Sewoong Oh |
26 |
2023-10-30 |
MoCa: Measuring Human-Language Model Alignment on Causal and Moral Judgment Tasks |
link |
Allen Nie, Yuhui Zhang,..., Tobias Gerstenberg |
26 |
2023-03-02 |
SHAP-IQ: Unified Approximation of any-order Shapley Interactions |
link |
Fabian Fumagalli, Maximilian Muschalik,..., Barbara Eva Hammer |
26 |
2023-01-09 |
BQ-NCO: Bisimulation Quotienting for Efficient Neural Combinatorial Optimization |
link |
Darko Drakulic, Sofia Michel,..., Jean-Marc Andreoli |
26 |
2023-06-08 |
RDumb: A simple approach that questions our progress in continual test-time adaptation |
link |
Ori Press, Steffen Schneider,..., Matthias Bethge |
26 |
2023-06-30 |
The Shaped Transformer: Attention Models in the Infinite Depth-and-Width Limit |
link |
Lorenzo Noci, Chuning Li,..., Daniel M. Roy |
26 |
2023-06-01 |
Make Pre-trained Model Reversible: From Parameter to Memory Efficient Fine-Tuning |
link |
Baohao Liao, Shaomu Tan, Christof Monz |
26 |
2023-07-17 |
Going Beyond Linear Mode Connectivity: The Layerwise Linear Feature Connectivity |
link |
Zhanpeng Zhou, Yongyi Yang,..., Wei Hu |
26 |
2023-10-31 |
BasisFormer: Attention-based Time Series Forecasting with Learnable and Interpretable Basis |
link |
Zelin Ni, Hang Yu,..., Weiyao Lin |
26 |
2023-11-13 |
A Data-Free Approach to Mitigate Catastrophic Forgetting in Federated Class Incremental Learning for Vision Tasks |
link |
Sara Babakniya, Zalan Fabian,..., Salman Avestimehr |
26 |
2023-11-05 |
Scenario Diffusion: Controllable Driving Scenario Generation With Diffusion |
link |
Ethan Pronovost, Meghana Reddy Ganesina,..., Nicholas Roy |
26 |
2023-06-06 |
FAMO: Fast Adaptive Multitask Optimization |
link |
Bo Liu, Yihao Feng,..., qiang liu |
26 |
2023-05-31 |
Representation Equivalent Neural Operators: a Framework for Alias-free Operator Learning |
link |
Francesca Bartolucci, Emmanuel de Bezenac,..., Rima Alaifari |
26 |
2023-05-20 |
Brain encoding models based on multimodal transformers can transfer across language and vision |
link |
Jerry Tang, Meng Du,..., Alexander Huth |
26 |
2023-11-03 |
Flow-Based Feature Fusion for Vehicle-Infrastructure Cooperative 3D Object Detection |
link |
Haibao Yu, Yingjuan Tang,..., Zaiqing Nie |
26 |
None |
Removing Hidden Confounding in Recommendation: A Unified Multi-Task Learning Approach |
link |
Haoxuan Li, Kunhan Wu,..., Peng Wu |
26 |
2023-06-07 |
Improving neural network representations using human similarity judgments |
link |
Lukas Muttenthaler, Lorenz Linhardt,..., Simon Kornblith |
26 |
2023-07-24 |
Described Object Detection: Liberating Object Detection with Flexible Expressions |
link |
Chi Xie, Zhao Zhang,..., Shuang Liang |
26 |
2023-11-02 |
Act As You Wish: Fine-Grained Control of Motion Diffusion Model with Hierarchical Semantic Graphs |
link |
Peng Jin, Yang Wu,..., Li Yuan |
26 |
2023-10-31 |
HEDNet: A Hierarchical Encoder-Decoder Network for 3D Object Detection in Point Clouds |
link |
Gang Zhang, Chen Junnan,..., Xiaolin Hu |
26 |
2023-12-12 |
One-Step Diffusion Distillation via Deep Equilibrium Models |
link |
Zhengyang Geng, Ashwini Pokle, J Zico Kolter |
26 |
2023-05-31 |
Spontaneous symmetry breaking in generative diffusion models |
link |
Gabriel Raya, Luca Ambrogioni |
26 |
2023-09-19 |
PGDiff: Guiding Diffusion Models for Versatile Face Restoration via Partial Guidance |
link |
Peiqing Yang, Shangchen Zhou,..., Chen Change Loy |
25 |
2023-07-20 |
Sharpness Minimization Algorithms Do Not Only Minimize Sharpness To Achieve Better Generalization |
link |
Kaiyue Wen, Zhiyuan Li, Tengyu Ma |
25 |
2023-06-01 |
Thought Cloning: Learning to Think while Acting by Imitating Human Thinking |
link |
Shengran Hu, Jeff Clune |
25 |
2023-12-03 |
Honesty Is the Best Policy: Defining and Mitigating AI Deception |
link |
Francis Rhys Ward, Francesca Toni,..., Tom Everitt |
25 |
2023-10-07 |
Subspace Identification for Multi-Source Domain Adaptation |
link |
Zijian Li, Ruichu Cai,..., Kun Zhang |
25 |
2023-01-27 |
Alignment with human representations supports robust few-shot learning |
link |
Ilia Sucholutsky, Thomas L. Griffiths |
25 |
2023-12-07 |
Differentiable Registration of Images and LiDAR Point Clouds with VoxelPoint-to-Pixel Matching |
link |
Junsheng Zhou, Baorui Ma,..., Zhizhong Han |
25 |
2023-12-06 |
Language Model Alignment with Elastic Reset |
link |
Michael Noukhovitch, Samuel Lavoie,..., Aaron Courville |
25 |
2022-10-07 |
Winner Takes It All: Training Performant RL Populations for Combinatorial Optimization |
link |
Nathan Grinsztajn, Daniel Furelos-Blanco,..., Thomas D Barrett |
25 |
None |
Learning Topology-Agnostic EEG Representations with Geometry-Aware Modeling |
link |
Ke Yi, Yansen Wang,..., Dongsheng Li |
25 |
2023-10-28 |
This Looks Like Those: Illuminating Prototypical Concepts Using Multiple Visualizations |
link |
Chiyu Ma, Brandon Zhao,..., Cynthia Rudin |
25 |
2023-06-07 |
Stochastic Collapse: How Gradient Noise Attracts SGD Dynamics Towards Simpler Subnetworks |
link |
Feng Chen, Daniel Kunin,..., Surya Ganguli |
25 |
2023-09-22 |
Neural Data Transformer 2: Multi-context Pretraining for Neural Spiking Activity |
link |
Joel Ye, Jennifer L Collinger,..., Robert Gaunt |
25 |
2023-04-04 |
Privacy Amplification via Compression: Achieving the Optimal Privacy-Accuracy-Communication Trade-off in Distributed Mean Estimation |
link |
Wei-Ning Chen, Dan Song,..., Peter Kairouz |
25 |
2023-10-19 |
Real-Time Motion Prediction via Heterogeneous Polyline Transformer with Relative Pose Encoding |
link |
Zhejun Zhang, Alexander Liniger,..., Luc Van Gool |
25 |
2022-12-15 |
Joint processing of linguistic properties in brains and language models |
link |
SUBBA REDDY OOTA, Manish Gupta, Mariya Toneva |
25 |
2023-05-15 |
A Theoretical Analysis of Optimistic Proximal Policy Optimization in Linear Markov Decision Processes |
link |
Han Zhong, Tong Zhang |
25 |
None |
Achieving Cross Modal Generalization with Multimodal Unified Representation |
link |
Yan Xia, Hai Huang,..., Zhou Zhao |
25 |
2023-04-06 |
Graph Mixture of Experts: Learning on Large-Scale Graphs with Explicit Diversity Modeling |
link |
Haotao Wang, Ziyu Jiang,..., Zhangyang Wang |
25 |
2023-02-21 |
Adversarial Model for Offline Reinforcement Learning |
link |
Mohak Bhardwaj, Tengyang Xie,..., Ching-An Cheng |
25 |
2023-06-02 |
DaTaSeg: Taming a Universal Multi-Dataset Multi-Task Segmentation Model |
link |
Xiuye Gu, Yin Cui,..., David A Ross |
25 |
2023-10-21 |
Diversified Outlier Exposure for Out-of-Distribution Detection via Informative Extrapolation |
link |
Jianing Zhu, Geng Yu,..., Bo Han |
25 |
None |
MonoUNI: A Unified Vehicle and Infrastructure-side Monocular 3D Object Detection Network with Sufficient Depth Clues |
link |
Jinrang Jia, Zhenjia Li, Yifeng Shi |
25 |
2023-05-29 |
Aligning Optimization Trajectories with Diffusion Models for Constrained Design Generation |
link |
Giorgio Giannone, Akash Srivastava,..., Faez Ahmed |
24 |
2023-05-18 |
Paxion: Patching Action Knowledge in Video-Language Foundation Models |
link |
Zhenhailong Wang, Ansel Blume,..., Heng Ji |
24 |
2023-10-02 |
Equivariant Adaptation of Large Pretrained Models |
link |
Arnab Kumar Mondal, Siba Smarak Panigrahi,..., Siamak Ravanbakhsh |
24 |
2023-06-21 |
Joint Prompt Optimization of Stacked LLMs using Variational Inference |
link |
Alessandro Sordoni, Xingdi Yuan,..., Nicolas Le Roux |
24 |
2023-12-22 |
Energy-based learning algorithms for analog computing: a comparative study |
link |
Benjamin Scellier, Maxence Ernoult,..., Suhas Kumar |
24 |
2023-01-30 |
Direct Preference-based Policy Optimization without Reward Modeling |
link |
Gaon An, Junhyeok Lee,..., Hyun Oh Song |
24 |
2023-06-19 |
PLASTIC: Improving Input and Label Plasticity for Sample Efficient Reinforcement Learning |
link |
Hojoon Lee, Hanseul Cho,..., Chulhee Yun |