2087 |
2023-05-29 |
link |
Direct Preference Optimization: Your Language Model is Secretly a Reward Model |
Rafael Rafailov, Archit Sharma,..., Chelsea Finn |
1705 |
2023-05-23 |
link |
QLoRA: Efficient Finetuning of Quantized LLMs |
Tim Dettmers, Artidoro Pagnoni,..., Luke Zettlemoyer |
1457 |
2023-05-11 |
link |
InstructBLIP: Towards General-purpose Vision-Language Models with Instruction Tuning |
Wenliang Dai, Junnan Li,..., Steven Hoi |
1264 |
2023-02-09 |
link |
Toolformer: Language Models Can Teach Themselves to Use Tools |
Timo Schick, Jane Dwivedi-Yu,..., Thomas Scialom |
1180 |
2023-05-17 |
link |
Tree of Thoughts: Deliberate Problem Solving with Large Language Models |
Shunyu Yao, Dian Yu,..., Karthik R Narasimhan |
1003 |
2023-03-30 |
link |
Self-Refine: Iterative Refinement with Self-Feedback |
Aman Madaan, Niket Tandon,..., Peter Clark |
732 |
2023-03-20 |
link |
Reflexion: language agents with verbal reinforcement learning |
Noah Shinn, Federico Cassano,..., Shunyu Yao |
689 |
None |
link |
HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in Hugging Face |
Yongliang Shen, Kaitao Song,..., Yueting Zhuang |
618 |
2023-07-05 |
link |
Jailbroken: How Does LLM Safety Training Fail? |
Alexander Wei, Nika Haghtalab, Jacob Steinhardt |
609 |
2023-05-25 |
link |
ProlificDreamer: High-Fidelity and Diverse Text-to-3D Generation with Variational Score Distillation |
Zhengyi Wang, Cheng Lu,..., Jun Zhu |
603 |
2023-05-18 |
link |
LIMA: Less Is More for Alignment |
Chunting Zhou, Pengfei Liu,..., Omer Levy |
516 |
2023-05-02 |
link |
Is Your Code Generated by ChatGPT Really Correct? Rigorous Evaluation of Large Language Models for Code Generation |
Jiawei Liu, Chunqiu Steven Xia,..., LINGMING ZHANG |
451 |
2023-02-27 |
link |
Language Is Not All You Need: Aligning Perception with Language Models |
Shaohan Huang, Li Dong,..., Furu Wei |
427 |
2023-05-22 |
link |
AlpacaFarm: A Simulation Framework for Methods that Learn from Human Feedback |
Yann Dubois, Xuechen Li,..., Tatsunori Hashimoto |
357 |
2023-05-18 |
link |
VisionLLM: Large Language Model is also an Open-Ended Decoder for Vision-Centric Tasks |
Wenhai Wang, Zhe Chen,..., Jifeng Dai |
335 |
2023-06-06 |
link |
Inference-Time Intervention: Eliciting Truthful Answers from a Language Model |
Kenneth Li, Oam Patel,..., Martin Wattenberg |
332 |
2023-04-13 |
link |
Segment Everything Everywhere All at Once |
Xueyan Zou, Jianwei Yang,..., Yong Jae Lee |
327 |
2023-06-29 |
link |
One-2-3-45: Any Single Image to 3D Mesh in 45 Seconds without Per-Shape Optimization |
Minghua Liu, Chao Xu,..., Hao Su |
298 |
None |
link |
Are Emergent Abilities of Large Language Models a Mirage? |
Rylan Schaeffer, Brando Miranda, Sanmi Koyejo |
287 |
2023-05-07 |
link |
Language Models Don't Always Say What They Think: Unfaithful Explanations in Chain-of-Thought Prompting |
Miles Turpin, Julian Michael,..., Samuel R. Bowman |
276 |
2023-04-11 |
link |
RRHF: Rank Responses to Align Language Models with Human Feedback without tears |
Hongyi Yuan, Zheng Yuan,..., Fei Huang |
259 |
2023-05-04 |
link |
Principle-Driven Self-Alignment of Language Models from Scratch with Minimal Human Supervision |
Zhiqing Sun, Yikang Shen,..., Chuang Gan |
259 |
2023-03-30 |
link |
Language Models can Solve Computer Tasks |
Geunwoo Kim, Pierre Baldi, Stephen Marcus McAleer |
256 |
2023-05-29 |
link |
Faith and Fate: Limits of Transformers on Compositionality |
Nouha Dziri, Ximing Lu,..., Yejin Choi |
256 |
2023-02-13 |
link |
Symbolic Discovery of Optimization Algorithms |
Xiangning Chen, Chen Liang,..., Quoc V Le |
255 |
2023-05-19 |
link |
LLM-Pruner: On the Structural Pruning of Large Language Models |
Xinyin Ma, Gongfan Fang, Xinchao Wang |
250 |
None |
link |
CAMEL: Communicative Agents for "Mind" Exploration of Large Scale Language Model Society |
Guohao Li, Hasan Abed Al Kader Hammoud,..., Bernard Ghanem |
236 |
2023-04-19 |
link |
Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models |
Pan Lu, Baolin Peng,..., Jianfeng Gao |
234 |
2023-06-02 |
link |
Fine-Grained Human Feedback Gives Better Rewards for Language Model Training |
Zeqiu Wu, Yushi Hu,..., Hannaneh Hajishirzi |
223 |
2023-03-23 |
link |
Paraphrasing evades detectors of AI-generated text, but retrieval is an effective defense |
Kalpesh Krishna, Yixiao Song,..., Mohit Iyyer |
217 |
2023-06-03 |
link |
VideoComposer: Compositional Video Synthesis with Motion Controllability |
Xiang Wang, Hangjie Yuan,..., Jingren Zhou |
216 |
2022-08-19 |
link |
Cold Diffusion: Inverting Arbitrary Image Transforms Without Noise |
Arpit Bansal, Eitan Borgnia,..., Tom Goldstein |
215 |
2023-05-02 |
link |
Pick-a-Pic: An Open Dataset of User Preferences for Text-to-Image Generation |
Yuval Kirstain, Adam Polyak,..., Omer Levy |
211 |
2023-04-21 |
link |
DIN-SQL: Decomposed In-Context Learning of Text-to-SQL with Self-Correction |
Mohammadreza Pourreza, Davood Rafiei |
210 |
2023-06-02 |
link |
Segment Anything in High Quality |
Lei Ke, Mingqiao Ye,..., Fisher Yu |
209 |
2023-10-11 |
link |
Large Language Models Are Zero-Shot Time Series Forecasters |
Nate Gruver, Marc Anton Finzi,..., Andrew Gordon Wilson |
208 |
2023-05-24 |
link |
BLIP-Diffusion: Pre-trained Subject Representation for Controllable Text-to-Image Generation and Editing |
Dongxu Li, Junnan Li, Steven Hoi |
200 |
2023-06-23 |
link |
Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale |
Matthew Le, Apoorv Vyas,..., Wei-Ning Hsu |
190 |
2023-06-11 |
link |
High-Fidelity Audio Compression with Improved RVQGAN |
Rithesh Kumar, Prem Seetharaman,..., Kundan Kumar |
187 |
2023-04-12 |
link |
ImageReward: Learning and Evaluating Human Preferences for Text-to-Image Generation |
Jiazheng Xu, Xiao Liu,..., Yuxiao Dong |
182 |
2023-05-26 |
link |
Generating Images with Multimodal Language Models |
Jing Yu Koh, Daniel Fried, Ruslan Salakhutdinov |
181 |
2023-02-07 |
link |
Hard Prompts Made Easy: Gradient-Based Discrete Optimization for Prompt Tuning and Discovery |
Yuxin Wen, Neel Jain,..., Tom Goldstein |
177 |
2023-06-26 |
link |
Are aligned neural networks adversarially aligned? |
Nicholas Carlini, Milad Nasr,..., Ludwig Schmidt |
174 |
None |
link |
3D-LLM: Injecting the 3D World into Large Language Models |
Yining Hong, Haoyu Zhen,..., Chuang Gan |
171 |
2023-06-01 |
link |
Diffusion Self-Guidance for Controllable Image Generation |
Dave Epstein, Allan Jabri,..., Aleksander Holynski |
168 |
2023-06-26 |
link |
MotionGPT: Human Motion as a Foreign Language |
Biao Jiang, Xin Chen,..., Tao Chen |
165 |
2023-05-25 |
link |
On the Planning Abilities of Large Language Models - A Critical Investigation |
Karthik Valmeekam, Matthew Marquez,..., Subbarao Kambhampati |
164 |
2023-06-06 |
link |
Emergent Correspondence from Image Diffusion |
Luming Tang, Menglin Jia,..., Bharath Hariharan |
158 |
2023-05-30 |
link |
GPT4Tools: Teaching Large Language Model to Use Tools via Self-instruction |
Rui Yang, Lin Song,..., Ying Shan |
156 |
2023-05-25 |
link |
Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models |
Shihao Zhao, Dongdong Chen,..., Kwan-Yee K. Wong |
152 |
2023-05-24 |
link |
EmbodiedGPT: Vision-Language Pre-Training via Embodied Chain of Thought |
Yao Mu, Qinglong Zhang,..., Ping Luo |
148 |
2023-04-01 |
link |
Subject-driven Text-to-Image Generation via Apprenticeship Learning |
Wenhu Chen, Hexiang Hu,..., William W. Cohen |
142 |
2023-05-24 |
link |
Towards Revealing the Mystery behind Chain of Thought: a Theoretical Perspective |
Guhao Feng, Bohang Zhang,..., Liwei Wang |
140 |
2023-05-19 |
link |
ToolkenGPT: Augmenting Frozen Language Models with Massive Tools via Tool Embeddings |
Shibo Hao, Tianyang Liu,..., Zhiting Hu |
139 |
2023-06-02 |
link |
TIES-Merging: Resolving Interference When Merging Models |
Prateek Yadav, Derek Tam,..., Mohit Bansal |
138 |
2023-02-09 |
link |
UniPC: A Unified Predictor-Corrector Framework for Fast Sampling of Diffusion Models |
Wenliang Zhao, Lujia Bai,..., Jiwen Lu |
135 |
2023-06-07 |
link |
Transformers as Statisticians: Provable In-Context Learning with In-Context Algorithm Selection |
Yu Bai, Fan Chen,..., Song Mei |
134 |
2023-01-10 |
link |
Does Localization Inform Editing? Surprising Differences in Causality-Based Localization vs. Knowledge Editing in Language Models |
Peter Hase, Mohit Bansal,..., Asma Ghandeharioun |
130 |
2023-03-31 |
link |
Where are we in the search for an Artificial Visual Cortex for Embodied Intelligence? |
Arjun Majumdar, Karmesh Yadav,..., Franziska Meier |
128 |
2023-05-27 |
link |
Fine-Tuning Language Models with Just Forward Passes |
Sadhika Malladi, Tianyu Gao,..., Sanjeev Arora |
127 |
2023-06-24 |
link |
H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models |
Zhenyu Zhang, Ying Sheng,..., Beidi Chen |
127 |
2023-05-17 |
link |
DoReMi: Optimizing Data Mixtures Speeds Up Language Model Pretraining |
Sang Michael Xie, Hieu Pham,..., Adams Wei Yu |
127 |
2023-05-17 |
link |
Can Language Models Solve Graph Problems in Natural Language? |
Heng Wang, Shangbin Feng,..., Yulia Tsvetkov |
126 |
2023-05-31 |
link |
The Impact of Positional Encoding on Length Generalization in Transformers |
Amirhossein Kazemnejad, Inkit Padhi,..., Siva Reddy |
126 |
2023-02-06 |
link |
Data Selection for Language Models via Importance Resampling |
Sang Michael Xie, Shibani Santurkar,..., Percy Liang |
123 |
2023-05-23 |
link |
Large Language Models as Commonsense Knowledge for Large-Scale Task Planning |
Zirui Zhao, Wee Sun Lee, David Hsu |
123 |
2023-05-19 |
link |
Any-to-Any Generation via Composable Diffusion |
Zineng Tang, Ziyi Yang,..., Mohit Bansal |
121 |
2023-05-26 |
link |
Scissorhands: Exploiting the Persistence of Importance Hypothesis for LLM KV Cache Compression at Test Time |
Zichang Liu, Aditya Desai,..., Anshumali Shrivastava |
121 |
2023-07-25 |
link |
QuIP: 2-Bit Quantization of Large Language Models With Guarantees |
Jerry Chee, Yaohui Cai,..., Christopher De Sa |
117 |
2023-05-19 |
link |
Pengi: An Audio Language Model for Audio Tasks |
Soham Deshmukh, Benjamin Elizalde,..., Huaming Wang |
115 |
2023-05-24 |
link |
A Tale of Two Features: Stable Diffusion Complements DINO for Zero-Shot Semantic Correspondence |
Junyi Zhang, Charles Herrmann,..., Ming-Hsuan Yang |
115 |
2023-07-03 |
link |
MVDiffusion: Enabling Holistic Multi-view Image Generation with Correspondence-Aware Diffusion |
Shitao Tang, Fuyang Zhang,..., Yasutaka Furukawa |
115 |
2023-06-16 |
link |
Scaling Open-Vocabulary Object Detection |
Matthias Minderer, Alexey A. Gritsenko, Neil Houlsby |
114 |
2021-12-24 |
link |
Counterfactual Memorization in Neural Language Models |
Chiyuan Zhang, Daphne Ippolito,..., Nicholas Carlini |
111 |
2023-06-23 |
link |
OpenMask3D: Open-Vocabulary 3D Instance Segmentation |
Ayça Takmaz, Elisabetta Fedele,..., Francis Engelmann |
110 |
2023-05-31 |
link |
Improving CLIP Training with Language Rewrites |
Lijie Fan, Dilip Krishnan,..., Yonglong Tian |
110 |
2023-05-24 |
link |
In-Context Impersonation Reveals Large Language Models' Strengths and Biases |
Leonard Salewski, Stephan Alaniz,..., Zeynep Akata |
108 |
2023-05-26 |
link |
On Evaluating Adversarial Robustness of Large Vision-Language Models |
Yunqing Zhao, Tianyu Pang,..., Min Lin |
108 |
2023-06-01 |
link |
SnapFusion: Text-to-Image Diffusion Model on Mobile Devices within Two Seconds |
Yanyu Li, Huan Wang,..., Jian Ren |
107 |
2023-05-24 |
link |
Leveraging Pre-trained Large Language Models to Construct and Utilize World Models for Model-based Task Planning |
Lin Guan, Karthik Valmeekam,..., Subbarao Kambhampati |
107 |
2023-07-06 |
link |
Focused Transformer: Contrastive Training for Context Scaling |
Szymon Tworkowski, Konrad Staniszewski,..., Piotr Miłoś |
105 |
2023-07-23 |
link |
ResShift: Efficient Diffusion Model for Image Super-resolution by Residual Shifting |
Zongsheng Yue, Jianyi Wang, Chen Change Loy |
105 |
2023-05-24 |
link |
LayoutGPT: Compositional Visual Planning and Generation with Large Language Models |
Weixi Feng, Wanrong Zhu,..., William Yang Wang |
105 |
2022-12-19 |
link |
Optimizing Prompts for Text-to-Image Generation |
Yaru Hao, Zewen Chi,..., Furu Wei |
105 |
2023-05-27 |
link |
SwiftSage: A Generative Agent with Fast and Slow Thinking for Complex Interactive Tasks |
Bill Yuchen Lin, Yicheng Fu,..., Xiang Ren |
103 |
2022-11-20 |
link |
Aging with GRACE: Lifelong Model Editing with Discrete Key-Value Adaptors |
Thomas Hartvigsen, Swami Sankaranarayanan,..., Marzyeh Ghassemi |
101 |
2023-06-01 |
link |
StableRep: Synthetic Images from Text-to-Image Models Make Strong Visual Representation Learners |
Yonglong Tian, Lijie Fan,..., Dilip Krishnan |
100 |
2023-05-02 |
link |
Unlimiformer: Long-Range Transformers with Unlimited Length Input |
Amanda Bertsch, Uri Alon,..., Matthew R. Gormley |
98 |
2023-06-07 |
link |
Rewarded soups: towards Pareto-optimal alignment by interpolating weights fine-tuned on diverse rewards |
Alexandre Rame, Guillaume Couairon,..., Matthieu Cord |
97 |
2023-05-29 |
link |
RAPHAEL: Text-to-Image Generation via Large Mixture of Diffusion Paths |
Zeyue Xue, Guanglu Song,..., Ping Luo |
95 |
2023-04-21 |
link |
Emergent and Predictable Memorization in Large Language Models |
Stella Biderman, USVSN Sai Prashanth,..., Edward Raff |
93 |
2023-05-31 |
link |
Understanding and Mitigating Copying in Diffusion Models |
Gowthami Somepalli, Vasu Singla,..., Tom Goldstein |
92 |
2023-06-12 |
link |
Controlling Text-to-Image Diffusion by Orthogonal Finetuning |
Zeju Qiu, Weiyang Liu,..., Bernhard Schölkopf |
90 |
2023-05-26 |
link |
AdaPlanner: Adaptive Planning from Feedback with Language Models |
Haotian Sun, Yuchen Zhuang,..., Chao Zhang |
89 |
None |
link |
Grounded Decoding: Guiding Text Generation with Grounded Models for Robot Control |
Wenlong Huang, Fei Xia,..., brian ichter |
88 |
2023-02-02 |
link |
SceneScape: Text-Driven Consistent Scene Generation |
Rafail Fridman, Amit Abecasis,..., Tali Dekel |
86 |
2023-06-06 |
link |
Deductive Verification of Chain-of-Thought Reasoning |
Zhan Ling, Yunhao Fang,..., Hao Su |
85 |
2023-09-20 |
link |
Gold-YOLO: Efficient Object Detector via Gather-and-Distribute Mechanism |
Chengcheng Wang, Wei He,..., Kai Han |
85 |
2023-06-26 |
link |
Composing Parameter-Efficient Modules with Arithmetic Operations |
Jinghan Zhang, Shiqi Chen,..., Junxian He |
84 |
2023-03-09 |
link |
Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning |
Mitsuhiko Nakamoto, Yuexiang Zhai,..., Sergey Levine |
84 |
2023-02-20 |
link |
Towards Unbounded Machine Unlearning |
Meghdad Kurmanji, Peter Triantafillou,..., Eleni Triantafillou |
83 |
2023-06-06 |
link |
LEACE: Perfect linear concept erasure in closed form |
Nora Belrose, David Schneider-Joseph,..., Stella Biderman |
83 |
2023-09-01 |
link |
Geometry-Informed Neural Operator for Large-Scale 3D PDEs |
Zongyi Li, Nikola Borislavov Kovachki,..., Anima Anandkumar |
82 |
2023-05-23 |
link |
Diffusion Hyperfeatures: Searching Through Time and Space for Semantic Correspondence |
Grace Luo, Lisa Dunlap,..., Trevor Darrell |
81 |
2023-05-11 |
link |
Self-Chained Image-Language Model for Video Localization and Question Answering |
Shoubin Yu, Jaemin Cho,..., Mohit Bansal |
80 |
2023-01-31 |
link |
What Makes Good Examples for Visual In-Context Learning? |
Yuanhan Zhang, Kaiyang Zhou, Ziwei Liu |
79 |
2023-05-18 |
link |
Structural Pruning for Diffusion Models |
Gongfan Fang, Xinyin Ma, Xinchao Wang |
78 |
2023-07-26 |
link |
Evaluating the Moral Beliefs Encoded in LLMs |
Nino Scherrer, Claudia Shi,..., David Blei |
78 |
2023-03-27 |
link |
Text-to-Image Diffusion Models are Zero-Shot Classifiers |
Kevin Clark, Priyank Jaini |
77 |
2023-05-18 |
link |
TextDiffuser: Diffusion Models as Text Painters |
Jingye Chen, Yupan Huang,..., Furu Wei |
77 |
2022-08-08 |
link |
Deep Patch Visual Odometry |
Zachary Teed, Lahav Lipson, Jia Deng |
77 |
2023-01-27 |
link |
Large Language Models Are Latent Variable Models: Explaining and Finding Good Demonstrations for In-Context Learning |
Xinyi Wang, Wanrong Zhu,..., William Yang Wang |
76 |
2023-05-18 |
link |
OpenShape: Scaling Up 3D Shape Representation Towards Open-World Understanding |
Minghua Liu, Ruoxi Shi,..., Hao Su |
75 |
2023-05-29 |
link |
Reconstructing the Mind's Eye: fMRI-to-Image with Contrastive Learning and Diffusion Priors |
Paul Steven Scotti, Atmadeep Banerjee,..., Tanishq Mathew Abraham |
75 |
2023-06-28 |
link |
On the Exploitability of Instruction Tuning |
Manli Shu, Jiongxiao Wang,..., Tom Goldstein |
74 |
2023-06-15 |
link |
Linguistic Binding in Diffusion Models: Enhancing Attribute Correspondence through Attention Map Alignment |
Royi Rassin, Eran Hirsch,..., Gal Chechik |
74 |
2023-05-31 |
link |
Protein Design with Guided Discrete Diffusion |
Nate Gruver, Samuel Don Stanton,..., Andrew Gordon Wilson |
73 |
2023-02-16 |
link |
DIFUSCO: Graph-based Diffusion Solvers for Combinatorial Optimization |
Zhiqing Sun, Yiming Yang |
73 |
2023-05-29 |
link |
Diff-Instruct: A Universal Approach for Transferring Knowledge From Pre-trained Diffusion Models |
Weijian Luo, Tianyang Hu,..., Zhihua Zhang |
72 |
2023-03-14 |
link |
The Learnability of In-Context Learning |
Noam Wies, Yoav Levine, Amnon Shashua |
72 |
2023-05-24 |
link |
Cheap and Quick: Efficient Vision-Language Instruction Tuning for Large Language Models |
Gen Luo, Yiyi Zhou,..., Rongrong Ji |
72 |
2023-03-01 |
link |
Understanding Diffusion Objectives as the ELBO with Simple Data Augmentation |
Diederik P Kingma, Ruiqi Gao |
72 |
2023-05-22 |
link |
Task Arithmetic in the Tangent Space: Improved Editing of Pre-Trained Models |
Guillermo Ortiz-Jimenez, Alessandro Favero, Pascal Frossard |
71 |
2023-02-22 |
link |
Guiding Large Language Models via Directional Stimulus Prompting |
Zekun Li, Baolin Peng,..., Xifeng Yan |
70 |
2023-05-23 |
link |
Memory-Efficient Fine-Tuning of Compressed Large Language Models via sub-4-bit Integer Quantization |
Jeonghoon Kim, Jung Hyun Lee,..., Dongsoo Lee |
70 |
2023-05-12 |
link |
MEGABYTE: Predicting Million-byte Sequences with Multiscale Transformers |
LILI YU, Daniel Simig,..., Mike Lewis |
69 |
2023-06-20 |
link |
Diffusion with Forward Models: Solving Stochastic Inverse Problems Without Direct Supervision |
Ayush Tewari, Tianwei Yin,..., Vincent Sitzmann |
69 |
2023-05-18 |
link |
Language Models Meet World Models: Embodied Experiences Enhance Language Models |
Jiannan Xiang, Tianhua Tao,..., Zhiting Hu |
69 |
2023-06-15 |
link |
DreamHuman: Animatable 3D Avatars from Text |
Nikos Kolotouros, Thiemo Alldieck,..., Cristian Sminchisescu |
68 |
2023-06-30 |
link |
The Clock and the Pizza: Two Stories in Mechanistic Explanation of Neural Networks |
Ziqian Zhong, Ziming Liu,..., Jacob Andreas |
68 |
2023-07-02 |
link |
Solving Linear Inverse Problems Provably via Posterior Sampling with Latent Diffusion Models |
Litu Rout, Negin Raoof,..., Sanjay Shakkottai |
68 |
2023-05-18 |
link |
Weakly-Supervised Concealed Object Segmentation with SAM-based Pseudo Labeling and Multi-scale Feature Grouping |
Chunming He, Kai Li,..., Xiu Li |
67 |
2023-08-11 |
link |
DatasetDM: Synthesizing Data with Perception Annotations Using Diffusion Models |
Weijia Wu, Yuzhong Zhao,..., Chunhua Shen |
67 |
2023-04-25 |
link |
Patch Diffusion: Faster and More Data-Efficient Training of Diffusion Models |
Zhendong Wang, Yifan Jiang,..., Mingyuan Zhou |
67 |
2023-05-17 |
link |
Language Model Tokenizers Introduce Unfairness Between Languages |
Aleksandar Petrov, Emanuele La Malfa,..., Adel Bibi |
67 |
2023-07-07 |
link |
RADAR: Robust AI-Text Detection via Adversarial Learning |
Xiaomeng Hu, Pin-Yu Chen, Tsung-Yi Ho |
67 |
2023-05-15 |
link |
Interpretability at Scale: Identifying Causal Mechanisms in Alpaca |
Zhengxuan Wu, Atticus Geiger,..., Noah Goodman |
66 |
2023-06-22 |
link |
PromptIR: Prompting for All-in-One Blind Image Restoration |
Vaishnav Potlapalli, Syed Waqas Zamir,..., Fahad Khan |
66 |
2023-05-17 |
link |
Selective Amnesia: A Continual Learning Approach to Forgetting in Deep Generative Models |
Alvin Heng, Harold Soh |
66 |
2023-03-07 |
link |
Structured State Space Models for In-Context Reinforcement Learning |
Chris Lu, Yannick Schroecker,..., Feryal Behbahani |
65 |
2023-06-13 |
link |
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models |
Yinghao Aaron Li, Cong Han,..., Nima Mesgarani |
65 |
2023-06-12 |
link |
Augmenting Language Models with Long-Term Memory |
Weizhi Wang, Li Dong,..., Furu Wei |
65 |
2023-04-11 |
link |
Model Sparsity Can Simplify Machine Unlearning |
Jinghan Jia, Jiancheng Liu,..., Sijia Liu |
64 |
2023-02-26 |
link |
Fast Attention Requires Bounded Entries |
Josh Alman, Zhao Song |
64 |
2023-06-15 |
link |
DreamSim: Learning New Dimensions of Human Visual Similarity using Synthetic Data |
Stephanie Fu, Netanel Yakir Tamir,..., Phillip Isola |
63 |
2023-06-22 |
link |
Quantizable Transformers: Removing Outliers by Helping Attention Heads Do Nothing |
Yelysei Bondarenko, Markus Nagel, Tijmen Blankevoort |
63 |
2023-05-24 |
link |
Testing the General Deductive Reasoning Capacity of Large Language Models Using OOD Examples |
Abulhair Saparov, Richard Yuanzhe Pang,..., He He |
62 |
2023-05-18 |
link |
PTQD: Accurate Post-Training Quantization for Diffusion Models |
Yefei He, Luping Liu,..., Bohan Zhuang |
61 |
2023-06-01 |
link |
Birth of a Transformer: A Memory Viewpoint |
Alberto Bietti, Vivien Cabannes,..., Leon Bottou |
61 |
2023-05-25 |
link |
Scan and Snap: Understanding Training Dynamics and Token Composition in 1-layer Transformer |
Yuandong Tian, Yiping Wang,..., Simon Shaolei Du |
61 |
2023-05-19 |
link |
The probability flow ODE is provably fast |
Sitan Chen, Sinho Chewi,..., Adil Salim |
61 |
2023-03-23 |
link |
Towards Better Dynamic Graph Learning: New Architecture and Unified Library |
Le Yu, Leilei Sun,..., Weifeng Lv |
61 |
2023-05-29 |
link |
VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset |
Sihan Chen, Handong Li,..., Jing Liu |
61 |
2023-05-30 |
link |
Koopa: Learning Non-stationary Time Series Dynamics with Koopman Predictors |
Yong Liu, Chenyu Li,..., Mingsheng Long |
60 |
2023-06-15 |
link |
Segment Any Point Cloud Sequences by Distilling Vision Foundation Models |
Youquan Liu, Lingdong Kong,..., Ziwei Liu |
60 |
2023-03-23 |
link |
The Quantization Model of Neural Scaling |
Eric J Michaud, Ziming Liu,..., Max Tegmark |
60 |
2023-05-03 |
link |
Lift Yourself Up: Retrieval-augmented Text Generation with Self Memory |
Xin Cheng, Di Luo,..., Rui Yan |
59 |
2023-05-01 |
link |
In-Context Learning Unlocked for Diffusion Models |
Zhendong Wang, Yifan Jiang,..., Mingyuan Zhou |
59 |
2023-07-04 |
link |
Spike-driven Transformer |
Man Yao, JiaKui Hu,..., Guoqi Li |
58 |
2023-06-07 |
link |
Intrinsic Dimension Estimation for Robust Detection of AI-Generated Texts |
Eduard Tulchinskii, Kristian Kuznetsov,..., Irina Piontkovskaya |
58 |
2023-02-02 |
link |
SimMTM: A Simple Pre-Training Framework for Masked Time-Series Modeling |
Jiaxiang Dong, Haixu Wu,..., Mingsheng Long |
58 |
2023-05-15 |
link |
Privacy Auditing with One (1) Training Run |
Thomas Steinke, Milad Nasr, Matthew Jagielski |
58 |
2023-05-22 |
link |
To Repeat or Not To Repeat: Insights from Scaling LLM under Token-Crisis |
Fuzhao Xue, Yao Fu,..., Yang You |
58 |
2023-05-24 |
link |
Unsupervised Semantic Correspondence Using Stable Diffusion |
Eric Hedlin, Gopal Sharma,..., Kwang Moo Yi |
58 |
2023-01-12 |
link |
Tracr: Compiled Transformers as a Laboratory for Interpretability |
David Lindner, Janos Kramar,..., Vladimir Mikulik |
57 |
2023-06-01 |
link |
STEVE-1: A Generative Model for Text-to-Behavior in Minecraft |
Shalev Lifshitz, Keiran Paster,..., Sheila A. McIlraith |
57 |
2023-06-29 |
link |
Diff-Foley: Synchronized Video-to-Audio Synthesis with Latent Diffusion Models |
Simian Luo, Chuanhao Yan,..., Hang Zhao |
56 |
2023-05-29 |
link |
Diffusion Model is an Effective Planner and Data Synthesizer for Multi-Task Reinforcement Learning |
Haoran He, Chenjia Bai,..., Xuelong Li |
56 |
2023-07-12 |
link |
Patch n' Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution |
Mostafa Dehghani, Basil Mustafa,..., Neil Houlsby |
56 |
2023-05-31 |
link |
Med-UniC: Unifying Cross-Lingual Medical Vision-Language Pre-Training by Diminishing Bias |
Zhongwei Wan, Che Liu,..., Rossella Arcucci |
56 |
2023-06-07 |
link |
Exposing flaws of generative model evaluation metrics and their unfair treatment of diffusion models |
George Stein, Jesse C. Cresswell,..., Gabriel Loaiza-Ganem |
55 |
2023-06-01 |
link |
White-Box Transformers via Sparse Rate Reduction |
Yaodong Yu, Sam Buchanan,..., Yi Ma |
54 |
2023-06-26 |
link |
Supervised Pretraining Can Learn In-Context Reinforcement Learning |
Jonathan Lee, Annie Xie,..., Emma Brunskill |
52 |
2023-05-17 |
link |
What You See is What You Read? Improving Text-Image Alignment Evaluation |
Michal Yarom, Yonatan Bitton,..., Idan Szpektor |
52 |
2023-06-26 |
link |
Pretraining task diversity and the emergence of non-Bayesian in-context learning for regression |
Allan Raventos, Mansheej Paul,..., Surya Ganguli |
51 |
2023-06-02 |
link |
The Surprising Effectiveness of Diffusion Models for Optical Flow and Monocular Depth Estimation |
Saurabh Saxena, Charles Herrmann,..., David J. Fleet |
51 |
2023-07-05 |
link |
RanPAC: Random Projections and Pre-trained Models for Continual Learning |
Mark McDonnell, Dong Gong,..., Anton van den Hengel |
51 |
2022-06-14 |
link |
Automatic Clipping: Differentially Private Deep Learning Made Easier and Stronger |
Zhiqi Bu, Yu-Xiang Wang,..., George Karypis |
51 |
2023-05-29 |
link |
GlyphControl: Glyph Conditional Control for Visual Text Generation |
Yukang Yang, Dongnan Gui,..., Kai Chen |
51 |
2022-12-19 |
link |
Latent Diffusion for Language Generation |
Justin Lovelace, Varsha Kishore,..., Kilian Q Weinberger |
51 |
2023-02-02 |
link |
Convolutional Neural Operators for robust and accurate learning of PDEs |
Bogdan Raonic, Roberto Molinaro,..., Emmanuel de Bezenac |
51 |
2023-05-25 |
link |
Diversify Your Vision Datasets with Automatic Diffusion-Based Augmentation |
Lisa Dunlap, Alyssa Umino,..., Trevor Darrell |
50 |
2023-05-18 |
link |
LLMScore: Unveiling the Power of Large Language Models in Text-to-Image Synthesis Evaluation |
Yujie Lu, Xianjun Yang,..., William Yang Wang |
49 |
2023-05-24 |
link |
Flocks of Stochastic Parrots: Differentially Private Prompt Learning for Large Language Models |
Haonan Duan, Adam Dziedzic,..., Franziska Boenisch |
49 |
2023-10-23 |
link |
SpecTr: Fast Speculative Decoding via Optimal Transport |
Ziteng Sun, Ananda Theertha Suresh,..., Felix Yu |
49 |
2022-05-20 |
link |
Evaluating and Inducing Personality in Pre-trained Language Models |
Guangyuan Jiang, Manjie Xu,..., Yixin Zhu |
48 |
2023-05-22 |
link |
VanillaNet: the Power of Minimalism in Deep Learning |
Hanting Chen, Yunhe Wang,..., Dacheng Tao |
48 |
2023-09-25 |
link |
Evaluating Cognitive Maps and Planning in Large Language Models with CogEval |
Ida Momennejad, Hosein Hasanbeig,..., Jonathan Larson |
48 |
2023-08-10 |
link |
PDE-Refiner: Achieving Accurate Long Rollouts with Neural PDE Solvers |
Phillip Lippe, Bastiaan S. Veeling,..., Johannes Brandstetter |
48 |
2023-05-19 |
link |
PointGPT: Auto-regressively Generative Pre-training from Point Clouds |
Guangyan Chen, Meiling Wang,..., Yufeng Yue |
47 |
2023-05-08 |
link |
Recommender Systems with Generative Retrieval |
Shashank Rajput, Nikhil Mehta,..., Maheswaran Sathiamoorthy |
47 |
2023-06-08 |
link |
SyncDiffusion: Coherent Montage via Synchronized Joint Diffusions |
Yuseung Lee, Kunho Kim,..., Minhyuk Sung |
47 |
2023-03-12 |
link |
Synthetic Experience Replay |
Cong Lu, Philip J. Ball,..., Jack Parker-Holder |
46 |
2023-05-23 |
link |
Weakly Supervised 3D Open-vocabulary Segmentation |
Kunhao Liu, Fangneng Zhan,..., Shijian Lu |
46 |
2023-06-22 |
link |
Squeeze, Recover and Relabel: Dataset Condensation at ImageNet Scale From A New Perspective |
Zeyuan Yin, Eric Xing, Zhiqiang Shen |
46 |
2023-09-25 |
link |
Free-Bloom: Zero-Shot Text-to-Video Generator with LLM Director and LDM Animator |
Hanzhuo Huang, Yufan Feng,..., Sibei Yang |
45 |
2023-06-02 |
link |
LoCoOp: Few-Shot Out-of-Distribution Detection via Prompt Learning |
Atsuyuki Miyai, Qing Yu,..., Kiyoharu Aizawa |
45 |
2023-10-11 |
link |
Hierarchical Decomposition of Prompt-Based Continual Learning: Rethinking Obscured Sub-optimality |
Liyuan Wang, Jingyi Xie,..., Jun Zhu |
45 |
2023-03-03 |
link |
Revisiting Adversarial Training for ImageNet: Architectures, Training and Generalization across Threat Models |
Naman Deep Singh, Francesco Croce, Matthias Hein |
44 |
2023-07-20 |
link |
A Definition of Continual Reinforcement Learning |
David Abel, Andre Barreto,..., Satinder Singh |
44 |
2023-06-30 |
link |
Practical and Asymptotically Exact Conditional Sampling in Diffusion Models |
Luhuan Wu, Brian L. Trippe,..., David Blei |
44 |
2023-05-21 |
link |
PRODIGY: Enabling In-context Learning Over Graphs |
Qian Huang, Hongyu Ren,..., Jure Leskovec |
44 |
2023-03-20 |
link |
Object-Centric Slot Diffusion |
Jindong Jiang, Fei Deng,..., Sungjin Ahn |
44 |
2023-05-28 |
link |
GIMLET: A Unified Graph-Text Model for Instruction-Based Molecule Zero-Shot Learning |
Haiteng Zhao, Shengchao Liu,..., Qi Liu |
43 |
2023-05-22 |
link |
Hierarchical Integration Diffusion Model for Realistic Image Deblurring |
Zheng Chen, Yulun Zhang,..., Xin Yuan |
43 |
2023-05-11 |
link |
An Inverse Scaling Law for CLIP Training |
Xianhang Li, Zeyu Wang, Cihang Xie |
43 |
2023-04-03 |
link |
AUDIT: Audio Editing by Following Instructions with Latent Diffusion Models |
Yuancheng Wang, Zeqian Ju,..., sheng zhao |
43 |
2023-02-03 |
link |
Rethinking Semi-Supervised Medical Image Segmentation: A Variance-Reduction Perspective |
Chenyu You, Weicheng Dai,..., James s Duncan |
43 |
2023-09-15 |
link |
Compositional Foundation Models for Hierarchical Planning |
Anurag Ajay, Seungwook Han,..., Pulkit Agrawal |
43 |
2023-05-30 |
link |
Ambient Diffusion: Learning Clean Distributions from Corrupted Data |
Giannis Daras, Kulin Shah,..., Adam Klivans |
42 |
2022-12-15 |
link |
MAViL: Masked Audio-Video Learners |
Po-Yao Huang, Vasu Sharma,..., Christoph Feichtenhofer |
42 |
2023-07-31 |
link |
Conformal PID Control for Time Series Prediction |
Anastasios Nikolas Angelopoulos, Emmanuel Candes, Ryan Tibshirani |
42 |
2023-05-27 |
link |
Scalable Transformer for PDE Surrogate Modeling |
Zijie Li, Dule Shu, Amir Barati Farimani |
42 |
2023-06-01 |
link |
Nonparametric Identifiability of Causal Representations from Unknown Interventions |
Julius von Kügelgen, Michel Besserve,..., Bernhard Schölkopf |
42 |
2023-09-27 |
link |
GeoCLIP: Clip-Inspired Alignment between Locations and Images for Effective Worldwide Geo-localization |
Vicente Vivanco Cepeda, Gaurav Kumar Nayak, Mubarak Shah |
42 |
2023-04-11 |
link |
Conditional Adapters: Parameter-efficient Transfer Learning with Fast Inference |
Tao Lei, Junwen Bai,..., Ming-Wei Chang |
42 |
2023-10-20 |
link |
DPM-Solver-v3: Improved Diffusion ODE Solver with Empirical Model Statistics |
Kaiwen Zheng, Cheng Lu,..., Jun Zhu |
41 |
2023-06-05 |
link |
Structure-free Graph Condensation: From Large-scale Graphs to Condensed Graph-free Data |
Xin Zheng, Miao Zhang,..., Shirui Pan |
41 |
2022-12-20 |
link |
Parsel🦆: Algorithmic Reasoning with Language Models by Composing Decompositions |
Eric Zelikman, Qian Huang,..., Nick Haber |
41 |
2023-02-09 |
link |
Read and Reap the Rewards: Learning to Play Atari with the Help of Instruction Manuals |
Yue Wu, Yewen Fan,..., Tom Mitchell |
41 |
2023-06-01 |
link |
Towards Foundation Models for Scientific Machine Learning: Characterizing Scaling and Transfer Behavior |
Shashank Subramanian, Peter Harrington,..., Amir Gholami |
41 |
2023-05-31 |
link |
Dense and Aligned Captions (DAC) Promote Compositional Reasoning in VL Models |
Sivan Doveh, Assaf Arbelle,..., Leonid Karlinsky |
41 |
2023-05-23 |
link |
Uncertainty Quantification over Graph with Conformalized Graph Neural Networks |
Kexin Huang, Ying Jin,..., Jure Leskovec |
41 |
2023-06-07 |
link |
Fine-Grained Visual Prompting |
Lingfeng Yang, Yueze Wang,..., Jian Yang |
40 |
2023-06-13 |
link |
Image Captioners Are Scalable Vision Learners Too |
Michael Tschannen, Manoj Kumar,..., Lucas Beyer |
40 |
2023-05-18 |
link |
Content-based Unrestricted Adversarial Attack |
Zhaoyu Chen, Bo Li,..., Wenqiang Zhang |
40 |
2023-05-19 |
link |
Cinematic Mindscapes: High-quality Video Reconstruction from Brain Activity |
Zijiao Chen, Jiaxin Qing, Juan Helen Zhou |
40 |
2023-05-28 |
link |
Knowledge-Augmented Reasoning Distillation for Small Language Models in Knowledge-Intensive Tasks |
Minki Kang, Seanie Lee,..., Sung Ju Hwang |
40 |
2023-10-11 |
link |
RoboCLIP: One Demonstration is Enough to Learn Robot Policies |
Sumedh Anand Sontakke, Jesse Zhang,..., Laurent Itti |
40 |
2023-05-25 |
link |
Efficient Neural Music Generation |
Max W. Y. Lam, Qiao Tian,..., Yuxuan Wang |
39 |
2023-05-31 |
link |
Efficient Diffusion Policies for Offline Reinforcement Learning |
Bingyi Kang, Xiao Ma,..., Shuicheng YAN |
39 |
2023-06-09 |
link |
S3: Increasing GPU Utilization during Generative Inference for Higher Throughput |
Yunho Jin, Chun-Feng Wu,..., Gu-Yeon Wei |
39 |
2023-05-19 |
link |
Scaling laws for language encoding models in fMRI |
Richard Antonello, Aditya Vaidya, Alexander Huth |
39 |
2023-06-01 |
link |
Inserting Anybody in Diffusion Models via Celeb Basis |
Ge Yuan, Xiaodong Cun,..., Huicheng Zheng |
39 |
2023-04-27 |
link |
Convergence of Adam Under Relaxed Assumptions |
Haochuan Li, Alexander Rakhlin, Ali Jadbabaie |
39 |
2023-06-26 |
link |
Equivariant flow matching |
Leon Klein, Andreas Krämer, Frank Noe |
39 |
2023-05-19 |
link |
Post Hoc Explanations of Language Models Can Improve Language Models |
Satyapriya Krishna, Jiaqi Ma,..., Himabindu Lakkaraju |
38 |
2023-02-12 |
link |
MarioGPT: Open-Ended Text2Level Generation through Large Language Models |
Shyam Sudhakaran, Miguel González-Duque,..., Sebastian Risi |
38 |
2023-09-24 |
link |
GraphAdapter: Tuning Vision-Language Models With Dual Knowledge Graph |
Xin Li, Dongze Lian,..., Xinchao Wang |
38 |
2022-12-21 |
link |
Hidden Poison: Machine Unlearning Enables Camouflaged Poisoning Attacks |
Jimmy Z. Di, Jack Douglas,..., Ayush Sekhari |
38 |
2023-06-19 |
link |
Simplifying and Empowering Transformers for Large-Graph Representations |
Qitian Wu, Wentao Zhao,..., Junchi Yan |
37 |
2023-05-30 |
link |
Joint Bayesian Inference of Graphical Structure and Parameters with a Single Generative Flow Network |
Tristan Deleu, Mizu Nishikawa-Toomey,..., Yoshua Bengio |
37 |
2023-05-31 |
link |
From Pixels to UI Actions: Learning to Follow Instructions via Graphical User Interfaces |
Peter Shaw, Mandar Joshi,..., Kristina Toutanova |
37 |
2023-02-01 |
link |
The geometry of hidden representations of large transformer models |
Lucrezia Valeriani, Diego Doimo,..., Alberto Cazzaniga |
37 |
2023-10-26 |
link |
Global Structure-Aware Diffusion Process for Low-Light Image Enhancement |
Jinhui HOU, Zhiyu Zhu,..., Hui Yuan |
37 |
2023-07-04 |
link |
DiT-3D: Exploring Plain Diffusion Transformers for 3D Shape Generation |
Shentong Mo, Enze Xie,..., Zhenguo Li |
36 |
2023-06-06 |
link |
Towards Label-free Scene Understanding by Vision Foundation Models |
Runnan Chen, Youquan Liu,..., Wenping Wang |
36 |
2023-05-24 |
link |
Inverse Preference Learning: Preference-based RL without a Reward Function |
Joey Hejna, Dorsa Sadigh |
36 |
2023-07-19 |
link |
PreDiff: Precipitation Nowcasting with Latent Diffusion Models |
Zhihan Gao, Xingjian Shi,..., Bernie Wang |
36 |
2023-10-09 |
link |
Domain Watermark: Effective and Harmless Dataset Copyright Protection is Closed at Hand |
Junfeng Guo, Yiming Li,..., Bo Li |
36 |
2023-06-03 |
link |
DYffusion: A Dynamics-informed Diffusion Model for Spatiotemporal Forecasting |
Salva Rühling Cachay, Bo Zhao,..., Rose Yu |
35 |
2023-05-23 |
link |
Video Prediction Models as Rewards for Reinforcement Learning |
Alejandro Escontrela, Ademi Adeniji,..., Pieter Abbeel |
35 |
2023-06-26 |
link |
Restart Sampling for Improving Generative Processes |
Yilun Xu, Mingyang Deng,..., Tommi S. Jaakkola |
35 |
2023-05-26 |
link |
Flow Matching for Scalable Simulation-Based Inference |
Jonas Bernhard Wildberger, Maximilian Dax,..., Bernhard Schölkopf |
35 |
2023-05-24 |
link |
Deep Reinforcement Learning with Plasticity Injection |
Evgenii Nikishin, Junhyuk Oh,..., Andre Barreto |
35 |
2023-06-01 |
link |
Exposing Attention Glitches with Flip-Flop Language Modeling |
Bingbin Liu, Jordan T. Ash,..., Cyril Zhang |
35 |
2023-09-27 |
link |
Dynamic Prompt Learning: Addressing Cross-Attention Leakage for Text-Based Image Editing |
Kai Wang, Fei Yang,..., Joost van de Weijer |
35 |
2023-07-26 |
link |
Skill-it! A Data-Driven Skills Framework for Understanding and Training Language Models |
Mayee F Chen, Nicholas Roberts,..., Christopher Re |
35 |
2023-06-30 |
link |
SPAE: Semantic Pyramid AutoEncoder for Multimodal Generation with Frozen LLMs |
Lijun Yu, Yong Cheng,..., Lu Jiang |
35 |
2023-05-23 |
link |
Siamese Masked Autoencoders |
Agrim Gupta, Jiajun Wu,..., Li Fei-Fei |
35 |
2023-06-05 |
link |
HeadSculpt: Crafting 3D Head Avatars with Text |
Xiao Han, Yukang Cao,..., Kwan-Yee K. Wong |
34 |
2022-11-25 |
link |
Expanding Small-Scale Datasets with Guided Imagination |
Yifan Zhang, Daquan Zhou,..., Jiashi Feng |
34 |
2023-05-31 |
link |
Direct Diffusion Bridge using Data Consistency for Inverse Problems |
Hyungjin Chung, Jeongsol Kim, Jong Chul Ye |
34 |
2023-05-25 |
link |
Dynamic Context Pruning for Efficient and Interpretable Autoregressive Transformers |
Sotiris Anagnostidis, Dario Pavllo,..., Thomas Hofmann |
34 |
2023-10-13 |
link |
Rank-DETR for High Quality Object Detection |
Yifan Pu, Weicong Liang,..., Gao Huang |
34 |
2023-07-07 |
link |
AutoDecoding Latent 3D Diffusion Models |
Evangelos Ntavelis, Aliaksandr Siarohin,..., Sergey Tulyakov |
34 |
2023-10-18 |
link |
Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture |
Daniel Y Fu, Simran Arora,..., Christopher Re |
33 |
2023-05-30 |
link |
Grammar Prompting for Domain-Specific Language Generation with Large Language Models |
Bailin Wang, Zi Wang,..., Yoon Kim |
33 |
2023-10-23 |
link |
Large Language Models are Visual Reasoning Coordinators |
Liangyu Chen, Bo Li,..., Ziwei Liu |
33 |
2023-05-25 |
link |
Parallel Sampling of Diffusion Models |
Andy Shih, Suneel Belkhale,..., Nima Anari |
33 |
2023-07-24 |
link |
Understanding the Latent Space of Diffusion Models through the Lens of Riemannian Geometry |
Yong-Hyun Park, Mingi Kwon,..., Youngjung Uh |
33 |
2023-02-17 |
link |
Consistent Diffusion Models: Mitigating Sampling Drift by Learning to be Consistent |
Giannis Daras, Yuval Dagan,..., Constantinos Costis Daskalakis |
33 |
2023-02-28 |
link |
Goal Driven Discovery of Distributional Differences via Language Descriptions |
Ruiqi Zhong, Peter Zhang,..., Jacob Steinhardt |
33 |
2023-02-27 |
link |
Permutation Equivariant Neural Functionals |
Allan Zhou, Kaien Yang,..., Chelsea Finn |
33 |
2023-05-16 |
link |
AR-Diffusion: Auto-Regressive Diffusion Model for Text Generation |
Tong Wu, Zhihao Fan,..., Weizhu Chen |
33 |
2023-05-09 |
link |
The emergence of clusters in self-attention dynamics |
Borjan Geshkovski, Cyril Letrouit,..., Philippe Rigollet |
32 |
2023-05-29 |
link |
Photoswap: Personalized Subject Swapping in Images |
Jing Gu, Yilin Wang,..., Xin Eric Wang |
32 |
2023-10-25 |
link |
Towards Self-Interpretable Graph-Level Anomaly Detection |
Yixin Liu, Kaize Ding,..., Shirui Pan |
32 |
2023-10-21 |
link |
Contrast Everything: A Hierarchical Contrastive Framework for Medical Time-Series |
Yihe Wang, Yu Han,..., Xiang Zhang |
32 |
2023-05-18 |
link |
SlotDiffusion: Object-Centric Generative Modeling with Diffusion Models |
Ziyi Wu, Jingyu Hu,..., Animesh Garg |
32 |
2023-05-25 |
link |
MixFormerV2: Efficient Fully Transformer Tracking |
Yutao Cui, Tianhui Song,..., Limin Wang |
32 |
2023-09-25 |
link |
DeepACO: Neural-enhanced Ant Systems for Combinatorial Optimization |
Haoran Ye, Jiarui Wang,..., Yong Li |
32 |
2023-05-22 |
link |
On quantum backpropagation, information reuse, and cheating measurement collapse |
Amira Abbas, Robbie King,..., Jarrod Ryan McClean |
32 |
2023-10-12 |
link |
Neural Combinatorial Optimization with Heavy Decoder: Toward Large Scale Generalization |
Fu Luo, Xi Lin,..., Zhenkun Wang |
32 |
2022-06-27 |
link |
Supply-Side Equilibria in Recommender Systems |
Meena Jagadeesan, Nikhil Garg, Jacob Steinhardt |
32 |
2023-06-04 |
link |
For SALE: State-Action Representation Learning for Deep Reinforcement Learning |
Scott Fujimoto, Wei-Di Chang,..., David Meger |
31 |
2023-10-25 |
link |
CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection |
Chuofan Ma, Yi Jiang,..., XIAOJUAN QI |
31 |
2022-10-06 |
link |
A Logic for Expressing Log-Precision Transformers |
William Merrill, Ashish Sabharwal |
31 |
2023-06-29 |
link |
Graph Denoising Diffusion for Inverse Protein Folding |
Kai Yi, Bingxin Zhou,..., Yu Guang Wang |
31 |
2023-06-23 |
link |
Max-Margin Token Selection in Attention Mechanism |
Davoud Ataee Tarzanagh, Yingcong Li,..., Samet Oymak |
31 |
2023-06-08 |
link |
Factorized Contrastive Learning: Going Beyond Multi-view Redundancy |
Paul Pu Liang, Zihao Deng,..., Russ Salakhutdinov |
31 |
2023-05-18 |
link |
Clifford Group Equivariant Neural Networks |
David Ruhe, Johannes Brandstetter, Patrick Forré |
31 |
2023-09-23 |
link |
Dream the Impossible: Outlier Imagination with Diffusion Models |
Xuefeng Du, Yiyou Sun,..., Yixuan Li |
30 |
2023-02-14 |
link |
Bounding Training Data Reconstruction in DP-SGD |
Jamie Hayes, Borja Balle, Saeed Mahloujifar |
30 |
2023-05-30 |
link |
PanoGen: Text-Conditioned Panoramic Environment Generation for Vision-and-Language Navigation |
Jialu Li, Mohit Bansal |
30 |
2023-05-26 |
link |
Language Models Can Improve Event Prediction by Few-Shot Abductive Reasoning |
Xiaoming Shi, Siqiao Xue,..., Hongyuan Mei |
30 |
2023-07-06 |
link |
MomentDiff: Generative Video Moment Retrieval from Random to Real |
Pandeng Li, Chen-Wei Xie,..., Yongdong Zhang |
30 |
2023-06-04 |
link |
Temporal Dynamic Quantization for Diffusion Models |
Junhyuk So, Jungwon Lee,..., Eunhyeok Park |
30 |
2023-10-08 |
link |
FedFed: Feature Distillation against Data Heterogeneity in Federated Learning |
Zhiqin Yang, Yonggang Zhang,..., Bo Han |
30 |
2023-02-02 |
link |
Timewarp: Transferable Acceleration of Molecular Dynamics by Learning Time-Coarsened Dynamics |
Leon Klein, Andrew Y. K. Foong,..., Ryota Tomioka |
30 |
2023-09-10 |
link |
SA-Solver: Stochastic Adams Solver for Fast Sampling of Diffusion Models |
Shuchen Xue, Mingyang Yi,..., Zhi-Ming Ma |
30 |
2023-10-31 |
link |
Unexpected Improvements to Expected Improvement for Bayesian Optimization |
Sebastian Ament, Sam Daulton,..., Eytan Bakshy |
30 |
2023-05-22 |
link |
Getting ViT in Shape: Scaling Laws for Compute-Optimal Model Design |
Ibrahim Alabdulmohsin, Xiaohua Zhai,..., Lucas Beyer |
30 |
2023-06-23 |
link |
Scaling MLPs: A Tale of Inductive Bias |
Gregor Bachmann, Sotiris Anagnostidis, Thomas Hofmann |
30 |
2023-10-23 |
link |
FreeMask: Synthetic Images with Dense Annotations Make Stronger Segmentation Models |
Lihe Yang, Xiaogang Xu,..., Hengshuang Zhao |
30 |
2023-04-02 |
link |
Saddle-to-Saddle Dynamics in Diagonal Linear Networks |
Scott Pesme, Nicolas Flammarion |
29 |
2023-05-30 |
link |
LANCE: Stress-testing Visual Models by Generating Language-guided Counterfactual Images |
Viraj Uday Prabhu, Sriram Yenamandra,..., Judy Hoffman |
29 |
2023-09-25 |
link |
LinGCN: Structural Linearized Graph Convolutional Network for Homomorphically Encrypted Inference |
Hongwu Peng, Ran Ran,..., Caiwen Ding |
29 |
2023-07-22 |
link |
HIQL: Offline Goal-Conditioned RL with Latent States as Actions |
Seohong Park, Dibya Ghosh,..., Sergey Levine |
29 |
2023-06-21 |
link |
Training Transformers with 4-bit Integers |
Haocheng Xi, ChangHao Li,..., Jun Zhu |
29 |
2023-05-05 |
link |
Large Language Models for Automated Data Science: Introducing CAAFE for Context-Aware Automated Feature Engineering |
Noah Hollmann, Samuel Müller, Frank Hutter |
29 |
2023-05-30 |
link |
Intriguing Properties of Quantization at Scale |
Arash Ahmadian, Saurabh Dash,..., Sara Hooker |
29 |
2023-06-10 |
link |
TensorNet: Cartesian Tensor Representations for Efficient Learning of Molecular Potentials |
Guillem Simeon, Gianni De Fabritiis |
29 |
2023-05-17 |
link |
End-To-End Latent Variational Diffusion Models for Inverse Problems in High Energy Physics |
Alexander Shmakov, Kevin Greif,..., Daniel Whiteson |
29 |
2022-03-29 |
link |
Causal de Finetti: On the Identification of Invariant Causal Structure in Exchangeable Data |
Siyuan Guo, Viktor Tóth,..., Ferenc Huszár |
29 |
2023-05-24 |
link |
Reverse Engineering Self-Supervised Learning |
Ido Ben-Shaul, Ravid Shwartz-Ziv,..., Yann LeCun |
29 |
2023-06-11 |
link |
A Holistic Approach to Unifying Automatic Concept Extraction and Concept Importance Estimation |
Thomas FEL, Victor Boutin,..., Thomas Serre |
29 |
2023-07-28 |
link |
AbDiffuser: Full-Atom Generation of In-Vitro Functioning Antibodies |
Karolis Martinkus, Jan Ludwiczak,..., Andreas Loukas |
29 |
2023-06-04 |
link |
Data Quality in Imitation Learning |
Suneel Belkhale, Yuchen Cui, Dorsa Sadigh |
28 |
2023-06-01 |
link |
Learning Transformer Programs |
Dan Friedman, Alexander Wettig, Danqi Chen |
28 |
2023-05-22 |
link |
Meta-in-context learning in large language models |
Julian Coda-Forno, Marcel Binz,..., Eric Schulz |
28 |
2023-09-25 |
link |
FeCAM: Exploiting the Heterogeneity of Class Distributions in Exemplar-Free Continual Learning |
Dipam Goswami, Yuyang Liu,..., Joost van de Weijer |
28 |
2023-05-24 |
link |
ALGO: Synthesizing Algorithmic Programs with Generated Oracle Verifiers |
Kexun Zhang, Danqing Wang,..., Lei Li |
28 |
2023-06-16 |
link |
HiNeRV: Video Compression with Hierarchical Encoding based Neural Representation |
Ho Man Kwan, Ge Gao,..., David Bull |
28 |
2023-05-26 |
link |
Demo2Code: From Summarizing Demonstrations to Synthesizing Code via Extended Chain-of-Thought |
Huaxiaoyue Wang, Gonzalo Gonzalez-Pumariega,..., Sanjiban Choudhury |
28 |
2023-05-26 |
link |
Causal Component Analysis |
Wendong Liang, Armin Kekić,..., Bernhard Schölkopf |
28 |
2022-11-02 |
link |
Entropic Neural Optimal Transport via Diffusion Processes |
Nikita Gushchin, Alexander Kolesov,..., Evgeny Burnaev |
28 |
2023-07-03 |
link |
Hierarchical Open-vocabulary Universal Image Segmentation |
Xudong Wang, Shufan Li,..., Trevor Darrell |
28 |
2022-12-23 |
link |
A-NeSI: A Scalable Approximate Method for Probabilistic Neurosymbolic Inference |
Emile van Krieken, Thiviyan Thanapalasingam,..., Annette Ten Teije |
27 |
2023-05-25 |
link |
Diffusion-Based Adversarial Sample Generation for Improved Stealthiness and Controllability |
Haotian Xue, Alexandre Araujo,..., Yongxin Chen |
27 |
2023-08-27 |
link |
Revisiting Scalarization in Multi-Task Learning: A Theoretical Perspective |
Yuzheng Hu, Ruicheng Xian,..., Han Zhao |
27 |
2022-09-30 |
link |
Universal Prompt Tuning for Graph Neural Networks |
Taoran Fang, Yunchao Mercer Zhang,..., Lei CHEN |
27 |
2023-05-26 |
link |
CRoSS: Diffusion Model Makes Controllable, Robust and Secure Image Steganography |
Jiwen Yu, Xuanyu Zhang,..., Jian Zhang |
27 |
2023-06-20 |
link |
LVM-Med: Learning Large-Scale Self-Supervised Vision Models for Medical Imaging via Second-order Graph Matching |
Duy Minh Ho Nguyen, Hoang Nguyen,..., Mathias Niepert |
27 |
2023-05-30 |
link |
Likelihood-Based Diffusion Language Models |
Ishaan Gulrajani, Tatsunori Hashimoto |
27 |
2023-04-10 |
link |
Prompt Pre-Training with Twenty-Thousand Classes for Open-Vocabulary Visual Recognition |
Shuhuai Ren, Aston Zhang,..., Xu Sun |
27 |
2023-06-12 |
link |
VillanDiffusion: A Unified Backdoor Attack Framework for Diffusion Models |
Sheng-Yen Chou, Pin-Yu Chen, Tsung-Yi Ho |
27 |
2023-07-05 |
link |
Elastic Decision Transformer |
Yueh-Hua Wu, Xiaolong Wang, Masashi Hamaya |
27 |
2023-06-28 |
link |
Separable Physics-Informed Neural Networks |
Junwoo Cho, Seungtae Nam,..., Eunbyung Park |
27 |
2023-07-30 |
link |
Crystal Structure Prediction by Joint Equivariant Diffusion |
Rui Jiao, Wenbing Huang,..., Yang Liu |
27 |
2023-04-25 |
link |
Stable and low-precision training for large-scale vision-language models |
Mitchell Wortsman, Tim Dettmers,..., Ludwig Schmidt |
26 |
2023-06-12 |
link |
Transformers learn through gradual rank increase |
Emmanuel Abbe, Samy Bengio,..., Joshua M. Susskind |
26 |
2023-05-30 |
link |
SheetCopilot: Bringing Software Productivity to the Next Level through Large Language Models |
Hongxin Li, Jingran Su,..., Zhaoxiang Zhang |
26 |
2023-06-02 |
link |
Demystifying Structural Disparity in Graph Neural Networks: Can One Size Fit All? |
Haitao Mao, Zhikai Chen,..., Jiliang Tang |
26 |
2023-09-23 |
link |
Deciphering Spatio-Temporal Graph Forecasting: A Causal Lens and Treatment |
Yutong Xia, Yuxuan Liang,..., Roger Zimmermann |
26 |
None |
link |
CommonScenes: Generating Commonsense 3D Indoor Scenes with Scene Graphs |
Guangyao Zhai, Evin Pinar Örnek,..., Benjamin Busam |
26 |
2023-06-15 |
link |
Class-Conditional Conformal Prediction With Many Classes |
Tiffany Ding, Anastasios Nikolas Angelopoulos,..., Ryan Tibshirani |
26 |
2023-04-25 |
link |
Parallel Spiking Neurons with High Efficiency and Ability to Learn Long-term Dependencies |
Wei Fang, Zhaofei Yu,..., Yonghong Tian |
26 |
2023-04-02 |
link |
SEENN: Towards Temporal Spiking Early-Exit Neural Networks |
Yuhang Li, Tamar Geller,..., Priyadarshini Panda |
26 |
2022-06-07 |
link |
A*Net: A Scalable Path-based Reasoning Approach for Knowledge Graphs |
Zhaocheng Zhu, Xinyu Yuan,..., Jian Tang |
26 |
2023-06-06 |
link |
The Emergence of Essential Sparsity in Large Pre-trained Models: The Weights that Matter |
AJAY KUMAR JAISWAL, Shiwei Liu,..., Zhangyang Wang |
25 |
2023-06-12 |
link |
Operator Learning with Neural Fields: Tackling PDEs on General Geometries |
Louis Serrano, Lise Le Boudec,..., patrick gallinari |
25 |
2023-05-26 |
link |
The Curious Price of Distributional Robustness in Reinforcement Learning with a Generative Model |
Laixi Shi, Gen Li,..., Yuejie Chi |
25 |
2023-06-02 |
link |
Interpretable and Explainable Logical Policies via Neurally Guided Symbolic Abstraction |
Quentin Delfosse, Hikaru Shindo,..., Kristian Kersting |
25 |
2023-07-06 |
link |
Pruning vs Quantization: Which is Better? |
Andrey Kuzmin, Markus Nagel,..., Tijmen Blankevoort |
25 |
2023-06-14 |
link |
Training-free Diffusion Model Adaptation for Variable-Sized Text-to-Image Synthesis |
Zhiyu Jin, Xuli Shen,..., Xiangyang Xue |
25 |
2022-11-25 |
link |
Bypass Exponential Time Preprocessing: Fast Neural Network Training via Weight-Data Correlation Preprocessing |
Josh Alman, Jiehao Liang,..., Danyang Zhuo |
25 |
2023-07-13 |
link |
Reward-Directed Conditional Diffusion: Provable Distribution Estimation and Reward Improvement |
Hui Yuan, Kaixuan Huang,..., Mengdi Wang |
25 |
2023-06-02 |
link |
Spatially Resolved Gene Expression Prediction from H&E Histology Images via Bi-modal Contrastive Learning |
Ronald Xie, Kuan Pang,..., Gary Bader |
25 |
2023-05-25 |
link |
Knowledge Diffusion for Distillation |
Tao Huang, Yuan Zhang,..., Chang Xu |
25 |
2023-10-23 |
link |
Rethinking Tokenizer and Decoder in Masked Graph Modeling for Molecules |
Zhiyuan Liu, Yaorui Shi,..., Tat-Seng Chua |
25 |
2023-07-20 |
link |
Shared Adversarial Unlearning: Backdoor Mitigation by Unlearning Shared Adversarial Examples |
Shaokui Wei, Mingda Zhang,..., Baoyuan Wu |
25 |
2022-12-06 |
link |
GAUCHE: A Library for Gaussian Processes in Chemistry |
Ryan-Rhys Griffiths, Leo Klarner,..., Jian Tang |
24 |
2023-05-30 |
link |
Complex Query Answering on Eventuality Knowledge Graph with Implicit Logical Constraints |
Jiaxin Bai, Xin Liu,..., Yangqiu Song |
24 |
2023-05-24 |
link |
Exploring Diverse In-Context Configurations for Image Captioning |
Xu Yang, Yongliang Wu,..., Xin Geng |
24 |
2023-10-02 |
link |
Disentangling Voice and Content with Self-Supervision for Speaker Recognition |
TIANCHI LIU, Kong Aik Lee,..., Haizhou Li |
24 |
2023-05-29 |
link |
LaFTer: Label-Free Tuning of Zero-shot Classifier using Language and Unlabeled Image Collections |
Muhammad Jehanzeb Mirza, Leonid Karlinsky,..., Horst Bischof |
24 |
2023-10-04 |
link |
CoDA: Collaborative Novel Box Discovery and Cross-modal Alignment for Open-vocabulary 3D Object Detection |
Yang Cao, Yihan Zeng,..., Dan Xu |
24 |
2023-06-21 |
link |
Mass-Producing Failures of Multimodal Systems with Language Models |
Shengbang Tong, Erik Jones, Jacob Steinhardt |
24 |
2023-02-21 |
link |
Diffusion Models and Semi-Supervised Learners Benefit Mutually with Few Labels |
Zebin You, Yong Zhong,..., Jun Zhu |
24 |
2023-05-26 |
link |
Contrast, Attend and Diffuse to Decode High-Resolution Images from Brain Activities |
Jingyuan Sun, Mingxiao Li,..., Shaonan Wang |
24 |
2023-06-18 |
link |
Online Map Vectorization for Autonomous Driving: A Rasterization Perspective |
Gongjie Zhang, Jiahao Lin,..., Zuoguan Wang |
24 |
2023-06-01 |
link |
StyleGAN knows Normal, Depth, Albedo, and More |
Anand Bhattad, Daniel McKee,..., David Forsyth |
24 |
2023-10-13 |
link |
Does Graph Distillation See Like Vision Dataset Counterpart? |
Beining Yang, Kai Wang,..., Jianxin Li |
24 |
2023-05-16 |
link |
Revisiting the Minimalist Approach to Offline Reinforcement Learning |
Denis Tarasov, Vladislav Kurenkov,..., Sergey Kolesnikov |
24 |
2023-05-18 |
link |
Smoothing the Landscape Boosts the Signal for SGD: Optimal Sample Complexity for Learning Single Index Models |
Alex Damian, Eshaan Nichani,..., Jason D. Lee |
24 |
2023-03-23 |
link |
Fairness-guided Few-shot Prompting for Large Language Models |
Huan Ma, Changqing Zhang,..., Bingzhe Wu |
24 |
2023-09-25 |
link |
IEBins: Iterative Elastic Bins for Monocular Depth Estimation |
Shuwei Shao, Zhongcai Pei,..., Zhengguo Li |
24 |
2023-10-31 |
link |
Generate What You Prefer: Reshaping Sequential Recommendation via Guided Diffusion |
Zhengyi Yang, Jiancan Wu,..., Xiangnan He |
24 |
2022-09-13 |
link |
Characterizing Graph Datasets for Node Classification: Homophily-Heterophily Dichotomy and Beyond |
Oleg Platonov, Denis Kuznedelev,..., Liudmila Prokhorenkova |
24 |
2023-10-17 |
link |
DELIFFAS: Deformable Light Fields for Fast Avatar Synthesis |
YoungJoong Kwon, Lingjie Liu,..., Christian Theobalt |
23 |
2023-03-22 |
link |
EDGI: Equivariant Diffusion for Planning with Embodied Agents |
Johann Brehmer, Joey Bose,..., Taco Cohen |
23 |
2023-07-20 |
link |
Sharpness Minimization Algorithms Do Not Only Minimize Sharpness To Achieve Better Generalization |
Kaiyue Wen, Zhiyuan Li, Tengyu Ma |
23 |
2023-04-07 |
link |
A new perspective on building efficient and expressive 3D equivariant graph neural networks |
weitao Du, Yuanqi Du,..., Zhi-Ming Ma |
23 |
2023-06-02 |
link |
DaTaSeg: Taming a Universal Multi-Dataset Multi-Task Segmentation Model |
Xiuye Gu, Yin Cui,..., David A Ross |
23 |
2023-10-27 |
link |
Learning to Search Feasible and Infeasible Regions of Routing Problems with Flexible Neural k-Opt |
Yining Ma, Zhiguang Cao, Yeow Meng Chee |
23 |
2023-05-18 |
link |
DiffUTE: Universal Text Editing Diffusion Model |
Haoxing Chen, Zhuoer Xu,..., Weiqiang Wang |
23 |
2023-08-16 |
link |
Towards Personalized Federated Learning via Heterogeneous Model Reassembly |
Jiaqi Wang, Xingyi Yang,..., Fenglong Ma |
23 |
2023-10-07 |
link |
VLAttack: Multimodal Adversarial Attacks on Vision-Language Tasks via Pre-trained Models |
Ziyi Yin, Muchao Ye,..., Fenglong Ma |
23 |
2023-06-01 |
link |
DiffPack: A Torsional Diffusion Model for Autoregressive Protein Side-Chain Packing |
Yangtian Zhang, Zuobai Zhang,..., Jian Tang |
23 |
2023-11-01 |
link |
CROMA: Remote Sensing Representations with Contrastive Radar-Optical Masked Autoencoders |
Anthony Fuller, Koreen Millard, James R Green |
23 |
2023-04-04 |
link |
The expressive power of pooling in Graph Neural Networks |
Filippo Maria Bianchi, Veronica Lachi |
22 |
2023-06-29 |
link |
Neural Polarizer: A Lightweight and Effective Backdoor Defense via Purifying Poisoned Features |
Mingli Zhu, Shaokui Wei,..., Baoyuan Wu |
22 |
2023-05-16 |
link |
Double Pessimism is Provably Efficient for Distributionally Robust Offline Reinforcement Learning: Generic Algorithm and Robust Partial Coverage |
Jose Blanchet, Miao Lu,..., Han Zhong |
22 |
2023-04-10 |
link |
H2RBox-v2: Incorporating Symmetry for Boosting Horizontal Box Supervised Oriented Object Detection |
Yi Yu, Xue Yang,..., Junchi Yan |
22 |
2023-02-07 |
link |
Concept Algebra for (Score-Based) Text-Controlled Generative Models |
Zihao Wang, Lin Gui,..., Victor Veitch |
22 |
2023-10-29 |
link |
Does Invariant Graph Learning via Environment Augmentation Learn Invariance? |
Yongqiang Chen, Yatao Bian,..., James Cheng |
22 |
2023-05-22 |
link |
Neural Functional Transformers |
Allan Zhou, Kaien Yang,..., Chelsea Finn |
22 |
2023-05-25 |
link |
Sharpness-Aware Minimization Leads to Low-Rank Features |
Maksym Andriushchenko, Dara Bahri,..., Nicolas Flammarion |
22 |
2023-09-29 |
link |
Asynchrony-Robust Collaborative Perception via Bird's Eye View Flow |
Sizhe Wei, Yuxi Wei,..., Ya Zhang |
22 |
2023-06-02 |
link |
Towards In-context Scene Understanding |
Ivana Balazevic, David Steiner,..., Olivier J Henaff |
22 |
2023-06-07 |
link |
Object-Centric Learning for Real-World Videos by Predicting Temporal Feature Similarities |
Andrii Zadaianchuk, Maximilian Seitzer, Georg Martius |
22 |
2023-05-31 |
link |
FlowCam: Training Generalizable 3D Radiance Fields without Camera Poses via Pixel-Aligned Scene Flow |
Cameron Omid Smith, Yilun Du,..., Vincent Sitzmann |
22 |
2023-05-15 |
link |
A Theoretical Analysis of Optimistic Proximal Policy Optimization in Linear Markov Decision Processes |
Han Zhong, Tong Zhang |
22 |
2023-06-02 |
link |
Convex and Non-Convex Optimization under Generalized Smoothness |
Haochuan Li, Jian Qian,..., Ali Jadbabaie |
21 |
2023-06-09 |
link |
PoET: A generative model of protein families as sequences-of-sequences |
Timothy Fei Truong Jr, Tristan Bepler |
21 |
2023-06-18 |
link |
Score-based Data Assimilation |
François Rozet, Gilles Louppe |
21 |
2023-04-19 |
link |
Bridging RL Theory and Practice with the Effective Horizon |
Cassidy Laidlaw, Stuart Russell, Anca Dragan |
21 |
2023-10-09 |
link |
Improved Communication Efficiency in Federated Natural Policy Gradient via ADMM-based Gradient Updates |
Guangchen Lan, Han Wang,..., Vaneet Aggarwal |
21 |
2023-07-10 |
link |
Compositional Generalization from First Principles |
Thaddäus Wiedemer, Prasanna Mayilvahanan,..., Wieland Brendel |
21 |
2023-09-22 |
link |
OneNet: Enhancing Time Series Forecasting Models under Concept Drift by Online Ensembling |
YiFan Zhang, Qingsong Wen,..., Tieniu Tan |
21 |
2023-01-27 |
link |
Alignment with human representations supports robust few-shot learning |
Ilia Sucholutsky, Thomas L. Griffiths |
21 |
2022-12-15 |
link |
Joint processing of linguistic properties in brains and language models |
SUBBA REDDY OOTA, Manish Gupta, Mariya Toneva |
21 |
2023-08-02 |
link |
ImageBrush: Learning Visual In-Context Instructions for Exemplar-Based Image Manipulation |
Yasheng SUN, Yifan Yang,..., Hideki Koike |
21 |
2023-10-23 |
link |
Data Pruning via Moving-one-Sample-out |
Haoru Tan, Sitong Wu,..., XIAOJUAN QI |
21 |
2023-09-14 |
link |
Where2Explore: Few-shot Affordance Learning for Unseen Novel Categories of Articulated Objects |
Chuanruo Ning, Ruihai Wu,..., Hao Dong |
21 |
2023-05-20 |
link |
Brain encoding models based on multimodal transformers can transfer across language and vision |
Jerry Tang, Meng Du,..., Alexander Huth |
21 |
2023-05-30 |
link |
Multi-modal Queried Object Detection in the Wild |
Yifan Xu, Mengdan Zhang,..., Changsheng Xu |
21 |
2023-10-24 |
link |
A Unified, Scalable Framework for Neural Population Decoding |
Mehdi Azabou, Vinam Arora,..., Eva L Dyer |
21 |
2023-06-10 |
link |
Neural Injective Functions for Multisets, Measures and Graphs via a Finite Witness Theorem |
Tal Amir, Steven J. Gortler,..., Nadav Dym |
21 |
2023-03-19 |
link |
Unsupervised Learning for Solving the Travelling Salesman Problem |
Yimeng Min, Yiwei Bai, Carla P Gomes |
21 |
2023-06-20 |
link |
Sampling from Gaussian Process Posteriors using Stochastic Gradient Descent |
Jihao Andreas Lin, Javier Antoran,..., Alexander Terenin |
21 |
2023-06-07 |
link |
Improving neural network representations using human similarity judgments |
Lukas Muttenthaler, Lorenz Linhardt,..., Simon Kornblith |
20 |
2023-07-26 |
link |
Visual Instruction Inversion: Image Editing via Visual Prompting |
Thao Nguyen, Yuheng Li,..., Yong Jae Lee |
20 |
2023-06-08 |
link |
RDumb: A simple approach that questions our progress in continual test-time adaptation |
Ori Press, Steffen Schneider,..., Matthias Bethge |
20 |
2023-02-02 |
link |
Language Quantized AutoEncoders: Towards Unsupervised Text-Image Alignment |
Hao Liu, Wilson Yan, Pieter Abbeel |
20 |
2023-05-31 |
link |
A Unified Framework for U-Net Design and Analysis |
Christopher Williams, Fabian Falck,..., Saifuddin Syed |
20 |
2023-05-22 |
link |
A Fractional Graph Laplacian Approach to Oversmoothing |
Sohir Maskey, Raffaele Paolino,..., Gitta Kutyniok |
20 |
2023-05-20 |
link |
A Scalable Neural Network for DSIC Affine Maximizer Auction Design |
Zhijian Duan, Haoran Sun,..., Xiaotie Deng |
20 |
2023-07-14 |
link |
HYTREL: Hypergraph-enhanced Tabular Data Representation Learning |
Pei Chen, Soumajyoti Sarkar,..., George Karypis |
20 |
2023-04-06 |
link |
Dynamics of Finite Width Kernel and Prediction Fluctuations in Mean Field Neural Networks |
Blake Bordelon, Cengiz Pehlevan |
20 |
2023-03-02 |
link |
SHAP-IQ: Unified Approximation of any-order Shapley Interactions |
Fabian Fumagalli, Maximilian Muschalik,..., Barbara Eva Hammer |
19 |
2023-06-01 |
link |
Thought Cloning: Learning to Think while Acting by Imitating Human Thinking |
Shengran Hu, Jeff Clune |
19 |
2023-07-11 |
link |
Self-Supervised Learning with Lie Symmetries for Partial Differential Equations |
Grégoire Mialon, Quentin Garrido,..., Bobak Kiani |
19 |
2023-10-31 |
link |
BasisFormer: Attention-based Time Series Forecasting with Learnable and Interpretable Basis |
Zelin Ni, Hang Yu,..., Weiyao Lin |
19 |
2023-03-01 |
link |
Continuous-Time Functional Diffusion Processes |
Giulio Franzese, Giulio Corallo,..., Pietro Michiardi |
19 |
2023-06-01 |
link |
Lightweight Vision Transformer with Bidirectional Interaction |
Qihang Fan, Huaibo Huang,..., Ran He |
19 |
2023-04-19 |
link |
Long-Term Fairness with Unknown Dynamics |
Tongxin Yin, Reilly Raab,..., Yang Liu |
19 |
2023-02-04 |
link |
AV-NeRF: Learning Neural Fields for Real-World Audio-Visual Scene Synthesis |
Susan Liang, Chao Huang,..., Chenliang Xu |
19 |
2023-07-21 |
link |
What can a Single Attention Layer Learn? A Study Through the Random Features Lens |
Hengyu Fu, Tianyu Guo,..., Song Mei |
19 |
2023-05-18 |
link |
Paxion: Patching Action Knowledge in Video-Language Foundation Models |
Zhenhailong Wang, Ansel Blume,..., Heng Ji |
19 |
2023-05-27 |
link |
Approximation-Generalization Trade-offs under (Approximate) Group Equivariance |
Mircea Petrache, Shubhendu Trivedi |
19 |
2023-06-01 |
link |
Joint Learning of Label and Environment Causal Independence for Graph Out-of-Distribution Generalization |
Shurui Gui, Meng Liu,..., Shuiwang Ji |
19 |
2023-10-30 |
link |
MoCa: Measuring Human-Language Model Alignment on Causal and Moral Judgment Tasks |
Allen Nie, Yuhui Zhang,..., Tobias Gerstenberg |
19 |
2023-01-27 |
link |
D2CSG: Unsupervised Learning of Compact CSG Trees with Dual Complements and Dropouts |
Fenggen Yu, Qimin Chen,..., Hao Zhang |
19 |
2023-09-14 |
link |
Learning Environment-Aware Affordance for 3D Articulated Object Manipulation under Occlusions |
Ruihai Wu, Kai Cheng,..., Hao Dong |
19 |
2022-03-08 |
link |
Geodesic Multi-Modal Mixup for Robust Fine-Tuning |
Changdae Oh, Junhyuk So,..., Kyungwoo Song |
19 |
2023-09-22 |
link |
On Sparse Modern Hopfield Model |
Jerry Yao-Chieh Hu, Donglin Yang,..., Han Liu |
19 |
2023-10-19 |
link |
Real-Time Motion Prediction via Heterogeneous Polyline Transformer with Relative Pose Encoding |
Zhejun Zhang, Alexander Liniger,..., Luc Van Gool |
19 |
2023-05-29 |
link |
Aligning Optimization Trajectories with Diffusion Models for Constrained Design Generation |
Giorgio Giannone, Akash Srivastava,..., Faez Ahmed |
19 |
2023-06-22 |
link |
Rethinking the Backward Propagation for Adversarial Transferability |
Xiaosen Wang, Kangheng Tong, Kun He |
19 |
2023-09-19 |
link |
PGDiff: Guiding Diffusion Models for Versatile Face Restoration via Partial Guidance |
Peiqing Yang, Shangchen Zhou,..., Chen Change Loy |
19 |
2023-03-13 |
link |
Transformer-based Planning for Symbolic Regression |
Parshin Shojaee, Kazem Meidani,..., Chandan K. Reddy |
19 |
2023-10-14 |
link |
STORM: Efficient Stochastic Transformer based World Models for Reinforcement Learning |
Weipu Zhang, Gang Wang,..., Gao Huang |
19 |
2023-03-01 |
link |
Time Series as Images: Vision Transformer for Irregularly Sampled Time Series |
Zekun Li, Shiyang Li, Xifeng Yan |
19 |
2023-05-28 |
link |
Disentanglement via Latent Quantization |
Kyle Hsu, Will Dorrell,..., Chelsea Finn |
19 |
2023-04-06 |
link |
Graph Mixture of Experts: Learning on Large-Scale Graphs with Explicit Diversity Modeling |
Haotao Wang, Ziyu Jiang,..., Zhangyang Wang |
19 |
2023-10-27 |
link |
Optimal Transport for Treatment Effect Estimation |
Hao Wang, Jiajun Fan,..., Ruiming Tang |
19 |
2023-05-31 |
link |
Spontaneous symmetry breaking in generative diffusion models |
Gabriel Raya, Luca Ambrogioni |
19 |
2023-07-04 |
link |
Collaborative Score Distillation for Consistent Visual Synthesis |
Subin Kim, Kyungmin Lee,..., Jinwoo Shin |
18 |
2023-05-17 |
link |
Explain Any Concept: Segment Anything Meets Concept-Based Explanation |
Ao Sun, Pingchuan Ma,..., Shuai Wang |
18 |
2023-05-24 |
link |
Generative Modeling through the Semi-dual Formulation of Unbalanced Optimal Transport |
Jaemoo Choi, Jaewoong Choi, Myungjoo Kang |
18 |
2023-01-09 |
link |
BQ-NCO: Bisimulation Quotienting for Efficient Neural Combinatorial Optimization |
Darko Drakulic, Sofia Michel,..., Jean-Marc Andreoli |
18 |
2023-06-07 |
link |
Stochastic Collapse: How Gradient Noise Attracts SGD Dynamics Towards Simpler Subnetworks |
Feng Chen, Daniel Kunin,..., Surya Ganguli |
18 |
2023-10-17 |
link |
Towards Generic Semi-Supervised Framework for Volumetric Medical Image Segmentation |
Haonan Wang, Xiaomeng Li |
18 |
2023-05-29 |
link |
Maximize to Explore: One Objective Function Fusing Estimation, Planning, and Exploration |
Zhihan Liu, Miao Lu,..., Zhaoran Wang |
18 |
2023-06-09 |
link |
Topology-Aware Uncertainty for Image Segmentation |
Saumya Gupta, Yikai Zhang,..., Chao Chen |
18 |
2023-06-19 |
link |
Beyond Normal: On the Evaluation of Mutual Information Estimators |
Paweł Czyż, Frederic Grabowski,..., Alexander Marx |
18 |
2023-06-01 |
link |
Make Pre-trained Model Reversible: From Parameter to Memory Efficient Fine-Tuning |
Baohao Liao, Shaomu Tan, Christof Monz |
18 |
2023-06-08 |
link |
SNAP: Self-Supervised Neural Maps for Visual Positioning and Semantic Understanding |
Paul-Edouard Sarlin, Eduard Trulls,..., Simon Lynen |
18 |
2022-12-19 |
link |
Continual Learning for Instruction Following from Realtime Feedback |
Alane Suhr, Yoav Artzi |
18 |
2023-09-04 |
link |
Memory Efficient Optimizers with 4-bit States |
Bingrui Li, Jianfei Chen, Jun Zhu |
18 |
2023-08-20 |
link |
SE(3) Equivariant Augmented Coupling Flows |
Laurence Illing Midgley, Vincent Stimper,..., José Miguel Hernández-Lobato |
18 |
2023-06-06 |
link |
Large Language Models of Code Fail at Completing Code with Potential Bugs |
Tuan Dinh, Jinman Zhao,..., George Karypis |
18 |
2023-05-30 |
link |
Smooth, exact rotational symmetrization for deep learning on point clouds |
Sergey Pozdnyakov, Michele Ceriotti |
18 |
2023-05-28 |
link |
Feature-Learning Networks Are Consistent Across Widths At Realistic Scales |
Nikhil Vyas, Alexander Atanasov,..., Cengiz Pehlevan |
18 |
2023-05-25 |
link |
Demystifying Oversmoothing in Attention-Based Graph Neural Networks |
Xinyi Wu, Amir Ajorlou,..., Ali Jadbabaie |
18 |
2023-09-23 |
link |
State-space Models with Layer-wise Nonlinearity are Universal Approximators with Exponential Decaying Memory |
Shida Wang, Beichen Xue |
18 |
2023-07-24 |
link |
On the Connection between Pre-training Data Diversity and Fine-tuning Robustness |
Vivek Ramanujan, Thao Nguyen,..., Ludwig Schmidt |
18 |
2023-06-08 |
link |
Learning Fine-grained View-Invariant Representations from Unpaired Ego-Exo Videos via Temporal Alignment |
Zihui Xue, Kristen Grauman |
18 |
2023-06-16 |
link |
Beyond Geometry: Comparing the Temporal Structure of Computation in Neural Circuits with Dynamical Similarity Analysis |
Mitchell Ostrow, Adam Joseph Eisen,..., Ila R Fiete |
18 |
2022-06-02 |
link |
Offline Reinforcement Learning with Differential Privacy |
Dan Qiao, Yu-Xiang Wang |
18 |
2023-05-02 |
link |
Don't Stop Pretraining? Make Prompt-based Fine-tuning Powerful Learner |
Zhengxiang Shi, Aldo Lipani |
18 |
2023-05-30 |
link |
Compression with Bayesian Implicit Neural Representations |
Zongyu Guo, Gergely Flamich,..., José Miguel Hernández-Lobato |
18 |
2023-12-03 |
link |
Honesty Is the Best Policy: Defining and Mitigating AI Deception |
Francis Rhys Ward, Francesca Toni,..., Tom Everitt |
18 |
2023-05-30 |
link |
Functional-Group-Based Diffusion for Pocket-Specific Molecule Generation and Elaboration |
Haitao Lin, Yufei Huang,..., Stan Z. Li |
18 |
2023-05-30 |
link |
DäRF: Boosting Radiance Fields from Sparse Inputs with Monocular Depth Adaptation |
Jiuhn Song, Seonghoon Park,..., Seungryong Kim |
18 |
2023-06-07 |
link |
ARTIC3D: Learning Robust Articulated 3D Shapes from Noisy Web Image Collections |
Chun-Han Yao, Amit Raj,..., Varun Jampani |
18 |
2023-06-09 |
link |
Explaining Predictive Uncertainty with Information Theoretic Shapley Values |
David Watson, Joshua O'Hara,..., Ido Guy |
18 |
2023-03-17 |
link |
Data-Centric Learning from Unlabeled Graphs with Diffusion Model |
Gang Liu, Eric Inae,..., Meng Jiang |
17 |
2023-02-21 |
link |
Adversarial Model for Offline Reinforcement Learning |
Mohak Bhardwaj, Tengyang Xie,..., Ching-An Cheng |
17 |
2023-05-22 |
link |
Deep Neural Collapse Is Provably Optimal for the Deep Unconstrained Features Model |
Peter Súkeník, Marco Mondelli, Christoph H Lampert |
17 |
2023-06-14 |
link |
Explore In-Context Learning for 3D Point Cloud Understanding |
Zhongbin Fang, Xiangtai Li,..., Mengyuan Liu |
17 |
2023-09-23 |
link |
Defending Pre-trained Language Models as Few-shot Learners against Backdoor Attacks |
Zhaohan Xi, Tianyu Du,..., Ting Wang |
17 |
2023-10-20 |
link |
ScaleLong: Towards More Stable Training of Diffusion Model via Scaling Network Long Skip Connection |
Zhongzhan Huang, Pan Zhou,..., Liang Lin |
17 |
2023-07-13 |
link |
Bootstrapping Vision-Language Learning with Decoupled Language Pre-training |
Yiren Jian, Chongyang Gao, Soroush Vosoughi |
17 |
2023-06-06 |
link |
Fine-grained Expressivity of Graph Neural Networks |
Jan Böker, Ron Levie,..., Christopher Morris |
17 |
2023-05-17 |
link |
DinoSR: Self-Distillation and Online Clustering for Self-supervised Speech Representation Learning |
Alexander H. Liu, Heng-Jui Chang,..., James R. Glass |
17 |
2023-05-26 |
link |
Distributionally Robust Linear Quadratic Control |
Bahar Taskesen, Dan Andrei Iancu,..., Daniel Kuhn |
17 |
2023-06-13 |
link |
Binary Radiance Fields |
Seungjoo Shin, Jaesik Park |
17 |
2023-10-22 |
link |
Hierarchical Vector Quantized Transformer for Multi-class Unsupervised Anomaly Detection |
Ruiying Lu, YuJie Wu,..., Ruimin Hu |
17 |
2023-07-17 |
link |
Going Beyond Linear Mode Connectivity: The Layerwise Linear Feature Connectivity |
Zhanpeng Zhou, Yongyi Yang,..., Wei Hu |
17 |
2023-10-07 |
link |
Subspace Identification for Multi-Source Domain Adaptation |
Zijian Li, Ruichu Cai,..., Kun Zhang |
17 |
2023-06-01 |
link |
AD-PT: Autonomous Driving Pre-Training with Large-scale Point Cloud Dataset |
Jiakang Yuan, Bo Zhang,..., Yu Qiao |
17 |
2023-05-25 |
link |
Non-adversarial training of Neural SDEs with signature kernel scores |
Zacharia Issa, Blanka Horvath,..., Cristopher Salvi |
17 |
2023-06-04 |
link |
Systematic Visual Reasoning through Object-Centric Relational Abstraction |
Taylor Whittington Webb, Shanka Subhra Mondal, Jonathan Cohen |
17 |
2023-10-19 |
link |
CycleNet: Rethinking Cycle Consistency in Text-Guided Diffusion for Image Manipulation |
Sihan Xu, Ziqiao Ma,..., Joyce Chai |
17 |
2023-08-02 |
link |
From Discrete Tokens to High-Fidelity Audio Using Multi-Band Diffusion |
Robin San Roman, Yossi Adi,..., Alexandre Défossez |
17 |
2022-11-07 |
link |
Multi-Head Adapter Routing for Cross-Task Generalization |
Lucas Caccia, Edoardo Ponti,..., Alessandro Sordoni |
17 |
2022-09-30 |
link |
Multi-Prompt Alignment for Multi-source Unsupervised Domain Adaptation |
Haoran Chen, Xintong Han,..., Yu-Gang Jiang |
17 |
2023-01-30 |
link |
Direct Preference-based Policy Optimization without Reward Modeling |
Gaon An, Junhyeok Lee,..., Hyun Oh Song |
17 |
2023-01-31 |
link |
DisDiff: Unsupervised Disentanglement of Diffusion Probabilistic Models |
Tao Yang, Yuwang Wang,..., Nanning Zheng |
17 |
2023-05-15 |
link |
Parameter-efficient Tuning of Large-scale Multimodal Foundation Model |
Haixin Wang, Xinlong Yang,..., Qi Tian |
17 |
2023-05-23 |
link |
SEEDS: Exponential SDE Solvers for Fast High-Quality Sampling from Diffusion Models |
Martin Gonzalez, Nelson Fernandez,..., Nader Masmoudi |
17 |
2023-10-12 |
link |
Every Parameter Matters: Ensuring the Convergence of Federated Learning with Dynamic Heterogeneous Models Reduction |
Hanhan Zhou, Tian Lan,..., Wenbo Ding |
17 |
2023-05-29 |
link |
Unleashing the Power of Randomization in Auditing Differentially Private ML |
Krishna Pillutla, Galen Andrew,..., Sewoong Oh |
17 |
2023-02-08 |
link |
DynGFN: Towards Bayesian Inference of Gene Regulatory Networks with GFlowNets |
Lazar Atanackovic, Alexander Tong,..., Jason Hartford |
17 |
2023-06-08 |
link |
Robust Learning with Progressive Data Expansion Against Spurious Correlation |
Yihe Deng, Yu Yang,..., Quanquan Gu |
17 |
2023-01-19 |
link |
AtMan: Understanding Transformer Predictions Through Memory Efficient Attention Manipulation |
Björn Deiseroth, Mayukh Deb,..., Kristian Kersting |
17 |
2023-10-29 |
link |
Label Poisoning is All You Need |
Rishi Dev Jha, Jonathan Hayase, Sewoong Oh |
17 |
2023-01-11 |
link |
Private estimation algorithms for stochastic block models and mixture models |
Hongjie Chen, Vincent Cohen-Addad,..., Stefan Tiegel |
17 |
2022-12-29 |
link |
Normalizing flow neural networks by JKO scheme |
Chen Xu, Xiuyuan Cheng, Yao Xie |
17 |
2023-05-26 |
link |
Inverse Dynamics Pretraining Learns Good Representations for Multitask Imitation |
David Brandfonbrener, Ofir Nachum, Joan Bruna |
16 |
2023-06-14 |
link |
Generalizable One-shot Neural Head Avatar |
Xueting Li, Shalini De Mello,..., Jan Kautz |
16 |
2023-04-17 |
link |
Bridging Discrete and Backpropagation: Straight-Through and Beyond |
Liyuan Liu, Chengyu Dong,..., Jianfeng Gao |
16 |
2023-01-26 |
link |
On the Convergence of No-Regret Learning Dynamics in Time-Varying Games |
Ioannis Anagnostides, Ioannis Panageas,..., Tuomas Sandholm |
16 |
2023-05-25 |
link |
The Benefits of Being Distributional: Small-Loss Bounds for Reinforcement Learning |
Kaiwen Wang, Kevin Zhou,..., Wen Sun |
16 |
2021-10-18 |
link |
Path Regularization: A Convexity and Sparsity Inducing Regularization for Parallel ReLU Networks |
Tolga Ergen, Mert Pilanci |
16 |
2023-05-31 |
link |
Not All Neuro-Symbolic Concepts Are Created Equal: Analysis and Mitigation of Reasoning Shortcuts |
Emanuele Marconato, Stefano Teso,..., Andrea Passerini |
16 |
2023-03-13 |
link |
Meet in the Middle: A New Pre-training Paradigm |
Anh Tuan Nguyen, Nikos Karampatziakis, Weizhu Chen |
16 |
2023-05-29 |
link |
The Rise of AI Language Pathologists: Exploring Two-level Prompt Learning for Few-shot Weakly-supervised Whole Slide Image Classification |
Linhao Qu, xiaoyuan Luo,..., Zhijian Song |
16 |
2023-09-27 |
link |
Enhancing Sharpness-Aware Optimization Through Variance Suppression |
Bingcong Li, Georgios B. Giannakis |
16 |
2023-10-09 |
link |
Aligning Language Models with Human Preferences via a Bayesian Approach |
Jiashuo WANG, Haozhao Wang,..., Wenjie Li |
16 |
2023-05-21 |
link |
Zero-shot Visual Relation Detection via Composite Visual Cues from Large Language Models |
Lin Li, Jun Xiao,..., Long Chen |
16 |
2023-05-31 |
link |
Three-Way Trade-Off in Multi-Objective Learning: Optimization, Generalization and Conflict-Avoidance |
Lisha Chen, Heshan Devaka Fernando,..., Tianyi Chen |
16 |
2023-05-24 |
link |
Debias Coarsely, Sample Conditionally: Statistical Downscaling through Optimal Transport and Probabilistic Diffusion Models |
Zhong Yi Wan, Ricardo Baptista,..., Leonardo Zepeda-Nunez |
16 |
2023-10-20 |
link |
Assumption violations in causal discovery and the robustness of score matching |
Francesco Montagna, Atalanti A. Mastakouri,..., Francesco Locatello |
16 |
2020-11-23 |
link |
On the Overlooked Pitfalls of Weight Decay and How to Mitigate Them: A Gradient-Norm Perspective |
Zeke Xie, zhiqiang xu,..., Masashi Sugiyama |
16 |
2023-06-08 |
link |
Exact Optimality of Communication-Privacy-Utility Tradeoffs in Distributed Mean Estimation |
Berivan Isik, Wei-Ning Chen,..., Albert No |
16 |
2023-05-25 |
link |
DoWG Unleashed: An Efficient Universal Parameter-Free Gradient Descent Method |
Ahmed Khaled, Konstantin Mishchenko, Chi Jin |
16 |
2023-09-22 |
link |
Spatial-frequency channels, shape bias, and adversarial robustness |
Ajay Subramanian, Elena Sizikova,..., Denis G. Pelli |
16 |
2023-06-01 |
link |
Rotating Features for Object Discovery |
Sindy Löwe, Phillip Lippe,..., Max Welling |
16 |
2023-05-31 |
link |
A Unified Conditional Framework for Diffusion-based Image Restoration |
Yi Zhang, Xiaoyu Shi,..., Hongsheng Li |
16 |
2023-05-11 |
link |
Generalization bounds for neural ordinary differential equations and deep residual networks |
Pierre Marion |
16 |
2023-01-25 |
link |
Unsupervised Protein-Ligand Binding Energy Prediction via Neural Euler's Rotation Equation |
Wengong Jin, Siranush Sarkizova,..., Caroline Uhler |
16 |
2023-05-25 |
link |
A Guide Through the Zoo of Biased SGD |
Yury Demidovich, Grigory Malinovsky,..., Peter Richtárik |
16 |
2023-10-17 |
link |
Understanding Contrastive Learning via Distributionally Robust Optimization |
Junkang Wu, Jiawei Chen,..., Xiangnan He |
16 |
2023-03-06 |
link |
Students Parrot Their Teachers: Membership Inference on Model Distillation |
Matthew Jagielski, Milad Nasr,..., Florian Tramèr |
16 |
2023-10-08 |
link |
Prompt-augmented Temporal Point Process for Streaming Event Sequence |
Siqiao Xue, Yan Wang,..., JUN ZHOU |
16 |
2023-06-13 |
link |
3D molecule generation by denoising voxel grids |
Pedro O. Pinheiro, Joshua Rackers,..., Saeed Saremi |
16 |
2023-10-24 |
link |
What's Left? Concept Grounding with Logic-Enhanced Foundation Models |
Joy Hsu, Jiayuan Mao,..., Jiajun Wu |
16 |
2023-05-27 |
link |
Towards Consistent Video Editing with Text-to-Image Diffusion Models |
Zicheng Zhang, Bonan Li,..., Luoqi Liu |
16 |
2023-04-30 |
link |
Domain Agnostic Fourier Neural Operators |
Ning Liu, Siavash Jafarzadeh, Yue Yu |
16 |
2023-10-29 |
link |
Simple and Asymmetric Graph Contrastive Learning without Augmentations |
Teng Xiao, Huaisheng Zhu,..., Suhang Wang |
16 |
2023-10-04 |
link |
Full-Atom Protein Pocket Design via Iterative Refinement |
ZAIXI ZHANG, Zepu Lu,..., Qi Liu |
15 |
2023-05-30 |
link |
When Does Optimizing a Proper Loss Yield Calibration? |
Jarosław Błasiok, Parikshit Gopalan,..., Preetum Nakkiran |
15 |
2023-04-17 |
link |
Leveraging sparse and shared feature activations for disentangled representation learning |
Marco Fumero, Florian Wenzel,..., Francesco Locatello |
15 |
2023-05-27 |
link |
Toward Understanding Generative Data Augmentation |
Chenyu Zheng, Guoqiang Wu, Chongxuan Li |
15 |
2023-06-02 |
link |
PolyDiffuse: Polygonal Shape Reconstruction via Guided Set Diffusion Models |
Jiacheng Chen, Ruizhi Deng, Yasutaka Furukawa |
15 |
2023-06-28 |
link |
DiffComplete: Diffusion-based Generative 3D Shape Completion |
Ruihang Chu, Enze Xie,..., Jiaya Jia |
15 |
2023-10-03 |
link |
Towards Stable Backdoor Purification through Feature Shift Tuning |
Rui Min, Zeyu Qin,..., Minhao Cheng |
15 |
2023-06-24 |
link |
Waypoint Transformer: Reinforcement Learning via Supervised Learning with Intermediate Targets |
Anirudhan Badrinath, Yannis Flet-Berliac,..., Emma Brunskill |
15 |
2023-05-28 |
link |
Conditional score-based diffusion models for Bayesian inference in infinite dimensions |
Lorenzo Baldassari, Ali Siahkoohi,..., Maarten V. de Hoop |
15 |
2023-05-29 |
link |
Beyond Confidence: Reliable Models Should Also Consider Atypicality |
Mert Yuksekgonul, Linjun Zhang,..., Carlos Guestrin |
15 |
2023-03-02 |
link |
The Double-Edged Sword of Implicit Bias: Generalization vs. Robustness in ReLU Networks |
Spencer Frei, Gal Vardi,..., Nathan Srebro |
15 |
2023-02-11 |
link |
Is Distance Matrix Enough for Geometric Deep Learning? |
Zian Li, Xiyuan Wang,..., Muhan Zhang |
15 |
2023-06-15 |
link |
Text Promptable Surgical Instrument Segmentation with Vision-Language Models |
Zijian Zhou, Oluwatosin Alabi,..., Miaojing Shi |
15 |
2023-06-16 |
link |
Energy-Based Cross Attention for Bayesian Context Update in Text-to-Image Diffusion Models |
Geon Yeong Park, Jeongsol Kim,..., Jong Chul Ye |
15 |
2023-05-21 |
link |
Two Sides of One Coin: the Limits of Untuned SGD and the Power of Adaptive Methods |
Junchi YANG, Xiang Li,..., Niao He |
15 |
2023-06-14 |
link |
MMD-FUSE: Learning and Combining Kernels for Two-Sample Testing Without Data Splitting |
Felix Biggs, Antonin Schrab, Arthur Gretton |
15 |
2023-06-05 |
link |
Bootstrapped Training of Score-Conditioned Generator for Offline Design of Biological Sequences |
Minsu Kim, Federico Berto,..., Jinkyoo Park |
15 |
2023-09-22 |
link |
Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning |
Jianzhun Shao, Yun Qu,..., Xiangyang Ji |
15 |
2023-02-17 |
link |
Universality laws for Gaussian mixtures in generalized linear models |
Yatin Dandi, Ludovic Stephan,..., Lenka Zdeborova |
15 |
2023-10-26 |
link |
Masked Space-Time Hash Encoding for Efficient Dynamic Scene Reconstruction |
Feng Wang, Zilong Chen,..., Huaping Liu |
15 |
2023-09-29 |
link |
Fine-grained Late-interaction Multi-modal Retrieval for Retrieval Augmented Visual Question Answering |
Weizhe Lin, Jinghong Chen,..., Bill Byrne |
15 |
2023-10-11 |
link |
Self-supervised Object-Centric Learning for Videos |
Görkay Aydemir, Weidi Xie, Fatma Guney |
15 |
2023-02-03 |
link |
Sharp Spectral Rates for Koopman Operator Learning |
Vladimir R Kostic, Karim Lounici,..., massimiliano pontil |
15 |
2023-05-15 |
link |
Nearly Optimal VC-Dimension and Pseudo-Dimension Bounds for Deep Neural Network Derivatives |
Yahong Yang, Haizhao Yang, Yang Xiang |
15 |
2023-10-29 |
link |
Analyzing Vision Transformers for Image Classification in Class Embedding Space |
Martina G. Vilas, Timothy Schaumlöffel, Gemma Roig |
15 |
2023-03-05 |
link |
Uncoupled and Convergent Learning in Two-Player Zero-Sum Markov Games |
Yang Cai, Haipeng Luo,..., Weiqiang Zheng |
15 |
2023-09-22 |
link |
Neural Data Transformer 2: Multi-context Pretraining for Neural Spiking Activity |
Joel Ye, Jennifer L Collinger,..., Robert Gaunt |
15 |
2023-10-02 |
link |
Equivariant Adaptation of Large Pretrained Models |
Arnab Kumar Mondal, Siba Smarak Panigrahi,..., Siamak Ravanbakhsh |
15 |
2023-08-21 |
link |
Approximately Equivariant Graph Networks |
Ningyuan Teresa Huang, Ron Levie, Soledad Villar |
15 |
2023-05-24 |
link |
Momentum Provably Improves Error Feedback! |
Ilyas Fatkhullin, Alexander Tyurin, Peter Richtárik |
15 |
2023-06-01 |
link |
Addressing Negative Transfer in Diffusion Models |
Hyojun Go, Jinyoung Kim,..., Seungtaek Choi |
15 |
2023-10-21 |
link |
Diversified Outlier Exposure for Out-of-Distribution Detection via Informative Extrapolation |
Jianing Zhu, Geng Yu,..., Bo Han |
15 |
2023-01-26 |
link |
Joint Training of Deep Ensembles Fails Due to Learner Collusion |
Alan Jeffares, Tennison Liu,..., Mihaela van der Schaar |
15 |
2023-05-31 |
link |
Primal-Attention: Self-attention through Asymmetric Kernel SVD in Primal Representation |
Yingyi Chen, Qinghua Tao,..., Johan Suykens |
15 |
2023-06-04 |
link |
Provable convergence guarantees for black-box variational inference |
Justin Domke, Robert M. Gower, Guillaume Garrigos |