3084 |
2023-01-30 |
link |
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models |
Junnan Li, Dongxu Li,..., Steven Hoi |
2441 |
2022-12-06 |
link |
Robust Speech Recognition via Large-Scale Weak Supervision |
Alec Radford, Jong Wook Kim,..., Ilya Sutskever |
1249 |
2023-03-06 |
link |
PaLM-E: An Embodied Multimodal Language Model |
Danny Driess, Fei Xia,..., Pete Florence |
949 |
2023-04-03 |
link |
Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling |
Stella Biderman, Hailey Schoelkopf,..., Oskar van der Wal |
532 |
2022-11-18 |
link |
SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models |
Guangxuan Xiao, Ji Lin,..., song han |
528 |
2023-01-31 |
link |
The Flan Collection: Designing Data and Methods for Effective Instruction Tuning |
Shayne Longpre, Le Hou,..., Adam Roberts |
452 |
2023-02-10 |
link |
Scaling Vision Transformers to 22 Billion Parameters |
Mostafa Dehghani, Josip Djolonga,..., Neil Houlsby |
448 |
2023-01-26 |
link |
DetectGPT: Zero-Shot Machine-Generated Text Detection using Probability Curvature |
Eric Mitchell, Yoonho Lee,..., Chelsea Finn |
442 |
2023-01-02 |
link |
SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot |
Elias Frantar, Dan Alistarh |
437 |
2023-01-02 |
link |
Muse: Text-To-Image Generation via Masked Generative Transformers |
Huiwen Chang, Han Zhang,..., Dilip Krishnan |
418 |
2022-11-30 |
link |
Fast Inference from Transformers via Speculative Decoding |
Yaniv Leviathan, Matan Kalman, Yossi Matias |
412 |
2023-01-31 |
link |
Large Language Models Can Be Easily Distracted by Irrelevant Context |
Freda Shi, Xinyun Chen,..., Denny Zhou |
398 |
2023-01-29 |
link |
AudioLDM: Text-to-Audio Generation with Latent Diffusion Models |
Haohe Liu, Zehua Chen,..., Mark D Plumbley |
363 |
2022-11-18 |
link |
PAL: Program-aided Language Models |
Luyu Gao, Aman Madaan,..., Graham Neubig |
359 |
2023-01-24 |
link |
A Watermark for Large Language Models |
John Kirchenbauer, Jonas Geiping,..., Tom Goldstein |
355 |
2022-10-19 |
link |
Scaling Laws for Reward Model Overoptimization |
Leo Gao, John Schulman, Jacob Hilton |
349 |
2022-12-15 |
link |
Transformers learn in-context by gradient descent |
Johannes Von Oswald, Eyvind Niklasson,..., Max Vladymyrov |
280 |
2022-11-15 |
link |
Large Language Models Struggle to Learn Long-Tail Knowledge |
Nikhil Kandpal, Haikang Deng,..., Colin Raffel |
274 |
2023-02-16 |
link |
MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation |
Omer Bar-Tal, Lior Yariv,..., Tali Dekel |
273 |
2023-03-30 |
link |
Whose Opinions Do Language Models Reflect? |
Shibani Santurkar, Esin Durmus,..., Tatsunori Hashimoto |
244 |
2022-08-18 |
link |
Using Large Language Models to Simulate Multiple Humans and Replicate Human Subject Studies |
Gati Aher, Rosa I. Arriaga, Adam Tauman Kalai |
243 |
2023-01-30 |
link |
Make-An-Audio: Text-To-Audio Generation with Prompt-Enhanced Diffusion Models |
Rongjie Huang, Jiawei Huang,..., Zhou Zhao |
235 |
2023-02-20 |
link |
Composer: Creative and Controllable Image Synthesis with Composable Conditions |
Lianghua Huang, Di Chen,..., Jingren Zhou |
232 |
2022-11-18 |
link |
DS-1000: A Natural and Reliable Benchmark for Data Science Code Generation |
Yuhang Lai, Chengxi Li,..., Tao Yu |
220 |
2023-02-21 |
link |
Hyena Hierarchy: Towards Larger Convolutional Language Models |
Michael Poli, Stefano Massaroli,..., Christopher Re |
213 |
2022-10-07 |
link |
Pix2Struct: Screenshot Parsing as Pretraining for Visual Language Understanding |
Kenton Lee, Mandar Joshi,..., Kristina Toutanova |
210 |
2023-01-17 |
link |
Prompting Large Language Model for Machine Translation: A Case Study |
Biao Zhang, Barry Haddow, Alexandra Birch |
202 |
2023-03-11 |
link |
Resurrecting Recurrent Neural Networks for Long Sequences |
Antonio Orvieto, Samuel L Smith,..., Soham De |
196 |
2023-01-24 |
link |
ClimaX: A foundation model for weather and climate |
Tung Nguyen, Johannes Brandstetter,..., Aditya Grover |
193 |
2023-01-30 |
link |
Specializing Smaller Language Models towards Multi-Step Reasoning |
Yao Fu, Hao Peng,..., Tushar Khot |
191 |
2022-12-18 |
link |
BEATs: Audio Pre-Training with Acoustic Tokenizers |
Sanyuan Chen, Yu Wu,..., Furu Wei |
189 |
2023-01-26 |
link |
simple diffusion: End-to-end diffusion for high resolution images |
Emiel Hoogeboom, Jonathan Heek, Tim Salimans |
180 |
2023-02-16 |
link |
Pretraining Language Models with Human Preferences |
Tomasz Korbak, Kejian Shi,..., Ethan Perez |
173 |
2022-12-19 |
link |
The case for 4-bit precision: k-bit Inference Scaling Laws |
Tim Dettmers, Luke Zettlemoyer |
172 |
2023-01-23 |
link |
StyleGAN-T: Unlocking the Power of GANs for Fast Large-Scale Text-to-Image Synthesis |
Axel Sauer, Tero Karras,..., Timo Aila |
168 |
2023-02-16 |
link |
LEVER: Learning to Verify Language-to-Code Generation with Execution |
Ansong Ni, Srini Iyer,..., Xi Victoria Lin |
163 |
2022-09-30 |
link |
TabDDPM: Modelling Tabular Data with Diffusion Models |
Akim Kotelnikov, Dmitry Baranchuk,..., Artem Babenko |
162 |
2023-02-09 |
link |
Better Diffusion Models Further Improve Adversarial Training |
Zekai Wang, Tianyu Pang,..., Shuicheng YAN |
155 |
2023-02-05 |
link |
SE(3) diffusion model with application to protein backbone generation |
Jason Yim, Brian L. Trippe,..., Tommi S. Jaakkola |
152 |
2023-04-14 |
link |
Cross-Entropy Loss Functions: Theoretical Analysis and Applications |
Anqi Mao, Mehryar Mohri, Yutao Zhong |
150 |
2023-05-01 |
link |
Poisoning Language Models During Instruction Tuning |
Alexander Wan, Eric Wallace,..., Dan Klein |
149 |
2023-02-20 |
link |
NerfDiff: Single-image View Synthesis with NeRF-guided Distillation from 3D-aware Diffusion |
Jiatao Gu, Alex Trevithick,..., Ravi Ramamoorthi |
149 |
2022-06-17 |
link |
VectorMapNet: End-to-end Vectorized HD Map Learning |
Yicheng Liu, Tianyuan Yuan,..., Hang Zhao |
140 |
2023-10-26 |
link |
Deja Vu: Contextual Sparsity for Efficient LLMs at Inference Time |
Zichang Liu, Jue WANG,..., Beidi Chen |
137 |
2023-01-26 |
link |
Principled Reinforcement Learning with Human Feedback from Pairwise or K-wise Comparisons |
Banghua Zhu, Michael Jordan, Jiantao Jiao |
133 |
2023-02-06 |
link |
Grounding Large Language Models in Interactive Environments with Online Reinforcement Learning |
Thomas Carta, Clément ROMAC,..., Pierre-Yves Oudeyer |
125 |
2023-03-08 |
link |
Automatically Auditing Large Language Models via Discrete Optimization |
Erik Jones, Anca Dragan,..., Jacob Steinhardt |
124 |
2023-02-01 |
link |
mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video |
Haiyang Xu, Qinghao Ye,..., Jingren Zhou |
122 |
2023-03-12 |
link |
One Transformer Fits All Distributions in Multi-Modal Diffusion at Scale |
Fan Bao, Shen Nie,..., Jun Zhu |
120 |
2023-01-17 |
link |
Transformers as Algorithms: Generalization and Stability in In-context Learning |
Yingcong Li, Muhammed Emrullah Ildiz,..., Samet Oymak |
118 |
2023-01-27 |
link |
Image Restoration with Mean-Reverting Stochastic Differential Equations |
Ziwei Luo, Fredrik K. Gustafsson,..., Thomas B. Schön |
116 |
2022-11-27 |
link |
SegCLIP: Patch Aggregation with Learnable Centers for Open-Vocabulary Semantic Segmentation |
Huaishao Luo, Junwei Bao,..., Tianrui Li |
115 |
2023-01-26 |
link |
Text-To-4D Dynamic Scene Generation |
Uriel Singer, Shelly Sheynin,..., Yaniv Taigman |
113 |
2022-11-24 |
link |
Fast Sampling of Diffusion Models via Operator Learning |
Hongkai Zheng, Weili Nie,..., Anima Anandkumar |
107 |
2023-02-12 |
link |
I2SB: Image-to-Image Schrödinger Bridge |
Guan-Horng Liu, Arash Vahdat,..., Anima Anandkumar |
106 |
2022-06-26 |
link |
Repository-Level Prompt Generation for Large Language Models of Code |
Disha Shrivastava, Hugo Larochelle, Daniel Tarlow |
104 |
2023-02-28 |
link |
GNOT: A General Neural Operator Transformer for Operator Learning |
Zhongkai Hao, Zhengyi Wang,..., Jun Zhu |
104 |
2023-04-06 |
link |
Do the Rewards Justify the Means? Measuring Trade-Offs Between Rewards and Ethical Behavior in the MACHIAVELLI Benchmark |
Alexander Pan, Jun Shern Chan,..., Dan Hendrycks |
103 |
2023-02-22 |
link |
Reduce, Reuse, Recycle: Compositional Generation with Energy-Based Diffusion Models and MCMC |
Yilun Du, Conor Durkan,..., Will Sussman Grathwohl |
102 |
2023-03-24 |
link |
TRAK: Attributing Model Behavior at Scale |
Sung Min Park, Kristian Georgiev,..., Aleksander Madry |
96 |
2022-06-20 |
link |
Global Context Vision Transformers |
Ali Hatamizadeh, Hongxu Yin,..., Pavlo Molchanov |
95 |
2023-05-02 |
link |
Geometric Latent Diffusion Models for 3D Molecule Generation |
Minkai Xu, Alexander S Powers,..., Jure Leskovec |
94 |
2023-01-31 |
link |
Grounding Language Models to Images for Multimodal Inputs and Outputs |
Jing Yu Koh, Ruslan Salakhutdinov, Daniel Fried |
94 |
2022-11-03 |
link |
Improved Analysis of Score-based Generative Modeling: User-Friendly Bounds under Minimal Smoothness Assumptions |
Hongrui Chen, Holden Lee, Jianfeng Lu |
94 |
2023-01-18 |
link |
Human-Timescale Adaptation in an Open-Ended Task Space |
Jakob Bauer, Kate Baumli,..., Lei M Zhang |
93 |
2023-06-01 |
link |
LIV: Language-Image Representations and Rewards for Robotic Control |
Yecheng Jason Ma, Vikash Kumar,..., Dinesh Jayaraman |
92 |
2023-04-28 |
link |
Multisample Flow Matching: Straightening Flows with Minibatch Couplings |
Aram-Alexandre Pooladian, Heli Ben-Hamu,..., Ricky T. Q. Chen |
91 |
2022-12-22 |
link |
Scalable Adaptive Computation for Iterative Generation |
Allan Jabri, David J. Fleet, Ting Chen |
91 |
2023-03-09 |
link |
Cones: Concept Neurons in Diffusion Models for Customized Generation |
Zhiheng Liu, Ruili Feng,..., Yang Cao |
90 |
2023-06-01 |
link |
Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles |
Chaitanya Ryali, Yuan-Ting Hu,..., Christoph Feichtenhofer |
89 |
2023-02-06 |
link |
On Over-Squashing in Message Passing Neural Networks: The Impact of Width, Depth, and Topology |
Francesco Di Giovanni, Lorenzo Giusti,..., Michael M. Bronstein |
89 |
2023-06-06 |
link |
Spherical Fourier Neural Operators: Learning Stable Dynamics on the Sphere |
Boris Bonev, Thorsten Kurth,..., Anima Anandkumar |
89 |
2023-02-02 |
link |
The unreasonable effectiveness of few-shot learning for machine translation |
Xavier Garcia, Yamini Bansal,..., Orhan Firat |
88 |
2023-02-11 |
link |
Compositional Exemplars for In-context Learning |
Jiacheng Ye, Zhiyong Wu,..., Lingpeng Kong |
85 |
2023-05-01 |
link |
Deep Graph Representation Learning and Optimization for Influence Maximization |
Chen Ling, Junji Jiang,..., Liang Zhao |
84 |
2023-02-02 |
link |
Are Diffusion Models Vulnerable to Membership Inference Attacks? |
Jinhao Duan, Fei Kong,..., Kaidi Xu |
83 |
2023-01-30 |
link |
Looped Transformers as Programmable Computers |
Angeliki Giannou, Shashank Rajput,..., Dimitris Papailiopoulos |
83 |
2023-02-09 |
link |
Adversarial Example Does Good: Preventing Painting Imitation from Diffusion Models via Adversarial Examples |
Chumeng Liang, Xiaoyu Wu,..., Haibing Guan |
81 |
2022-11-29 |
link |
Coder Reviewer Reranking for Code Generation |
Tianyi Zhang, Tao Yu,..., Sida Wang |
80 |
2023-02-06 |
link |
A Toy Model of Universality: Reverse Engineering How Networks Learn Group Operations |
Bilal Chughtai, Lawrence Chan, Neel Nanda |
80 |
2023-02-13 |
link |
Raising the Cost of Malicious AI-Powered Image Editing |
Hadi Salman, Alaa Khaddaj,..., Aleksander Madry |
80 |
None |
link |
Retrieval-Augmented Multimodal Language Modeling |
Michihiro Yasunaga, Armen Aghajanyan,..., Wen-tau Yih |
78 |
2023-01-10 |
link |
Scaling Laws for Generative Mixed-Modal Language Models |
Armen Aghajanyan, LILI YU,..., Luke Zettlemoyer |
75 |
2023-02-21 |
link |
On Provable Copyright Protection for Generative Models |
Nikhil Vyas, Sham M. Kakade, Boaz Barak |
75 |
None |
link |
Loss-Guided Diffusion Models for Plug-and-Play Controllable Generation |
Jiaming Song, Qinsheng Zhang,..., Arash Vahdat |
74 |
2023-01-30 |
link |
ESC: Exploration with Soft Commonsense Constraints for Zero-shot Object Navigation |
Kaiwen Zhou, Kaizhi Zheng,..., Xin Eric Wang |
74 |
2022-12-28 |
link |
Cramming: Training a Language Model on a Single GPU in One Day |
Jonas Geiping, Tom Goldstein |
73 |
2023-01-23 |
link |
On the Expressive Power of Geometric Graph Neural Networks |
Chaitanya K. Joshi, Cristian Bodnar,..., Pietro Lio |
73 |
2023-03-10 |
link |
Exphormer: Sparse Transformers for Graphs |
Hamed Shirzad, Ameya Velingker,..., Ali Kemal Sinop |
72 |
2023-03-02 |
link |
Understanding plasticity in neural networks |
Clare Lyle, Zeyu Zheng,..., Will Dabney |
72 |
2022-10-23 |
link |
Multi-Objective GFlowNets |
Moksh Jain, Sharath Chandra Raparthy,..., Emmanuel Bengio |
71 |
2023-02-03 |
link |
AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners |
Zhixuan Liang, Yao Mu,..., Ping Luo |
70 |
2022-11-28 |
link |
Refining Generative Process with Discriminator Guidance in Score-based Diffusion Models |
Dongjun Kim, Yeongmin Kim,..., Il-chul Moon |
69 |
2022-12-14 |
link |
Efficient Self-supervised Learning with Contextualized Target Representations for Vision, Speech and Language |
Alexei Baevski, Arun Babu,..., Michael Auli |
69 |
2022-09-24 |
link |
Interventional Causal Representation Learning |
Kartik Ahuja, Divyat Mahajan,..., Yoshua Bengio |
69 |
2023-02-14 |
link |
Score Approximation, Estimation and Distribution Recovery of Diffusion Models on Low-Dimensional Data |
Minshuo Chen, Kaixuan Huang,..., Mengdi Wang |
68 |
2023-02-07 |
link |
Exploring the Benefits of Training Expert Language Models over Instruction Tuning |
Joel Jang, Seungone Kim,..., Minjoon Seo |
68 |
2023-05-27 |
link |
Graph Inductive Biases in Transformers without Message Passing |
Liheng Ma, Chen Lin,..., Ser-Nam Lim |
68 |
2022-12-27 |
link |
A Generalization of ViT/MLP-Mixer to Graphs |
Xiaoxin He, Bryan Hooi,..., Xavier Bresson |
67 |
2023-02-24 |
link |
The Dormant Neuron Phenomenon in Deep Reinforcement Learning |
Ghada Sokar, Rishabh Agarwal,..., Utku Evci |
67 |
2023-02-03 |
link |
Structure-informed Language Models Are Protein Designers |
Zaixiang Zheng, Yifan Deng,..., Quanquan Gu |
66 |
2023-04-27 |
link |
Controlled Text Generation with Natural Language Instructions |
Wangchunshu Zhou, Yuchen Eleanor Jiang,..., Mrinmaya Sachan |
66 |
2023-01-30 |
link |
A theory of continuous generative flow networks |
Salem Lahlou, Tristan Deleu,..., Nikolay Malkin |
65 |
2023-05-30 |
link |
Bigger, Better, Faster: Human-level Atari with human-level efficiency |
Max Schwarzer, Johan Samir Obando Ceron,..., Pablo Samuel Castro |
65 |
2023-02-06 |
link |
Protecting Language Generation Models via Invisible Watermarking |
Xuandong Zhao, Yu-Xiang Wang, Lei Li |
65 |
2023-03-03 |
link |
Diffusion Models are Minimax Optimal Distribution Estimators |
Kazusato Oko, Shunta Akiyama, Taiji Suzuki |
64 |
2022-10-05 |
link |
RankMe: Assessing the downstream performance of pretrained self-supervised representations by their rank |
Quentin Garrido, Randall Balestriero,..., Yann LeCun |
64 |
2023-01-29 |
link |
Unifying Molecular and Textual Representations via Multi-task Language Modelling |
Dimitrios Christofidellis, Giorgio Giannone,..., Matteo Manica |
64 |
2023-02-07 |
link |
Reducing SO(3) Convolutions to SO(2) for Efficient Equivariant GNNs |
Saro Passaro, C. Lawrence Zitnick |
63 |
2023-02-06 |
link |
CHiLS: Zero-Shot Image Classification with Hierarchical Label Sets |
Zachary Novack, Julian McAuley,..., Saurabh Garg |
63 |
2023-01-18 |
link |
Learning-Rate-Free Learning by D-Adaptation |
Aaron Defazio, Konstantin Mishchenko |
63 |
2022-08-18 |
link |
Open-Vocabulary Universal Image Segmentation with MaskCLIP |
Zheng Ding, Jieke Wang, Zhuowen Tu |
61 |
2022-12-20 |
link |
Model Ratatouille: Recycling Diverse Models for Out-of-Distribution Generalization |
Alexandre Rame, Kartik Ahuja,..., David Lopez-Paz |
60 |
2023-01-28 |
link |
Do Embodied Agents Dream of Pixelated Sheep?: Embodied Decision Making using Language Guided World Modelling |
Kolby Nottingham, Prithviraj Ammanabrolu,..., Roy Fox |
60 |
2023-06-01 |
link |
In or Out? Fixing ImageNet Out-of-Distribution Detection Evaluation |
Julian Bitterwolf, Maximilian Müller, Matthias Hein |
60 |
2023-01-29 |
link |
Generating Novel, Designable, and Diverse Protein Structures by Equivariantly Diffusing Oriented Residue Clouds |
Yeqing Lin, Mohammed AlQuraishi |
59 |
2022-11-29 |
link |
SinDDM: A Single Image Denoising Diffusion Model |
Vladimir Kulikov, Shahar Yadin,..., Tomer Michaeli |
58 |
2022-09-08 |
link |
Q-learning Decision Transformer: Leveraging Dynamic Programming for Conditional Sequence Modelling in Offline RL |
Taku Yamagata, Ahmed Khalil, Raul Santos-Rodriguez |
55 |
2022-12-12 |
link |
On Pre-Training for Visuo-Motor Control: Revisiting a Learning-from-Scratch Baseline |
Nicklas Hansen, Zhecheng Yuan,..., Xiaolong Wang |
55 |
2022-06-06 |
link |
Exploring Chemical Space with Score-based Out-of-distribution Generation |
Seul Lee, Jaehyeong Jo, Sung Ju Hwang |
55 |
2023-02-13 |
link |
Task-Specific Skill Localization in Fine-tuned Language Models |
Abhishek Panigrahi, Nikunj Saunshi,..., Sanjeev Arora |
54 |
2023-03-06 |
link |
Restoration-Degradation Beyond Linear Diffusions: A Non-Asymptotic Analysis For DDIM-Type Samplers |
Sitan Chen, Giannis Daras, Alex Dimakis |
54 |
2022-11-29 |
link |
Linear Causal Disentanglement via Interventions |
Chandler Squires, Anna Seigal,..., Caroline Uhler |
54 |
2022-11-21 |
link |
SinFusion: Training Diffusion Models on a Single Image or Video |
Yaniv Nikankin, Niv Haim, michal Irani |
54 |
2023-02-16 |
link |
Aligning Language Models with Preferences through f-divergence Minimization |
Dongyoung Go, Tomasz Korbak,..., Marc Dymetman |
53 |
2023-02-01 |
link |
Synthetic Prompting: Generating Chain-of-Thought Demonstrations for Large Language Models |
Zhihong Shao, Yeyun Gong,..., Weizhu Chen |
52 |
None |
link |
Fourmer: An Efficient Global Modeling Paradigm for Image Restoration |
man zhou, Jie Huang,..., Chongyi Li |
52 |
2023-02-08 |
link |
PFGM++: Unlocking the Potential of Physics-Inspired Generative Models |
Yilun Xu, Ziming Liu,..., Tommi S. Jaakkola |
52 |
2023-02-02 |
link |
CLIPood: Generalizing CLIP to Out-of-Distributions |
Yang Shu, Xingzhuo Guo,..., Mingsheng Long |
51 |
2023-06-26 |
link |
LongCoder: A Long-Range Pre-trained Language Model for Code Completion |
Daya Guo, Canwen Xu,..., Julian McAuley |
51 |
2023-01-26 |
link |
A Fully First-Order Method for Stochastic Bilevel Optimization |
Jeongyeol Kwon, Dohyun Kwon,..., Robert D Nowak |
51 |
2023-05-10 |
link |
XTab: Cross-table Pretraining for Tabular Transformers |
Bingzhao Zhu, Xingjian Shi,..., Mahsa Shoaran |
50 |
2022-10-04 |
link |
Less is More: Task-aware Layer-wise Distillation for Language Model Compression |
Chen Liang, Simiao Zuo,..., Tuo Zhao |
50 |
2023-02-03 |
link |
Better Training of GFlowNets with Local Credit and Incomplete Trajectories |
Ling Pan, Nikolay Malkin,..., Yoshua Bengio |
50 |
2022-12-12 |
link |
Nearly Minimax Optimal Reinforcement Learning for Linear Markov Decision Processes |
Jiafan He, Heyang Zhao,..., Quanquan Gu |
49 |
2022-10-11 |
link |
A Kernel-Based View of Language Model Fine-Tuning |
Sadhika Malladi, Alexander Wettig,..., Sanjeev Arora |
48 |
2023-04-13 |
link |
Language Instructed Reinforcement Learning for Human-AI Coordination |
Hengyuan Hu, Dorsa Sadigh |
48 |
2023-05-13 |
link |
DRew: Dynamically Rewired Message Passing with Delay |
Benjamin Gutteridge, Xiaowen Dong,..., Francesco Di Giovanni |
47 |
2023-02-14 |
link |
Understanding Oversquashing in GNNs through the Lens of Effective Resistance |
Mitchell Black, Zhengchao Wan,..., Yusu Wang |
47 |
2022-11-11 |
link |
Equivariance with Learned Canonicalization Functions |
Sékou-Oumar Kaba, Arnab Kumar Mondal,..., Siamak Ravanbakhsh |
47 |
2023-07-17 |
link |
Autoregressive Diffusion Model for Graph Generation |
Lingkai Kong, Jiaming Cui,..., Chao Zhang |
47 |
2022-11-28 |
link |
Revisiting Over-smoothing and Over-squashing using Ollivier's Ricci Curvature |
Khang Nguyen, Nong Minh Hieu,..., Tan Minh Nguyen |
47 |
2023-02-13 |
link |
Simple Hardware-Efficient Long Convolutions for Sequence Modeling |
Daniel Y Fu, Elliot L Epstein,..., Christopher Re |
46 |
2023-02-08 |
link |
DoG is SGD's Best Friend: A Parameter-Free Dynamic Step Size Schedule |
Maor Ivgi, Oliver Hinder, Yair Carmon |
46 |
2023-06-02 |
link |
Probabilistic Concept Bottleneck Models |
Eunji Kim, Dahuin Jung,..., Sungroh Yoon |
46 |
2023-06-20 |
link |
LoSparse: Structured Compression of Large Language Models based on Low-Rank and Sparse Approximation |
Yixiao Li, Yifan Yu,..., Tuo Zhao |
46 |
2023-04-06 |
link |
Towards Coherent Image Inpainting Using Denoising Diffusion Implicit Models |
Guanhua Zhang, Jiabao Ji,..., Shiyu Chang |
46 |
2021-12-01 |
link |
The Price of Differential Privacy under Continual Observation |
Palak Jain, Sofya Raskhodnikova,..., Adam Smith |
46 |
2022-10-11 |
link |
SGD with large step sizes learns sparse features |
Maksym Andriushchenko, Aditya Vardhan Varre,..., Nicolas Flammarion |
45 |
2022-12-22 |
link |
Text Generation with Diffusion Language Models: A Pre-training Approach with Continuous Paragraph Denoise |
Zhenghao Lin, Yeyun Gong,..., Weizhu Chen |
45 |
2023-02-14 |
link |
A modern look at the relationship between sharpness and generalization |
Maksym Andriushchenko, Francesco Croce,..., Nicolas Flammarion |
44 |
2023-01-30 |
link |
Equivariant Architectures for Learning in Deep Weight Spaces |
Aviv Navon, Aviv Shamsian,..., Haggai Maron |
44 |
2023-04-10 |
link |
Reflected Diffusion Models |
Aaron Lou, Stefano Ermon |
44 |
2023-05-01 |
link |
Discover and Cure: Concept-aware Mitigation of Spurious Correlation |
Shirley Wu, Mert Yuksekgonul,..., James Zou |
43 |
None |
link |
Can Large Language Models Reason about Program Invariants? |
Kexin Pei, David Bieber,..., Pengcheng Yin |
43 |
2023-01-27 |
link |
Input Perturbation Reduces Exposure Bias in Diffusion Models |
Mang Ning, Enver Sangineto,..., Rita Cucchiara |
43 |
2023-02-14 |
link |
A Complete Expressiveness Hierarchy for Subgraph GNNs via Subgraph Weisfeiler-Lehman Tests |
Bohang Zhang, Guhao Feng,..., Liwei Wang |
43 |
2022-05-27 |
link |
Architecture-Agnostic Masked Image Modeling - From ViT back to CNN |
Siyuan Li, Di Wu,..., Stan Z. Li |
42 |
2023-01-27 |
link |
Minimizing Trajectory Curvature of ODE-based Generative Models |
Sangyun Lee, Beomsu Kim, Jong Chul Ye |
42 |
2023-02-06 |
link |
Domain Adaptation for Time Series Under Feature and Label Shifts |
Huan He, Owen Queen,..., Marinka Zitnik |
42 |
2024-02-26 |
link |
DecompDiff: Diffusion Models with Decomposed Priors for Structure-Based Drug Design |
Jiaqi Guan, Xiangxin Zhou,..., Quanquan Gu |
42 |
2023-02-05 |
link |
Multi-View Masked World Models for Visual Robotic Manipulation |
Younggyo Seo, Junsu Kim,..., Pieter Abbeel |
42 |
2023-05-11 |
link |
Towards Understanding and Improving GFlowNet Training |
Max W Shen, Emmanuel Bengio,..., Tommaso Biancalani |
42 |
None |
link |
DDGR: Continual Learning with Deep Diffusion-based Generative Replay |
Rui Gao, Weiwei Liu |
42 |
2023-04-18 |
link |
Hyperbolic Image-Text Representations |
Karan Desai, Maximilian Nickel,..., Shanmukha Ramakrishna Vedantam |
41 |
2023-02-09 |
link |
Bag of Tricks for Training Data Extraction from Language Models |
Weichen Yu, Tianyu Pang,..., Shuicheng YAN |
41 |
None |
link |
Fast Federated Machine Unlearning with Nonlinear Functional Theory |
Tianshi Che, Yang Zhou,..., Jun Huan |
41 |
2023-05-30 |
link |
FedDisco: Federated Learning with Discrepancy-Aware Collaboration |
Rui Ye, Mingkai Xu,..., Yanfeng Wang |
41 |
2023-05-03 |
link |
CLUSTSEG: Clustering for Universal Segmentation |
James Chenhao Liang, Tianfei Zhou,..., Wenguan Wang |
41 |
2023-02-10 |
link |
On Penalty-based Bilevel Gradient Descent Method |
Han Shen, Tianyi Chen |
41 |
2023-01-30 |
link |
Generalization on the Unseen, Logic Reasoning and Degree Curriculum |
Emmanuel Abbe, Samy Bengio,..., Kevin Rizk |
40 |
2023-02-10 |
link |
The Wisdom of Hindsight Makes Language Models Better Instruction Followers |
Tianjun Zhang, Fangchen Liu,..., Joseph E. Gonzalez |
40 |
2021-05-29 |
link |
Diffusion Based Representation Learning |
Sarthak Mittal, Korbinian Abstreiter,..., Arash Mehrjou |
40 |
2022-09-30 |
link |
Differentially Private Optimization on Large Model at Small Cost |
Zhiqi Bu, Yu-Xiang Wang,..., George Karypis |
39 |
2023-02-08 |
link |
Improving the Model Consistency of Decentralized Federated Learning |
Yifan Shi, Li Shen,..., Dacheng Tao |
39 |
2023-04-25 |
link |
Contrastive Energy Prediction for Exact Energy-Guided Diffusion Sampling in Offline Reinforcement Learning |
Cheng Lu, Huayu Chen,..., Jun Zhu |
39 |
2022-03-14 |
link |
Invariance in Policy Optimisation and Partial Identifiability in Reward Learning |
Joar Max Viktor Skalse, Matthew Farrugia-Roberts,..., Adam Gleave |
38 |
2023-01-31 |
link |
Optimizing DDPM Sampling with Shortcut Fine-Tuning |
Ying Fan, Kangwook Lee |
38 |
2023-02-14 |
link |
Revisiting Weighted Aggregation in Federated Learning with Neural Networks |
Zexi Li, Tao Lin,..., Chao Wu |
38 |
2023-06-08 |
link |
Multi-Modal Classifiers for Open-Vocabulary Object Detection |
Prannay Kaul, Weidi Xie, Andrew Zisserman |
38 |
2022-09-29 |
link |
Enforcing Hard Constraints with Soft Barriers: Safe Reinforcement Learning in Unknown Stochastic Environments |
Yixuan Wang, Simon Sinong Zhan,..., Qi Zhu |
37 |
2023-07-18 |
link |
Can Neural Network Memorization Be Localized? |
Pratyush Maini, Michael Curtis Mozer,..., Chiyuan Zhang |
37 |
2022-10-25 |
link |
Same Pre-training Loss, Better Downstream: Implicit Bias Matters for Language Models |
Hong Liu, Sang Michael Xie,..., Tengyu Ma |
37 |
None |
link |
Feature learning in deep classifiers through Intermediate Neural Collapse |
Akshay Rangamani, Marius Lindegaard,..., tomaso a poggio |
36 |
2022-10-17 |
link |
Forget Unlearning: Towards True Data-Deletion in Machine Learning |
Rishav Chourasia, Neil Shah |
36 |
2023-05-30 |
link |
How Does Information Bottleneck Help Deep Learning? |
Kenji Kawaguchi, Zhun Deng,..., Jiaoyang Huang |
36 |
2023-01-27 |
link |
On the Connection Between MPNN and Graph Transformer |
Chen Cai, Truong Son Hy,..., Yusu Wang |
36 |
2023-01-28 |
link |
A Closer Look at Few-shot Classification Again |
Xu Luo, Hao Wu,..., Jingkuan Song |
36 |
2022-11-12 |
link |
Multi-Epoch Matrix Factorization Mechanisms for Private Machine Learning |
Christopher A. Choquette-Choo, Hugh Brendan McMahan,..., Abhradeep Guha Thakurta |
36 |
2023-06-06 |
link |
On Pitfalls of Test-Time Adaptation |
Hao Zhao, Yuejiang Liu,..., Tao Lin |
36 |
2023-01-30 |
link |
GibbsDDRM: A Partially Collapsed Gibbs Sampler for Solving Blind Inverse Problems with Denoising Diffusion Restoration |
Naoki Murata, Koichi Saito,..., Stefano Ermon |
36 |
2022-10-07 |
link |
TAN without a burn: Scaling Laws of DP-SGD |
Tom Sander, Pierre Stock, Alexandre Sablayrolles |
36 |
2022-11-06 |
link |
Tuning Language Models as Training Data Generators for Augmentation-Enhanced Few-Shot Learning |
Yu Meng, Martin Michalski,..., Jiawei Han |
36 |
2022-11-15 |
link |
Mechanistic Mode Connectivity |
Ekdeep Singh Lubana, Eric J Bigelow,..., Hidenori Tanaka |
36 |
2023-02-10 |
link |
Controllability-Aware Unsupervised Skill Discovery |
Seohong Park, Kimin Lee,..., Pieter Abbeel |
36 |
2023-05-02 |
link |
Revisiting Gradient Clipping: Stochastic bias and tight convergence guarantees |
Anastasia Koloskova, Hadrien Hendrikx, Sebastian U Stich |
35 |
2023-05-01 |
link |
CSP: Self-Supervised Contrastive Spatial Pre-Training for Geospatial-Visual Representations |
Gengchen Mai, Ni Lao,..., Stefano Ermon |
35 |
2023-05-18 |
link |
Dirichlet Diffusion Score Model for Biological Sequence Generation |
Pavel Avdeyev, Chenlai Shi,..., Jian Zhou |
35 |
2023-05-04 |
link |
Efficient Personalized Federated Learning via Sparse Model-Adaptation |
Daoyuan Chen, Liuyi Yao,..., Yaliang Li |
35 |
2023-02-02 |
link |
High-Probability Bounds for Stochastic Optimization and Variational Inequalities: the Case of Unbounded Variance |
Abdurakhmon Sadiev, Marina Danilova,..., Peter Richtárik |
35 |
2023-05-28 |
link |
Dink-Net: Neural Clustering on Large Graphs |
Yue Liu, KE LIANG,..., Stan Z. Li |
35 |
2023-06-06 |
link |
On the Role of Attention in Prompt-tuning |
Samet Oymak, Ankit Singh Rawat,..., Christos Thrampoulidis |
35 |
2023-02-15 |
link |
Improved Online Conformal Prediction via Strongly Adaptive Online Learning |
Aadyot Bhatnagar, Huan Wang,..., Yu Bai |
35 |
2023-06-08 |
link |
CoCo: A Coupled Contrastive Framework for Unsupervised Domain Adaptive Graph Classification |
Nan Yin, Li Shen,..., Xiao Luo |
34 |
2023-01-26 |
link |
GeCoNeRF: Few-shot Neural Radiance Fields via Geometric Consistency |
Min-Seop Kwak, Jiuhn Song, Seungryong Kim |
34 |
2023-02-13 |
link |
GFlowNet-EM for learning compositional latent variable models |
Edward J Hu, Nikolay Malkin,..., Yoshua Bengio |
34 |
2023-05-31 |
link |
Towards Omni-generalizable Neural Methods for Vehicle Routing Problems |
Jianan Zhou, Yaoxin Wu,..., Jie Zhang |
34 |
2023-02-01 |
link |
End-to-End Full-Atom Antibody Design |
Xiangzhe Kong, Wenbing Huang, Yang Liu |
34 |
2023-04-25 |
link |
CoDi: Co-evolving Contrastive Diffusion Models for Mixed-type Tabular Synthesis |
Chaejeong Lee, Jayoung Kim, Noseong Park |
34 |
2022-11-21 |
link |
Neural networks trained with SGD learn distributions of increasing complexity |
Maria Refinetti, Alessandro Ingrosso, Sebastian Goldt |
34 |
2023-04-10 |
link |
Reinforcement Learning from Passive Data via Latent Intentions |
Dibya Ghosh, Chethan Anand Bhateja, Sergey Levine |
34 |
2022-07-05 |
link |
Mitigating Propagation Failures in Physics-informed Neural Networks using Retain-Resample-Release (R3) Sampling |
Arka Daw, Jie Bu,..., Anuj Karpatne |
34 |
2023-02-05 |
link |
KDEformer: Accelerating Transformers via Kernel Density Estimation |
Amir Zandieh, Insu Han,..., Amin Karbasi |
34 |
2023-03-06 |
link |
Enhancing Activity Prediction Models in Drug Discovery with the Ability to Understand Human Language |
Philipp Seidl, Andreu Vall,..., Günter Klambauer |
34 |
2023-05-20 |
link |
Lifelong Language Pretraining with Distribution-Specialized Experts |
Wuyang Chen, Yanqi Zhou,..., Claire Cui |
34 |
2023-06-12 |
link |
Diffusion Models for Black-Box Optimization |
Siddarth Krishnamoorthy, Satvik Mehul Mashkaria, Aditya Grover |
34 |
2022-09-30 |
link |
Data Poisoning Attacks Against Multimodal Encoders |
Ziqing Yang, Xinlei He,..., Yang Zhang |
34 |
2023-05-11 |
link |
MolDiff: Addressing the Atom-Bond Inconsistency Problem in 3D Molecule Diffusion Generation |
Xingang Peng, Jiaqi Guan,..., Jianzhu Ma |
34 |
2023-02-16 |
link |
Tuning computer vision models with task rewards |
André Susano Pinto, Alexander Kolesnikov,..., Xiaohua Zhai |
34 |
2023-05-06 |
link |
Efficient and Degree-Guided Graph Generation via Discrete Diffusion Modeling |
Xiaohui Chen, Jiaxing He,..., Liping Liu |
33 |
2023-02-28 |
link |
High Probability Convergence of Stochastic Gradient Methods |
Zijian Liu, Ta Duy Nguyen,..., Huy Nguyen |
33 |
2023-04-28 |
link |
FAENet: Frame Averaging Equivariant GNN for Materials Modeling |
Alexandre AGM Duval, Victor Schmidt,..., David Rolnick |
33 |
2023-01-26 |
link |
BiBench: Benchmarking and Analyzing Network Binarization |
Haotong Qin, Mingyuan Zhang,..., Xianglong Liu |
33 |
2023-06-19 |
link |
Simple and Fast Group Robustness by Automatic Feature Reweighting |
Shikai Qiu, Andres Potapczynski,..., Andrew Gordon Wilson |
33 |
2022-06-21 |
link |
Personalized Subgraph Federated Learning |
Jinheon Baek, Wonyong Jeong,..., Sung Ju Hwang |
33 |
2022-11-09 |
link |
Leveraging Offline Data in Online Reinforcement Learning |
Andrew Wagenmaker, Aldo Pacchiano |
32 |
2023-05-31 |
link |
MetaDiffuser: Diffusion Model as Conditional Planner for Offline Meta-RL |
Fei Ni, Jianye HAO,..., Zhixuan Liang |
32 |
2023-04-25 |
link |
AdaNPC: Exploring Non-Parametric Classifier for Test-Time Adaptation |
YiFan Zhang, xue wang,..., Tieniu Tan |
32 |
2023-06-03 |
link |
Provable Dynamic Fusion for Low-Quality Multimodal Data |
Qingyang Zhang, Haitao Wu,..., Xi Peng |
32 |
2022-06-08 |
link |
Neural Diffusion Processes |
Vincent Dutordoir, Alan Saul,..., Fergus Simpson |
32 |
None |
link |
Personalized Federated Learning with Inferred Collaboration Graphs |
Rui Ye, Zhenyang Ni,..., Yanfeng Wang |
32 |
2023-01-26 |
link |
Neural Inverse Operators for Solving PDE Inverse Problems |
Roberto Molinaro, Yunan Yang,..., Siddhartha Mishra |
32 |
2023-02-03 |
link |
Stochastic Policy Gradient Methods: Improved Sample Complexity for Fisher-non-degenerate Policies |
Ilyas Fatkhullin, Anas Barakat,..., Niao He |
32 |
2022-09-28 |
link |
Causal Proxy Models for Concept-Based Model Explanations |
Zhengxuan Wu, Karel D'Oosterlinck,..., Christopher Potts |
32 |
2023-06-15 |
link |
Feed Two Birds with One Scone: Exploiting Wild Data for Both Out-of-Distribution Generalization and Detection |
Haoyue Bai, Gregory Canal,..., Yixuan Li |
32 |
2022-06-10 |
link |
Bayesian Estimation of Differential Privacy |
Santiago Zanella-Beguelin, Lukas Wutschitz,..., Daniel Jones |
32 |
2023-05-04 |
link |
Masked Trajectory Models for Prediction, Representation, and Control |
Philipp Wu, Arjun Majumdar,..., Aravind Rajeswaran |
31 |
2023-01-31 |
link |
UPop: Unified and Progressive Pruning for Compressing Vision-Language Transformers |
Dachuan Shi, Chaofan Tao,..., Jiaqi Wang |
31 |
2023-02-09 |
link |
An Investigation into Pre-Training Object-Centric Representations for Reinforcement Learning |
Jaesik Yoon, Yi-Fu Wu,..., Sungjin Ahn |
31 |
2023-05-15 |
link |
Straightening Out the Straight-Through Estimator: Overcoming Optimization Challenges in Vector Quantized Networks |
Minyoung Huh, Brian Cheung,..., Phillip Isola |
31 |
2023-01-27 |
link |
Direct Parameterization of Lipschitz-Bounded Deep Networks |
Ruigang Wang, Ian Manchester |
30 |
None |
link |
GOAT: A Global Transformer on Large-scale Graphs |
Kezhi Kong, Jiuhai Chen,..., Tom Goldstein |
30 |
2023-02-19 |
link |
Why Is Public Pretraining Necessary for Private Model Training? |
Arun Ganesh, MAHDI HAGHIFAM,..., Lun Wang |
30 |
2022-09-08 |
link |
Data Feedback Loops: Model-driven Amplification of Dataset Biases |
Rohan Taori, Tatsunori Hashimoto |
29 |
2023-02-11 |
link |
Cross-Modal Fine-Tuning: Align then Refine |
Junhong Shen, Liam Li,..., Ameet Talwalkar |
29 |
2023-05-24 |
link |
Reconstructive Neuron Pruning for Backdoor Defense |
Yige Li, Xixiang Lyu,..., Yu-Gang Jiang |
29 |
2023-06-26 |
link |
ChiPFormer: Transferable Chip Placement via Offline Decision Transformer |
Yao Lai, Jinxin Liu,..., Ping Luo |
29 |
2023-04-27 |
link |
Interpretable Neural-Symbolic Concept Reasoning |
Pietro Barbiero, Gabriele Ciravegna,..., Giuseppe Marra |
29 |
2022-09-30 |
link |
Fed-CBS: A Heterogeneity-Aware Client Sampling Mechanism for Federated Learning via Class-Imbalance Reduction |
Jianyi Zhang, Ang Li,..., Hai Li |
29 |
2023-05-10 |
link |
Text-To-Concept (and Back) via Cross-Model Alignment |
Mazda Moayeri, Keivan Rezaei,..., Soheil Feizi |
29 |
2022-12-07 |
link |
X-Paste: Revisiting Scalable Copy-Paste for Instance Segmentation using CLIP and StableDiffusion |
Hanqing Zhao, Dianmo Sheng,..., Nenghai Yu |
29 |
2022-05-28 |
link |
A Closer Look at Self-supervised Lightweight Vision Transformers |
Shaoru Wang, Jin Gao,..., Weiming Hu |
29 |
2023-01-27 |
link |
Understanding Incremental Learning of Gradient Descent: A Fine-grained Analysis of Matrix Sensing |
Jikai Jin, Zhiyuan Li,..., Jason D. Lee |
29 |
2022-10-11 |
link |
Linkless Link Prediction via Relational Distillation |
Zhichun Guo, William Shiao,..., Tong Zhao |
29 |
2023-05-01 |
link |
Personalized Federated Learning under Mixture of Distributions |
Yue Wu, SHUAICHENG ZHANG,..., Wei Cheng |
28 |
2022-11-29 |
link |
On the power of foundation models |
Yang Yuan |
28 |
2023-01-30 |
link |
Solving High-Dimensional PDEs with Latent Spectral Models |
Haixu Wu, Tengge Hu,..., Mingsheng Long |
28 |
2023-05-22 |
link |
Learning Subpocket Prototypes for Generalizable Structure-based Drug Design |
ZAIXI ZHANG, Qi Liu |
28 |
2022-12-24 |
link |
Deep Latent State Space Models for Time-Series Generation |
Linqi Zhou, Michael Poli,..., Stefano Ermon |
28 |
2023-02-02 |
link |
The Power of Preconditioning in Overparameterized Low-Rank Matrix Sensing |
Xingyu Xu, Yandi Shen,..., Cong Ma |
27 |
2023-02-07 |
link |
Optimal Stochastic Non-smooth Non-convex Optimization through Online-to-Non-convex Conversion |
Ashok Cutkosky, Harsh Mehta, Francesco Orabona |
27 |
2022-10-13 |
link |
Action Matching: Learning Stochastic Dynamics from Samples |
Kirill Neklyudov, Rob Brekelmans,..., Alireza Makhzani |
27 |
2023-05-23 |
link |
Provably Learning Object-Centric Representations |
Jack Brady, Roland S. Zimmermann,..., Wieland Brendel |
27 |
2022-10-13 |
link |
On the Identifiability and Estimation of Causal Location-Scale Noise Models |
Alexander Immer, Christoph Schultheiss,..., Alexander Marx |
27 |
2023-01-31 |
link |
Transformers Meet Directed Graphs |
Simon Geisler, Yujia Li,..., Cosmin Paduraru |
27 |
2023-03-02 |
link |
Dropout Reduces Underfitting |
Zhuang Liu, Zhiqiu Xu,..., Trevor Darrell |
27 |
2023-06-07 |
link |
Effective Neural Topic Modeling with Embedding Clustering Regularization |
Xiaobao Wu, Xinshuai Dong,..., Anh Tuan Luu |
27 |
2023-02-13 |
link |
A Simple Zero-shot Prompt Weighting Technique to Improve Prompt Ensembling in Text-Image Models |
James Urquhart Allingham, Jie Ren,..., Balaji Lakshminarayanan |
27 |
2023-02-22 |
link |
Equivariant Polynomials for Graph Neural Networks |
Omri Puny, Derek Lim,..., Yaron Lipman |
27 |
2023-05-02 |
link |
On Uni-Modal Feature Learning in Supervised Multi-Modal Learning |
Chenzhuang Du, Jiaye Teng,..., Hang Zhao |
26 |
2023-06-27 |
link |
High Fidelity Image Counterfactuals with Probabilistic Causal Models |
Fabio De Sousa Ribeiro, Tian Xia,..., Ben Glocker |
26 |
2023-04-03 |
link |
Optimal Goal-Reaching Reinforcement Learning via Quasimetric Learning |
Tongzhou Wang, Antonio Torralba,..., Amy Zhang |
26 |
2023-02-13 |
link |
Geometric Clifford Algebra Networks |
David Ruhe, Jayesh K Gupta,..., Johannes Brandstetter |
26 |
2022-10-15 |
link |
Deep Regression Unlearning |
Ayush Kumar Tarun, Vikram Singh Chundawat,..., Mohan Kankanhalli |
26 |
2023-01-31 |
link |
Image Shortcut Squeezing: Countering Perturbative Availability Poisons with Compression |
Zhuoran Liu, Zhengyu Zhao, Martha Larson |
26 |
2023-02-09 |
link |
The Monge Gap: A Regularizer to Learn All Transport Maps |
Théo Uscidda, marco cuturi |
26 |
2023-05-31 |
link |
InGram: Inductive Knowledge Graph Embedding via Relation Graphs |
Jaejun Lee, Chanyoung Chung, Joyce Jiyoung Whang |
26 |
2022-11-03 |
link |
Fair and Optimal Classification via Post-Processing |
Ruicheng Xian, Lang Yin, Han Zhao |
26 |
2023-02-06 |
link |
The SSL Interplay: Augmentations, Inductive Bias, and Generalization |
Vivien Cabannes, Bobak Kiani,..., Alberto Bietti |
26 |
None |
link |
Propensity Matters: Measuring and Enhancing Balancing for Recommendation |
Haoxuan Li, Yanghao Xiao,..., Peng Cui |
26 |
2023-02-25 |
link |
Does a Neural Network Really Encode Symbolic Concept? |
Mingjie Li, Quanshi Zhang |
26 |
2022-06-02 |
link |
Faster Rates of Convergence to Stationary Points in Differentially Private Optimization |
Raman Arora, Raef Bassily,..., Enayat Ullah |
26 |
2023-05-16 |
link |
Mimetic Initialization of Self-Attention Layers |
Asher Trockman, J Zico Kolter |
26 |
None |
link |
VIMA: Robot Manipulation with Multimodal Prompts |
Yunfan Jiang, Agrim Gupta,..., Linxi Fan |
25 |
2023-05-25 |
link |
Beyond Reward: Offline Preference-guided Policy Optimization |
Yachen Kang, Diyuan Shi,..., Donglin Wang |
25 |
2022-12-06 |
link |
Understanding Self-Predictive Learning for Reinforcement Learning |
Yunhao Tang, Zhaohan Daniel Guo,..., Michal Valko |
25 |
2022-09-30 |
link |
Implicit Neural Spatial Representations for Time-dependent PDEs |
Honglin Chen, Rundi Wu,..., Peter Yichen Chen |
25 |
2023-05-19 |
link |
Dynamic Regularized Sharpness Aware Minimization in Federated Learning: Approaching Global Consistency and Smooth Landscape |
Yan Sun, Li Shen,..., Dacheng Tao |
25 |
2023-06-05 |
link |
Structural Re-weighting Improves Graph Domain Adaptation |
Shikun Liu, Tianchun Li,..., Pan Li |
25 |
2023-06-14 |
link |
InfoDiffusion: Representation Learning Using Information Maximizing Diffusion Models |
Yingheng Wang, Yair Schiff,..., Volodymyr Kuleshov |
25 |
2023-06-15 |
link |
OMS-DPM: Optimizing the Model Schedule for Diffusion Probabilistic Models |
Enshu Liu, Xuefei Ning,..., Yu Wang |
25 |
2023-01-27 |
link |
PLay: Parametrically Conditioned Layout Generation using Latent Diffusion |
Chin-Yi Cheng, Forrest Huang,..., Yang Li |
25 |
2022-11-28 |
link |
Topologically faithful image segmentation via induced matching of persistence barcodes |
Nico Daniel Stucki, Johannes C. Paetzold,..., Ulrich Bauer |
25 |
2023-03-27 |
link |
On the stepwise nature of self-supervised learning |
James B Simon, Maksis Knutins,..., Joshua Albrecht |
25 |
2023-01-01 |
link |
Neural Collapse in Deep Linear Networks: From Balanced to Imbalanced Data |
Hien Dang, Tho Tran Huu,..., Tan Minh Nguyen |
25 |
2023-04-08 |
link |
Mitigating Spurious Correlations in Multi-modal Models during Fine-tuning |
Yu Yang, Besmira Nushi,..., Baharan Mirzasoleiman |
25 |
2023-06-09 |
link |
Group Equivariant Fourier Neural Operators for Partial Differential Equations |
Jacob Helwig, Xuan Zhang,..., Shuiwang Ji |
25 |
2023-02-09 |
link |
Invariant Slot Attention: Object Discovery with Slot-Centric Reference Frames |
Ondrej Biza, Sjoerd van Steenkiste,..., Thomas Kipf |
25 |
2023-03-03 |
link |
Uncertainty Estimation by Fisher Information-based Evidential Deep Learning |
Danruo DENG, Guangyong Chen,..., Pheng-Ann Heng |
25 |
2023-05-03 |
link |
Beyond Homophily: Reconstructing Structure for Graph-agnostic Clustering |
Erlin Pan, zhao kang |
25 |
2023-02-20 |
link |
Unsupervised Out-of-Distribution Detection with Diffusion Inpainting |
Zhenzhen Liu, Jin Peng Zhou,..., Kilian Q Weinberger |
24 |
2023-05-19 |
link |
Robust Counterfactual Explanations for Neural Networks With Probabilistic Guarantees |
Faisal Hamman, Erfaun Noorani,..., Sanghamitra Dutta |
24 |
None |
link |
Conformal Prediction Sets for Graph Neural Networks |
Soroush H. Zargarbashi, Simone Antonelli, Aleksandar Bojchevski |
24 |
2022-12-07 |
link |
Sequential Predictive Conformal Inference for Time Series |
Chen Xu, Yao Xie |
24 |
2023-02-12 |
link |
Theory on Forgetting and Generalization of Continual Learning |
Sen Lin, Peizhong Ju,..., Ness Shroff |
24 |
2022-10-10 |
link |
Second-order regression models exhibit progressive sharpening to the edge of stability |
Atish Agarwala, Fabian Pedregosa, Jeffrey Pennington |
24 |
2023-06-08 |
link |
Improving Visual Prompt Tuning for Self-supervised Vision Transformers |
Seungryong Yoo, Eunji Kim,..., Sungroh Yoon |
24 |
2023-02-06 |
link |
Sampling-Based Accuracy Testing of Posterior Estimators for General Inference |
Pablo Lemos, Adam Coogan,..., Laurence Perreault-Levasseur |
24 |
2022-10-09 |
link |
FP-Diffusion: Improving Score-based Diffusion Models by Enforcing the Underlying Score Fokker-Planck Equation |
Chieh-Hsin Lai, Yuhta Takida,..., Stefano Ermon |
24 |
2023-06-13 |
link |
Finding the Missing-half: Graph Complementary Learning for Homophily-prone and Heterophily-prone Graphs |
YIZHEN ZHENG, He Zhang,..., Shirui Pan |
24 |
2023-05-28 |
link |
A Group Symmetric Stochastic Differential Equation Model for Molecule Multi-modal Pretraining |
Shengchao Liu, weitao Du,..., Jian Tang |
24 |
2022-11-04 |
link |
Modeling Temporal Data as Continuous Functions with Stochastic Process Diffusion |
Marin Biloš, Kashif Rasul,..., Stephan Günnemann |
24 |
2023-05-16 |
link |
Synthetic data, real errors: how (not) to publish and use synthetic data |
Boris van Breugel, Zhaozhi Qian, Mihaela van der Schaar |
24 |
None |
link |
IRNeXt: Rethinking Convolutional Network Design for Image Restoration |
Yuning Cui, Wenqi Ren,..., Alois Knoll |
24 |
2022-10-15 |
link |
Sketching for First Order Method: Efficient Algorithm for Low-Bandwidth Channel and Vulnerability |
Zhao Song, Yitan Wang,..., Lichen Zhang |
24 |
2023-03-26 |
link |
Inverse Reinforcement Learning without Reinforcement Learning |
Gokul Swamy, David Wu,..., Steven Wu |
24 |
2023-02-06 |
link |
RLSbench: Domain Adaptation Under Relaxed Label Shift |
Saurabh Garg, Nick Erickson,..., Zachary Chase Lipton |
24 |
2023-01-27 |
link |
SWARM Parallelism: Training Large Models Can Be Surprisingly Communication-Efficient |
Max Ryabinin, Tim Dettmers,..., Alexander Borzunov |
24 |
2023-02-14 |
link |
Self-supervised learning of Split Invariant Equivariant representations |
Quentin Garrido, Laurent Najman, Yann LeCun |
23 |
2022-09-12 |
link |
Cocktail Party Attack: Breaking Aggregation-Based Privacy in Federated Learning using Independent Component Analysis |
Sanjay Kariyappa, Chuan Guo,..., Hsien-Hsin S. Lee |
23 |
2023-06-12 |
link |
Efficient Approximations of Complete Interatomic Potentials for Crystal Property Prediction |
Yuchao Lin, Keqiang Yan,..., Shuiwang Ji |
23 |
None |
link |
QASA: Advanced Question Answering on Scientific Articles |
Yoonjoo Lee, Kyungjae Lee,..., Moontae Lee |
23 |
2023-06-08 |
link |
Non-autoregressive Conditional Diffusion Models for Time Series Prediction |
Lifeng Shen, James Kwok |
23 |
None |
link |
ODS: Test-Time Adaptation in the Presence of Open-World Data Shift |
Zhi Zhou, Lan-Zhe Guo,..., Yu-Feng Li |
23 |
2023-02-01 |
link |
Deterministic equivalent and error universality of deep random features learning |
Dominik Schröder, Hugo Cui,..., Bruno Loureiro |
23 |
2023-02-06 |
link |
On the Convergence of Federated Averaging with Cyclic Client Participation |
Yae Jee Cho, Pranay Sharma,..., Tong Zhang |
23 |
2022-10-29 |
link |
Perturbation Analysis of Neural Collapse |
Tom Tirer, Haoxiang Huang, Jonathan Niles-Weed |
23 |
2023-07-19 |
link |
Rethinking Backdoor Attacks |
Alaa Khaddaj, Guillaume Leclerc,..., Aleksander Madry |
23 |
2023-02-13 |
link |
Near-Optimal Cryptographic Hardness of Agnostically Learning Halfspaces and ReLU Regression under Gaussian Marginals |
Ilias Diakonikolas, Daniel Kane, Lisheng Ren |
23 |
2023-02-20 |
link |
Neural Algorithmic Reasoning with Causal Regularisation |
Beatrice Bevilacqua, Kyriacos Nikiforou,..., Petar Veličković |
23 |
None |
link |
Hierarchical Diffusion for Offline Decision Making |
Wenhao Li, Xiangfeng Wang,..., Hongyuan Zha |
23 |
2023-03-26 |
link |
Prototype-Sample Relation Distillation: Towards Replay-Free Continual Learning |
Nader Asadi, MohammadReza Davari,..., Eugene Belilovsky |
23 |
2022-12-29 |
link |
Policy Mirror Ascent for Efficient and Independent Learning in Mean Field Games |
Batuhan Yardim, Semih Cayci,..., Niao He |
23 |
2023-02-19 |
link |
Scaling Laws for Multilingual Neural Machine Translation |
Patrick Fernandes, Behrooz Ghorbani,..., Orhan Firat |
22 |
2023-06-01 |
link |
FlexRound: Learnable Rounding based on Element-wise Division for Post-Training Quantization |
Jung Hyun Lee, Jeonghoon Kim,..., Dongsoo Lee |
22 |
2023-06-26 |
link |
Parameter-Level Soft-Masking for Continual Learning |
Tatsuya Konishi, Mori Kurokawa,..., Bing Liu |
22 |
None |
link |
CocktailSGD: Fine-tuning Foundation Models over 500Mbps Networks |
Jue WANG, Yucheng Lu,..., Ce Zhang |
22 |
2023-01-26 |
link |
Rigid body flows for sampling molecular crystal structures |
Jonas Köhler, Michele Invernizzi,..., Frank Noe |
22 |
2023-06-04 |
link |
Towards Deep Attention in Graph Neural Networks: Problems and Remedies |
Soo Yong Lee, Fanchen Bu,..., Kijung Shin |
22 |
2023-05-26 |
link |
Emergent Agentic Transformer from Chain of Hindsight Experience |
Hao Liu, Pieter Abbeel |
22 |
2023-05-11 |
link |
Continual Vision-Language Representation Learning with Off-Diagonal Information |
Zixuan Ni, Longhui Wei,..., Qi Tian |
22 |
2023-03-30 |
link |
Neural signature kernels as infinite-width-depth-limits of controlled ResNets |
Nicola Muca Cirone, Maud Lemercier, Cristopher Salvi |
22 |
2023-02-28 |
link |
Stochastic Gradient Descent under Markovian Sampling Schemes |
Mathieu Even |
22 |
2023-02-06 |
link |
In Search of Insights, Not Magic Bullets: Towards Demystification of the Model Selection Dilemma in Heterogeneous Treatment Effect Estimation |
Alicia Curth, Mihaela van der Schaar |
22 |
2022-12-08 |
link |
Mitigating Memorization of Noisy Labels by Clipping the Model Prediction |
Hongxin Wei, HUIPING ZHUANG,..., Yixuan Li |
22 |
2023-02-03 |
link |
From Robustness to Privacy and Back |
Hilal Asi, Jonathan Ullman, Lydia Zakynthinou |
22 |
2023-01-23 |
link |
Modality-Agnostic Variational Compression of Implicit Neural Representations |
Jonathan Richard Schwarz, Jihoon Tack,..., Jinwoo Shin |
22 |
None |
link |
HOPE: High-order Graph ODE For Modeling Interacting Dynamics |
Xiao Luo, Jingyang Yuan,..., Yizhou Sun |
21 |
2023-02-05 |
link |
Revisiting Discriminative vs. Generative Classifiers: Theory and Implications |
Chenyu Zheng, Guoqiang Wu,..., Jun Zhu |
21 |
2023-06-02 |
link |
Hyperparameters in Reinforcement Learning and How To Tune Them |
Theresa Eimer, Marius Lindauer, Roberta Raileanu |
21 |
2022-06-10 |
link |
Meta Optimal Transport |
Brandon Amos, Giulia Luise,..., Ievgen Redko |
21 |
2023-02-03 |
link |
Measuring The Impact Of Programming Language Distribution |
Gabriel Orlanski, Kefan Xiao,..., Michele Catasta |
21 |
2023-06-02 |
link |
Towards Sustainable Learning: Coresets for Data-efficient Deep Learning |
Yu Yang, Kang Hao, Baharan Mirzasoleiman |
21 |
2022-10-23 |
link |
Decentralized Stochastic Bilevel Optimization with Improved Per-Iteration Complexity |
Xuxing Chen, Minhui Huang,..., Krishna Balasubramanian |
21 |
2022-01-28 |
link |
Improving Expert Predictions with Conformal Prediction |
Eleni Straitouri, Lequn Wang,..., Manuel Gomez Rodriguez |
21 |
2022-10-05 |
link |
Reprogramming Pretrained Language Models for Antibody Sequence Infilling |
Igor Melnyk, Vijil Chenthamarakshan,..., Devleena Das |
21 |
2023-04-20 |
link |
B-Learner: Quasi-Oracle Bounds on Heterogeneous Causal Effects Under Hidden Confounding |
Miruna Oprescu, Jacob Dorn,..., Uri Shalit |
21 |
None |
link |
Random Shuffle Transformer for Image Restoration |
Jie Xiao, Xueyang Fu,..., Zheng-Jun Zha |
21 |
2023-06-08 |
link |
Efficient and Equivariant Graph Networks for Predicting Quantum Hamiltonian |
Haiyang Yu, Zhao Xu,..., Shuiwang Ji |
21 |
2022-07-22 |
link |
Discrete Key-Value Bottleneck |
Frederik Träuble, Anirudh Goyal,..., Bernhard Schölkopf |
21 |
2022-12-19 |
link |
Answering Complex Logical Queries on Knowledge Graphs via Query Computation Tree Optimization |
Yushi Bai, Xin Lv,..., Lei Hou |
21 |
2023-01-26 |
link |
WL meet VC |
Christopher Morris, Floris Geerts,..., Martin Grohe |
21 |
2022-05-26 |
link |
FedBR: Improving Federated Learning on Heterogeneous Data via Local Learning Bias Reduction |
Yongxin Guo, Xiaoying Tang, Tao Lin |
21 |
2023-05-06 |
link |
Improved Techniques for Maximum Likelihood Estimation for Diffusion ODEs |
Kaiwen Zheng, Cheng Lu,..., Jun Zhu |
21 |
2023-02-04 |
link |
Oscillation-free Quantization for Low-bit Vision Transformers |
Shih-yang Liu, Zechun Liu, Kwang-Ting Cheng |
21 |
2023-02-07 |
link |
Averaged Method of Multipliers for Bi-Level Optimization without Lower-Level Strong Convexity |
Risheng Liu, Yaohua Liu,..., Jin Zhang |
20 |
2023-02-07 |
link |
How to Trust Your Diffusion Model: A Convex Optimization Approach to Conformal Risk Control |
Jacopo Teneggi, Matthew Tivnan,..., Jeremias Sulam |
20 |
2023-06-09 |
link |
Quantifying the Knowledge in GNNs for Reliable Distillation into MLPs |
Lirong Wu, Haitao Lin,..., Stan Z. Li |
20 |
2023-05-29 |
link |
Crafting Training Degradation Distribution for the Accuracy-Generalization Trade-off in Real-World Super-Resolution |
Ruofan Zhang, Jinjin Gu,..., Wenming Yang |
20 |
2022-11-25 |
link |
GREAD: Graph Neural Reaction-Diffusion Networks |
Jeongwhan Choi, Seoyoung Hong,..., Sung-Bae Cho |
20 |
2023-03-06 |
link |
Generalized-Smooth Nonconvex Optimization is As Efficient As Smooth Nonconvex Optimization |
Ziyi Chen, Yi Zhou,..., Zhaosong Lu |
20 |
2023-03-08 |
link |
Ewald-based Long-Range Message Passing for Molecular Graphs |
Arthur Kosmala, Johannes Gasteiger,..., Stephan Günnemann |
20 |
2023-05-30 |
link |
What is Essential for Unseen Goal Generalization of Offline Goal-conditioned RL? |
Rui Yang, LIN Yong,..., Tong Zhang |
20 |
2023-05-29 |
link |
Brainformers: Trading Simplicity for Efficiency |
Yanqi Zhou, Nan Du,..., Jeff Dean |
20 |
2023-04-29 |
link |
POUF: Prompt-oriented unsupervised fine-tuning for large pre-trained models |
Korawat Tanwisuth, Shujian Zhang,..., Mingyuan Zhou |
20 |
2022-07-13 |
link |
Learning Deep Time-index Models for Time Series Forecasting |
Gerald Woo, Chenghao Liu,..., Steven Hoi |
20 |
None |
link |
CodeIPPrompt: Intellectual Property Infringement Assessment of Code Language Models |
Zhiyuan Yu, Yuhao Wu,..., Chaowei Xiao |
20 |
2023-04-29 |
link |
The Ideal Continual Learner: An Agent That Never Forgets |
Liangzu Peng, Paris Giampouras, Rene Vidal |
20 |
2023-02-27 |
link |
Internet Explorer: Targeted Representation Learning on the Open Web |
Alexander Cong Li, Ellis Langham Brown,..., Deepak Pathak |
20 |
2022-07-28 |
link |
Regret Minimization and Convergence to Equilibria in General-sum Markov Games |
Liad Erez, Tal Lancewicki,..., Yishay Mansour |
20 |
2023-02-24 |
link |
Graph Neural Networks with Learnable and Optimal Polynomial Bases |
Yuhe Guo, Zhewei Wei |
20 |
2022-06-22 |
link |
Generative Pretraining for Black-Box Optimization |
Satvik Mehul Mashkaria, Siddarth Krishnamoorthy, Aditya Grover |
19 |
2022-07-13 |
link |
Hindsight Learning for MDPs with Exogenous Inputs |
Sean R. Sinclair, Felipe Vieira Frujeri,..., Adith Swaminathan |
19 |
2023-02-04 |
link |
Counterfactual Identifiability of Bijective Causal Models |
Arash Nasr-Esfahany, Mohammad Alizadeh, Devavrat Shah |
19 |
2023-07-05 |
link |
DeSRA: Detect and Delete the Artifacts of GAN-based Real-World Super-Resolution Models |
Liangbin Xie, Xintao Wang,..., Chao Dong |
19 |
2023-02-03 |
link |
Multi-channel Autobidding with Budget and ROI Constraints |
Yuan Deng, Negin Golrezaei,..., Vahab Mirrokni |
19 |
2023-01-27 |
link |
Pre-training for Speech Translation: CTC Meets Optimal Transport |
Phuong-Hang Le, Hongyu Gong,..., Didier Schwab |
19 |
2023-07-01 |
link |
Actor-Critic Alignment for Offline-to-Online Reinforcement Learning |
Zishun Yu, Xinhua Zhang |
19 |
2023-05-27 |
link |
The Implicit Regularization of Dynamical Stability in Stochastic Gradient Descent |
Lei Wu, Weijie J Su |
19 |
2023-06-09 |
link |
Path Neural Networks: Expressive and Accurate Graph Neural Networks |
Gaspard Michel, Giannis Nikolentzos,..., Michalis Vazirgiannis |
19 |
2022-09-30 |
link |
EF21-P and Friends: Improved Theoretical Communication Complexity for Distributed Optimization with Bidirectional Compression |
Kaja Gruntkowska, Alexander Tyurin, Peter Richtárik |
19 |
2022-12-20 |
link |
Settling the Reward Hypothesis |
Michael Bowling, John D Martin,..., Will Dabney |
19 |
2022-10-20 |
link |
Sketching Meets Differential Privacy: Fast Algorithm for Dynamic Kronecker Projection Maintenance |
Zhao Song, Xin Yang,..., Lichen Zhang |
19 |
2023-04-10 |
link |
Forward-backward Gaussian variational inference via JKO in the Bures-Wasserstein Space |
Michael Ziyang Diao, Krishna Balasubramanian,..., Adil Salim |
19 |
2023-01-31 |
link |
Anti-Exploration by Random Network Distillation |
Alexander Nikulin, Vladislav Kurenkov,..., Sergey Kolesnikov |
19 |
2023-02-28 |
link |
A Closer Look at the Intervention Procedure of Concept Bottleneck Models |
Sungbin Shin, Yohan Jo,..., Namhoon Lee |
19 |
2023-06-11 |
link |
On Kinetic Optimal Probability Paths for Generative Models |
Neta Shaul, Ricky T. Q. Chen,..., Yaron Lipman |
19 |
2022-12-17 |
link |
Two-Scale Gradient Descent Ascent Dynamics Finds Mixed Nash Equilibria of Continuous Games: A Mean-Field Perspective |
Yulong Lu |
19 |
2023-04-28 |
link |
Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation |
Wenqing Zheng, S P Sharan,..., Zhangyang Wang |
19 |
2023-01-29 |
link |
Distilling Internet-Scale Vision-Language Models into Embodied Agents |
Theodore Sumers, Kenneth Marino,..., Ishita Dasgupta |
19 |
2023-06-07 |
link |
On the Generalization of Multi-modal Contrastive Learning |
Qi Zhang, Yifei Wang, Yisen Wang |
19 |
2023-05-25 |
link |
Fascinating Supervisory Signals and Where to Find Them: Deep Anomaly Detection with Scale Learning |
Hongzuo Xu, Yijie Wang,..., Ning Liu |
19 |
2023-06-11 |
link |
Graph Mixup with Soft Alignments |
Hongyi Ling, Zhimeng Jiang,..., Na Zou |
18 |
2022-10-05 |
link |
Temporally Consistent Transformers for Video Generation |
Wilson Yan, Danijar Hafner,..., Pieter Abbeel |
18 |
2023-01-26 |
link |
Minimax estimation of discontinuous optimal transport maps: The semi-discrete case |
Aram-Alexandre Pooladian, Vincent Divol, Jonathan Niles-Weed |
18 |
2022-10-18 |
link |
Improving Medical Predictions by Irregular Multimodal Electronic Health Records Modeling |
Xinlu Zhang, Shiyang Li,..., Linda Ruth Petzold |
18 |
2023-04-25 |
link |
Chameleon: Adapting to Peer Images for Planting Durable Backdoors in Federated Learning |
Yanbo Dai, Songze Li |
18 |
2023-02-09 |
link |
Cooperative Open-ended Learning Framework for Zero-shot Coordination |
Yang Li, Shao Zhang,..., Wei Pan |
18 |
None |
link |
Adaptive Smoothing Gradient Learning for Spiking Neural Networks |
Ziming Wang, Runhao Jiang,..., Huajin Tang |
18 |
2023-05-26 |
link |
Future-conditioned Unsupervised Pretraining for Decision Transformer |
Zhihui Xie, Zichuan Lin,..., Shuai Li |
18 |
2023-01-31 |
link |
Learning in POMDPs is Sample-Efficient with Hindsight Observability |
Jonathan Lee, Alekh Agarwal,..., Tong Zhang |
18 |
2022-10-31 |
link |
The Numerical Stability of Hyperbolic Representation Learning |
Gal Mishne, Zhengchao Wan,..., Sheng Yang |
18 |
2023-02-17 |
link |
SGD with AdaGrad Stepsizes: Full Adaptivity with High Probability to Unknown Parameters, Unbounded Gradients and Affine Variance |
Amit Attia, Tomer Koren |
18 |
2023-01-07 |
link |
Why do Nearest Neighbor Language Models Work? |
Frank F. Xu, Uri Alon, Graham Neubig |
18 |
2023-02-23 |
link |
Out-of-Domain Robustness via Targeted Augmentations |
Irena Gao, Shiori Sagawa,..., Percy Liang |
18 |
None |
link |
Curriculum Co-disentangled Representation Learning across Multiple Environments for Social Recommendation |
Xin Wang, Zirui Pan,..., Wenwu Zhu |
18 |
2023-06-18 |
link |
Instant Soup: Cheap Pruning Ensembles in A Single Pass Can Draw Lottery Tickets from Large Models |
AJAY KUMAR JAISWAL, Shiwei Liu,..., Zhangyang Wang |
18 |
2023-02-03 |
link |
Robust Camera Pose Refinement for Multi-Resolution Hash Encoding |
Hwan Heo, Taekyung Kim,..., Jin-Hwa Kim |
18 |
2023-06-02 |
link |
Learning Signed Distance Functions from Noisy 3D Point Clouds via Noise to Noise Mapping |
Baorui Ma, Yu-Shen Liu, Zhizhong Han |
18 |
2023-02-03 |
link |
How Bad is Top-K Recommendation under Competing Content Creators? |
Fan Yao, Chuanhao Li,..., Haifeng Xu |
18 |
2022-10-18 |
link |
Pareto Manifold Learning: Tackling multiple tasks via ensembles of single-task models |
Nikolaos Dimitriadis, Pascal Frossard, François Fleuret |
18 |
2023-02-22 |
link |
Deep Generative Symbolic Regression with Monte-Carlo-Tree-Search |
Pierre-Alexandre Kamienny, Guillaume Lample,..., Marco Virgolin |
18 |
2022-11-22 |
link |
ModelDiff: A Framework for Comparing Learning Algorithms |
Harshay Shah, Sung Min Park,..., Aleksander Madry |
17 |
2023-04-10 |
link |
For Pre-Trained Vision Models in Motor Control, Not All Policy Learning Methods are Created Equal |
Yingdong Hu, Renhao Wang,..., Yang Gao |
17 |
2022-10-19 |
link |
Oracles & Followers: Stackelberg Equilibria in Deep Multi-Agent Reinforcement Learning |
Matthias Gerstgrasser, David C. Parkes |
17 |
2022-11-26 |
link |
Distribution Free Prediction Sets for Node Classification |
Jase Clarkson |
17 |
2023-06-05 |
link |
Conformal Prediction with Missing Values |
Margaux Zaffran, Aymeric Dieuleveut,..., Yaniv Romano |
17 |
2022-10-22 |
link |
SurCo: Learning Linear Surrogates For Combinatorial Nonlinear Optimization Problems |
Aaron M Ferber, Taoan Huang,..., Yuandong Tian |
17 |
2023-02-28 |
link |
Fast as CHITA: Neural Network Pruning with Combinatorial Optimization |
Riade Benbaki, Wenyu Chen,..., Rahul Mazumder |
17 |
2023-02-08 |
link |
Generalizing Neural Wave Functions |
Nicholas Gao, Stephan Günnemann |
17 |
2023-08-07 |
link |
Do You Remember? Overcoming Catastrophic Forgetting for Fake Audio Detection |
XiaoHui Zhang, Jiangyan Yi,..., Chu Yuan Zhang |
17 |
None |
link |
Trustworthy Policy Learning under the Counterfactual No-Harm Criterion |
Haoxuan Li, Chunyuan Zheng,..., Peng Wu |
17 |
2023-05-27 |
link |
Federated Conformal Predictors for Distributed Uncertainty Quantification |
Charles Lu, Yaodong Yu,..., Ramesh Raskar |
17 |
None |
link |
H-Consistency Bounds for Pairwise Misranking Loss Surrogates |
Anqi Mao, Mehryar Mohri, Yutao Zhong |
17 |
2023-02-07 |
link |
Near-Minimax-Optimal Risk-Sensitive Reinforcement Learning with CVaR |
Kaiwen Wang, Nathan Kallus, Wen Sun |
17 |
2023-02-08 |
link |
Algorithmic Collective Action in Machine Learning |
Moritz Hardt, Eric Mazumdar,..., Tijana Zrnic |
17 |
2023-04-26 |
link |
FedVS: Straggler-Resilient and Privacy-Preserving Vertical Federated Learning for Split Models |
Songze Li, Duanyi YAO, Jin Liu |
17 |
2023-02-09 |
link |
On the Privacy-Robustness-Utility Trilemma in Distributed Learning |
Youssef Allouah, Rachid Guerraoui,..., John Stephan |
17 |
2023-04-12 |
link |
Representation Learning with Multi-Step Inverse Kinematics: An Efficient and Optimal Approach to Rich-Observation RL |
Zakaria Mhammedi, Dylan J Foster, Alexander Rakhlin |
17 |
2023-02-01 |
link |
Generative Adversarial Symmetry Discovery |
Jianke Yang, Robin Walters,..., Rose Yu |
17 |
2022-10-12 |
link |
Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories |
Qinqing Zheng, Mikael Henaff,..., Aditya Grover |
17 |
2022-10-06 |
link |
Weak Proxies are Sufficient and Preferable for Fairness with Missing Sensitive Attributes |
Zhaowei Zhu, Yuanshun Yao,..., Yang Liu |
17 |
2023-05-29 |
link |
Trompt: Towards a Better Deep Neural Network for Tabular Data |
Kuan-Yu Chen, Ping-Han Chiang,..., Tien-Hao Chang |
17 |
2022-10-25 |
link |
Convergence of Proximal Point and Extragradient-Based Methods Beyond Monotonicity: the Case of Negative Comonotonicity |
Eduard Gorbunov, Adrien Taylor,..., Gauthier Gidel |
17 |
None |
link |
Model-Bellman Inconsistency for Model-based Offline Reinforcement Learning |
Yihao Sun, Jiaji Zhang,..., Yang Yu |
17 |
2023-04-29 |
link |
Conditional Graph Information Bottleneck for Molecular Relational Learning |
Namkyeong Lee, Dongmin Hyun,..., Chanyoung Park |
17 |
2023-02-17 |
link |
SAM operates far from home: eigenvalue regularization as a dynamical phenomenon |
Atish Agarwala, Yann Dauphin |
17 |
2022-10-24 |
link |
GFlowOut: Dropout with Generative Flow Networks |
Dianbo Liu, Moksh Jain,..., Yoshua Bengio |
17 |
2022-10-24 |
link |
Analyzing Privacy Leakage in Machine Learning via Multiple Hypothesis Testing: A Lesson From Fano |
Chuan Guo, Alexandre Sablayrolles, Maziar Sanjabi |
17 |
2023-05-30 |
link |
A Unified Audio-Visual Learning Framework for Localization, Separation, and Recognition |
Shentong Mo, Pedro Morgado |
16 |
2022-09-13 |
link |
Normalizing Flows for Interventional Density Estimation |
Valentyn Melnychuk, Dennis Frauen, Stefan Feuerriegel |
16 |
2022-10-19 |
link |
Towards Explaining Distribution Shifts |
Sean Kulinski, David I. Inouye |
16 |
2023-02-02 |
link |
ReLOAD: Reinforcement Learning with Optimistic Ascent-Descent for Last-Iterate Convergence in Constrained MDPs |
Ted Moskovitz, Brendan O'Donoghue,..., Tom Zahavy |
16 |
2023-01-31 |
link |
Multicalibration as Boosting for Regression |
Ira Globus-Harris, Declan Harrison,..., Jessica Sorrell |
16 |
2023-05-08 |
link |
BiRT: Bio-inspired Replay in Vision Transformers for Continual Learning |
Kishaan Jeeveswaran, Prashant Shivaram Bhat,..., Elahe Arani |
16 |
2022-12-14 |
link |
Sequential Kernelized Independence Testing |
Aleksandr Podkopaev, Patrick Blöbaum,..., Aaditya Ramdas |
16 |
2022-12-13 |
link |
Quantum Policy Gradient Algorithm with Optimized Action Decoding |
Nico Meyer, Daniel D. Scherer,..., Michael J. Hartmann |
16 |
2023-03-07 |
link |
Gradient-Free Structured Pruning with Unlabeled Data |
Azade Nova, Hanjun Dai, Dale Schuurmans |
16 |
2023-05-23 |
link |
Statistical Indistinguishability of Learning Algorithms |
Alkis Kalavasis, Amin Karbasi,..., Grigoris Velegkas |
16 |
2023-02-13 |
link |
In Search for a Generalizable Method for Source Free Domain Adaptation |
Malik Boudiaf, tom denton,..., Eleni Triantafillou |
16 |
2023-05-31 |
link |
AbODE: Ab Initio Antibody Design using Conjoined ODEs |
Yogesh Verma, Markus Heinonen, Vikas Garg |
16 |
2023-08-09 |
link |
When and How Does Known Class Help Discover Unknown Ones? Provable Understanding Through Spectral Analysis |
Yiyou Sun, Zhenmei Shi,..., Yixuan Li |
16 |
2023-06-09 |
link |
Hidden symmetries of ReLU networks |
Elisenda Grigsby, Kathryn Lindsey, David Rolnick |
16 |
2022-07-25 |
link |
Live in the Moment: Learning Dynamics Model Adapted to Evolving Policy |
Xiyao Wang, Wichayaporn Wongkamjan,..., Furong Huang |
16 |
2022-06-25 |
link |
A Fast, Well-Founded Approximation to the Empirical Neural Tangent Kernel |
Mohamad Amin Mohamadi, Wonho Bae, Danica J. Sutherland |
16 |
2022-09-28 |
link |
Compositional Score Modeling for Simulation-Based Inference |
Tomas Geffner, George Papamakarios, Andriy Mnih |
16 |
2023-02-10 |
link |
Achieving Linear Speedup in Non-IID Federated Bilevel Learning |
Minhui Huang, Dewei Zhang, Kaiyi Ji |
16 |
2023-06-05 |
link |
Spatial Implicit Neural Representations for Global-Scale Species Mapping |
Elijah Cole, Grant Van Horn,..., Oisin Mac Aodha |
16 |
2023-03-07 |
link |
Polynomial Time and Private Learning of Unbounded Gaussian Mixture Models |
Jamil Arbas, Hassan Ashtiani, Christopher Liaw |
16 |
2022-11-01 |
link |
Adversarial Policies Beat Superhuman Go AIs |
Tony Tong Wang, Adam Gleave,..., Stuart Russell |
15 |
None |
link |
QuantumDARTS: Differentiable Quantum Architecture Search for Variational Quantum Algorithms |
Wenjie Wu, Ge Yan,..., Junchi Yan |
15 |
2023-05-18 |
link |
The Blessing of Heterogeneity in Federated Q-learning: Linear Speedup and Beyond |
Jiin Woo, Gauri Joshi, Yuejie Chi |
15 |
2023-01-30 |
link |
On Second-Order Scoring Rules for Epistemic Uncertainty Quantification |
Viktor Bengs, Eyke Hüllermeier, Willem Waegeman |
15 |
2022-11-20 |
link |
Adversarial Cheap Talk |
Chris Lu, Timon Willi,..., Jakob Nicolaus Foerster |
15 |
2022-10-21 |
link |
Men Also Do Laundry: Multi-Attribute Bias Amplification |
Dora Zhao, Jerone Andrews, Alice Xiang |
15 |
2023-03-13 |
link |
Tighter Lower Bounds for Shuffling SGD: Random Permutations and Beyond |
Jaeyoung Cha, Jaewook Lee, Chulhee Yun |
15 |
2022-09-24 |
link |
Mastering the Unsupervised Reinforcement Learning Benchmark from Pixels |
Sai Rajeswar, Pietro Mazzaglia,..., Alexandre Lacoste |
15 |
None |
link |
Analysis of Error Feedback in Federated Non-Convex Optimization with Biased Compression: Fast Convergence and Partial Participation |
Xiaoyun Li, Ping Li |
15 |
None |
link |
BPipe: Memory-Balanced Pipeline Parallelism for Training Large Language Models |
Taebum Kim, Hyoungjoo Kim,..., Byung-Gon Chun |
15 |
2023-05-25 |
link |
Which Features are Learnt by Contrastive Learning? On the Role of Simplicity Bias in Class Collapse and Feature Suppression |
Yihao Xue, Siddharth Joshi,..., Baharan Mirzasoleiman |
15 |
2023-05-27 |
link |
A Hybrid Quantum-Classical Approach based on the Hadamard Transform for the Convolutional Layer |
Hongyi Pan, Xin Zhu,..., Ahmet Cetin |
15 |
2023-06-10 |
link |
Optimizing the Collaboration Structure in Cross-Silo Federated Learning |
Wenxuan Bao, Haohan Wang,..., Jingrui He |
15 |
2022-06-30 |
link |
Performative Reinforcement Learning |
Debmalya Mandal, Stelios Triantafyllou, Goran Radanovic |
15 |
2023-06-01 |
link |
Effective Structured Prompting by Meta-Learning and Representative Verbalizer |
Weisen Jiang, Yu Zhang, James Kwok |
15 |
2023-03-05 |
link |
Streaming Active Learning with Deep Neural Networks |
Akanksha Saran, Safoora Yousefi,..., Jordan T. Ash |
15 |
2022-11-22 |
link |
OpenFE: Automated Feature Generation with Expert-level Performance |
Tianping Zhang, Zheyu Zhang,..., Jian Li |
15 |
None |
link |
Detecting Out-of-distribution Data through In-distribution Class Prior |
Xue Jiang, Feng Liu,..., Bo Han |
15 |
2023-07-20 |
link |
Fractional Denoising for 3D Molecular Pre-training |
Shikun Feng, Yuyan Ni,..., Weiying Ma |
15 |
2022-12-20 |
link |
Policy Gradient in Robust MDPs with Global Convergence Guarantee |
Qiuhao Wang, Chin Pang Ho, Marek Petrik |
15 |
2023-06-01 |
link |
MEWL: Few-shot multimodal word learning with referential uncertainty |
Guangyuan Jiang, Manjie Xu,..., Yixin Zhu |
15 |
2022-05-23 |
link |
Why does Throwing Away Data Improve Worst-Group Error? |
Kamalika Chaudhuri, Kartik Ahuja,..., David Lopez-Paz |
15 |
2023-06-16 |
link |
Semi-Offline Reinforcement Learning for Optimized Text Generation |
Changyu Chen, Xiting Wang,..., Rui Yan |
14 |
2023-02-03 |
link |
Searching Large Neighborhoods for Integer Linear Programs with Contrastive Learning |
Taoan Huang, Aaron M Ferber,..., Benoit Steiner |
14 |
2023-03-30 |
link |
MAHALO: Unifying Offline Reinforcement Learning and Imitation Learning from Observations |
Anqi Li, Byron Boots, Ching-An Cheng |
14 |
2023-02-05 |
link |
Run-Off Election: Improved Provable Defense against Data Poisoning Attacks |
Keivan Rezaei, Kiarash Banihashem,..., Soheil Feizi |
14 |
2023-02-06 |
link |
Probabilistic Contrastive Learning Recovers the Correct Aleatoric Uncertainty of Ambiguous Inputs |
Michael Kirchhof, Enkelejda Kasneci, Seong Joon Oh |
14 |
2023-03-02 |
link |
Chemically Transferable Generative Backmapping of Coarse-Grained Proteins |
Soojung Yang, Rafael Gomez-Bombarelli |
14 |
2023-06-05 |
link |
Generating Private Synthetic Data with Genetic Algorithms |
Terrance Liu, Jingwu Tang,..., Steven Wu |
14 |
2023-06-15 |
link |
On Strengthening and Defending Graph Reconstruction Attack with Markov Chain Approximation |
Zhanke Zhou, Chenyu Zhou,..., Bo Han |
14 |
2023-06-05 |
link |
Meta-SAGE: Scale Meta-Learning Scheduled Adaptation with Guided Exploration for Mitigating Scale Shift on Combinatorial Optimization |
Jiwoo Son, Minsu Kim,..., Jinkyoo Park |
14 |
2023-01-26 |
link |
Deep Laplacian-based Options for Temporally-Extended Exploration |
Martin Klissarov, Marlos C. Machado |
14 |
2023-03-10 |
link |
Sliced-Wasserstein on Symmetric Positive Definite Matrices for M/EEG Signals |
Clément Bonet, Benoît Malézieux,..., Nicolas Courty |
14 |
None |
link |
Disentangled Multiplex Graph Representation Learning |
Yujie Mo, Yajie Lei,..., Xiaofeng Zhu |
14 |
2023-02-01 |
link |
DoCoFL: Downlink Compression for Cross-Device Federated Learning |
Ron Dorfman, Shay Vargaftik,..., Kfir Yehuda Levy |
14 |
2023-01-27 |
link |
SNeRL: Semantic-aware Neural Radiance Fields for Reinforcement Learning |
Dongseok Shim, Seungjae Lee, H. Jin Kim |
14 |
2023-06-08 |
link |
Unconstrained Online Learning with Unbounded Losses |
Andrew Jacobsen, Ashok Cutkosky |
14 |
2023-06-11 |
link |
Policy Regularization with Dataset Constraint for Offline Reinforcement Learning |
Yuhang Ran, Yi-Chen Li,..., Yang Yu |
14 |
None |
link |
Out-of-Distribution Generalization of Federated Learning via Implicit Invariant Relationships |
Yaming Guo, Kai Guo,..., Yi Chang |
14 |
2023-01-30 |
link |
Adaptive Computation with Elastic Input Sequence |
Fuzhao Xue, Valerii Likhosherstov,..., Yang You |
14 |
2023-05-14 |
link |
Off-Policy Evaluation for Large Action Spaces via Conjunct Effect Modeling |
Yuta Saito, Qingyang Ren, Thorsten Joachims |
14 |
2023-02-09 |
link |
Optimistic Online Mirror Descent for Bridging Stochastic and Adversarial Online Convex Optimization |
Sijia Chen, Wei-Wei Tu,..., Lijun Zhang |
14 |
2023-07-01 |
link |
Half-Hop: A graph upsampling approach for slowing down message passing |
Mehdi Azabou, Venkataramana Ganesh,..., Eva L Dyer |
14 |
2022-06-21 |
link |
Beyond Uniform Lipschitz Condition in Differentially Private Optimization |
Rudrajit Das, Satyen Kale,..., sujay sanghavi |
14 |
2023-10-03 |
link |
Nugget: Neural Agglomerative Embeddings of Text |
Guanghui Qin, Benjamin Van Durme |
14 |
2023-02-19 |
link |
Distributional Offline Policy Evaluation with Predictive Error Guarantees |
Runzhe Wu, Masatoshi Uehara, Wen Sun |
14 |
2023-06-13 |
link |
User-defined Event Sampling and Uncertainty Quantification in Diffusion Models for Physical Dynamical Systems |
Marc Anton Finzi, Anudhyan Boral,..., Leonardo Zepeda-Nunez |
14 |
None |
link |
Facial Expression Recognition with Adaptive Frame Rate based on Multiple Testing Correction |
Andrey Savchenko |
14 |
2023-01-30 |
link |
Are Random Decompositions all we need in High Dimensional Bayesian Optimisation? |
Juliusz Krzysztof Ziomek, Haitham Bou Ammar |
14 |
2023-03-02 |
link |
Optimal Rates and Efficient Algorithms for Online Bayesian Persuasion |
Martino Bernasconi, Matteo Castiglioni,..., Nicola Gatti |
13 |
2023-04-30 |
link |
Importance Weighted Expectation-Maximization for Protein Sequence Design |
Zhenqiao Song, Lei Li |
13 |
2023-06-25 |
link |
Towards Trustworthy Explanation: On Causal Rationalization |
Wenbo Zhang, TONG WU,..., Hengrui Cai |
13 |
2023-01-25 |
link |
Pre-computed memory or on-the-fly encoding? A hybrid approach to retrieval augmentation makes the most of your compute |
Michiel de Jong, Yury Zemlyanskiy,..., William W. Cohen |
13 |
2023-06-12 |
link |
Evolving Semantic Prototype Improves Generative Zero-Shot Learning |
Shiming Chen, Wenjin Hou,..., Kun Zhang |
13 |
2023-05-19 |
link |
Not All Semantics are Created Equal: Contrastive Self-supervised Learning with Automatic Temperature Individualization |
Zi-Hao Qiu, Quanqi Hu,..., Tianbao Yang |
13 |
2023-02-01 |
link |
Width and Depth Limits Commute in Residual Networks |
Soufiane Hayou, Greg Yang |
13 |
2023-03-15 |
link |
The Benefits of Mixup for Feature Learning |
Difan Zou, Yuan Cao,..., Quanquan Gu |
13 |
2023-07-20 |
link |
Identifying Interpretable Subspaces in Image Representations |
Neha Kalibhat, Shweta Bhardwaj,..., Soheil Feizi |
13 |
2024-02-20 |
link |
Towards Robust Graph Incremental Learning on Evolving Graphs |
Junwei Su, Difan Zou,..., Chuan Wu |
13 |
2022-12-01 |
link |
Second-order optimization with lazy Hessians |
Nikita Doikov, El Mahdi Chayti, Martin Jaggi |
13 |
2023-06-12 |
link |
Slot-VAE: Object-Centric Scene Generation with Slot Attention |
Yanbo Wang, Letao Liu, Justin Dauwels |
13 |
None |
link |
RACE: Improve Multi-Agent Reinforcement Learning with Representation Asymmetry and Collaborative Evolution |
Pengyi Li, Jianye HAO,..., Xian Fu |
13 |
2023-05-26 |
link |
Optimizing NOTEARS Objectives via Topological Swaps |
Chang Deng, Kevin Bello,..., Pradeep Kumar Ravikumar |
13 |
2023-02-05 |
link |
Tighter Information-Theoretic Generalization Bounds from Supersamples |
Ziqiao Wang, Yongyi Mao |
13 |
2022-10-19 |
link |
"Why did the Model Fail?": Attributing Model Performance Changes to Distribution Shifts |
Haoran Zhang, Harvineet Singh,..., Shalmali Joshi |
13 |
None |
link |
Shape-Guided Dual-Memory Learning for 3D Anomaly Detection |
Yu-Min Chu, Liu Chieh,..., Tyng-Luh Liu |
13 |
2022-10-05 |
link |
Why Random Pruning Is All We Need to Start Sparse |
Advait Harshal Gadhikar, Sohom Mukherjee, Rebekka Burkholz |
13 |
2022-12-24 |
link |
Understanding the Complexity Gains of Single-Task RL with a Curriculum |
Qiyang Li, Yuexiang Zhai,..., Sergey Levine |
13 |
2022-10-31 |
link |
Improving Graph Neural Networks with Learnable Propagation Operators |
Moshe Eliasof, Lars Ruthotto, Eran Treister |
13 |
None |
link |
Linearly Constrained Bilevel Optimization: A Smoothed Implicit Gradient Approach |
Prashant Khanduri, Ioannis Tsaknakis,..., Mingyi Hong |
13 |
2023-02-04 |
link |
Reinforcement Learning in Low-Rank MDPs with Density Features |
Audrey Huang, Jinglin Chen, Nan Jiang |
13 |
None |
link |
Demystifying Uneven Vulnerability of Link Stealing Attacks against Graph Neural Networks |
He Zhang, Bang Wu,..., Xingliang YUAN |
13 |
2023-06-12 |
link |
Can Forward Gradient Match Backpropagation? |
Louis Fournier, Stephane Rivaud,..., Edouard Oyallon |
13 |
2022-08-07 |
link |
Federated Adversarial Learning: A Framework with Convergence Analysis |
Xiaoxiao Li, Zhao Song, Jiaming Yang |
13 |
2022-05-26 |
link |
DevFormer: A Symmetric Transformer for Context-Aware Device Placement |
Haeyeon Kim, Minsu Kim,..., Jinkyoo Park |
13 |
2023-02-09 |
link |
On Sampling with Approximate Transport Maps |
Louis Grenioux, Alain Durmus,..., Marylou Gabrié |
13 |
2022-10-28 |
link |
Differential Privacy has Bounded Impact on Fairness in Classification |
Paul Mangold, Michaël Perrot,..., Marc Tommasi |
13 |
2023-05-23 |
link |
Dual Focal Loss for Calibration |
Linwei Tao, Minjing Dong, Chang Xu |
13 |
None |
link |
A Unified Optimization Framework of ANN-SNN Conversion: Towards Optimal Mapping from Activation Values to Firing Rates |
Haiyan Jiang, Srinivas Anumasa,..., Bin Gu |
13 |
None |
link |
Stable Estimation of Heterogeneous Treatment Effects |
Anpeng Wu, Kun Kuang,..., Fei Wu |
13 |
2022-02-01 |
link |
Few-Bit Backward: Quantized Gradients of Activation Functions for Memory Footprint Reduction |
Georgii Sergeevich Novikov, Daniel Bershatsky,..., Ivan Oseledets |
13 |
2023-01-27 |
link |
Algorithmic Stability of Heavy-Tailed SGD with General Loss Functions |
Anant Raj, Lingjiong Zhu,..., Umut Simsekli |
13 |
2022-09-15 |
link |
Omnipredictors for Constrained Optimization |
Lunjia Hu, Inbal Rachel Livni Navon,..., Chutong Yang |
12 |
2023-02-18 |
link |
Data-Efficient Contrastive Self-supervised Learning: Most Beneficial Examples for Supervised Learning Contribute the Least |
Siddharth Joshi, Baharan Mirzasoleiman |
12 |
2023-06-05 |
link |
Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning |
Sam Lobel, Akhil Bagaria, George Konidaris |
12 |
2023-06-07 |
link |
Accounting For Informative Sampling When Learning to Forecast Treatment Outcomes Over Time |
Toon Vanderschueren, Alicia Curth,..., Mihaela van der Schaar |
12 |
2023-01-27 |
link |
Understanding Int4 Quantization for Language Models: Latency Speedup, Composability, and Failure Cases |
Xiaoxia Wu, Cheng Li,..., Yuxiong He |
12 |
2023-05-30 |
link |
Approximation and Estimation Ability of Transformers for Sequence-to-Sequence Functions with Infinite Dimensional Input |
Shokichi Takakura, Taiji Suzuki |
12 |
2022-10-06 |
link |
Paging with Succinct Predictions |
Antonios Antoniadis, Joan Boyar,..., Bertrand Simon |
12 |
2022-10-05 |
link |
Atari-5: Distilling the Arcade Learning Environment down to Five Games |
Matthew Aitchison, Penny Sweetser, Marcus Hutter |
12 |
2022-10-05 |
link |
Efficient Learning of Mesh-Based Physical Simulation with Bi-Stride Multi-Scale Graph Neural Network |
Yadi Cao, Menglei Chai,..., Chenfanfu Jiang |
12 |
2023-06-02 |
link |
Calibrating Multimodal Learning |
Huan Ma, Qingyang Zhang,..., Qinghua Hu |
12 |
2023-02-13 |
link |
One-Shot Federated Conformal Prediction |
Pierre Humbert, Batiste Le bars,..., Sylvain Arlot |
12 |
2022-12-09 |
link |
Cyclic Block Coordinate Descent With Variance Reduction for Composite Nonconvex Optimization |
Xufeng Cai, Chaobing Song,..., Jelena Diakonikolas |
12 |
2023-04-17 |
link |
Attributing Image Generative Models using Latent Fingerprints |
Guangyu Nie, Changhoon Kim,..., Yi Ren |
12 |
None |
link |
A Critical Revisit of Adversarial Robustness in 3D Point Cloud Recognition with Diffusion-Driven Purification |
Jiachen Sun, Jiongxiao Wang,..., Chaowei Xiao |
12 |
2023-04-13 |
link |
Efficient Sequence Transduction by Jointly Predicting Tokens and Durations |
Hainan Xu, Fei Jia,..., Boris Ginsburg |
12 |
2023-03-14 |
link |
Fast Rates for Maximum Entropy Exploration |
Daniil Tiapkin, Denis Belomestny,..., Pierre MENARD |
12 |
2023-02-03 |
link |
LazyGNN: Large-Scale Graph Neural Networks via Lazy Propagation |
Rui Xue, Haoyu Han,..., Xiaorui Liu |
12 |
2023-02-15 |
link |
Deep Anomaly Detection under Labeling Budget Constraints |
Aodong Li, Chen Qiu,..., Maja Rudolph |
12 |
2023-02-21 |
link |
Differentiable Multi-Target Causal Bayesian Experimental Design |
Panagiotis Tigas, Yashas Annadani,..., Stefan Bauer |
12 |
2023-01-23 |
link |
Sampling-based Nyström Approximation and Kernel Quadrature |
Satoshi Hayakawa, Harald Oberhauser, Terry Lyons |
12 |
2022-12-16 |
link |
Brauer's Group Equivariant Neural Networks |
Edward Pearce-Crump |
12 |
None |
link |
Optimal No-Regret Learning for One-Sided Lipschitz Functions |
Paul Duetting, Guru Guruganesh,..., Joshua Ruizhi Wang |
12 |
2023-02-05 |
link |
Improving Fair Training under Correlation Shifts |
Yuji Roh, Kangwook Lee,..., Changho Suh |
12 |
2023-02-18 |
link |
Best of Both Worlds Policy Optimization |
Christoph Dann, Chen-Yu Wei, Julian Zimmert |
12 |
2023-04-29 |
link |
Leveraging Label Non-Uniformity for Node Classification in Graph Neural Networks |
Feng Ji, See Hian Lee,..., Wee Peng Tay |
12 |
2023-02-12 |
link |
Vector Quantized Wasserstein Auto-Encoder |
Long Tung Vuong, Trung Le,..., Dinh Phung |
12 |
2023-03-07 |
link |
Can We Scale Transformers to Predict Parameters of Diverse ImageNet Models? |
Boris Knyazev, DOHA HWANG, Simon Lacoste-Julien |
12 |
2023-01-19 |
link |
An SDE for Modeling SAM: Theory and Insights |
Enea Monzio Compagnoni, Luca Biggio,..., Aurelien Lucchi |
12 |
2023-02-06 |
link |
Toward Large Kernel Models |
Amirhesam Abedsoltan, Mikhail Belkin, Parthe Pandit |
12 |
2023-03-30 |
link |
Contextual Combinatorial Bandits with Probabilistically Triggered Arms |
Xutong Liu, Jinhang Zuo,..., Wei Chen |
12 |
None |
link |
Patch-level Contrastive Learning via Positional Query for Visual Pre-training |
Shaofeng Zhang, Qiang Zhou,..., Junchi Yan |