Last updated: 2024-12-09 08:31:21. Maintained by Weisen Jiang.

citation date review title (pdf) authors
1300 2023-07-04 link SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis Dustin Podell, Zion English,..., Robin Rombach
485 2023-07-10 link AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific
Tuning
Yuwei Guo, Ceyuan Yang,..., Bo Dai
438 2023-07-31 link ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world
APIs
Yujia Qin, Shihao Liang,..., Maosong Sun
426 2023-09-28 link DreamGaussian: Generative Gaussian Splatting for Efficient 3D Content Creation Jiaxiang Tang, Jiawei Ren,..., Gang Zeng
377 2023-10-17 link Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection Akari Asai, Zeqiu Wu,..., Hannaneh Hajishirzi
360 2023-10-05 link Fine-tuning Aligned Language Models Compromises Safety, Even When Users
Do Not Intend To!
Xiangyu Qi, Yi Zeng,..., Peter Henderson
295 2023-09-07 link SyncDreamer: Generating Multiview-consistent Images from a Single-view Image Yuan Liu, Cheng Lin,..., Wenping Wang
286 2023-10-03 link MathVista: Evaluating Mathematical Reasoning of Foundation Models in Visual
Contexts
Pan Lu, Hritik Bansal,..., Jianfeng Gao
274 2023-09-11 link MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning Xiang Yue, Xingwei Qu,..., Wenhu Chen
230 2023-11-08 link LRM: Large Reconstruction Model for Single Image to 3D Yicong Hong, Kai Zhang,..., Hao Tan
220 2023-10-10 link iTransformer: Inverted Transformers Are Effective for Time Series Forecasting Yong Liu, Tengge Hu,..., Mingsheng Long
219 2023-09-21 link MetaMath: Bootstrap Your Own Mathematical Questions for Large Language
Models
Longhui Yu, Weisen Jiang,..., Weiyang Liu
215 2023-10-12 link Ferret: Refer and Ground Anything Anywhere at Any Granularity Haoxuan You, Haotian Zhang,..., Yinfei Yang
214 2023-10-17 link Quantifying Language Models' Sensitivity to Spurious Features in Prompt
Design or: How I learned to start worrying about prompt formatting
Melanie Sclar, Yejin Choi,..., Alane Suhr
214 2023-10-10 link SWE-bench: Can Language Models Resolve Real-World GitHub Issues? Carlos E Jimenez, John Yang,..., Karthik R Narasimhan
205 2023-10-11 link Catastrophic Jailbreak of Open-source LLMs via Exploiting Generation Yangsibo Huang, Samyak Gupta,..., Danqi Chen
199 2023-05-22 link Training Diffusion Models with Reinforcement Learning Kevin Black, Michael Janner,..., Sergey Levine
199 2023-10-16 link Llemma: An Open Language Model For Mathematics Zhangir Azerbayev, Hailey Schoelkopf,..., Sean Welleck
194 2023-10-19 link Eureka: Human-Level Reward Design via Coding Large Language Models Yecheng Jason Ma, William Liang,..., Anima Anandkumar
191 2023-10-19 link Safe RLHF: Safe Reinforcement Learning from Human Feedback Josef Dai, Xuehai Pan,..., Yaodong Yang
184 2023-09-28 link Vision Transformers Need Registers Timothée Darcet, Maxime Oquab,..., Piotr Bojanowski
183 2023-08-07 link AgentBench: Evaluating LLMs as Agents Xiao Liu, Hao Yu,..., Jie Tang
182 2023-09-21 link The Reversal Curse: LLMs trained on "A is B"
fail to learn "B is A"
Lukas Berglund, Meg Tong,..., Owain Evans
172 2023-04-18 link NaturalSpeech 2: Latent Diffusion Models are Natural and Zero-Shot
Speech and Singing Synthesizers
Kai Shen, Zeqian Ju,..., Jiang Bian
172 2023-10-03 link AutoDAN: Generating Stealthy Jailbreak Prompts on Aligned Large Language
Models
Xiaogeng Liu, Nan Xu,..., Chaowei Xiao
171 2023-02-14 link Universal Guidance for Diffusion Models Arpit Bansal, Hong-Min Chu,..., Tom Goldstein
169 2023-08-16 link Stochastic Controlled Averaging for Federated Learning with Communication Compression Xinmeng Huang, Ping Li, Xiaoyun Li
168 2023-06-08 link PandaLM: An Automatic Evaluation Benchmark for LLM Instruction Tuning
Optimization
Yidong Wang, Zhuohao Yu,..., Yue Zhang
167 2023-02-07 link Effective Data Augmentation With Diffusion Models Brandon Trabucco, Kyle Doherty,..., Ruslan Salakhutdinov
164 2023-06-26 link Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction
Tuning
Fuxiao Liu, Kevin Lin,..., Lijuan Wang
153 2023-07-13 link InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and
Generation
Yi Wang, Yinan He,..., Yu Qiao
143 2023-12-25 link What Makes Good Data for Alignment? A Comprehensive Study
of Automatic Data Selection in Instruction Tuning
Wei Liu, Weihao Zeng,..., Junxian He
141 2023-09-07 link Large Language Models Are Not Robust Multiple Choice Selectors Chujie Zheng, Hao Zhou,..., Minlie Huang
138 2023-10-09 link Language Model Beats Diffusion -- Tokenizer is Key to
Visual Generation
Lijun Yu, Jose Lezama,..., Lu Jiang
137 2023-10-12 link Prometheus: Inducing Fine-grained Evaluation Capability in Language Models Seungone Kim, Jamin Shin,..., Minjoon Seo
131 2023-07-24 link A Real-World WebAgent with Planning, Long Context Understanding, and
Program Synthesis
Izzeddin Gur, Hiroki Furuta,..., Aleksandra Faust
126 2023-11-14 link Fine-tuning Language Models for Factuality Katherine Tian, Eric Mitchell,..., Chelsea Finn
124 2023-10-11 link Evaluating Large Language Models at Evaluating Instruction Following Zhiyuan Zeng, Jiatong Yu,..., Danqi Chen
120 2023-10-02 link Making Retrieval-Augmented Language Models Robust to Irrelevant Context Ori Yoran, Tomer Wolfson,..., Jonathan Berant
118 2023-10-03 link Model Tells You What to Discard: Adaptive KV Cache
Compression for LLMs
Suyu Ge, Yunan Zhang,..., Jianfeng Gao
116 2023-08-25 link OmniQuant: Omnidirectionally Calibrated Quantization for Large Language Models Wenqi Shao, Mengzhao Chen,..., Ping Luo
115 2023-09-21 link LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models Yukang Chen, Shengju Qian,..., Jiaya Jia
115 2023-09-25 link Can LLM-Generated Misinformation Be Detected? Canyu Chen, Kai Shu
112 2023-09-20 link DreamLLM: Synergistic Multimodal Comprehension and Creation Runpei Dong, Chunrui Han,..., Li Yi
112 2023-10-25 link Detecting Pretraining Data from Large Language Models Weijia Shi, Anirudh Ajith,..., Luke Zettlemoyer
112 2023-07-05 link Building Cooperative Embodied Agents Modularly with Large Language Models Hongxin Zhang, Weihua Du,..., Chuang Gan
107 2023-02-06 link Chain of Hindsight Aligns Language Models with Feedback Hao Liu, Carmelo Sferrazza, Pieter Abbeel
106 2023-11-15 link DMV3D: Denoising Multi-View Diffusion using 3D Large Reconstruction Model Yinghao Xu, Hao Tan,..., Kai Zhang
104 None link WizardLM: Empowering Large Pre-Trained Language Models to Follow Complex
Instructions
Can Xu, Qingfeng Sun,..., Daxin Jiang
101 2023-07-05 link DragonDiffusion: Enabling Drag-style Manipulation on Diffusion Models Chong Mou, Xintao Wang,..., Jian Zhang
101 2023-09-21 link LMSYS-Chat-1M: A Large-Scale Real-World LLM Conversation Dataset Lianmin Zheng, Wei-Lin Chiang,..., Hao Zhang
101 2023-05-23 link Sophia: A Scalable Stochastic Second-order Optimizer for Language Model
Pre-training
Hong Liu, Zhiyuan Li,..., Tengyu Ma
100 2023-10-22 link Improved Techniques for Training Consistency Models Yang Song, Prafulla Dhariwal
99 2023-10-03 link Language Models Represent Space and Time Wes Gurnee, Max Tegmark
98 2023-09-25 link Q-Bench: A Benchmark for General-Purpose Foundation Models on Low-level
Vision
Haoning Wu, Zicheng Zhang,..., Weisi Lin
96 2023-10-26 link Proving Test Set Contamination in Black Box Language Models Yonatan Oren, Nicole Meister,..., Tatsunori Hashimoto
95 2023-08-14 link OctoPack: Instruction Tuning Code Large Language Models Niklas Muennighoff, Qian Liu,..., Shayne Longpre
94 2023-06-05 link RepoBench: Benchmarking Repository-Level Code Auto-Completion Systems Tianyang Liu, Canwen Xu, Julian McAuley
90 2023-09-29 link Directly Fine-Tuning Diffusion Models on Differentiable Rewards Kevin Clark, Paul Vicol,..., David J. Fleet
89 2024-05-02 link WildChat: 1M ChatGPT Interaction Logs in the Wild Wenting Zhao, Xiang Ren,..., Yuntian Deng
88 2023-10-02 link RA-DIT: Retrieval-Augmented Dual Instruction Tuning Xi Victoria Lin, Xilun Chen,..., Wen-tau Yih
88 2023-08-11 link Self-Alignment with Instruction Backtranslation Xian Li, Ping Yu,..., Mike Lewis
87 2023-05-07 link A Variational Perspective on Solving Inverse Problems with Diffusion
Models
Morteza Mardani, Jiaming Song,..., Arash Vahdat
86 2023-10-12 link LoftQ: LoRA-Fine-Tuning-Aware Quantization for Large Language Models Yixiao Li, Yifan Yu,..., Tuo Zhao
86 2023-05-25 link Self-contradictory Hallucinations of Large Language Models: Evaluation, Detection and
Mitigation
Niels Mündler, Jingxuan He,..., Martin Vechev
84 2023-06-09 link Can Large Language Models Infer Causation from Correlation? Zhijing Jin, Jiarui Liu,..., Bernhard Schölkopf
83 2023-10-10 link Multilingual Jailbreak Challenges in Large Language Models Yue Deng, Wenxuan Zhang,..., Lidong Bing
80 2023-07-26 link Jailbreak in pieces: Compositional Adversarial Attacks on Multi-Modal Language
Models
Erfan Shayegani, Yue Dong, Nael Abu-Ghazaleh
79 2023-10-24 link What Algorithms can Transformers Learn? A Study in Length
Generalization
Hattie Zhou, Arwen Bradley,..., Preetum Nakkiran
78 2023-10-04 link Reward Model Ensembles Help Mitigate Overoptimization Thomas Coste, Usman Anwar,..., David Krueger
75 2023-09-28 link Demystifying CLIP Data Hu Xu, Saining Xie,..., Christoph Feichtenhofer
75 2023-10-31 link SEINE: Short-to-Long Video Diffusion Model for Generative Transition and
Prediction
Xinyuan Chen, Yaohui Wang,..., Ziwei Liu
75 2023-08-02 link From Sparse to Soft Mixtures of Experts Joan Puigcerver, Carlos Riquelme Ruiz,..., Neil Houlsby
74 2023-10-25 link PromptAgent: Strategic Planning with Language Models Enables Expert-level Prompt
Optimization
Xinyuan Wang, Chenxi Li,..., Zhiting Hu
73 2023-08-16 link Time Travel in LLMs: Tracing Data Contamination in Large
Language Models
Shahriar Golchin, Mihai Surdeanu
73 2023-05-31 link MERT: Acoustic Music Understanding Model with Large-Scale Self-supervised Training Yizhi LI, Ruibin Yuan,..., Jie Fu
72 2023-07-20 link FLASK: Fine-grained Language Model Evaluation based on Alignment Skill
Sets
Seonghyeon Ye, Doyoung Kim,..., Minjoon Seo
72 2023-11-02 link Vision-Language Foundation Models as Effective Robot Imitators Xinghang Li, Minghuan Liu,..., Tao Kong
71 2023-10-01 link BooookScore: A systematic exploration of book-length summarization in the
era of LLMs
Yapei Chang, Kyle Lo,..., Mohit Iyyer
71 2024-02-27 link When Scaling Meets LLM Finetuning: The Effect of Data,
Model and Finetuning Method
Biao Zhang, Zhongtao Liu,..., Orhan Firat
70 2023-10-08 link TEMPO: Prompt-based Generative Pre-trained Transformer for Time Series Forecasting Defu Cao, Furong Jia,..., Yan Liu
70 2023-10-18 link SOTOPIA: Interactive Evaluation for Social Intelligence in Language Agents Xuhui Zhou, Hao Zhu,..., Maarten Sap
69 2023-11-20 link PF-LRM: Pose-Free Large Reconstruction Model for Joint Pose and
Shape Prediction
Peng Wang, Hao Tan,..., Kai Zhang
69 2023-07-07 link One Step of Gradient Descent is Provably the Optimal
In-Context Learner with One Layer of Linear Self-Attention
Arvind V. Mahankali, Tatsunori Hashimoto, Tengyu Ma
69 2024-01-31 link RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval Parth Sarthi, Salman Abdullah,..., Christopher D Manning
68 2023-09-29 link Can Sensitive Information Be Deleted From LLMs? Objectives for
Defending Against Extraction Attacks
Vaidehi Patil, Peter Hase, Mohit Bansal
68 2023-09-29 link One for All: Towards Training One Graph Model for
All Classification Tasks
Hao Liu, Jiarui Feng,..., Muhan Zhang
68 2023-08-07 link Nearly d-Linear Convergence Bounds for Diffusion Models via Stochastic
Localization
Joe Benton, Valentin De Bortoli,..., George Deligiannidis
68 2024-04-19 link SaProt: Protein Language Modeling with Structure-aware Vocabulary Jin Su, Chenchen Han,..., Fajie Yuan
67 2023-08-17 link Linearity of Relation Decoding in Transformer Language Models Evan Hernandez, Arnab Sen Sharma,..., David Bau
66 2023-10-25 link TD-MPC2: Scalable, Robust World Models for Continuous Control Nicklas Hansen, Hao Su, Xiaolong Wang
65 2023-05-22 link Chain-of-Knowledge: Grounding Large Language Models via Dynamic Knowledge Adapting
over Heterogeneous Sources
Xingxuan Li, Ruochen Zhao,..., Lidong Bing
63 2023-10-19 link SalUn: Empowering Machine Unlearning via Gradient-based Weight Saliency in
Both Image Classification and Generation
Chongyu Fan, Jiancheng Liu,..., Sijia Liu
63 2023-02-15 link Learning Performance-Improving Code Edits Alexander G Shypula, Aman Madaan,..., Amir Yazdanbakhsh
63 2023-07-16 link Solving Inverse Problems with Latent Diffusion Models via Hard
Data Consistency
Bowen Song, Soo Min Kwon,..., Liyue Shen
62 2023-09-25 link Identifying the Risks of LM Agents with an LM-Emulated
Sandbox
Yangjun Ruan, Honghua Dong,..., Tatsunori Hashimoto
61 2023-10-31 link What's In My Big Data? Yanai Elazar, Akshita Bhagia,..., Jesse Dodge
60 2023-11-08 link Bias Runs Deep: Implicit Reasoning Biases in Persona-Assigned LLMs Shashank Gupta, Vaishnavi Shrivastava,..., Tushar Khot
59 2023-11-02 link Tensor Trust: Interpretable Prompt Injection Attacks from an Online
Game
Sam Toyer, Olivia Watkins,..., Stuart Russell
57 2023-09-28 link Beyond Reverse KL: Generalizing Direct Preference Optimization with Diverse
Divergence Constraints
Chaoqi Wang, Yibo Jiang,..., Yuxin Chen
57 2023-10-09 link NEFTune: Noisy Embeddings Improve Instruction Finetuning Neel Jain, Ping-yeh Chiang,..., Tom Goldstein
56 2023-10-04 link Retrieval meets Long Context Large Language Models Peng Xu, Wei Ping,..., Bryan Catanzaro
56 2023-10-29 link AnomalyCLIP: Object-agnostic Prompt Learning for Zero-shot Anomaly Detection Qihang Zhou, Guansong Pang,..., Jiming Chen
56 2023-10-11 link Beyond Memorization: Violating Privacy Via Inference with Large Language
Models
Robin Staab, Mark Vero,..., Martin Vechev
54 2023-06-15 link KoLA: Carefully Benchmarking World Knowledge of Large Language Models Jifan Yu, Xiaozhi Wang,..., Juanzi Li
54 2023-09-25 link Small-scale proxies for large-scale Transformer training instabilities Mitchell Wortsman, Peter J Liu,..., Simon Kornblith
54 2024-02-20 link Chain of Thought Empowers Transformers to Solve Inherently Serial
Problems
Zhiyuan Li, Hong Liu,..., Tengyu Ma
54 2023-10-10 link Uni3D: Exploring Unified 3D Representation at Scale Junsheng Zhou, Jinsheng Wang,..., Xinlong Wang
52 2023-08-04 link Retroformer: Retrospective Large Language Agents with Policy Gradient Optimization Weiran Yao, Shelby Heinecke,..., Silvio Savarese
52 2023-09-29 link Guiding Instruction-based Image Editing via Multimodal Large Language Models Tsu-Jui Fu, Wenze Hu,..., Zhe Gan
51 2023-10-12 link Phenomenal Yet Puzzling: Testing Inductive Reasoning Capabilities of Language
Models with Hypothesis Refinement
Linlu Qiu, Liwei Jiang,..., Xiang Ren
51 2023-07-13 link In-context Autoencoder for Context Compression in a Large Language
Model
Tao Ge, Hu Jing,..., Furu Wei
50 2023-10-27 link Davidsonian Scene Graph: Improving Reliability in Fine-grained Evaluation for
Text-to-Image Generation
Jaemin Cho, Yushi Hu,..., Su Wang
50 2023-10-10 link Lemur: Harmonizing Natural Language and Code for Language Agents Yiheng Xu, Hongjin SU,..., Tao Yu
50 2023-10-05 link GoLLIE: Annotation Guidelines improve Zero-Shot Information-Extraction Oscar Sainz, Iker García-Ferrero,..., Eneko Agirre
49 2023-10-17 link Zipformer: A faster and better encoder for automatic speech
recognition
Zengwei Yao, Liyong Guo,..., Daniel Povey
48 2023-07-06 link FITS: Modeling Time Series with 10k Parameters Zhijian Xu, Ailing Zeng, Qiang Xu
48 2023-08-08 link SILO Language Models: Isolating Legal Risk In a Nonparametric
Datastore
Sewon Min, Suchin Gururangan,..., Luke Zettlemoyer
48 2023-09-13 link Sudden Drops in the Loss: Syntax Acquisition, Phase Transitions,
and Simplicity Bias in MLMs
Angelica Chen, Ravid Shwartz-Ziv,..., Naomi Saphra
48 2023-06-01 link Vocos: Closing the gap between time-domain and Fourier-based neural
vocoders for high-quality audio synthesis
Hubert Siuzdak
47 2023-05-31 link Harnessing Explanations: LLM-to-LM Interpreter for Enhanced Text-Attributed Graph Representation
Learning
Xiaoxin He, Xavier Bresson,..., Bryan Hooi
47 2023-10-09 link Generative Judge for Evaluating Alignment Junlong Li, Shichao Sun,..., Pengfei Liu
45 2023-11-21 link Mechanistically analyzing the effects of fine-tuning on procedurally defined
tasks
Samyak Jain, Robert Kirk,..., David Krueger
45 2023-10-12 link ScaleCrafter: Tuning-free Higher-Resolution Visual Generation with Diffusion Models Yingqing He, Shaoshu Yang,..., Ying Shan
45 2023-10-03 link SE(3)-Stochastic Flow Matching for Protein Backbone Generation Joey Bose, Tara Akhound-Sadegh,..., Alexander Tong
45 2023-08-08 link Fine-tuning Multimodal LLMs to Follow Zero-shot Demonstrative Instructions Juncheng Li, Kaihang Pan,..., Yueting Zhuang
45 2023-10-09 link Interpreting CLIP's Image Representation via Text-Based Decomposition Yossi Gandelsman, Alexei A Efros, Jacob Steinhardt
44 2023-10-02 link GenSim: Generating Robotic Simulation Tasks via Large Language Models Lirui Wang, Yiyang Ling,..., Xiaolong Wang
43 2023-10-12 link Circuit Component Reuse Across Tasks in Transformer Language Models Jack Merullo, Carsten Eickhoff, Ellie Pavlick
43 2024-04-22 link Hybrid LLM: Cost-Efficient and Quality-Aware Query Routing Dujian Ding, Ankur Mallick,..., Ahmed Hassan Awadallah
43 2023-12-08 link Zoology: Measuring and Improving Recall in Efficient Language Models Simran Arora, Sabri Eyuboglu,..., Christopher Re
42 2023-11-06 link AnyText: Multilingual Visual Text Generation And Editing Yuxiang Tuo, Wangmeng Xiang,..., Xuansong Xie
41 None link RECOMP: Improving Retrieval-Augmented LMs with Context Compression and Selective
Augmentation
Fangyuan Xu, Weijia Shi, Eunsol Choi
41 2023-10-04 link Generalization in diffusion models arises from geometry-adaptive harmonic representation Zahra Kadkhodaie, Florentin Guth,..., Stéphane Mallat
41 2023-10-02 link CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction Size Wu, Wenwei Zhang,..., Chen Change Loy
40 2023-10-16 link In-Context Pretraining: Language Modeling Beyond Document Boundaries Weijia Shi, Sewon Min,..., Mike Lewis
40 2023-02-12 link Single Motion Diffusion Sigal Raab, Inbal Leibovitch,..., Daniel Cohen-Or
40 2023-09-28 link At Which Training Stage Does Code Data Help LLMs
Reasoning?
YINGWEI MA, Yue Liu,..., Shanshan Li
40 2023-10-19 link An Emulator for Fine-Tuning Large Language Models using Small
Language Models
Eric Mitchell, Rafael Rafailov,..., Christopher D Manning
40 2023-10-14 link Mixed-Type Tabular Data Synthesis with Score-based Diffusion in Latent
Space
Hengrui Zhang, Jiani Zhang,..., George Karypis
39 2023-07-18 link Overthinking the Truth: Understanding how Language Models Process False
Demonstrations
Danny Halawi, Jean-Stanislas Denain, Jacob Steinhardt
39 2023-02-07 link Flow Matching on General Geometries Ricky T. Q. Chen, Yaron Lipman
39 2023-10-02 link DataInf: Efficiently Estimating Data Influence in LoRA-tuned LLMs and
Diffusion Models
Yongchan Kwon, Eric Wu,..., James Zou
39 None link ModernTCN: A Modern Pure Convolution Structure for General Time
Series Analysis
Luo donghao, wang xue
38 2023-12-03 link The mechanistic basis of data dependence and abrupt learning
in an in-context classification task
Gautam Reddy
38 2023-11-03 link RT-Trajectory: Robotic Task Generalization via Hindsight Trajectory Sketches Jiayuan Gu, Sean Kirmani,..., Ted Xiao
38 2023-10-12 link How Many Pretraining Tasks Are Needed for In-Context Learning
of Linear Regression?
Jingfeng Wu, Difan Zou,..., Peter Bartlett
38 2023-09-25 link LLMCarbon: Modeling the end-to-end Carbon Footprint of Large Language
Models
Ahmad Faiz, Sotaro Kaneda,..., Lei Jiang
38 2023-10-06 link ReLU Strikes Back: Exploiting Activation Sparsity in Large Language
Models
Seyed Iman Mirzadeh, Keivan Alizadeh-Vahid,..., Mehrdad Farajtabar
37 2023-03-08 link InfoBatch: Lossless Training Speed Up by Unbiased Dynamic Data
Pruning
Ziheng Qin, Kai Wang,..., Yang You
37 2023-08-23 link Large Multilingual Models Pivot Zero-Shot Multimodal Learning across Languages Jinyi Hu, Yuan Yao,..., Maosong Sun
37 2023-09-28 link A Benchmark for Learning to Translate a New Language
from One Grammar Book
Garrett Tanzer, Mirac Suzgun,..., Luke Melas-Kyriazi
36 2023-06-20 link Evaluating the Zero-shot Robustness of Instruction-tuned Language Models Jiuding Sun, Chantal Shaib, Byron C Wallace
36 2023-11-20 link LQ-LoRA: Low-rank Plus Quantized Matrix Decomposition for Efficient Language
Model Finetuning
Han Guo, Philip Greengard,..., Yoon Kim
36 2023-09-18 link Understanding Catastrophic Forgetting in Language Models via Implicit Inference Suhas Kotha, Jacob Mitchell Springer, Aditi Raghunathan
36 2023-10-20 link ToolChain: Efficient Action Space Navigation in Large Language Models
with A
Search
Yuchen Zhuang, Xiang Chen,..., Chao Zhang
35 2023-10-06 link Amortizing intractable inference in large language models Edward J Hu, Moksh Jain,..., Nikolay Malkin
35 2023-10-06 link Confronting Reward Model Overoptimization with Constrained RLHF Ted Moskovitz, Aaditya K Singh,..., Stephen Marcus McAleer
34 2023-10-04 link Kosmos-G: Generating Images in Context with Multimodal Large Language
Models
Xichen Pan, Li Dong,..., Furu Wei
33 2023-05-29 link Multiscale Positive-Unlabeled Detection of AI-Generated Texts Yuchuan Tian, Hanting Chen,..., Yunhe Wang
32 2023-05-18 link Deep Temporal Graph Clustering Meng Liu, Yue Liu,..., Xinwang Liu
32 2023-06-30 link Learning Delays in Spiking Neural Networks using Dilated Convolutions
with Learnable Spacings
Ilyass Hammouamri, Ismail Khalfaoui-Hassani, Timothée Masquelier
32 2023-10-01 link LEGO-Prover: Neural Theorem Proving with Growing Libraries Haiming Wang, Huajian Xin,..., Xiaodan Liang
31 2023-11-11 link Finetuning Text-to-Image Diffusion Models for Fairness Xudong Shen, Chao Du,..., Mohan Kankanhalli
30 2024-07-31 link Detecting, Explaining, and Mitigating Memorization in Diffusion Models Yuxin Wen, Yuchen Liu,..., Lingjuan Lyu
30 2023-10-04 link Understanding In-Context Learning in Transformers and LLMs by Learning
to Learn Discrete Functions
Satwik Bhattamishra, Arkil Patel,..., Varun Kanade
30 2023-09-22 link Unbiased Watermark for Large Language Models Zhengmian Hu, Lichang Chen,..., Heng Huang
30 2023-12-13 link Distributional Preference Learning: Understanding and Accounting for Hidden Context
in RLHF
Anand Siththaranjan, Cassidy Laidlaw, Dylan Hadfield-Menell
29 None link DSPy: Compiling Declarative Language Model Calls into State-of-the-Art Pipelines Omar Khattab, Arnav Singhvi,..., Christopher Potts
29 2023-10-10 link Advancing Pose-Guided Image Synthesis with Progressive Conditional Diffusion Models Fei Shen, Hu Ye,..., Yang Wei
29 2023-06-01 link TorchRL: A data-driven decision-making library for PyTorch Albert Bou, Matteo Bettini,..., Vincent Moens
29 2023-09-30 link On the Stability of Iterative Retraining of Generative Models
on their own Data
Quentin Bertrand, Joey Bose,..., Gauthier Gidel
28 2024-02-06 link The Hedgehog & the Porcupine: Expressive Linear Attentions with
Softmax Mimicry
Michael Zhang, Kush Bhatia,..., Christopher Re
27 2023-02-04 link Multi-Source Diffusion Models for Simultaneous Music Generation and Separation Giorgio Mariani, Irene Tallini,..., Emanuele Rodolà
27 2023-06-08 link In-Context Learning through the Bayesian Prism Madhur Panwar, Kabir Ahuja, Navin Goyal
26 2024-05-29 link Large Brain Model for Learning Generic Representations with Tremendous
EEG Data in BCI
Weibang Jiang, Liming Zhao, Bao-liang Lu
26 2023-10-26 link How do Language Models Bind Entities in Context? Jiahai Feng, Jacob Steinhardt
25 2022-11-17 link How to Fine-Tune Vision Models with SGD Ananya Kumar, Ruoqi Shen,..., Suriya Gunasekar
25 2024-02-06 link Large Language Models to Enhance Bayesian Optimization Tennison Liu, Nicolás Astorga,..., Mihaela van der Schaar
25 2023-10-04 link Local Search GFlowNets Minsu Kim, Taeyoung Yun,..., Jinkyoo Park
25 2024-02-22 link Cameras as Rays: Pose Estimation via Ray Diffusion Jason Y. Zhang, Amy Lin,..., Shubham Tulsiani
25 2023-10-26 link CADS: Unleashing the Diversity of Diffusion Models through Condition-Annealed
Sampling
Seyedmorteza Sadat, Jakob Buhmann,..., Romann M. Weber
24 2024-02-14 link MUSTARD: Mastering Uniform Synthesis of Theorem and Proof Data Yinya Huang, Xiaohan Lin,..., Xiaodan Liang
24 2023-10-26 link Skill-Mix: a Flexible and Expandable Family of Evaluations for
AI models
Dingli Yu, Simran Kaur,..., Sanjeev Arora
24 2019-02-14 link CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater
Sample Efficiency and Simplicity
Aditya Bhatt, Daniel Palenicek,..., Jan Peters
24 2024-03-04 link Making Pre-trained Language Models Great on Tabular Prediction Jiahuan Yan, Bo Zheng,..., Jintai Chen
24 2023-02-06 link One-shot Empirical Privacy Estimation for Federated Learning Galen Andrew, Peter Kairouz,..., Vinith Menon Suriyakumar
24 2023-05-19 link LLM-CXR: Instruction-Finetuned LLM for CXR Image Understanding and Generation Suhyeon Lee, Won Jun Kim,..., Jong Chul Ye
24 2024-05-03 link What does the Knowledge Neuron Thesis Have to do
with Knowledge?
Jingcheng Niu, Andrew Liu,..., Gerald Penn
23 2023-12-07 link On the Learnability of Watermarks for Language Models Chenchen Gu, Xiang Lisa Li,..., Tatsunori Hashimoto
23 2024-01-13 link BrainLM: A foundation model for brain activity recordings Josue Ortega Caro, Antonio Henrique de Oliveira Fonseca,..., David van Dijk
23 2023-05-24 link Alleviating Exposure Bias in Diffusion Models through Sampling with
Shifted Time Steps
Mingxiao Li, Tingyu Qu,..., Marie-Francine Moens
23 2022-08-10 link A Sublinear Adversarial Training Algorithm Yeqi Gao, Lianke Qin,..., Yitan Wang
23 2023-11-30 link Dichotomy of Early and Late Phase Implicit Biases Can
Provably Induce Grokking
Kaifeng Lyu, Jikai Jin,..., Wei Hu
23 2023-09-04 link Relay Diffusion: Unifying diffusion process across resolutions for image
synthesis
Jiayan Teng, Wendi Zheng,..., Jie Tang
23 2024-01-20 link BadChain: Backdoor Chain-of-Thought Prompting for Large Language Models Zhen Xiang, Fengqing Jiang,..., Bo Li
23 2024-04-03 link CLaM-TTS: Improving Neural Codec Language Model for Zero-Shot Text-to-Speech Jaehyeon Kim, Keon Lee,..., Jaewoong Cho
23 2023-09-28 link Intriguing properties of generative classifiers Priyank Jaini, Kevin Clark, Robert Geirhos
23 2023-11-24 link Controlled Text Generation via Language Model Arithmetic Jasper Dekoninck, Marc Fischer,..., Martin Vechev
23 2023-11-10 link FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor Cores Daniel Y Fu, Hermann Kumbong,..., Christopher Re
22 2023-07-17 link COLLIE: Systematic Construction of Constrained Text Generation Tasks Shunyu Yao, Howard Chen,..., Karthik R Narasimhan
22 2023-11-08 link Massive Editing for Large Language Models via Meta Learning Chenmien Tan, Ge Zhang, Jie Fu
22 2023-05-17 link Knowledge Card: Filling LLMs' Knowledge Gaps with Plug-in Specialized
Language Models
Shangbin Feng, Weijia Shi,..., Yulia Tsvetkov
22 2023-10-05 link EfficientDM: Efficient Quantization-Aware Fine-Tuning of Low-Bit Diffusion Models Yefei He, Jing Liu,..., Bohan Zhuang
22 2024-02-16 link Robust agents learn causal world models Jonathan Richens, Tom Everitt
22 2023-10-13 link METRA: Scalable Unsupervised RL with Metric-Aware Abstraction Seohong Park, Oleh Rybkin, Sergey Levine
21 2022-01-07 link Fair and efficient contribution valuation for vertical federated learning Zhenan Fan, Huang Fang,..., Yong Zhang
21 2023-10-12 link GROOT: Learning to Follow Instructions by Watching Gameplay Videos Shaofei Cai, Bowei Zhang,..., Yitao Liang
21 2023-12-12 link Remote Sensing Vision-Language Foundation Models without Annotations via Ground
Remote Alignment
Utkarsh Mall, Cheng Perng Phoo,..., Kavita Bala
21 None link On the Humanity of Conversational AI: Evaluating the Psychological
Portrayal of LLMs
Jen-tse Huang, Wenxuan Wang,..., Michael Lyu
21 2024-04-17 link SuRe: Summarizing Retrievals using Answer Candidates for Open-domain QA
of LLMs
Jaehyung Kim, Jaehyun Nam,..., Jinwoo Shin
21 2023-12-05 link MUFFIN: Curating Multi-Faceted Instructions for Improving Instruction-Following Renze Lou, Kai Zhang,..., Wenpeng Yin
21 2024-01-16 link Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis Zhenhui Ye, Tianyun Zhong,..., Zhou Zhao
20 2023-05-24 link Differentially Private Synthetic Data via Foundation Model APIs 1:
Images
Zinan Lin, Sivakanth Gopi,..., Sergey Yekhanin
20 None link Würstchen: An Efficient Architecture for Large-Scale Text-to-Image Diffusion Models Pablo Pernias, Dominic Rampas,..., Marc Aubreville
20 2023-10-30 link DrM: Mastering Visual Reinforcement Learning through Dormant Ratio Minimization Guowei Xu, Ruijie Zheng,..., Huazhe Xu
20 2023-11-27 link DP-OPT: Make Large Language Model Your Privacy-Preserving Prompt Engineer Junyuan Hong, Jiachen T. Wang,..., Zhangyang Wang
20 2023-10-06 link How to Capture Higher-order Correlations? Generalizing Matrix Softmax Attention
to Kronecker Computation
Josh Alman, Zhao Song
19 2024-03-18 link Graph Neural Networks for Learning Equivariant Representations of Neural
Networks
Miltiadis Kofinas, Boris Knyazev,..., David W. Zhang
19 2023-10-24 link MuSR: Testing the Limits of Chain-of-thought with Multistep Soft
Reasoning
Zayne Rea Sprague, Xi Ye,..., Greg Durrett
19 2023-09-20 link Text2Reward: Reward Shaping with Language Models for Reinforcement Learning Tianbao Xie, Siheng Zhao,..., Tao Yu
19 2023-09-04 link On Penalty Methods for Nonconvex Bilevel Optimization and First-Order
Stochastic Approximation
Jeongyeol Kwon, Dohyun Kwon,..., Robert D Nowak
19 2022-11-01 link Two-stage LLM Fine-tuning with Less Specialization and More Generalization Yihan Wang, Si Si,..., Sanjiv Kumar
19 2023-11-07 link Beyond Imitation: Leveraging Fine-grained Quality Signals for Alignment Geyang Guo, Ranchi Zhao,..., Ji-Rong Wen
19 2023-12-07 link Graph Metanetworks for Processing Diverse Neural Architectures Derek Lim, Haggai Maron,..., James Lucas
19 2023-12-18 link Tuning LayerNorm in Attention: Towards Efficient Multi-Modal LLM Finetuning Bingchen Zhao, Haoqin Tu,..., Cihang Xie
19 2023-09-15 link Scaling Laws for Sparsely-Connected Foundation Models Elias Frantar, Carlos Riquelme Ruiz,..., Utku Evci
19 2023-06-08 link Protein Discovery with Discrete Walk-Jump Sampling Nathan C. Frey, Dan Berenberg,..., Saeed Saremi
19 2023-10-04 link Multi-resolution HuBERT: Multi-resolution Speech Self-Supervised Learning with Masked Unit
Prediction
Jiatong Shi, Hirofumi Inaguma,..., Anna Sun
19 2023-08-03 link PARL: A Unified Framework for Policy Alignment in Reinforcement
Learning from Human Feedback
Souradip Chakraborty, Amrit Bedi,..., Furong Huang
18 2023-10-10 link Geographic Location Encoding with Spherical Harmonics and Sinusoidal Representation
Networks
Marc Rußwurm, Konstantin Klemmer,..., Devis Tuia
18 2023-10-03 link Tensor Programs VI: Feature Learning in Infinite-Depth Neural Networks Greg Yang, Dingli Yu,..., Soufiane Hayou
18 2023-10-06 link Universal Humanoid Motion Representations for Physics-Based Control Zhengyi Luo, Jinkun Cao,..., Weipeng Xu
18 2023-10-24 link On the Foundations of Shortcut Learning Katherine Hermann, Hossein Mobahi,..., Michael Curtis Mozer
17 2023-10-19 link Frozen Transformers in Language Models Are Effective Visual Encoder
Layers
Ziqi Pang, Ziyang Xie,..., Yu-Xiong Wang
17 2023-11-07 link Multi-View Causal Representation Learning with Partial Observability Dingling Yao, Danru Xu,..., Francesco Locatello
17 2023-07-05 link Reverse Diffusion Monte Carlo Xunpeng Huang, Hanze Dong,..., Tong Zhang
17 2023-06-05 link PolyVoice: Language Models for Speech to Speech Translation Qian qian Dong, Zhiying Huang,..., Yuxuan Wang
17 2023-10-02 link Merge, Then Compress: Demystify Efficient SMoE with Hints from
Its Routing Policy
Pingzhi Li, Zhenyu Zhang,..., Tianlong Chen
17 2023-01-22 link Learning to Reject with a Fixed Predictor: Application to
Decontextualization
Christopher Mohri, Daniel Andor,..., Yutao Zhong
17 2024-03-29 link Negative Label Guided OOD Detection with Pretrained Vision-Language Models Xue Jiang, Feng Liu,..., Bo Han
17 None link An Image Is Worth 1000 Lies: Transferability of Adversarial
Images across Prompts on Vision-Language Models
Haochen Luo, Jindong Gu,..., Philip Torr
17 2023-10-04 link Never Train from Scratch: Fair Comparison of Long-Sequence Models
Requires Data-Driven Priors
Ido Amos, Jonathan Berant, Ankit Gupta
16 2023-09-30 link InstructCV: Instruction-Tuned Text-to-Image Diffusion Models as Vision Generalists Yulu Gan, Sungwoo Park,..., Ahmed Alaa
16 2024-04-15 link ClimODE: Climate and Weather Forecasting with Physics-informed Neural ODEs Yogesh Verma, Markus Heinonen, Vikas Garg
16 2024-03-07 link Mastering Memory Tasks with World Models Mohammad Reza Samsami, Artem Zholus,..., Sarath Chandar
16 2024-01-16 link Beyond Weisfeiler-Lehman: A Quantitative Framework for GNN Expressiveness Bohang Zhang, Jingchu Gai,..., Liwei Wang
16 2023-10-17 link Group Preference Optimization: Few-Shot Alignment of Large Language Models Siyan Zhao, John Dang, Aditya Grover
16 2023-05-25 link Implicit bias of SGD in L2-regularized linear DNNs: One-way
jumps from high to low rank
Zihan Wang, Arthur Jacot
16 2023-09-30 link AutomaTikZ: Text-Guided Synthesis of Scientific Vector Graphics with TikZ Jonas Belouadi, Anne Lauscher, Steffen Eger
16 2024-03-12 link Entropy is not Enough for Test-Time Adaptation: From the
Perspective of Disentangled Factors
Jonghyun Lee, Dahuin Jung,..., Sungroh Yoon
16 2023-09-29 link DyVal: Dynamic Evaluation of Large Language Models for Reasoning
Tasks
Kaijie Zhu, Jiaao Chen,..., Xing Xie
16 2023-11-01 link Plug-and-Play Policy Planner for Large Language Model Powered Dialogue
Agents
Yang Deng, Wenxuan Zhang,..., Tat-Seng Chua
16 2023-10-20 link Steve-Eye: Equipping LLM-based Embodied Agents with Visual Perception in
Open Worlds
Sipeng Zheng, jiazheng liu,..., Zongqing Lu
16 None link DREAM: Dual Structured Exploration with Mixup for Open-set Graph
Domain Adaption
Nan Yin, Mengzhu Wang,..., Xiao Luo
15 2024-01-18 link Enabling Efficient Equivariant Operations in the Fourier Basis via
Gaunt Tensor Products
Shengjie Luo, Tianlang Chen, Aditi S. Krishnapriyan
15 2024-03-26 link Don't Trust: Verify - Grounding LLM Quantitative Reasoning with
Autoformalization
Jin Peng Zhou, Charles E Staats,..., Yuhuai Wu
15 2023-11-21 link Looped Transformers are Better at Learning Learning Algorithms Liu Yang, Kangwook Lee,..., Dimitris Papailiopoulos
15 2023-10-14 link Does CLIP's Generalization Performance Mainly Stem from High Train-Test
Similarity?
Prasanna Mayilvahanan, Thaddäus Wiedemer,..., Wieland Brendel
15 2023-12-17 link Learning to Act without Actions Dominik Schmidt, Minqi Jiang
15 2023-12-09 link Batched Low-Rank Adaptation of Foundation Models Yeming Wen, Swarat Chaudhuri
15 2023-10-09 link Sentence-level Prompts Benefit Composed Image Retrieval Yang bai, Xinxing Xu,..., Chun-Mei Feng
15 2023-09-29 link Consistency Models as a Rich and Efficient Policy Class
for Reinforcement Learning
Zihan Ding, Chi Jin
14 2023-06-05 link Neuron Activation Coverage: Rethinking Out-of-distribution Detection and Generalization Yibing Liu, Chris XING TIAN,..., Shiqi Wang
14 2023-10-23 link Making RL with Preference-based Feedback Efficient via Randomization Runzhe Wu, Wen Sun
14 2023-10-02 link Controlling Vision-Language Models for Multi-Task Image Restoration Ziwei Luo, Fredrik K. Gustafsson,..., Thomas B. Schön
14 2023-02-10 link Forward Learning with Top-Down Feedback: Empirical and Analytical Characterization Ravi Francesco Srinivasan, Francesca Mignacco,..., Giorgia Dellaferrera
14 2024-02-07 link InstructScene: Instruction-Driven 3D Indoor Scene Synthesis with Semantic Graph
Prior
Chenguo Lin, Yadong MU
14 None link Towards Reliable and Efficient Backdoor Trigger Inversion via Decoupling
Benign Features
Xiong Xu, Kunzhe Huang,..., Kui Ren
14 2024-01-03 link Prototypical Information Bottlenecking and Disentangling for Multimodal Cancer Survival
Prediction
Yilan Zhang, Yingxue Xu,..., Hao Chen
14 2024-02-22 link MAPE-PPI: Towards Effective and Efficient Protein-Protein Interaction Prediction via
Microenvironment-Aware Protein Embedding
Lirong Wu, Yijun Tian,..., Stan Z. Li
14 2023-10-29 link Bespoke Solvers for Generative Flow Models Neta Shaul, Juan Perez,..., Yaron Lipman
14 2024-01-19 link Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model Yinan Zheng, Jianxiong Li,..., Jingjing Liu
14 2024-03-19 link Do Generated Data Always Help Contrastive Learning? Yifei Wang, Jizhe Zhang, Yisen Wang
14 2023-11-28 link Is This the Subspace You Are Looking for? An
Interpretability Illusion for Subspace Activation Patching
Aleksandar Makelov, Georg Lange,..., Neel Nanda
14 2023-10-12 link CompA: Addressing the Gap in Compositional Reasoning in Audio-Language
Models
Sreyan Ghosh, Ashish Seth,..., Dinesh Manocha
14 2023-06-03 link Memorization Capacity of Multi-Head Attention in Transformers Sadegh Mahdavi, Renjie Liao, Christos Thrampoulidis
14 2022-09-19 link Topological data analysis on noisy quantum computers Ismail Yunus Akhalwaya, Shashanka Ubaru,..., Lior Horesh
14 2023-05-29 link On Diffusion Modeling for Anomaly Detection Victor Livernoche, Vineet Jain,..., Siamak Ravanbakhsh
14 2023-05-24 link Provable Offline Preference-Based Reinforcement Learning Wenhao Zhan, Masatoshi Uehara,..., Wen Sun
14 2023-02-16 link Dual RL: Unification and New Methods for Reinforcement and
Imitation Learning
Harshit Sikchi, Qinqing Zheng,..., Scott Niekum
14 2024-01-27 link Finite-Time Analysis of On-Policy Heterogeneous Federated Reinforcement Learning Chenyu Zhang, Han Wang,..., James Anderson
14 2023-10-04 link Scaling Laws for Associative Memories Vivien Cabannes, Elvis Dohmatob, Alberto Bietti
14 2023-09-29 link Understanding and Mitigating the Label Noise in Pre-training on
Downstream Tasks
Hao Chen, Jindong Wang,..., Bhiksha Raj
14 2023-10-13 link The Consensus Game: Language Model Generation via Equilibrium Search Athul Paul Jacob, Yikang Shen,..., Jacob Andreas
13 2023-09-03 link Implicit regularization of deep residual networks towards neural ODEs Pierre Marion, Yu-Han Wu,..., Gérard Biau
13 2023-09-25 link Statistical Perspective of Top-K Sparse Softmax Gating Mixture of
Experts
Huy Nguyen, Pedram Akbarian,..., Nhat Ho
13 2023-03-02 link On the Provable Advantage of Unsupervised Pretraining Jiawei Ge, Shange Tang,..., Chi Jin
13 2024-01-09 link Evaluating Language Model Agency through Negotiations Tim Ruben Davidson, Veniamin Veselovsky,..., Robert West
13 2023-07-14 link SafeDreamer: Safe Reinforcement Learning with World Models Weidong Huang, Jiaming Ji,..., Yaodong Yang
13 2023-09-06 link ResFields: Residual Neural Fields for Spatiotemporal Signals Marko Mihajlovic, Sergey Prokudin,..., Siyu Tang
13 2023-04-14 link 3D Feature Prediction for Masked-AutoEncoder-Based Point Cloud Pretraining Siming Yan, Yuqi Yang,..., Qixing Huang
13 2024-01-16 link ValUES: A Framework for Systematic Validation of Uncertainty Estimation
in Semantic Segmentation
Kim-Celine Kahl, Carsten T. Lüth,..., Paul F Jaeger
13 2024-03-22 link DreamFlow: High-Quality Text-to-3D Generation by Approximating Probability Flow Kyungmin Lee, Kihyuk Sohn, Jinwoo Shin
13 None link Monte Carlo guided Denoising Diffusion models for Bayesian linear
inverse problems
Gabriel Cardoso, Yazid Janati el idrissi,..., Eric Moulines
13 2023-10-07 link Lemur: Integrating Large Language Models in Automated Program Verification Haoze Wu, Clark Barrett, Nina Narodytska
13 2024-01-19 link Large Language Models are Efficient Learners of Noise-Robust Speech
Recognition
Yuchen Hu, CHEN CHEN,..., EngSiong Chng
13 2023-05-22 link GraphCare: Enhancing Healthcare Predictions with Personalized Knowledge Graphs Pengcheng Jiang, Cao Xiao,..., Jimeng Sun
13 2023-10-03 link SNIP: Bridging Mathematical Symbolic and Numeric Realms with Unified
Pre-training
Kazem Meidani, Parshin Shojaee,..., Amir Barati Farimani
13 2024-02-28 link Deep Confident Steps to New Pockets: Strategies for Docking
Generalization
Gabriele Corso, Arthur Deng,..., Tommi S. Jaakkola
12 2023-10-11 link Score Regularized Policy Optimization through Diffusion Behavior Huayu Chen, Cheng Lu,..., Jun Zhu
12 2024-03-31 link Addressing Loss of Plasticity and Catastrophic Forgetting in Continual
Learning
Mohamed Elsayed, A. Rupam Mahmood
12 2023-10-09 link Provable Compositional Generalization for Object-Centric Learning Thaddäus Wiedemer, Jack Brady,..., Wieland Brendel
12 2023-10-04 link Decision ConvFormer: Local Filtering in MetaFormer is Sufficient for
Decision Making
Jeonghye Kim, Suyoung Lee,..., Youngchul Sung
12 2021-02-18 link Adaptive Rational Activations to Boost Deep Reinforcement Learning Quentin Delfosse, Patrick Schramowski,..., Kristian Kersting
12 2024-02-16 link GIM: Learning Generalizable Image Matcher From Internet Videos Xuelun Shen, zhipeng cai,..., Cheng Wang
12 2024-02-05 link Curriculum reinforcement learning for quantum architecture search under hardware
errors
Yash J. Patel, Akash Kundu,..., Onur Danaci
12 2023-10-19 link Understanding Addition in Transformers Philip Quirke, Fazl Barez
12 2024-01-12 link Few-Shot Detection of Machine-Generated Text using Style Representations Rafael Alberto Rivera Soto, Kailin Koch,..., Nicholas Andrews
12 2023-10-06 link Towards Foundational Models for Molecular Learning on Large-Scale Multi-Task
Datasets
Dominique Beaini, Shenyang Huang,..., Dominic Masters
12 2024-04-11 link PINNACLE: PINN Adaptive ColLocation and Experimental points selection Gregory Kang Ruey Lau, Apivich Hemachandra,..., Bryan Kian Hsiang Low
12 2023-06-08 link Yet Another ICU Benchmark: A Flexible Multi-Center Framework for
Clinical ML
Robin van de Water, Hendrik Nils Aurel Schmidt,..., Patrick Rockenschaub
12 2023-10-09 link ODEFormer: Symbolic Regression of Dynamical Systems with Transformers Stéphane d'Ascoli, Sören Becker,..., Niki Kilbertus
12 2023-10-10 link Correlated Noise Provably Beats Independent Noise for Differentially Private
Learning
Christopher A. Choquette-Choo, Krishnamurthy Dj Dvijotham,..., Abhradeep Guha Thakurta
11 2023-10-02 link Closing the Curious Case of Neural Text Degeneration Matthew Finlayson, John Hewitt,..., Ashish Sabharwal
11 2023-02-01 link Analyzing Feed-Forward Blocks in Transformers through the Lens of
Attention Maps
Goro Kobayashi, Tatsuki Kuribayashi,..., Kentaro Inui
11 2024-07-07 link PTaRL: Prototype-based Tabular Representation Learning via Space Calibration Hangting Ye, Wei Fan,..., Yi Chang
11 2023-10-12 link Is attention required for ICL? Exploring the Relationship Between
Model Architecture and In-Context Learning Ability
Ivan Lee, Nan Jiang, Taylor Berg-Kirkpatrick
11 2023-12-26 link Supervised Knowledge Makes Large Language Models Better In-context Learners Linyi Yang, Shuibai Zhang,..., Yue Zhang
11 2023-10-04 link High-dimensional SGD aligns with emerging outlier eigenspaces Gerard Ben Arous, Reza Gheissari,..., Aukosh Jagannath
11 None link Contrastive Preference Learning: Learning from Human Feedback without Reinforcement
Learning
Joey Hejna, Rafael Rafailov,..., Dorsa Sadigh
11 2023-11-25 link Unbalancedness in Neural Monge Maps Improves Unpaired Domain Translation Luca Eyring, Dominik Klein,..., Fabian J Theis
11 2023-10-18 link Improving Generalization of Alignment with Human Preferences through Group
Invariant Learning
Rui Zheng, Wei Shen,..., Xuanjing Huang
11 2023-07-26 link Are Transformers with One Layer Self-Attention Using Low-Rank Weight
Matrices Universal Approximators?
Tokio Kajitsuka, Issei Sato
11 2024-01-05 link Simple Hierarchical Planning with Diffusion Chang Chen, Fei Deng,..., Sungjin Ahn
11 2023-10-01 link Subtractive Mixture Models via Squaring: Representation and Learning Lorenzo Loconte, Aleksanteri Mikulus Sladek,..., Antonio Vergari
11 2023-09-28 link Transformer-VQ: Linear-Time Transformers via Vector Quantization Lucas Dax Lingle
11 None link Prompt Gradient Projection for Continual Learning Jingyang Qiao, zhizhong zhang,..., Yuan Xie
11 2024-04-17 link Variational Bayesian Last Layers James Harrison, John Willes, Jasper Snoek
11 2023-10-23 link Ghost on the Shell: An Expressive Representation of General
3D Shapes
Zhen Liu, Yao Feng,..., Bernhard Schölkopf
11 2023-10-19 link Towards Robust Offline Reinforcement Learning under Diverse Data Corruption Rui Yang, Han Zhong,..., Tong Zhang
11 2023-09-25 link Towards a statistical theory of data selection under weak
supervision
Germain Kolossov, Andrea Montanari, Pulkit Tandon
11 2023-06-17 link Understanding Certified Training with Interval Bound Propagation Yuhao Mao, Mark Niklas Mueller,..., Martin Vechev
11 2023-08-08 link Sample-Efficient Linear Representation Learning from Non-IID Non-Isotropic Data Thomas TCK Zhang, Leonardo Felipe Toso,..., Nikolai Matni
11 2024-01-19 link Escape Sky-high Cost: Early-stopping Self-Consistency for Multi-step Reasoning Yiwei Li, Peiwen Yuan,..., Kan Li
11 2024-03-24 link VCR-Graphormer: A Mini-batch Graph Transformer via Virtual Connections Dongqi Fu, Zhigang Hua,..., Bo Long
11 2023-11-19 link Unmasking and Improving Data Credibility: A Study with Datasets
for Training Harmless Language Models
Zhaowei Zhu, Jialu Wang,..., Yang Liu
11 None link The Devil is in the Neurons: Interpreting and Mitigating
Social Biases in Language Models
Yan Liu, Yu Liu,..., Tsung-Yi Ho
11 2023-09-29 link Navigating the Design Space of Equivariant Diffusion-Based Generative Models
for De Novo 3D Molecule Generation
Tuan Le, Julian Cremer,..., Kristof T Schütt
10 2023-10-25 link From Molecules to Materials: Pre-training Large Generalizable Models for
Atomic Property Prediction
Nima Shoghi, Adeesh Kolluru,..., Brandon M Wood
10 2023-11-08 link GENOME: GenerativE Neuro-symbOlic visual reasoning by growing and reusing
ModulEs
Zhenfang Chen, Rui Sun,..., Chuang Gan
10 2023-05-30 link Exploring the Promise and Limits of Real-Time Recurrent Learning Kazuki Irie, Anand Gopalakrishnan, Jürgen Schmidhuber
10 2023-10-04 link Generative Modeling of Regular and Irregular Time Series Data
via Koopman VAEs
Ilan Naiman, N. Benjamin Erichson,..., Omri Azencot
10 2023-09-03 link Traveling Waves Encode the Recent Past and Enhance Sequence
Learning
T. Anderson Keller, Lyle Muller,..., Max Welling
10 2023-07-18 link UniTabE: A Universal Pretraining Protocol for Tabular Foundation Model
in Data Science
Yazheng Yang, Yuqi Wang,..., Qi Liu
10 2023-10-05 link Logical Languages Accepted by Transformer Encoders with Hard Attention Pablo Barcelo, Alexander Kozachinskiy,..., Vladimir Podolskii
10 2023-11-07 link Selective Visual Representations Improve Convergence and Generalization for Embodied
AI
Ainaz Eftekhar, Kuo-Hao Zeng,..., Ranjay Krishna
10 2023-11-13 link Feature emergence via margin maximization: case studies in algebraic
tasks
Depen Morwani, Benjamin L. Edelman,..., Sham M. Kakade
10 2023-10-11 link Denoising Task Routing for Diffusion Models Byeongjun Park, Sangmin Woo,..., Changick Kim
10 2023-06-06 link Quick-Tune: Quickly Learning Which Pretrained Model to Finetune and
How
Sebastian Pineda Arango, Fabio Ferreira,..., Josif Grabocka
10 None link Conversational Drug Editing Using Retrieval and Domain Feedback Shengchao Liu, Jiongxiao Wang,..., Chaowei Xiao
10 2023-06-14 link Diffusion in Diffusion: Cyclic One-Way Diffusion for Text-Vision-Conditioned Generation Ruoyu Wang, Yongqi Yang,..., Yu Wu
9 None link The False Promise of Imitating Proprietary Language Models Arnav Gudibande, Eric Wallace,..., Dawn Song
9 2023-05-23 link Proximal Policy Gradient Arborescence for Quality Diversity Reinforcement Learning Sumeet Batra, Bryon Tjanaka,..., Gaurav S. Sukhatme
9 2023-10-01 link Learning to Make Adherence-Aware Advice Guanting Chen, Xiaocheng Li,..., Hanzhao Wang
9 2023-11-30 link Initializing Models with Larger Ones Zhiqiu Xu, Yanjie Chen,..., Zhuang Liu
9 None link PolyGCL: GRAPH CONTRASTIVE LEARNING via Learnable Spectral Polynomial Filters Jingyu Chen, Runlin Lei, Zhewei Wei
9 2023-09-06 link SEAL: A Framework for Systematic Evaluation of Real-World Super-Resolution Wenlong Zhang, Xiaohui Li,..., Chao Dong
9 2023-10-11 link Data Distillation Can Be Like Vodka: Distilling More Times
For Better Quality
Xuxi Chen, Yu Yang,..., Baharan Mirzasoleiman
9 2024-01-19 link CivRealm: A Learning and Reasoning Odyssey in Civilization for
Decision-Making Agents
Siyuan Qi, Shuo Chen,..., Song-Chun Zhu
9 2023-08-25 link SEGNO: Generalizing Equivariant Graph Neural Networks with Physical Inductive
Biases
Yang Liu, Jiashun Cheng,..., Yu Rong
9 2024-01-22 link EmerDiff: Emerging Pixel-level Semantic Knowledge in Diffusion Models Koichi Namekata, Amirmojtaba Sabour,..., Seung Wook Kim
9 2023-09-21 link Quasi-Monte Carlo for 3D Sliced Wasserstein Khai Nguyen, Nicola Bariletto, Nhat Ho
9 2023-12-19 link Adversarial AutoMixup Huafeng Qin, Xin Jin,..., Xinbo Gao
9 2023-02-28 link An Efficient Tester-Learner for Halfspaces Aravind Gollakota, Adam Klivans,..., Arsen Vasilyan
9 2023-07-25 link Submodular Reinforcement Learning Manish Prajapat, Mojmir Mutny,..., Andreas Krause
9 2022-10-04 link Neural-Symbolic Recursive Machine for Systematic Generalization Qing Li, Yixin Zhu,..., Siyuan Huang
9 2023-06-12 link Unprocessing Seven Years of Algorithmic Fairness André Cruz, Moritz Hardt
9 2022-12-06 link Image Inpainting via Iteratively Decoupled Probabilistic Modeling Wenbo Li, Xin Yu,..., Zhe Lin
9 2023-10-02 link Variance-Aware Regret Bounds for Stochastic Contextual Dueling Bandits Qiwei Di, Tao Jin,..., Quanquan Gu
9 2023-03-10 link MovingParts: Motion-based 3D Part Discovery in Dynamic Radiance Field Kaizhi Yang, Xiaoshuai Zhang,..., Hao Su
9 2023-10-21 link Beyond Accuracy: Evaluating Self-Consistency of Code Large Language Models
with IdentityChain
Marcus J. Min, Yangruibo Ding,..., Baishakhi Ray
9 2024-03-13 link DIFFTACTILE: A Physics-based Differentiable Tactile Simulator for Contact-rich Robotic
Manipulation
Zilin Si, Gu Zhang,..., Chuang Gan
9 2024-04-30 link MetaCoCo: A New Few-Shot Classification Benchmark with Spurious Correlation Min Zhang, Haoxuan Li,..., Kun Kuang
9 2023-10-06 link Identifying Representations for Intervention Extrapolation Sorawit Saengkyongam, Elan Rosenfeld,..., Jonas Peters
9 2023-10-11 link SpikePoint: An Efficient Point-based Spiking Neural Network for Event
Cameras Action Recognition
Hongwei Ren, Yue Zhou,..., Bojun Cheng
9 2023-10-03 link Stack Attention: Improving the Ability of Transformers to Model
Hierarchical Patterns
Brian DuSell, David Chiang
9 2023-10-11 link Generative Modeling with Phase Stochastic Bridges Tianrong Chen, Jiatao Gu,..., Shuangfei Zhai
9 2023-10-09 link An operator preconditioning perspective on training in physics-informed machine
learning
Tim De Ryck, Florent Bonnet,..., Emmanuel de Bezenac
9 2024-01-30 link Multi-granularity Correspondence Learning from Long-term Noisy Videos Yijie Lin, Jie Zhang,..., Xi Peng
9 2023-11-06 link Tailoring Self-Rationalizers with Multi-Reward Distillation Sahana Ramnath, Brihi Joshi,..., Xiang Ren
9 2024-02-07 link LLMs Meet VLMs: Boost Open Vocabulary Object Detection with
Fine-grained Descriptors
Sheng Jin, Xueying Jiang,..., Shijian Lu
8 2024-05-16 link Whole-Song Hierarchical Generation of Symbolic Music Using Cascaded Diffusion
Models
Ziyu Wang, Lejun Min, Gus Xia
8 2023-07-26 link ExeDec: Execution Decomposition for Compositional Generalization in Neural Program
Synthesis
Kensen Shi, Joey Hong,..., Charles Sutton
8 None link Generative Learning for Financial Time Series with Irregular and
Scale-Invariant Patterns
Hongbin Huang, Minghua Chen, Xiao Qiao
8 2023-10-12 link Is ImageNet worth 1 video? Learning strong image encoders
from 1 long unlabelled video
Shashanka Venkataramanan, Mamshad Nayeem Rizve,..., Yannis Avrithis
8 2023-10-04 link USB-NeRF: Unrolling Shutter Bundle Adjusted Neural Radiance Fields Moyang Li, Peng Wang,..., Peidong Liu
8 None link Flow to Better: Offline Preference-based Reinforcement Learning via Preferred
Trajectory Generation
Zhilong Zhang, Yihao Sun,..., Yang Yu
8 None link TabR: Tabular Deep Learning Meets Nearest Neighbors Yury Gorishniy, Ivan Rubachev,..., Artem Babenko
8 None link Towards Energy Efficient Spiking Neural Networks: An Unstructured Pruning
Framework
Xinyu Shi, Jianhao Ding,..., Zhaofei Yu
8 2023-05-28 link Repeated Random Sampling for Minimizing the Time-to-Accuracy of Learning Patrik Okanovic, Roger Waleffe,..., Theodoros Rekatsinas
8 2023-06-01 link Understanding Augmentation-based Self-Supervised Representation Learning via RKHS Approximation Runtian Zhai, Bingbin Liu,..., Pradeep Kumar Ravikumar
8 2024-03-31 link Label-Agnostic Forgetting: A Supervision-Free Unlearning in Deep Models Shaofei Shen, Chenhao Zhang,..., Miao Xu
8 2024-01-24 link Navigating Dataset Documentations in AI: A Large-Scale Analysis of
Dataset Cards on Hugging Face
Xinyu Yang, Weixin Liang, James Zou
8 2024-04-16 link Hierarchical Context Merging: Better Long Context Understanding for Pre-trained
LLMs
Woomin Song, Seunghyuk Oh,..., Jinwoo Shin
8 2023-07-11 link Benchmarking Algorithms for Federated Domain Generalization Ruqi Bai, Saurabh Bagchi, David I. Inouye
8 2024-02-17 link Boosting of Thoughts: Trial-and-Error Problem Solving with Large Language
Models
Sijia Chen, Baochun Li, Di Niu
8 2024-01-23 link Energy-based Automated Model Evaluation Ru Peng, Heming Zou,..., Junbo Zhao
8 None link Causal Modelling Agents: Causal Graph Discovery through Synergising Metadata-
and Data-driven Reasoning
Ahmed Abdulaal, adamos hadjivasiliou,..., Daniel C. Alexander
8 2023-11-01 link Instructive Decoding: Instruction-Tuned Large Language Models are Self-Refiner from
Noisy Instructions
Taehyeon Kim, Joonkee Kim,..., Se-Young Yun
8 2024-01-03 link On the hardness of learning under symmetries Bobak Kiani, Thien Le,..., Melanie Weber
8 2023-09-26 link SGD Finds then Tunes Features in Two-Layer Neural Networks
with near-Optimal Sample Complexity: A Case Study in the XOR problem
Margalit Glasgow
8 2023-08-30 link RetroBridge: Modeling Retrosynthesis with Markov Bridges Ilia Igashov, Arne Schneuing,..., Bruno Correia
8 2023-11-07 link A Simple Interpretable Transformer for Fine-Grained Image Classification and
Analysis
DIPANJYOTI PAUL, Arpita Chowdhury,..., Wei-Lun Chao
8 2023-10-02 link L2MAC: Large Language Model Automatic Computer for Extensive Code
Generation
Samuel Holt, Max Ruiz Luyten, Mihaela van der Schaar
8 2023-10-05 link Pre-Training and Fine-Tuning Generative Flow Networks Ling Pan, Moksh Jain,..., Yoshua Bengio
8 2023-10-03 link GNNX-BENCH: Unravelling the Utility of Perturbation-based GNN Explainers through
In-depth Benchmarking
Mert Kosan, Samidha Verma,..., Sayan Ranu
8 2024-01-28 link Neural Network-Based Score Estimation in Diffusion Models: Optimization and
Generalization
Yinbin Han, Meisam Razaviyayn, Renyuan Xu
8 None link Meta Continual Learning Revisited: Implicitly Enhancing Online Hessian Approximation
via Variance Reduction
Yichen Wu, Long-Kai Huang,..., Ying Wei
8 None link Improving Non-Transferable Representation Learning by Harnessing Content and Style Ziming Hong, Zhenyi Wang,..., Tongliang Liu
8 2023-11-06 link CoVLM: Composing Visual Entities and Relationships in Large Language
Models Via Communicative Decoding
Junyan Li, Delin Chen,..., Chuang Gan
8 2023-11-19 link Bounds on Representation-Induced Confounding Bias for Treatment Effect Estimation Valentyn Melnychuk, Dennis Frauen, Stefan Feuerriegel
8 2024-01-17 link Idempotence and Perceptual Image Compression Tongda Xu, Ziran Zhu,..., Ya-Qin Zhang
8 2023-12-22 link Federated Q-Learning: Linear Regret Speedup with Low Communication Cost Zhong Zheng, Fengyu Gao,..., Jing Yang
8 2023-10-02 link From Bricks to Bridges: Product of Invariances to Enhance
Latent Space Communication
Irene Cannistraci, Luca Moschella,..., Emanuele Rodolà
8 2023-10-02 link Tool-Augmented Reward Modeling Lei Li, Yekun Chai,..., Hua Wu
7 2023-12-26 link Social-Transmotion: Promptable Human Trajectory Prediction Saeed Saadatnejad, Yang Gao,..., Alexandre Alahi
7 2023-10-25 link Frequency-Aware Transformer for Learned Image Compression Han Li, Shaohui Li,..., Hongkai Xiong
7 2023-11-27 link Maximum Likelihood Estimation is All You Need for Well-Specified
Covariate Shift
Jiawei Ge, Shange Tang,..., Chi Jin
7 2023-10-03 link Towards Training Without Depth Limits: Batch Normalization Without Gradient
Explosion
Alexandru Meterez, Amir Joudaki,..., Hadi Daneshmand
7 None link Dissecting learning and forgetting in language model finetuning Xiao Zhang, Ji Wu
7 None link Test-time Adaptation against Multi-modal Reliability Bias Mouxing Yang, Yunfan Li,..., Xi Peng
7 2023-10-14 link Mirage: Model-Agnostic Graph Distillation for Graph Classification Mridul Gupta, Sahil Manchanda,..., Sayan Ranu
7 None link CircuitNet 2.0: An Advanced Dataset for Promoting Machine Learning
Innovations in Realistic Chip Design Environment
Xun Jiang, zhuomin chai,..., Ru Huang
7 2023-09-01 link Large Content And Behavior Models To Understand, Simulate, And
Optimize Content And Behavior
Ashmit Khandelwal, Aditya Agrawal,..., Balaji Krishnamurthy
7 None link Pre-training Sequence, Structure, and Surface Features for Comprehensive Protein
Representation Learning
Youhan Lee, Hasun Yu,..., Jaehoon Kim
7 2024-01-16 link Explaining Time Series via Contrastive and Locally Sparse Perturbations Zichuan Liu, Yingying ZHANG,..., Qingsong Wen
7 2023-12-02 link Symmetric Mean-field Langevin Dynamics for Distributional Minimax Problems Juno Kim, Kakei Yamamoto,..., Taiji Suzuki
7 2023-05-23 link Expressive Losses for Verified Robustness via Convex Combinations Alessandro De Palma, Rudy R Bunel,..., Alessio Lomuscio
7 2023-07-28 link Skeleton-of-Thought: Prompting LLMs for Efficient Parallel Generation Xuefei Ning, Zinan Lin,..., Yu Wang
7 2023-10-17 link Seeking Neural Nuggets: Knowledge Transfer in Large Language Models
from a Parametric Perspective
Ming Zhong, Chenxin An,..., Pengcheng He
7 None link Sparse MoE with Language Guided Routing for Multilingual Machine
Translation
Xinyu Zhao, Xuxi Chen,..., Tianlong Chen
7 None link Concept Bottleneck Generative Models Aya Abdelsalam Ismail, Julius Adebayo,..., Kyunghyun Cho
7 2023-04-04 link On the Variance of Neural Network Training with respect
to Test Sets and Distributions
Keller Jordan
7 2022-11-16 link A Stable, Fast, and Fully Automatic Learning Algorithm for
Predictive Coding Networks
Tommaso Salvatori, Yuhang Song,..., Thomas Lukasiewicz
7 2023-10-27 link Image Clustering Conditioned on Text Criteria Sehyun Kwon, Jaeseung Park,..., Kangwook Lee
7 2023-10-10 link Let Models Speak Ciphers: Multiagent Debate through Embeddings Chau Pham, Boyi Liu,..., Hongxia Yang
7 None link Pre-Training Goal-based Models for Sample-Efficient Reinforcement Learning Haoqi Yuan, Zhancun Mu,..., Zongqing Lu
7 2023-10-18 link De novo protein design using geometric vector field networks Weian Mao, Muzhi Zhu,..., Chunhua Shen
7 2023-05-30 link Inverse Approximation Theory for Nonlinear Recurrent Neural Networks Shida Wang, Zhong Li, Qianxiao Li
7 2023-09-10 link Learning Energy-Based Models by Cooperative Diffusion Recovery Likelihood Yaxuan Zhu, Jianwen Xie,..., Ruiqi Gao
7 None link Language Model Detectors Are Easily Optimized Against Charlotte Nicks, Eric Mitchell,..., Stefano Ermon
7 2023-12-13 link Revisiting the Last-Iterate Convergence of Stochastic Gradient Methods Zijian Liu, Zhengyuan Zhou
7 2023-05-22 link Enhancing Small Medical Learners with Privacy-preserving Contextual Prompting Xinlu Zhang, Shiyang Li,..., Linda Ruth Petzold
7 2024-02-14 link CLIP-MUSED: CLIP-Guided Multi-Subject Visual Neural Information Semantic Decoding Qiongyi Zhou, Changde Du,..., Huiguang He
7 2024-02-20 link Scaling physics-informed hard constraints with mixture-of-experts Nithin Chalapathi, Yiheng Du, Aditi S. Krishnapriyan
7 2024-02-22 link Stable Neural Stochastic Differential Equations in Analyzing Irregular Time
Series Data
YongKyung Oh, Dongyoung Lim, Sungil Kim
7 2024-02-01 link ODICE: Revealing the Mystery of Distribution Correction Estimation via
Orthogonal-gradient Update
Liyuan Mao, Haoran Xu,..., Xianyuan Zhan
7 None link Tackling the Data Heterogeneity in Asynchronous Federated Learning with
Cached Update Calibration
Yujia Wang, Yuanpu Cao,..., Jinghui Chen
7 2023-12-27 link Soft Contrastive Learning for Time Series Seunghan Lee, Taeyoung Park, Kibok Lee
6 None link What Makes a Good Prune? Maximal Unstructured Pruning for
Maximal Cosine Similarity
Gabryel Mason-Williams, Fredrik Dahlqvist
6 2023-06-08 link SequenceMatch: Imitation Learning for Autoregressive Sequence Modelling with Backtracking Chris Cundy, Stefano Ermon
6 2023-10-17 link Context-Aware Meta-Learning Christopher Fifty, Dennis Duan,..., Sebastian Thrun
6 2022-06-14 link Toward Student-oriented Teacher Network Training for Knowledge Distillation Chengyu Dong, Liyuan Liu, Jingbo Shang
6 2023-10-31 link Vanishing Gradients in Reinforcement Finetuning of Language Models Noam Razin, Hattie Zhou,..., Etai Littwin
6 2023-05-22 link Improving Convergence and Generalization Using Parameter Symmetries Bo Zhao, Robert M. Gower,..., Rose Yu
6 2023-10-26 link Fantastic Gains and Where to Find Them: On the
Existence and Prospect of General Knowledge Transfer between Any Pretrained Model
Karsten Roth, Lukas Thede,..., Zeynep Akata
6 2024-03-18 link Investigating the Benefits of Projection Head for Representation Learning Yihao Xue, Eric Gan,..., Baharan Mirzasoleiman
6 2023-10-09 link Rephrase, Augment, Reason: Visual Grounding of Questions for Vision-Language
Models
Archiki Prasad, Elias Stengel-Eskin, Mohit Bansal
6 None link How I Warped Your Noise: a Temporally-Correlated Noise Prior
for Diffusion Models
Pascal Chang, Jingwei Tang,..., Vinicius C. Azevedo
6 None link "What Data Benefits My Classifier?" Enhancing Model Performance and
Interpretability through Influence-Based Data Selection
Anshuman Chhabra, Peizhao Li,..., Hongfu Liu
6 2023-11-24 link Large Language Models as Automated Aligners for benchmarking Vision-Language
Models
Yuanfeng Ji, Chongjian GE,..., Ping Luo
6 2023-09-29 link CrossLoco: Human Motion Driven Control of Legged Robots via
Guided Unsupervised Reinforcement Learning
Tianyu Li, Hyunyoung Jung,..., Sehoon Ha
6 None link Sample-Efficient Quality-Diversity by Cooperative Coevolution Ke Xue, Ren-Jian Wang,..., Chao Qian
6 2023-02-13 link Generative Adversarial Equilibrium Solvers Denizalp Goktas, David C. Parkes,..., Andrea Tacchetti
6 2023-10-30 link LitCab: Lightweight Language Model Calibration over Short- and Long-form
Responses
Xin Liu, Muhammad Khalifa, Lu Wang
6 2024-01-24 link Task structure and nonlinearity jointly determine learned representational geometry Matteo Alleman, Jack Lindsey, Stefano Fusi
6 2023-10-02 link Robustifying State-space Models for Long Sequences via Approximate Diagonalization Annan Yu, Arnur Nigmetov,..., N. Benjamin Erichson
6 2024-01-18 link Harnessing Density Ratios for Online Reinforcement Learning Philip Amortila, Dylan J Foster,..., Tengyang Xie
6 2023-10-06 link Asymptotically Free Sketched Ridge Ensembles: Risks, Cross-Validation, and Tuning Pratik Patil, Daniel LeJeune
6 None link Graph Transformers on EHRs: Better Representation Improves Downstream Performance Raphael Poulain, Rahmatollah Beheshti
6 2023-10-17 link Lie Group Decompositions for Equivariant Neural Networks Mircea Mironenco, Patrick Forré
6 2024-03-07 link Dissecting Sample Hardness: A Fine-Grained Analysis of Hardness Characterization
Methods for Data-Centric AI
Nabeel Seedat, Fergus Imrie, Mihaela van der Schaar
6 2023-10-03 link How Over-Parameterization Slows Down Gradient Descent in Matrix Sensing:
The Curses of Symmetry and Initialization
Nuoya Xiong, Lijun Ding, Simon Shaolei Du
6 2023-10-10 link Approximating Nash Equilibria in Normal-Form Games via Stochastic Optimization Ian Gemp, Luke Marris, Georgios Piliouras
6 2023-05-26 link Exploring Weight Balancing on Long-Tailed Recognition Problem Naoya Hasegawa, Issei Sato
6 2024-04-30 link Debiased Collaborative Filtering with Kernel-Based Causal Balancing Haoxuan Li, Chunyuan Zheng,..., Peng Cui
6 2024-03-07 link On the Markov Property of Neural Algorithmic Reasoning: Analyses
and Methods
Montgomery Bohde, Meng Liu,..., Shuiwang Ji
6 2023-10-12 link Visual Data-Type Understanding does not emerge from Scaling Vision-Language
Models
Vishaal Udandarao, Max F Burg,..., Matthias Bethge
6 2023-06-03 link DOS: Diverse Outlier Sampling for Out-of-Distribution Detection Wenyu Jiang, Hao Cheng,..., Hongxin Wei
6 None link Learning Hierarchical World Models with Adaptive Temporal Abstractions from
Discrete Latent Dynamics
Christian Gumbsch, Noor Sajid,..., Martin V. Butz
6 None link Training-free Multi-objective Diffusion Model for 3D Molecule Generation Xu Han, Caihua Shan,..., Dongsheng Li
6 2023-02-06 link Improving Domain Generalization with Domain Relations Huaxiu Yao, Xinyu Yang,..., Chelsea Finn
6 2023-05-23 link Point2SSM: Learning Morphological Variations of Anatomies from Point Cloud Jadie Adams, Shireen Elhabian
6 2024-02-07 link Towards Aligned Layout Generation via Diffusion Model with Aesthetic
Constraints
Jian Chen, Ruiyi Zhang,..., Changyou Chen
6 2024-02-11 link Open-ended VQA benchmarking of Vision-Language models by exploiting Classification
datasets and their semantic hierarchy
Simon Ging, Maria Alejandra Bravo, Thomas Brox
6 2024-03-05 link TESTAM: A Time-Enhanced Spatio-Temporal Attention Model with Mixture of
Experts
Hyunwook Lee, Sungahn Ko
6 None link NuwaDynamics: Discovering and Updating in Causal Spatio-Temporal Modeling Kun Wang, Hao Wu,..., Yang Wang
6 2023-11-24 link A General Framework for User-Guided Bayesian Optimization Carl Hvarfner, Frank Hutter, Luigi Nardi
6 2023-10-24 link Privacy Amplification for Matrix Mechanisms Christopher A. Choquette-Choo, Arun Ganesh,..., Abhradeep Guha Thakurta
6 2024-01-31 link Rethinking Channel Dependence for Multivariate Time Series Forecasting: Learning
from Leading Indicators
Lifan Zhao, Yanyan Shen
6 2023-10-05 link Multimarginal generative modeling with stochastic interpolants Michael Samuel Albergo, Nicholas Matthew Boffi,..., Eric Vanden-Eijnden
6 2023-10-10 link CoT3DRef: Chain-of-Thoughts Data-Efficient 3D Visual Grounding Eslam Mohamed BAKR, Mohamed Ayman Mohamed,..., Mohamed Elhoseiny
6 2023-10-04 link ED-NeRF: Efficient Text-Guided Editing of 3D Scene With Latent
Space NeRF
JangHo Park, Gihyun Kwon, Jong Chul Ye
6 None link Consistency Training with Learnable Data Augmentation for Graph Anomaly
Detection with Limited Supervision
Nan Chen, Zemin Liu,..., Jia Chen
6 2024-03-10 link Multisize Dataset Condensation Yang He, Lingao Xiao,..., Ivor Tsang
6 2023-12-06 link Customizable Combination of Parameter-Efficient Modules for Multi-Task Learning Haowen Wang, Tao Sun,..., Cong Fan
6 2024-11-16 link Partitioning Message Passing for Graph Fraud Detection Wei Zhuo, Zemin Liu,..., Jia Chen
5 None link BarLeRIa: An Efficient Tuning Framework for Referring Image Segmentation Yaoming Wang, Jin Li,..., Qi Tian
5 None link EQA-MX: Embodied Question Answering using Multimodal Expression Md Mofijul Islam, Alexi Gladstone,..., Tariq Iqbal
5 2024-01-16 link Bayes Conditional Distribution Estimation for Knowledge Distillation Based on
Conditional Mutual Information
Linfeng Ye, Shayan Mohajer Hamidi,..., EN-HUI YANG
5 2023-03-21 link Influencer Backdoor Attack on Semantic Segmentation Haoheng Lan, Jindong Gu,..., Hengshuang Zhao
5 2024-05-01 link Are Models Biased on Text without Gender-related Language? Catarina G Belém, Preethi Seshadri,..., Sameer Singh
5 2023-10-28 link Pre-training with Random Orthogonal Projection Image Modeling Maryam Haghighat, Peyman Moghadam,..., Piotr Koniusz
5 2023-02-21 link Some Fundamental Aspects about Lipschitz Continuity of Neural Networks Grigory Khromov, Sidak Pal Singh
5 2023-10-09 link Predictive auxiliary objectives in deep RL mimic learning in
the brain
Ching Fang, Kim Stachenfeld
5 2023-10-09 link DyST: Towards Dynamic Neural Scene Representations on Real-World Videos Maximilian Seitzer, Sjoerd van Steenkiste,..., Mehdi S. M. Sajjadi
5 2023-10-31 link Stochastic Gradient Descent for Gaussian Processes Done Right Jihao Andreas Lin, Shreyas Padhy,..., David Janz
5 None link GAFormer: Enhancing Timeseries Transformers Through Group-Aware Embeddings Jingyun Xiao, Ran Liu, Eva L Dyer
5 2023-04-01 link Abstractors and relational cross-attention: An inductive bias for explicit
relational reasoning in Transformers
Awni Altabaa, Taylor Whittington Webb,..., John Lafferty
5 2023-05-23 link Faithful and Efficient Explanations for Neural Networks via Neural
Tangent Kernel Surrogate Models
Andrew William Engel, Zhichao Wang,..., Tony Chiang
5 2024-01-17 link Bilevel Optimization under Unbounded Smoothness: A New Algorithm and
Convergence Analysis
Jie Hao, Xiaochuan Gong, Mingrui Liu
5 2020-08-09 link Treatment Effects Estimation By Uniform Transformer Ruoqi Yu, Shulei Wang
5 2024-02-19 link Graph-based Virtual Sensing from Sparse and Partial Multivariate Observations Giovanni De Felice, Andrea Cini,..., Cesare Alippi
5 2023-11-25 link Coordinate-Aware Modulation for Neural Fields Joo Chan Lee, Daniel Rho,..., Eunbyung Park
5 2023-10-13 link Jointly-Learned Exit and Inference for a Dynamic Neural Network
: JEI-DNN
florence regol, Joud Chataoui, Mark Coates
5 2024-03-19 link Predictive, scalable and interpretable knowledge tracing on structured domains Hanqi Zhou, Robert Bamler,..., Álvaro Tejero-Cantero
5 2023-10-02 link BTR: Binary Token Representations for Efficient Retrieval Augmented Language
Models
Qingqing Cao, Sewon Min,..., Hannaneh Hajishirzi
5 2024-03-06 link Sparse Spiking Neural Network: Exploiting Heterogeneity in Timescales for
Pruning Recurrent SNN
Biswadeep Chakraborty, Beomseok Kang,..., Saibal Mukhopadhyay
5 2024-04-02 link Confidence-aware Reward Optimization for Fine-tuning Text-to-Image Models Kyuyoung Kim, Jongheon Jeong,..., Kimin Lee
5 2023-07-18 link Grounded Object-Centric Learning Avinash Kori, Francesco Locatello,..., Ben Glocker
5 None link Fast Imitation via Behavior Foundation Models Matteo Pirotta, Andrea Tirinzoni,..., Yann Ollivier
5 2024-01-22 link Retrieval-Guided Reinforcement Learning for Boolean Circuit Minimization Animesh Basak Chowdhury, Marco Romanelli,..., Siddharth Garg
5 2023-10-19 link To grok or not to grok: Disentangling generalization and
memorization on corrupted algorithmic datasets
Darshil Doshi, Aritra Das,..., Andrey Gromov
5 2023-05-27 link Diffeomorphic Mesh Deformation via Efficient Optimal Transport for Cortical
Surface Reconstruction
Thanh Tung Le, Khai Nguyen,..., Xiaohui Xie
5 None link GNNCert: Deterministic Certification of Graph Neural Networks against Adversarial
Perturbations
zaishuo xia, Han Yang,..., Jinyuan Jia
5 2023-05-24 link Sharpness-Aware Data Poisoning Attack Pengfei He, Han Xu,..., Jiliang Tang
5 None link SaNN: Simple Yet Powerful Simplicial-aware Neural Networks Sravanthi Gurugubelli, Sundeep Prabhakar Chepuri
5 2023-11-22 link Prompt Risk Control: A Rigorous Framework for Responsible Deployment
of Large Language Models
Thomas P Zollo, Todd Morrill,..., Richard Zemel
5 2023-06-01 link Improving Offline RL by Blending Heuristics Sinong Geng, Aldo Pacchiano,..., Ching-An Cheng
5 2024-02-29 link Masks, Signs, And Learning Rate Rewinding Advait Harshal Gadhikar, Rebekka Burkholz
5 2024-03-15 link Backdoor Secrets Unveiled: Identifying Backdoor Data with Optimized Scaled
Prediction Consistency
Soumyadeep Pal, Yuguang Yao,..., Sijia Liu
5 2024-03-19 link Non-negative Contrastive Learning Yifei Wang, Qi Zhang,..., Yisen Wang
5 2023-10-03 link Blending Imitation and Reinforcement Learning for Robust Policy Improvement Xuefeng Liu, Takuma Yoneda,..., Yuxin Chen
5 2024-01-23 link Locality Sensitive Sparse Encoding for Learning World Models Online Zichen Liu, Chao Du,..., Min Lin
5 None link A Good Learner can Teach Better: Teacher-Student Collaborative Knowledge
Distillation
Ayan Sengupta, Shantanu Dixit,..., Tanmoy Chakraborty
5 2024-03-25 link Grounding Language Plans in Demonstrations Through Counterfactual Perturbations Yanwei Wang, Tsun-Hsuan Wang,..., Julie Shah
5 2023-12-08 link Neural Spectral Methods: Self-supervised learning in the spectral domain Yiheng Du, Nithin Chalapathi, Aditi S. Krishnapriyan
5 None link Towards Understanding Factual Knowledge of Large Language Models Xuming Hu, Junzhe Chen,..., Zhijiang Guo
5 2024-03-16 link CORN: Contact-based Object Representation for Nonprehensile Manipulation of General
Unseen Objects
Yoonyoung Cho, Junhyek Han,..., Beomjoon Kim
5 2023-10-11 link What Matters to You? Towards Visual Representation Alignment for
Robot Learning
Thomas Tian, Chenfeng Xu,..., Andrea Bajcsy
5 2023-05-27 link Query-Policy Misalignment in Preference-Based Reinforcement Learning Xiao Hu, Jianxiong Li,..., Ya-Qin Zhang
5 2023-05-24 link Leftover Lunch: Advantage-based Offline Reinforcement Learning for Language Models Ashutosh Baheti, Ximing Lu,..., Mark Riedl
5 2024-03-17 link COLEP: Certifiably Robust Learning-Reasoning Conformal Prediction via Probabilistic Circuits Mintong Kang, Nezihe Merve Gürel,..., Bo Li
5 2023-05-18 link Massively Scalable Inverse Reinforcement Learning in Google Maps Matt Barnes, Matthew Abueg,..., Shawn O'Banion
5 2024-02-13 link H2O-SDF: Two-phase Learning for 3D Indoor Reconstruction using Object
Surface Fields
Minyoung Park, Mirae Do,..., Chul Lee
5 None link Unified Generative Modeling of 3D Molecules with Bayesian Flow
Networks
Yuxuan Song, Jingjing Gong,..., Wei-Ying Ma
5 2023-10-31 link Contrastive Difference Predictive Coding Chongyi Zheng, Ruslan Salakhutdinov, Benjamin Eysenbach
5 None link On the Scalability and Memory Efficiency of Semidefinite Programs
for Lipschitz Constant Estimation of Neural Networks
Zi Wang, Bin Hu,..., Somesh Jha
5 2023-07-16 link Tangent Transformers for Composition, Privacy and Removal Tian Yu Liu, Aditya Golatkar, Stefano Soatto
5 2024-10-16 link Reclaiming the Source of Programmatic Policies: Programmatic versus Latent
Spaces
Tales Henrique Carvalho, Kenneth Tjhia, Levi Lelis
5 2024-04-20 link Deep SE(3)-Equivariant Geometric Reasoning for Precise Placement Tasks Ben Eisner, Yi Yang,..., David Held
5 2023-11-13 link ViLMA: A Zero-Shot Benchmark for Linguistic and Temporal Grounding
in Video-Language Models
Ilker Kesen, Andrea Pedrotti,..., Erkut Erdem
4 2023-10-04 link CoLiDE: Concomitant Linear DAG Estimation Seyed Saman Saboksayr, Gonzalo Mateos, Mariano Tepper
4 2023-05-30 link Diffusion Model for Dense Matching Jisu Nam, Gyuseong Lee,..., Seungryong Kim
4 2023-10-16 link Equivariant Matrix Function Neural Networks Ilyes Batatia, Lars Leon Schaaf,..., Felix Andreas Faber
4 None link Addressing Signal Delay in Deep Reinforcement Learning Wei Wang, Dongqi Han,..., Dongsheng Li
4 2022-07-20 link Illusory Attacks: Information-theoretic detectability matters in adversarial attacks Tim Franzmeyer, Stephen Marcus McAleer,..., Christian Schroeder de Witt
4 2023-06-07 link On the Joint Interaction of Models, Data, and Features Yiding Jiang, Christina Baek, J Zico Kolter
4 None link Whittle Index with Multiple Actions and State Constraint for
Inventory Management
Chuheng Zhang, Xiangsen Wang,..., Jiang Bian
4 2023-11-09 link Generating Pragmatic Examples to Train Neural Program Synthesizers Saujas Vaduguru, Daniel Fried, Yewen Pu
4 2023-10-13 link Goodhart's Law in Reinforcement Learning Jacek Karwowski, Oliver Hayman,..., Joar Max Viktor Skalse
4 2024-04-18 link ASID: Active Exploration for System Identification in Robotic Manipulation Marius Memmel, Andrew Wagenmaker,..., Abhishek Gupta
4 None link MCM: Masked Cell Modeling for Anomaly Detection in Tabular
Data
Jiaxin Yin, Yuanyuan Qiao,..., Jie Yang
4 2023-01-09 link MOTOR: A Time-to-Event Foundation Model For Structured Medical Records Ethan Steinberg, Jason Alan Fries,..., Nigam Shah
4 2024-07-12 link On the Role of Discrete Tokenization in Visual Representation
Learning
Tianqi Du, Yifei Wang, Yisen Wang
4 2023-05-29 link Provable Reward-Agnostic Preference-Based Reinforcement Learning Wenhao Zhan, Masatoshi Uehara,..., Jason D. Lee
4 2023-05-26 link Physics-Regulated Deep Reinforcement Learning: Invariant Embeddings Hongpeng Cao, Yanbing Mao,..., Marco Caccamo
4 2024-04-01 link Lipsum-FT: Robust Fine-Tuning of Zero-Shot Models Using Random Text
Guidance
Giung Nam, Byeongho Heo, Juho Lee
4 None link Deep Geodesic Canonical Correlation Analysis for Covariance-Based Neuroimaging Data Ce Ju, Reinmar J Kobler,..., Motoaki Kawanabe
4 2024-02-26 link REFACTOR: Learning to Extract Theorems from Proofs Jin Peng Zhou, Yuhuai Wu,..., Roger Baker Grosse
4 2023-10-08 link Improved Active Learning via Dependent Leverage Score Sampling Atsushi Shimizu, Xiaoou Cheng,..., Jonathan Weare
4 None link On Bias-Variance Alignment in Deep Models Lin Chen, Michal Lukasik,..., Sanjiv Kumar
4 2023-07-06 link Provably Efficient Iterated CVaR Reinforcement Learning with Function Approximation Yu Chen, Yihan Du,..., Longbo Huang
4 2023-12-07 link LiDAR: Sensing Linear Probing Performance in Joint Embedding SSL
Architectures
Vimal Thilak, Chen Huang,..., Etai Littwin
4 2024-01-23 link DDMI: Domain-Agnostic Latent Diffusion Models for Synthesizing High-Quality Implicit
Neural Representations
Dogyun Park, Sihyeon Kim,..., Hyunwoo J. Kim
4 2023-10-08 link Understanding the Robustness of Multi-modal Contrastive Learning to Distribution
Shift
Yihao Xue, Siddharth Joshi,..., Baharan Mirzasoleiman
4 2023-10-03 link Ensemble Distillation for Unsupervised Constituency Parsing Behzad Shayegh, Yanshuai Cao,..., Lili Mou
4 2023-10-31 link Offline RL with Observation Histories: Analyzing and Improving Sample
Complexity
Joey Hong, Anca Dragan, Sergey Levine