1300 |
2023-07-04 |
link |
SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis |
Dustin Podell, Zion English,..., Robin Rombach |
485 |
2023-07-10 |
link |
AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning |
Yuwei Guo, Ceyuan Yang,..., Bo Dai |
438 |
2023-07-31 |
link |
ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIs |
Yujia Qin, Shihao Liang,..., Maosong Sun |
426 |
2023-09-28 |
link |
DreamGaussian: Generative Gaussian Splatting for Efficient 3D Content Creation |
Jiaxiang Tang, Jiawei Ren,..., Gang Zeng |
377 |
2023-10-17 |
link |
Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection |
Akari Asai, Zeqiu Wu,..., Hannaneh Hajishirzi |
360 |
2023-10-05 |
link |
Fine-tuning Aligned Language Models Compromises Safety, Even When Users Do Not Intend To! |
Xiangyu Qi, Yi Zeng,..., Peter Henderson |
295 |
2023-09-07 |
link |
SyncDreamer: Generating Multiview-consistent Images from a Single-view Image |
Yuan Liu, Cheng Lin,..., Wenping Wang |
286 |
2023-10-03 |
link |
MathVista: Evaluating Mathematical Reasoning of Foundation Models in Visual Contexts |
Pan Lu, Hritik Bansal,..., Jianfeng Gao |
274 |
2023-09-11 |
link |
MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning |
Xiang Yue, Xingwei Qu,..., Wenhu Chen |
230 |
2023-11-08 |
link |
LRM: Large Reconstruction Model for Single Image to 3D |
Yicong Hong, Kai Zhang,..., Hao Tan |
220 |
2023-10-10 |
link |
iTransformer: Inverted Transformers Are Effective for Time Series Forecasting |
Yong Liu, Tengge Hu,..., Mingsheng Long |
219 |
2023-09-21 |
link |
MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models |
Longhui Yu, Weisen Jiang,..., Weiyang Liu |
215 |
2023-10-12 |
link |
Ferret: Refer and Ground Anything Anywhere at Any Granularity |
Haoxuan You, Haotian Zhang,..., Yinfei Yang |
214 |
2023-10-17 |
link |
Quantifying Language Models' Sensitivity to Spurious Features in Prompt Design or: How I learned to start worrying about prompt formatting |
Melanie Sclar, Yejin Choi,..., Alane Suhr |
214 |
2023-10-10 |
link |
SWE-bench: Can Language Models Resolve Real-World GitHub Issues? |
Carlos E Jimenez, John Yang,..., Karthik R Narasimhan |
205 |
2023-10-11 |
link |
Catastrophic Jailbreak of Open-source LLMs via Exploiting Generation |
Yangsibo Huang, Samyak Gupta,..., Danqi Chen |
199 |
2023-05-22 |
link |
Training Diffusion Models with Reinforcement Learning |
Kevin Black, Michael Janner,..., Sergey Levine |
199 |
2023-10-16 |
link |
Llemma: An Open Language Model For Mathematics |
Zhangir Azerbayev, Hailey Schoelkopf,..., Sean Welleck |
194 |
2023-10-19 |
link |
Eureka: Human-Level Reward Design via Coding Large Language Models |
Yecheng Jason Ma, William Liang,..., Anima Anandkumar |
191 |
2023-10-19 |
link |
Safe RLHF: Safe Reinforcement Learning from Human Feedback |
Josef Dai, Xuehai Pan,..., Yaodong Yang |
184 |
2023-09-28 |
link |
Vision Transformers Need Registers |
Timothée Darcet, Maxime Oquab,..., Piotr Bojanowski |
183 |
2023-08-07 |
link |
AgentBench: Evaluating LLMs as Agents |
Xiao Liu, Hao Yu,..., Jie Tang |
182 |
2023-09-21 |
link |
The Reversal Curse: LLMs trained on "A is B" fail to learn "B is A" |
Lukas Berglund, Meg Tong,..., Owain Evans |
172 |
2023-04-18 |
link |
NaturalSpeech 2: Latent Diffusion Models are Natural and Zero-Shot Speech and Singing Synthesizers |
Kai Shen, Zeqian Ju,..., Jiang Bian |
172 |
2023-10-03 |
link |
AutoDAN: Generating Stealthy Jailbreak Prompts on Aligned Large Language Models |
Xiaogeng Liu, Nan Xu,..., Chaowei Xiao |
171 |
2023-02-14 |
link |
Universal Guidance for Diffusion Models |
Arpit Bansal, Hong-Min Chu,..., Tom Goldstein |
169 |
2023-08-16 |
link |
Stochastic Controlled Averaging for Federated Learning with Communication Compression |
Xinmeng Huang, Ping Li, Xiaoyun Li |
168 |
2023-06-08 |
link |
PandaLM: An Automatic Evaluation Benchmark for LLM Instruction Tuning Optimization |
Yidong Wang, Zhuohao Yu,..., Yue Zhang |
167 |
2023-02-07 |
link |
Effective Data Augmentation With Diffusion Models |
Brandon Trabucco, Kyle Doherty,..., Ruslan Salakhutdinov |
164 |
2023-06-26 |
link |
Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning |
Fuxiao Liu, Kevin Lin,..., Lijuan Wang |
153 |
2023-07-13 |
link |
InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation |
Yi Wang, Yinan He,..., Yu Qiao |
143 |
2023-12-25 |
link |
What Makes Good Data for Alignment? A Comprehensive Study of Automatic Data Selection in Instruction Tuning |
Wei Liu, Weihao Zeng,..., Junxian He |
141 |
2023-09-07 |
link |
Large Language Models Are Not Robust Multiple Choice Selectors |
Chujie Zheng, Hao Zhou,..., Minlie Huang |
138 |
2023-10-09 |
link |
Language Model Beats Diffusion -- Tokenizer is Key to Visual Generation |
Lijun Yu, Jose Lezama,..., Lu Jiang |
137 |
2023-10-12 |
link |
Prometheus: Inducing Fine-grained Evaluation Capability in Language Models |
Seungone Kim, Jamin Shin,..., Minjoon Seo |
131 |
2023-07-24 |
link |
A Real-World WebAgent with Planning, Long Context Understanding, and Program Synthesis |
Izzeddin Gur, Hiroki Furuta,..., Aleksandra Faust |
126 |
2023-11-14 |
link |
Fine-tuning Language Models for Factuality |
Katherine Tian, Eric Mitchell,..., Chelsea Finn |
124 |
2023-10-11 |
link |
Evaluating Large Language Models at Evaluating Instruction Following |
Zhiyuan Zeng, Jiatong Yu,..., Danqi Chen |
120 |
2023-10-02 |
link |
Making Retrieval-Augmented Language Models Robust to Irrelevant Context |
Ori Yoran, Tomer Wolfson,..., Jonathan Berant |
118 |
2023-10-03 |
link |
Model Tells You What to Discard: Adaptive KV Cache Compression for LLMs |
Suyu Ge, Yunan Zhang,..., Jianfeng Gao |
116 |
2023-08-25 |
link |
OmniQuant: Omnidirectionally Calibrated Quantization for Large Language Models |
Wenqi Shao, Mengzhao Chen,..., Ping Luo |
115 |
2023-09-21 |
link |
LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models |
Yukang Chen, Shengju Qian,..., Jiaya Jia |
115 |
2023-09-25 |
link |
Can LLM-Generated Misinformation Be Detected? |
Canyu Chen, Kai Shu |
112 |
2023-09-20 |
link |
DreamLLM: Synergistic Multimodal Comprehension and Creation |
Runpei Dong, Chunrui Han,..., Li Yi |
112 |
2023-10-25 |
link |
Detecting Pretraining Data from Large Language Models |
Weijia Shi, Anirudh Ajith,..., Luke Zettlemoyer |
112 |
2023-07-05 |
link |
Building Cooperative Embodied Agents Modularly with Large Language Models |
Hongxin Zhang, Weihua Du,..., Chuang Gan |
107 |
2023-02-06 |
link |
Chain of Hindsight Aligns Language Models with Feedback |
Hao Liu, Carmelo Sferrazza, Pieter Abbeel |
106 |
2023-11-15 |
link |
DMV3D: Denoising Multi-View Diffusion using 3D Large Reconstruction Model |
Yinghao Xu, Hao Tan,..., Kai Zhang |
104 |
None |
link |
WizardLM: Empowering Large Pre-Trained Language Models to Follow Complex Instructions |
Can Xu, Qingfeng Sun,..., Daxin Jiang |
101 |
2023-07-05 |
link |
DragonDiffusion: Enabling Drag-style Manipulation on Diffusion Models |
Chong Mou, Xintao Wang,..., Jian Zhang |
101 |
2023-09-21 |
link |
LMSYS-Chat-1M: A Large-Scale Real-World LLM Conversation Dataset |
Lianmin Zheng, Wei-Lin Chiang,..., Hao Zhang |
101 |
2023-05-23 |
link |
Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training |
Hong Liu, Zhiyuan Li,..., Tengyu Ma |
100 |
2023-10-22 |
link |
Improved Techniques for Training Consistency Models |
Yang Song, Prafulla Dhariwal |
99 |
2023-10-03 |
link |
Language Models Represent Space and Time |
Wes Gurnee, Max Tegmark |
98 |
2023-09-25 |
link |
Q-Bench: A Benchmark for General-Purpose Foundation Models on Low-level Vision |
Haoning Wu, Zicheng Zhang,..., Weisi Lin |
96 |
2023-10-26 |
link |
Proving Test Set Contamination in Black Box Language Models |
Yonatan Oren, Nicole Meister,..., Tatsunori Hashimoto |
95 |
2023-08-14 |
link |
OctoPack: Instruction Tuning Code Large Language Models |
Niklas Muennighoff, Qian Liu,..., Shayne Longpre |
94 |
2023-06-05 |
link |
RepoBench: Benchmarking Repository-Level Code Auto-Completion Systems |
Tianyang Liu, Canwen Xu, Julian McAuley |
90 |
2023-09-29 |
link |
Directly Fine-Tuning Diffusion Models on Differentiable Rewards |
Kevin Clark, Paul Vicol,..., David J. Fleet |
89 |
2024-05-02 |
link |
WildChat: 1M ChatGPT Interaction Logs in the Wild |
Wenting Zhao, Xiang Ren,..., Yuntian Deng |
88 |
2023-10-02 |
link |
RA-DIT: Retrieval-Augmented Dual Instruction Tuning |
Xi Victoria Lin, Xilun Chen,..., Wen-tau Yih |
88 |
2023-08-11 |
link |
Self-Alignment with Instruction Backtranslation |
Xian Li, Ping Yu,..., Mike Lewis |
87 |
2023-05-07 |
link |
A Variational Perspective on Solving Inverse Problems with Diffusion Models |
Morteza Mardani, Jiaming Song,..., Arash Vahdat |
86 |
2023-10-12 |
link |
LoftQ: LoRA-Fine-Tuning-Aware Quantization for Large Language Models |
Yixiao Li, Yifan Yu,..., Tuo Zhao |
86 |
2023-05-25 |
link |
Self-contradictory Hallucinations of Large Language Models: Evaluation, Detection and Mitigation |
Niels Mündler, Jingxuan He,..., Martin Vechev |
84 |
2023-06-09 |
link |
Can Large Language Models Infer Causation from Correlation? |
Zhijing Jin, Jiarui Liu,..., Bernhard Schölkopf |
83 |
2023-10-10 |
link |
Multilingual Jailbreak Challenges in Large Language Models |
Yue Deng, Wenxuan Zhang,..., Lidong Bing |
80 |
2023-07-26 |
link |
Jailbreak in pieces: Compositional Adversarial Attacks on Multi-Modal Language Models |
Erfan Shayegani, Yue Dong, Nael Abu-Ghazaleh |
79 |
2023-10-24 |
link |
What Algorithms can Transformers Learn? A Study in Length Generalization |
Hattie Zhou, Arwen Bradley,..., Preetum Nakkiran |
78 |
2023-10-04 |
link |
Reward Model Ensembles Help Mitigate Overoptimization |
Thomas Coste, Usman Anwar,..., David Krueger |
75 |
2023-09-28 |
link |
Demystifying CLIP Data |
Hu Xu, Saining Xie,..., Christoph Feichtenhofer |
75 |
2023-10-31 |
link |
SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction |
Xinyuan Chen, Yaohui Wang,..., Ziwei Liu |
75 |
2023-08-02 |
link |
From Sparse to Soft Mixtures of Experts |
Joan Puigcerver, Carlos Riquelme Ruiz,..., Neil Houlsby |
74 |
2023-10-25 |
link |
PromptAgent: Strategic Planning with Language Models Enables Expert-level Prompt Optimization |
Xinyuan Wang, Chenxi Li,..., Zhiting Hu |
73 |
2023-08-16 |
link |
Time Travel in LLMs: Tracing Data Contamination in Large Language Models |
Shahriar Golchin, Mihai Surdeanu |
73 |
2023-05-31 |
link |
MERT: Acoustic Music Understanding Model with Large-Scale Self-supervised Training |
Yizhi LI, Ruibin Yuan,..., Jie Fu |
72 |
2023-07-20 |
link |
FLASK: Fine-grained Language Model Evaluation based on Alignment Skill Sets |
Seonghyeon Ye, Doyoung Kim,..., Minjoon Seo |
72 |
2023-11-02 |
link |
Vision-Language Foundation Models as Effective Robot Imitators |
Xinghang Li, Minghuan Liu,..., Tao Kong |
71 |
2023-10-01 |
link |
BooookScore: A systematic exploration of book-length summarization in the era of LLMs |
Yapei Chang, Kyle Lo,..., Mohit Iyyer |
71 |
2024-02-27 |
link |
When Scaling Meets LLM Finetuning: The Effect of Data, Model and Finetuning Method |
Biao Zhang, Zhongtao Liu,..., Orhan Firat |
70 |
2023-10-08 |
link |
TEMPO: Prompt-based Generative Pre-trained Transformer for Time Series Forecasting |
Defu Cao, Furong Jia,..., Yan Liu |
70 |
2023-10-18 |
link |
SOTOPIA: Interactive Evaluation for Social Intelligence in Language Agents |
Xuhui Zhou, Hao Zhu,..., Maarten Sap |
69 |
2023-11-20 |
link |
PF-LRM: Pose-Free Large Reconstruction Model for Joint Pose and Shape Prediction |
Peng Wang, Hao Tan,..., Kai Zhang |
69 |
2023-07-07 |
link |
One Step of Gradient Descent is Provably the Optimal In-Context Learner with One Layer of Linear Self-Attention |
Arvind V. Mahankali, Tatsunori Hashimoto, Tengyu Ma |
69 |
2024-01-31 |
link |
RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval |
Parth Sarthi, Salman Abdullah,..., Christopher D Manning |
68 |
2023-09-29 |
link |
Can Sensitive Information Be Deleted From LLMs? Objectives for Defending Against Extraction Attacks |
Vaidehi Patil, Peter Hase, Mohit Bansal |
68 |
2023-09-29 |
link |
One for All: Towards Training One Graph Model for All Classification Tasks |
Hao Liu, Jiarui Feng,..., Muhan Zhang |
68 |
2023-08-07 |
link |
Nearly d-Linear Convergence Bounds for Diffusion Models via Stochastic Localization |
Joe Benton, Valentin De Bortoli,..., George Deligiannidis |
68 |
2024-04-19 |
link |
SaProt: Protein Language Modeling with Structure-aware Vocabulary |
Jin Su, Chenchen Han,..., Fajie Yuan |
67 |
2023-08-17 |
link |
Linearity of Relation Decoding in Transformer Language Models |
Evan Hernandez, Arnab Sen Sharma,..., David Bau |
66 |
2023-10-25 |
link |
TD-MPC2: Scalable, Robust World Models for Continuous Control |
Nicklas Hansen, Hao Su, Xiaolong Wang |
65 |
2023-05-22 |
link |
Chain-of-Knowledge: Grounding Large Language Models via Dynamic Knowledge Adapting over Heterogeneous Sources |
Xingxuan Li, Ruochen Zhao,..., Lidong Bing |
63 |
2023-10-19 |
link |
SalUn: Empowering Machine Unlearning via Gradient-based Weight Saliency in Both Image Classification and Generation |
Chongyu Fan, Jiancheng Liu,..., Sijia Liu |
63 |
2023-02-15 |
link |
Learning Performance-Improving Code Edits |
Alexander G Shypula, Aman Madaan,..., Amir Yazdanbakhsh |
63 |
2023-07-16 |
link |
Solving Inverse Problems with Latent Diffusion Models via Hard Data Consistency |
Bowen Song, Soo Min Kwon,..., Liyue Shen |
62 |
2023-09-25 |
link |
Identifying the Risks of LM Agents with an LM-Emulated Sandbox |
Yangjun Ruan, Honghua Dong,..., Tatsunori Hashimoto |
61 |
2023-10-31 |
link |
What's In My Big Data? |
Yanai Elazar, Akshita Bhagia,..., Jesse Dodge |
60 |
2023-11-08 |
link |
Bias Runs Deep: Implicit Reasoning Biases in Persona-Assigned LLMs |
Shashank Gupta, Vaishnavi Shrivastava,..., Tushar Khot |
59 |
2023-11-02 |
link |
Tensor Trust: Interpretable Prompt Injection Attacks from an Online Game |
Sam Toyer, Olivia Watkins,..., Stuart Russell |
57 |
2023-09-28 |
link |
Beyond Reverse KL: Generalizing Direct Preference Optimization with Diverse Divergence Constraints |
Chaoqi Wang, Yibo Jiang,..., Yuxin Chen |
57 |
2023-10-09 |
link |
NEFTune: Noisy Embeddings Improve Instruction Finetuning |
Neel Jain, Ping-yeh Chiang,..., Tom Goldstein |
56 |
2023-10-04 |
link |
Retrieval meets Long Context Large Language Models |
Peng Xu, Wei Ping,..., Bryan Catanzaro |
56 |
2023-10-29 |
link |
AnomalyCLIP: Object-agnostic Prompt Learning for Zero-shot Anomaly Detection |
Qihang Zhou, Guansong Pang,..., Jiming Chen |
56 |
2023-10-11 |
link |
Beyond Memorization: Violating Privacy Via Inference with Large Language Models |
Robin Staab, Mark Vero,..., Martin Vechev |
54 |
2023-06-15 |
link |
KoLA: Carefully Benchmarking World Knowledge of Large Language Models |
Jifan Yu, Xiaozhi Wang,..., Juanzi Li |
54 |
2023-09-25 |
link |
Small-scale proxies for large-scale Transformer training instabilities |
Mitchell Wortsman, Peter J Liu,..., Simon Kornblith |
54 |
2024-02-20 |
link |
Chain of Thought Empowers Transformers to Solve Inherently Serial Problems |
Zhiyuan Li, Hong Liu,..., Tengyu Ma |
54 |
2023-10-10 |
link |
Uni3D: Exploring Unified 3D Representation at Scale |
Junsheng Zhou, Jinsheng Wang,..., Xinlong Wang |
52 |
2023-08-04 |
link |
Retroformer: Retrospective Large Language Agents with Policy Gradient Optimization |
Weiran Yao, Shelby Heinecke,..., Silvio Savarese |
52 |
2023-09-29 |
link |
Guiding Instruction-based Image Editing via Multimodal Large Language Models |
Tsu-Jui Fu, Wenze Hu,..., Zhe Gan |
51 |
2023-10-12 |
link |
Phenomenal Yet Puzzling: Testing Inductive Reasoning Capabilities of Language Models with Hypothesis Refinement |
Linlu Qiu, Liwei Jiang,..., Xiang Ren |
51 |
2023-07-13 |
link |
In-context Autoencoder for Context Compression in a Large Language Model |
Tao Ge, Hu Jing,..., Furu Wei |
50 |
2023-10-27 |
link |
Davidsonian Scene Graph: Improving Reliability in Fine-grained Evaluation for Text-to-Image Generation |
Jaemin Cho, Yushi Hu,..., Su Wang |
50 |
2023-10-10 |
link |
Lemur: Harmonizing Natural Language and Code for Language Agents |
Yiheng Xu, Hongjin SU,..., Tao Yu |
50 |
2023-10-05 |
link |
GoLLIE: Annotation Guidelines improve Zero-Shot Information-Extraction |
Oscar Sainz, Iker García-Ferrero,..., Eneko Agirre |
49 |
2023-10-17 |
link |
Zipformer: A faster and better encoder for automatic speech recognition |
Zengwei Yao, Liyong Guo,..., Daniel Povey |
48 |
2023-07-06 |
link |
FITS: Modeling Time Series with 10k Parameters |
Zhijian Xu, Ailing Zeng, Qiang Xu |
48 |
2023-08-08 |
link |
SILO Language Models: Isolating Legal Risk In a Nonparametric Datastore |
Sewon Min, Suchin Gururangan,..., Luke Zettlemoyer |
48 |
2023-09-13 |
link |
Sudden Drops in the Loss: Syntax Acquisition, Phase Transitions, and Simplicity Bias in MLMs |
Angelica Chen, Ravid Shwartz-Ziv,..., Naomi Saphra |
48 |
2023-06-01 |
link |
Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis |
Hubert Siuzdak |
47 |
2023-05-31 |
link |
Harnessing Explanations: LLM-to-LM Interpreter for Enhanced Text-Attributed Graph Representation Learning |
Xiaoxin He, Xavier Bresson,..., Bryan Hooi |
47 |
2023-10-09 |
link |
Generative Judge for Evaluating Alignment |
Junlong Li, Shichao Sun,..., Pengfei Liu |
45 |
2023-11-21 |
link |
Mechanistically analyzing the effects of fine-tuning on procedurally defined tasks |
Samyak Jain, Robert Kirk,..., David Krueger |
45 |
2023-10-12 |
link |
ScaleCrafter: Tuning-free Higher-Resolution Visual Generation with Diffusion Models |
Yingqing He, Shaoshu Yang,..., Ying Shan |
45 |
2023-10-03 |
link |
SE(3)-Stochastic Flow Matching for Protein Backbone Generation |
Joey Bose, Tara Akhound-Sadegh,..., Alexander Tong |
45 |
2023-08-08 |
link |
Fine-tuning Multimodal LLMs to Follow Zero-shot Demonstrative Instructions |
Juncheng Li, Kaihang Pan,..., Yueting Zhuang |
45 |
2023-10-09 |
link |
Interpreting CLIP's Image Representation via Text-Based Decomposition |
Yossi Gandelsman, Alexei A Efros, Jacob Steinhardt |
44 |
2023-10-02 |
link |
GenSim: Generating Robotic Simulation Tasks via Large Language Models |
Lirui Wang, Yiyang Ling,..., Xiaolong Wang |
43 |
2023-10-12 |
link |
Circuit Component Reuse Across Tasks in Transformer Language Models |
Jack Merullo, Carsten Eickhoff, Ellie Pavlick |
43 |
2024-04-22 |
link |
Hybrid LLM: Cost-Efficient and Quality-Aware Query Routing |
Dujian Ding, Ankur Mallick,..., Ahmed Hassan Awadallah |
43 |
2023-12-08 |
link |
Zoology: Measuring and Improving Recall in Efficient Language Models |
Simran Arora, Sabri Eyuboglu,..., Christopher Re |
42 |
2023-11-06 |
link |
AnyText: Multilingual Visual Text Generation And Editing |
Yuxiang Tuo, Wangmeng Xiang,..., Xuansong Xie |
41 |
None |
link |
RECOMP: Improving Retrieval-Augmented LMs with Context Compression and Selective Augmentation |
Fangyuan Xu, Weijia Shi, Eunsol Choi |
41 |
2023-10-04 |
link |
Generalization in diffusion models arises from geometry-adaptive harmonic representation |
Zahra Kadkhodaie, Florentin Guth,..., Stéphane Mallat |
41 |
2023-10-02 |
link |
CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction |
Size Wu, Wenwei Zhang,..., Chen Change Loy |
40 |
2023-10-16 |
link |
In-Context Pretraining: Language Modeling Beyond Document Boundaries |
Weijia Shi, Sewon Min,..., Mike Lewis |
40 |
2023-02-12 |
link |
Single Motion Diffusion |
Sigal Raab, Inbal Leibovitch,..., Daniel Cohen-Or |
40 |
2023-09-28 |
link |
At Which Training Stage Does Code Data Help LLMs Reasoning? |
YINGWEI MA, Yue Liu,..., Shanshan Li |
40 |
2023-10-19 |
link |
An Emulator for Fine-Tuning Large Language Models using Small Language Models |
Eric Mitchell, Rafael Rafailov,..., Christopher D Manning |
40 |
2023-10-14 |
link |
Mixed-Type Tabular Data Synthesis with Score-based Diffusion in Latent Space |
Hengrui Zhang, Jiani Zhang,..., George Karypis |
39 |
2023-07-18 |
link |
Overthinking the Truth: Understanding how Language Models Process False Demonstrations |
Danny Halawi, Jean-Stanislas Denain, Jacob Steinhardt |
39 |
2023-02-07 |
link |
Flow Matching on General Geometries |
Ricky T. Q. Chen, Yaron Lipman |
39 |
2023-10-02 |
link |
DataInf: Efficiently Estimating Data Influence in LoRA-tuned LLMs and Diffusion Models |
Yongchan Kwon, Eric Wu,..., James Zou |
39 |
None |
link |
ModernTCN: A Modern Pure Convolution Structure for General Time Series Analysis |
Luo donghao, wang xue |
38 |
2023-12-03 |
link |
The mechanistic basis of data dependence and abrupt learning in an in-context classification task |
Gautam Reddy |
38 |
2023-11-03 |
link |
RT-Trajectory: Robotic Task Generalization via Hindsight Trajectory Sketches |
Jiayuan Gu, Sean Kirmani,..., Ted Xiao |
38 |
2023-10-12 |
link |
How Many Pretraining Tasks Are Needed for In-Context Learning of Linear Regression? |
Jingfeng Wu, Difan Zou,..., Peter Bartlett |
38 |
2023-09-25 |
link |
LLMCarbon: Modeling the end-to-end Carbon Footprint of Large Language Models |
Ahmad Faiz, Sotaro Kaneda,..., Lei Jiang |
38 |
2023-10-06 |
link |
ReLU Strikes Back: Exploiting Activation Sparsity in Large Language Models |
Seyed Iman Mirzadeh, Keivan Alizadeh-Vahid,..., Mehrdad Farajtabar |
37 |
2023-03-08 |
link |
InfoBatch: Lossless Training Speed Up by Unbiased Dynamic Data Pruning |
Ziheng Qin, Kai Wang,..., Yang You |
37 |
2023-08-23 |
link |
Large Multilingual Models Pivot Zero-Shot Multimodal Learning across Languages |
Jinyi Hu, Yuan Yao,..., Maosong Sun |
37 |
2023-09-28 |
link |
A Benchmark for Learning to Translate a New Language from One Grammar Book |
Garrett Tanzer, Mirac Suzgun,..., Luke Melas-Kyriazi |
36 |
2023-06-20 |
link |
Evaluating the Zero-shot Robustness of Instruction-tuned Language Models |
Jiuding Sun, Chantal Shaib, Byron C Wallace |
36 |
2023-11-20 |
link |
LQ-LoRA: Low-rank Plus Quantized Matrix Decomposition for Efficient Language Model Finetuning |
Han Guo, Philip Greengard,..., Yoon Kim |
36 |
2023-09-18 |
link |
Understanding Catastrophic Forgetting in Language Models via Implicit Inference |
Suhas Kotha, Jacob Mitchell Springer, Aditi Raghunathan |
36 |
2023-10-20 |
link |
ToolChain: Efficient Action Space Navigation in Large Language Models with A Search |
Yuchen Zhuang, Xiang Chen,..., Chao Zhang |
35 |
2023-10-06 |
link |
Amortizing intractable inference in large language models |
Edward J Hu, Moksh Jain,..., Nikolay Malkin |
35 |
2023-10-06 |
link |
Confronting Reward Model Overoptimization with Constrained RLHF |
Ted Moskovitz, Aaditya K Singh,..., Stephen Marcus McAleer |
34 |
2023-10-04 |
link |
Kosmos-G: Generating Images in Context with Multimodal Large Language Models |
Xichen Pan, Li Dong,..., Furu Wei |
33 |
2023-05-29 |
link |
Multiscale Positive-Unlabeled Detection of AI-Generated Texts |
Yuchuan Tian, Hanting Chen,..., Yunhe Wang |
32 |
2023-05-18 |
link |
Deep Temporal Graph Clustering |
Meng Liu, Yue Liu,..., Xinwang Liu |
32 |
2023-06-30 |
link |
Learning Delays in Spiking Neural Networks using Dilated Convolutions with Learnable Spacings |
Ilyass Hammouamri, Ismail Khalfaoui-Hassani, Timothée Masquelier |
32 |
2023-10-01 |
link |
LEGO-Prover: Neural Theorem Proving with Growing Libraries |
Haiming Wang, Huajian Xin,..., Xiaodan Liang |
31 |
2023-11-11 |
link |
Finetuning Text-to-Image Diffusion Models for Fairness |
Xudong Shen, Chao Du,..., Mohan Kankanhalli |
30 |
2024-07-31 |
link |
Detecting, Explaining, and Mitigating Memorization in Diffusion Models |
Yuxin Wen, Yuchen Liu,..., Lingjuan Lyu |
30 |
2023-10-04 |
link |
Understanding In-Context Learning in Transformers and LLMs by Learning to Learn Discrete Functions |
Satwik Bhattamishra, Arkil Patel,..., Varun Kanade |
30 |
2023-09-22 |
link |
Unbiased Watermark for Large Language Models |
Zhengmian Hu, Lichang Chen,..., Heng Huang |
30 |
2023-12-13 |
link |
Distributional Preference Learning: Understanding and Accounting for Hidden Context in RLHF |
Anand Siththaranjan, Cassidy Laidlaw, Dylan Hadfield-Menell |
29 |
None |
link |
DSPy: Compiling Declarative Language Model Calls into State-of-the-Art Pipelines |
Omar Khattab, Arnav Singhvi,..., Christopher Potts |
29 |
2023-10-10 |
link |
Advancing Pose-Guided Image Synthesis with Progressive Conditional Diffusion Models |
Fei Shen, Hu Ye,..., Yang Wei |
29 |
2023-06-01 |
link |
TorchRL: A data-driven decision-making library for PyTorch |
Albert Bou, Matteo Bettini,..., Vincent Moens |
29 |
2023-09-30 |
link |
On the Stability of Iterative Retraining of Generative Models on their own Data |
Quentin Bertrand, Joey Bose,..., Gauthier Gidel |
28 |
2024-02-06 |
link |
The Hedgehog & the Porcupine: Expressive Linear Attentions with Softmax Mimicry |
Michael Zhang, Kush Bhatia,..., Christopher Re |
27 |
2023-02-04 |
link |
Multi-Source Diffusion Models for Simultaneous Music Generation and Separation |
Giorgio Mariani, Irene Tallini,..., Emanuele Rodolà |
27 |
2023-06-08 |
link |
In-Context Learning through the Bayesian Prism |
Madhur Panwar, Kabir Ahuja, Navin Goyal |
26 |
2024-05-29 |
link |
Large Brain Model for Learning Generic Representations with Tremendous EEG Data in BCI |
Weibang Jiang, Liming Zhao, Bao-liang Lu |
26 |
2023-10-26 |
link |
How do Language Models Bind Entities in Context? |
Jiahai Feng, Jacob Steinhardt |
25 |
2022-11-17 |
link |
How to Fine-Tune Vision Models with SGD |
Ananya Kumar, Ruoqi Shen,..., Suriya Gunasekar |
25 |
2024-02-06 |
link |
Large Language Models to Enhance Bayesian Optimization |
Tennison Liu, Nicolás Astorga,..., Mihaela van der Schaar |
25 |
2023-10-04 |
link |
Local Search GFlowNets |
Minsu Kim, Taeyoung Yun,..., Jinkyoo Park |
25 |
2024-02-22 |
link |
Cameras as Rays: Pose Estimation via Ray Diffusion |
Jason Y. Zhang, Amy Lin,..., Shubham Tulsiani |
25 |
2023-10-26 |
link |
CADS: Unleashing the Diversity of Diffusion Models through Condition-Annealed Sampling |
Seyedmorteza Sadat, Jakob Buhmann,..., Romann M. Weber |
24 |
2024-02-14 |
link |
MUSTARD: Mastering Uniform Synthesis of Theorem and Proof Data |
Yinya Huang, Xiaohan Lin,..., Xiaodan Liang |
24 |
2023-10-26 |
link |
Skill-Mix: a Flexible and Expandable Family of Evaluations for AI models |
Dingli Yu, Simran Kaur,..., Sanjeev Arora |
24 |
2019-02-14 |
link |
CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity |
Aditya Bhatt, Daniel Palenicek,..., Jan Peters |
24 |
2024-03-04 |
link |
Making Pre-trained Language Models Great on Tabular Prediction |
Jiahuan Yan, Bo Zheng,..., Jintai Chen |
24 |
2023-02-06 |
link |
One-shot Empirical Privacy Estimation for Federated Learning |
Galen Andrew, Peter Kairouz,..., Vinith Menon Suriyakumar |
24 |
2023-05-19 |
link |
LLM-CXR: Instruction-Finetuned LLM for CXR Image Understanding and Generation |
Suhyeon Lee, Won Jun Kim,..., Jong Chul Ye |
24 |
2024-05-03 |
link |
What does the Knowledge Neuron Thesis Have to do with Knowledge? |
Jingcheng Niu, Andrew Liu,..., Gerald Penn |
23 |
2023-12-07 |
link |
On the Learnability of Watermarks for Language Models |
Chenchen Gu, Xiang Lisa Li,..., Tatsunori Hashimoto |
23 |
2024-01-13 |
link |
BrainLM: A foundation model for brain activity recordings |
Josue Ortega Caro, Antonio Henrique de Oliveira Fonseca,..., David van Dijk |
23 |
2023-05-24 |
link |
Alleviating Exposure Bias in Diffusion Models through Sampling with Shifted Time Steps |
Mingxiao Li, Tingyu Qu,..., Marie-Francine Moens |
23 |
2022-08-10 |
link |
A Sublinear Adversarial Training Algorithm |
Yeqi Gao, Lianke Qin,..., Yitan Wang |
23 |
2023-11-30 |
link |
Dichotomy of Early and Late Phase Implicit Biases Can Provably Induce Grokking |
Kaifeng Lyu, Jikai Jin,..., Wei Hu |
23 |
2023-09-04 |
link |
Relay Diffusion: Unifying diffusion process across resolutions for image synthesis |
Jiayan Teng, Wendi Zheng,..., Jie Tang |
23 |
2024-01-20 |
link |
BadChain: Backdoor Chain-of-Thought Prompting for Large Language Models |
Zhen Xiang, Fengqing Jiang,..., Bo Li |
23 |
2024-04-03 |
link |
CLaM-TTS: Improving Neural Codec Language Model for Zero-Shot Text-to-Speech |
Jaehyeon Kim, Keon Lee,..., Jaewoong Cho |
23 |
2023-09-28 |
link |
Intriguing properties of generative classifiers |
Priyank Jaini, Kevin Clark, Robert Geirhos |
23 |
2023-11-24 |
link |
Controlled Text Generation via Language Model Arithmetic |
Jasper Dekoninck, Marc Fischer,..., Martin Vechev |
23 |
2023-11-10 |
link |
FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor Cores |
Daniel Y Fu, Hermann Kumbong,..., Christopher Re |
22 |
2023-07-17 |
link |
COLLIE: Systematic Construction of Constrained Text Generation Tasks |
Shunyu Yao, Howard Chen,..., Karthik R Narasimhan |
22 |
2023-11-08 |
link |
Massive Editing for Large Language Models via Meta Learning |
Chenmien Tan, Ge Zhang, Jie Fu |
22 |
2023-05-17 |
link |
Knowledge Card: Filling LLMs' Knowledge Gaps with Plug-in Specialized Language Models |
Shangbin Feng, Weijia Shi,..., Yulia Tsvetkov |
22 |
2023-10-05 |
link |
EfficientDM: Efficient Quantization-Aware Fine-Tuning of Low-Bit Diffusion Models |
Yefei He, Jing Liu,..., Bohan Zhuang |
22 |
2024-02-16 |
link |
Robust agents learn causal world models |
Jonathan Richens, Tom Everitt |
22 |
2023-10-13 |
link |
METRA: Scalable Unsupervised RL with Metric-Aware Abstraction |
Seohong Park, Oleh Rybkin, Sergey Levine |
21 |
2022-01-07 |
link |
Fair and efficient contribution valuation for vertical federated learning |
Zhenan Fan, Huang Fang,..., Yong Zhang |
21 |
2023-10-12 |
link |
GROOT: Learning to Follow Instructions by Watching Gameplay Videos |
Shaofei Cai, Bowei Zhang,..., Yitao Liang |
21 |
2023-12-12 |
link |
Remote Sensing Vision-Language Foundation Models without Annotations via Ground Remote Alignment |
Utkarsh Mall, Cheng Perng Phoo,..., Kavita Bala |
21 |
None |
link |
On the Humanity of Conversational AI: Evaluating the Psychological Portrayal of LLMs |
Jen-tse Huang, Wenxuan Wang,..., Michael Lyu |
21 |
2024-04-17 |
link |
SuRe: Summarizing Retrievals using Answer Candidates for Open-domain QA of LLMs |
Jaehyung Kim, Jaehyun Nam,..., Jinwoo Shin |
21 |
2023-12-05 |
link |
MUFFIN: Curating Multi-Faceted Instructions for Improving Instruction-Following |
Renze Lou, Kai Zhang,..., Wenpeng Yin |
21 |
2024-01-16 |
link |
Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis |
Zhenhui Ye, Tianyun Zhong,..., Zhou Zhao |
20 |
2023-05-24 |
link |
Differentially Private Synthetic Data via Foundation Model APIs 1: Images |
Zinan Lin, Sivakanth Gopi,..., Sergey Yekhanin |
20 |
None |
link |
Würstchen: An Efficient Architecture for Large-Scale Text-to-Image Diffusion Models |
Pablo Pernias, Dominic Rampas,..., Marc Aubreville |
20 |
2023-10-30 |
link |
DrM: Mastering Visual Reinforcement Learning through Dormant Ratio Minimization |
Guowei Xu, Ruijie Zheng,..., Huazhe Xu |
20 |
2023-11-27 |
link |
DP-OPT: Make Large Language Model Your Privacy-Preserving Prompt Engineer |
Junyuan Hong, Jiachen T. Wang,..., Zhangyang Wang |
20 |
2023-10-06 |
link |
How to Capture Higher-order Correlations? Generalizing Matrix Softmax Attention to Kronecker Computation |
Josh Alman, Zhao Song |
19 |
2024-03-18 |
link |
Graph Neural Networks for Learning Equivariant Representations of Neural Networks |
Miltiadis Kofinas, Boris Knyazev,..., David W. Zhang |
19 |
2023-10-24 |
link |
MuSR: Testing the Limits of Chain-of-thought with Multistep Soft Reasoning |
Zayne Rea Sprague, Xi Ye,..., Greg Durrett |
19 |
2023-09-20 |
link |
Text2Reward: Reward Shaping with Language Models for Reinforcement Learning |
Tianbao Xie, Siheng Zhao,..., Tao Yu |
19 |
2023-09-04 |
link |
On Penalty Methods for Nonconvex Bilevel Optimization and First-Order Stochastic Approximation |
Jeongyeol Kwon, Dohyun Kwon,..., Robert D Nowak |
19 |
2022-11-01 |
link |
Two-stage LLM Fine-tuning with Less Specialization and More Generalization |
Yihan Wang, Si Si,..., Sanjiv Kumar |
19 |
2023-11-07 |
link |
Beyond Imitation: Leveraging Fine-grained Quality Signals for Alignment |
Geyang Guo, Ranchi Zhao,..., Ji-Rong Wen |
19 |
2023-12-07 |
link |
Graph Metanetworks for Processing Diverse Neural Architectures |
Derek Lim, Haggai Maron,..., James Lucas |
19 |
2023-12-18 |
link |
Tuning LayerNorm in Attention: Towards Efficient Multi-Modal LLM Finetuning |
Bingchen Zhao, Haoqin Tu,..., Cihang Xie |
19 |
2023-09-15 |
link |
Scaling Laws for Sparsely-Connected Foundation Models |
Elias Frantar, Carlos Riquelme Ruiz,..., Utku Evci |
19 |
2023-06-08 |
link |
Protein Discovery with Discrete Walk-Jump Sampling |
Nathan C. Frey, Dan Berenberg,..., Saeed Saremi |
19 |
2023-10-04 |
link |
Multi-resolution HuBERT: Multi-resolution Speech Self-Supervised Learning with Masked Unit Prediction |
Jiatong Shi, Hirofumi Inaguma,..., Anna Sun |
19 |
2023-08-03 |
link |
PARL: A Unified Framework for Policy Alignment in Reinforcement Learning from Human Feedback |
Souradip Chakraborty, Amrit Bedi,..., Furong Huang |
18 |
2023-10-10 |
link |
Geographic Location Encoding with Spherical Harmonics and Sinusoidal Representation Networks |
Marc Rußwurm, Konstantin Klemmer,..., Devis Tuia |
18 |
2023-10-03 |
link |
Tensor Programs VI: Feature Learning in Infinite-Depth Neural Networks |
Greg Yang, Dingli Yu,..., Soufiane Hayou |
18 |
2023-10-06 |
link |
Universal Humanoid Motion Representations for Physics-Based Control |
Zhengyi Luo, Jinkun Cao,..., Weipeng Xu |
18 |
2023-10-24 |
link |
On the Foundations of Shortcut Learning |
Katherine Hermann, Hossein Mobahi,..., Michael Curtis Mozer |
17 |
2023-10-19 |
link |
Frozen Transformers in Language Models Are Effective Visual Encoder Layers |
Ziqi Pang, Ziyang Xie,..., Yu-Xiong Wang |
17 |
2023-11-07 |
link |
Multi-View Causal Representation Learning with Partial Observability |
Dingling Yao, Danru Xu,..., Francesco Locatello |
17 |
2023-07-05 |
link |
Reverse Diffusion Monte Carlo |
Xunpeng Huang, Hanze Dong,..., Tong Zhang |
17 |
2023-06-05 |
link |
PolyVoice: Language Models for Speech to Speech Translation |
Qian qian Dong, Zhiying Huang,..., Yuxuan Wang |
17 |
2023-10-02 |
link |
Merge, Then Compress: Demystify Efficient SMoE with Hints from Its Routing Policy |
Pingzhi Li, Zhenyu Zhang,..., Tianlong Chen |
17 |
2023-01-22 |
link |
Learning to Reject with a Fixed Predictor: Application to Decontextualization |
Christopher Mohri, Daniel Andor,..., Yutao Zhong |
17 |
2024-03-29 |
link |
Negative Label Guided OOD Detection with Pretrained Vision-Language Models |
Xue Jiang, Feng Liu,..., Bo Han |
17 |
None |
link |
An Image Is Worth 1000 Lies: Transferability of Adversarial Images across Prompts on Vision-Language Models |
Haochen Luo, Jindong Gu,..., Philip Torr |
17 |
2023-10-04 |
link |
Never Train from Scratch: Fair Comparison of Long-Sequence Models Requires Data-Driven Priors |
Ido Amos, Jonathan Berant, Ankit Gupta |
16 |
2023-09-30 |
link |
InstructCV: Instruction-Tuned Text-to-Image Diffusion Models as Vision Generalists |
Yulu Gan, Sungwoo Park,..., Ahmed Alaa |
16 |
2024-04-15 |
link |
ClimODE: Climate and Weather Forecasting with Physics-informed Neural ODEs |
Yogesh Verma, Markus Heinonen, Vikas Garg |
16 |
2024-03-07 |
link |
Mastering Memory Tasks with World Models |
Mohammad Reza Samsami, Artem Zholus,..., Sarath Chandar |
16 |
2024-01-16 |
link |
Beyond Weisfeiler-Lehman: A Quantitative Framework for GNN Expressiveness |
Bohang Zhang, Jingchu Gai,..., Liwei Wang |
16 |
2023-10-17 |
link |
Group Preference Optimization: Few-Shot Alignment of Large Language Models |
Siyan Zhao, John Dang, Aditya Grover |
16 |
2023-05-25 |
link |
Implicit bias of SGD in L2-regularized linear DNNs: One-way jumps from high to low rank |
Zihan Wang, Arthur Jacot |
16 |
2023-09-30 |
link |
AutomaTikZ: Text-Guided Synthesis of Scientific Vector Graphics with TikZ |
Jonas Belouadi, Anne Lauscher, Steffen Eger |
16 |
2024-03-12 |
link |
Entropy is not Enough for Test-Time Adaptation: From the Perspective of Disentangled Factors |
Jonghyun Lee, Dahuin Jung,..., Sungroh Yoon |
16 |
2023-09-29 |
link |
DyVal: Dynamic Evaluation of Large Language Models for Reasoning Tasks |
Kaijie Zhu, Jiaao Chen,..., Xing Xie |
16 |
2023-11-01 |
link |
Plug-and-Play Policy Planner for Large Language Model Powered Dialogue Agents |
Yang Deng, Wenxuan Zhang,..., Tat-Seng Chua |
16 |
2023-10-20 |
link |
Steve-Eye: Equipping LLM-based Embodied Agents with Visual Perception in Open Worlds |
Sipeng Zheng, jiazheng liu,..., Zongqing Lu |
16 |
None |
link |
DREAM: Dual Structured Exploration with Mixup for Open-set Graph Domain Adaption |
Nan Yin, Mengzhu Wang,..., Xiao Luo |
15 |
2024-01-18 |
link |
Enabling Efficient Equivariant Operations in the Fourier Basis via Gaunt Tensor Products |
Shengjie Luo, Tianlang Chen, Aditi S. Krishnapriyan |
15 |
2024-03-26 |
link |
Don't Trust: Verify - Grounding LLM Quantitative Reasoning with Autoformalization |
Jin Peng Zhou, Charles E Staats,..., Yuhuai Wu |
15 |
2023-11-21 |
link |
Looped Transformers are Better at Learning Learning Algorithms |
Liu Yang, Kangwook Lee,..., Dimitris Papailiopoulos |
15 |
2023-10-14 |
link |
Does CLIP's Generalization Performance Mainly Stem from High Train-Test Similarity? |
Prasanna Mayilvahanan, Thaddäus Wiedemer,..., Wieland Brendel |
15 |
2023-12-17 |
link |
Learning to Act without Actions |
Dominik Schmidt, Minqi Jiang |
15 |
2023-12-09 |
link |
Batched Low-Rank Adaptation of Foundation Models |
Yeming Wen, Swarat Chaudhuri |
15 |
2023-10-09 |
link |
Sentence-level Prompts Benefit Composed Image Retrieval |
Yang bai, Xinxing Xu,..., Chun-Mei Feng |
15 |
2023-09-29 |
link |
Consistency Models as a Rich and Efficient Policy Class for Reinforcement Learning |
Zihan Ding, Chi Jin |
14 |
2023-06-05 |
link |
Neuron Activation Coverage: Rethinking Out-of-distribution Detection and Generalization |
Yibing Liu, Chris XING TIAN,..., Shiqi Wang |
14 |
2023-10-23 |
link |
Making RL with Preference-based Feedback Efficient via Randomization |
Runzhe Wu, Wen Sun |
14 |
2023-10-02 |
link |
Controlling Vision-Language Models for Multi-Task Image Restoration |
Ziwei Luo, Fredrik K. Gustafsson,..., Thomas B. Schön |
14 |
2023-02-10 |
link |
Forward Learning with Top-Down Feedback: Empirical and Analytical Characterization |
Ravi Francesco Srinivasan, Francesca Mignacco,..., Giorgia Dellaferrera |
14 |
2024-02-07 |
link |
InstructScene: Instruction-Driven 3D Indoor Scene Synthesis with Semantic Graph Prior |
Chenguo Lin, Yadong MU |
14 |
None |
link |
Towards Reliable and Efficient Backdoor Trigger Inversion via Decoupling Benign Features |
Xiong Xu, Kunzhe Huang,..., Kui Ren |
14 |
2024-01-03 |
link |
Prototypical Information Bottlenecking and Disentangling for Multimodal Cancer Survival Prediction |
Yilan Zhang, Yingxue Xu,..., Hao Chen |
14 |
2024-02-22 |
link |
MAPE-PPI: Towards Effective and Efficient Protein-Protein Interaction Prediction via Microenvironment-Aware Protein Embedding |
Lirong Wu, Yijun Tian,..., Stan Z. Li |
14 |
2023-10-29 |
link |
Bespoke Solvers for Generative Flow Models |
Neta Shaul, Juan Perez,..., Yaron Lipman |
14 |
2024-01-19 |
link |
Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model |
Yinan Zheng, Jianxiong Li,..., Jingjing Liu |
14 |
2024-03-19 |
link |
Do Generated Data Always Help Contrastive Learning? |
Yifei Wang, Jizhe Zhang, Yisen Wang |
14 |
2023-11-28 |
link |
Is This the Subspace You Are Looking for? An Interpretability Illusion for Subspace Activation Patching |
Aleksandar Makelov, Georg Lange,..., Neel Nanda |
14 |
2023-10-12 |
link |
CompA: Addressing the Gap in Compositional Reasoning in Audio-Language Models |
Sreyan Ghosh, Ashish Seth,..., Dinesh Manocha |
14 |
2023-06-03 |
link |
Memorization Capacity of Multi-Head Attention in Transformers |
Sadegh Mahdavi, Renjie Liao, Christos Thrampoulidis |
14 |
2022-09-19 |
link |
Topological data analysis on noisy quantum computers |
Ismail Yunus Akhalwaya, Shashanka Ubaru,..., Lior Horesh |
14 |
2023-05-29 |
link |
On Diffusion Modeling for Anomaly Detection |
Victor Livernoche, Vineet Jain,..., Siamak Ravanbakhsh |
14 |
2023-05-24 |
link |
Provable Offline Preference-Based Reinforcement Learning |
Wenhao Zhan, Masatoshi Uehara,..., Wen Sun |
14 |
2023-02-16 |
link |
Dual RL: Unification and New Methods for Reinforcement and Imitation Learning |
Harshit Sikchi, Qinqing Zheng,..., Scott Niekum |
14 |
2024-01-27 |
link |
Finite-Time Analysis of On-Policy Heterogeneous Federated Reinforcement Learning |
Chenyu Zhang, Han Wang,..., James Anderson |
14 |
2023-10-04 |
link |
Scaling Laws for Associative Memories |
Vivien Cabannes, Elvis Dohmatob, Alberto Bietti |
14 |
2023-09-29 |
link |
Understanding and Mitigating the Label Noise in Pre-training on Downstream Tasks |
Hao Chen, Jindong Wang,..., Bhiksha Raj |
14 |
2023-10-13 |
link |
The Consensus Game: Language Model Generation via Equilibrium Search |
Athul Paul Jacob, Yikang Shen,..., Jacob Andreas |
13 |
2023-09-03 |
link |
Implicit regularization of deep residual networks towards neural ODEs |
Pierre Marion, Yu-Han Wu,..., Gérard Biau |
13 |
2023-09-25 |
link |
Statistical Perspective of Top-K Sparse Softmax Gating Mixture of Experts |
Huy Nguyen, Pedram Akbarian,..., Nhat Ho |
13 |
2023-03-02 |
link |
On the Provable Advantage of Unsupervised Pretraining |
Jiawei Ge, Shange Tang,..., Chi Jin |
13 |
2024-01-09 |
link |
Evaluating Language Model Agency through Negotiations |
Tim Ruben Davidson, Veniamin Veselovsky,..., Robert West |
13 |
2023-07-14 |
link |
SafeDreamer: Safe Reinforcement Learning with World Models |
Weidong Huang, Jiaming Ji,..., Yaodong Yang |
13 |
2023-09-06 |
link |
ResFields: Residual Neural Fields for Spatiotemporal Signals |
Marko Mihajlovic, Sergey Prokudin,..., Siyu Tang |
13 |
2023-04-14 |
link |
3D Feature Prediction for Masked-AutoEncoder-Based Point Cloud Pretraining |
Siming Yan, Yuqi Yang,..., Qixing Huang |
13 |
2024-01-16 |
link |
ValUES: A Framework for Systematic Validation of Uncertainty Estimation in Semantic Segmentation |
Kim-Celine Kahl, Carsten T. Lüth,..., Paul F Jaeger |
13 |
2024-03-22 |
link |
DreamFlow: High-Quality Text-to-3D Generation by Approximating Probability Flow |
Kyungmin Lee, Kihyuk Sohn, Jinwoo Shin |
13 |
None |
link |
Monte Carlo guided Denoising Diffusion models for Bayesian linear inverse problems |
Gabriel Cardoso, Yazid Janati el idrissi,..., Eric Moulines |
13 |
2023-10-07 |
link |
Lemur: Integrating Large Language Models in Automated Program Verification |
Haoze Wu, Clark Barrett, Nina Narodytska |
13 |
2024-01-19 |
link |
Large Language Models are Efficient Learners of Noise-Robust Speech Recognition |
Yuchen Hu, CHEN CHEN,..., EngSiong Chng |
13 |
2023-05-22 |
link |
GraphCare: Enhancing Healthcare Predictions with Personalized Knowledge Graphs |
Pengcheng Jiang, Cao Xiao,..., Jimeng Sun |
13 |
2023-10-03 |
link |
SNIP: Bridging Mathematical Symbolic and Numeric Realms with Unified Pre-training |
Kazem Meidani, Parshin Shojaee,..., Amir Barati Farimani |
13 |
2024-02-28 |
link |
Deep Confident Steps to New Pockets: Strategies for Docking Generalization |
Gabriele Corso, Arthur Deng,..., Tommi S. Jaakkola |
12 |
2023-10-11 |
link |
Score Regularized Policy Optimization through Diffusion Behavior |
Huayu Chen, Cheng Lu,..., Jun Zhu |
12 |
2024-03-31 |
link |
Addressing Loss of Plasticity and Catastrophic Forgetting in Continual Learning |
Mohamed Elsayed, A. Rupam Mahmood |
12 |
2023-10-09 |
link |
Provable Compositional Generalization for Object-Centric Learning |
Thaddäus Wiedemer, Jack Brady,..., Wieland Brendel |
12 |
2023-10-04 |
link |
Decision ConvFormer: Local Filtering in MetaFormer is Sufficient for Decision Making |
Jeonghye Kim, Suyoung Lee,..., Youngchul Sung |
12 |
2021-02-18 |
link |
Adaptive Rational Activations to Boost Deep Reinforcement Learning |
Quentin Delfosse, Patrick Schramowski,..., Kristian Kersting |
12 |
2024-02-16 |
link |
GIM: Learning Generalizable Image Matcher From Internet Videos |
Xuelun Shen, zhipeng cai,..., Cheng Wang |
12 |
2024-02-05 |
link |
Curriculum reinforcement learning for quantum architecture search under hardware errors |
Yash J. Patel, Akash Kundu,..., Onur Danaci |
12 |
2023-10-19 |
link |
Understanding Addition in Transformers |
Philip Quirke, Fazl Barez |
12 |
2024-01-12 |
link |
Few-Shot Detection of Machine-Generated Text using Style Representations |
Rafael Alberto Rivera Soto, Kailin Koch,..., Nicholas Andrews |
12 |
2023-10-06 |
link |
Towards Foundational Models for Molecular Learning on Large-Scale Multi-Task Datasets |
Dominique Beaini, Shenyang Huang,..., Dominic Masters |
12 |
2024-04-11 |
link |
PINNACLE: PINN Adaptive ColLocation and Experimental points selection |
Gregory Kang Ruey Lau, Apivich Hemachandra,..., Bryan Kian Hsiang Low |
12 |
2023-06-08 |
link |
Yet Another ICU Benchmark: A Flexible Multi-Center Framework for Clinical ML |
Robin van de Water, Hendrik Nils Aurel Schmidt,..., Patrick Rockenschaub |
12 |
2023-10-09 |
link |
ODEFormer: Symbolic Regression of Dynamical Systems with Transformers |
Stéphane d'Ascoli, Sören Becker,..., Niki Kilbertus |
12 |
2023-10-10 |
link |
Correlated Noise Provably Beats Independent Noise for Differentially Private Learning |
Christopher A. Choquette-Choo, Krishnamurthy Dj Dvijotham,..., Abhradeep Guha Thakurta |
11 |
2023-10-02 |
link |
Closing the Curious Case of Neural Text Degeneration |
Matthew Finlayson, John Hewitt,..., Ashish Sabharwal |
11 |
2023-02-01 |
link |
Analyzing Feed-Forward Blocks in Transformers through the Lens of Attention Maps |
Goro Kobayashi, Tatsuki Kuribayashi,..., Kentaro Inui |
11 |
2024-07-07 |
link |
PTaRL: Prototype-based Tabular Representation Learning via Space Calibration |
Hangting Ye, Wei Fan,..., Yi Chang |
11 |
2023-10-12 |
link |
Is attention required for ICL? Exploring the Relationship Between Model Architecture and In-Context Learning Ability |
Ivan Lee, Nan Jiang, Taylor Berg-Kirkpatrick |
11 |
2023-12-26 |
link |
Supervised Knowledge Makes Large Language Models Better In-context Learners |
Linyi Yang, Shuibai Zhang,..., Yue Zhang |
11 |
2023-10-04 |
link |
High-dimensional SGD aligns with emerging outlier eigenspaces |
Gerard Ben Arous, Reza Gheissari,..., Aukosh Jagannath |
11 |
None |
link |
Contrastive Preference Learning: Learning from Human Feedback without Reinforcement Learning |
Joey Hejna, Rafael Rafailov,..., Dorsa Sadigh |
11 |
2023-11-25 |
link |
Unbalancedness in Neural Monge Maps Improves Unpaired Domain Translation |
Luca Eyring, Dominik Klein,..., Fabian J Theis |
11 |
2023-10-18 |
link |
Improving Generalization of Alignment with Human Preferences through Group Invariant Learning |
Rui Zheng, Wei Shen,..., Xuanjing Huang |
11 |
2023-07-26 |
link |
Are Transformers with One Layer Self-Attention Using Low-Rank Weight Matrices Universal Approximators? |
Tokio Kajitsuka, Issei Sato |
11 |
2024-01-05 |
link |
Simple Hierarchical Planning with Diffusion |
Chang Chen, Fei Deng,..., Sungjin Ahn |
11 |
2023-10-01 |
link |
Subtractive Mixture Models via Squaring: Representation and Learning |
Lorenzo Loconte, Aleksanteri Mikulus Sladek,..., Antonio Vergari |
11 |
2023-09-28 |
link |
Transformer-VQ: Linear-Time Transformers via Vector Quantization |
Lucas Dax Lingle |
11 |
None |
link |
Prompt Gradient Projection for Continual Learning |
Jingyang Qiao, zhizhong zhang,..., Yuan Xie |
11 |
2024-04-17 |
link |
Variational Bayesian Last Layers |
James Harrison, John Willes, Jasper Snoek |
11 |
2023-10-23 |
link |
Ghost on the Shell: An Expressive Representation of General 3D Shapes |
Zhen Liu, Yao Feng,..., Bernhard Schölkopf |
11 |
2023-10-19 |
link |
Towards Robust Offline Reinforcement Learning under Diverse Data Corruption |
Rui Yang, Han Zhong,..., Tong Zhang |
11 |
2023-09-25 |
link |
Towards a statistical theory of data selection under weak supervision |
Germain Kolossov, Andrea Montanari, Pulkit Tandon |
11 |
2023-06-17 |
link |
Understanding Certified Training with Interval Bound Propagation |
Yuhao Mao, Mark Niklas Mueller,..., Martin Vechev |
11 |
2023-08-08 |
link |
Sample-Efficient Linear Representation Learning from Non-IID Non-Isotropic Data |
Thomas TCK Zhang, Leonardo Felipe Toso,..., Nikolai Matni |
11 |
2024-01-19 |
link |
Escape Sky-high Cost: Early-stopping Self-Consistency for Multi-step Reasoning |
Yiwei Li, Peiwen Yuan,..., Kan Li |
11 |
2024-03-24 |
link |
VCR-Graphormer: A Mini-batch Graph Transformer via Virtual Connections |
Dongqi Fu, Zhigang Hua,..., Bo Long |
11 |
2023-11-19 |
link |
Unmasking and Improving Data Credibility: A Study with Datasets for Training Harmless Language Models |
Zhaowei Zhu, Jialu Wang,..., Yang Liu |
11 |
None |
link |
The Devil is in the Neurons: Interpreting and Mitigating Social Biases in Language Models |
Yan Liu, Yu Liu,..., Tsung-Yi Ho |
11 |
2023-09-29 |
link |
Navigating the Design Space of Equivariant Diffusion-Based Generative Models for De Novo 3D Molecule Generation |
Tuan Le, Julian Cremer,..., Kristof T Schütt |
10 |
2023-10-25 |
link |
From Molecules to Materials: Pre-training Large Generalizable Models for Atomic Property Prediction |
Nima Shoghi, Adeesh Kolluru,..., Brandon M Wood |
10 |
2023-11-08 |
link |
GENOME: GenerativE Neuro-symbOlic visual reasoning by growing and reusing ModulEs |
Zhenfang Chen, Rui Sun,..., Chuang Gan |
10 |
2023-05-30 |
link |
Exploring the Promise and Limits of Real-Time Recurrent Learning |
Kazuki Irie, Anand Gopalakrishnan, Jürgen Schmidhuber |
10 |
2023-10-04 |
link |
Generative Modeling of Regular and Irregular Time Series Data via Koopman VAEs |
Ilan Naiman, N. Benjamin Erichson,..., Omri Azencot |
10 |
2023-09-03 |
link |
Traveling Waves Encode the Recent Past and Enhance Sequence Learning |
T. Anderson Keller, Lyle Muller,..., Max Welling |
10 |
2023-07-18 |
link |
UniTabE: A Universal Pretraining Protocol for Tabular Foundation Model in Data Science |
Yazheng Yang, Yuqi Wang,..., Qi Liu |
10 |
2023-10-05 |
link |
Logical Languages Accepted by Transformer Encoders with Hard Attention |
Pablo Barcelo, Alexander Kozachinskiy,..., Vladimir Podolskii |
10 |
2023-11-07 |
link |
Selective Visual Representations Improve Convergence and Generalization for Embodied AI |
Ainaz Eftekhar, Kuo-Hao Zeng,..., Ranjay Krishna |
10 |
2023-11-13 |
link |
Feature emergence via margin maximization: case studies in algebraic tasks |
Depen Morwani, Benjamin L. Edelman,..., Sham M. Kakade |
10 |
2023-10-11 |
link |
Denoising Task Routing for Diffusion Models |
Byeongjun Park, Sangmin Woo,..., Changick Kim |
10 |
2023-06-06 |
link |
Quick-Tune: Quickly Learning Which Pretrained Model to Finetune and How |
Sebastian Pineda Arango, Fabio Ferreira,..., Josif Grabocka |
10 |
None |
link |
Conversational Drug Editing Using Retrieval and Domain Feedback |
Shengchao Liu, Jiongxiao Wang,..., Chaowei Xiao |
10 |
2023-06-14 |
link |
Diffusion in Diffusion: Cyclic One-Way Diffusion for Text-Vision-Conditioned Generation |
Ruoyu Wang, Yongqi Yang,..., Yu Wu |
9 |
None |
link |
The False Promise of Imitating Proprietary Language Models |
Arnav Gudibande, Eric Wallace,..., Dawn Song |
9 |
2023-05-23 |
link |
Proximal Policy Gradient Arborescence for Quality Diversity Reinforcement Learning |
Sumeet Batra, Bryon Tjanaka,..., Gaurav S. Sukhatme |
9 |
2023-10-01 |
link |
Learning to Make Adherence-Aware Advice |
Guanting Chen, Xiaocheng Li,..., Hanzhao Wang |
9 |
2023-11-30 |
link |
Initializing Models with Larger Ones |
Zhiqiu Xu, Yanjie Chen,..., Zhuang Liu |
9 |
None |
link |
PolyGCL: GRAPH CONTRASTIVE LEARNING via Learnable Spectral Polynomial Filters |
Jingyu Chen, Runlin Lei, Zhewei Wei |
9 |
2023-09-06 |
link |
SEAL: A Framework for Systematic Evaluation of Real-World Super-Resolution |
Wenlong Zhang, Xiaohui Li,..., Chao Dong |
9 |
2023-10-11 |
link |
Data Distillation Can Be Like Vodka: Distilling More Times For Better Quality |
Xuxi Chen, Yu Yang,..., Baharan Mirzasoleiman |
9 |
2024-01-19 |
link |
CivRealm: A Learning and Reasoning Odyssey in Civilization for Decision-Making Agents |
Siyuan Qi, Shuo Chen,..., Song-Chun Zhu |
9 |
2023-08-25 |
link |
SEGNO: Generalizing Equivariant Graph Neural Networks with Physical Inductive Biases |
Yang Liu, Jiashun Cheng,..., Yu Rong |
9 |
2024-01-22 |
link |
EmerDiff: Emerging Pixel-level Semantic Knowledge in Diffusion Models |
Koichi Namekata, Amirmojtaba Sabour,..., Seung Wook Kim |
9 |
2023-09-21 |
link |
Quasi-Monte Carlo for 3D Sliced Wasserstein |
Khai Nguyen, Nicola Bariletto, Nhat Ho |
9 |
2023-12-19 |
link |
Adversarial AutoMixup |
Huafeng Qin, Xin Jin,..., Xinbo Gao |
9 |
2023-02-28 |
link |
An Efficient Tester-Learner for Halfspaces |
Aravind Gollakota, Adam Klivans,..., Arsen Vasilyan |
9 |
2023-07-25 |
link |
Submodular Reinforcement Learning |
Manish Prajapat, Mojmir Mutny,..., Andreas Krause |
9 |
2022-10-04 |
link |
Neural-Symbolic Recursive Machine for Systematic Generalization |
Qing Li, Yixin Zhu,..., Siyuan Huang |
9 |
2023-06-12 |
link |
Unprocessing Seven Years of Algorithmic Fairness |
André Cruz, Moritz Hardt |
9 |
2022-12-06 |
link |
Image Inpainting via Iteratively Decoupled Probabilistic Modeling |
Wenbo Li, Xin Yu,..., Zhe Lin |
9 |
2023-10-02 |
link |
Variance-Aware Regret Bounds for Stochastic Contextual Dueling Bandits |
Qiwei Di, Tao Jin,..., Quanquan Gu |
9 |
2023-03-10 |
link |
MovingParts: Motion-based 3D Part Discovery in Dynamic Radiance Field |
Kaizhi Yang, Xiaoshuai Zhang,..., Hao Su |
9 |
2023-10-21 |
link |
Beyond Accuracy: Evaluating Self-Consistency of Code Large Language Models with IdentityChain |
Marcus J. Min, Yangruibo Ding,..., Baishakhi Ray |
9 |
2024-03-13 |
link |
DIFFTACTILE: A Physics-based Differentiable Tactile Simulator for Contact-rich Robotic Manipulation |
Zilin Si, Gu Zhang,..., Chuang Gan |
9 |
2024-04-30 |
link |
MetaCoCo: A New Few-Shot Classification Benchmark with Spurious Correlation |
Min Zhang, Haoxuan Li,..., Kun Kuang |
9 |
2023-10-06 |
link |
Identifying Representations for Intervention Extrapolation |
Sorawit Saengkyongam, Elan Rosenfeld,..., Jonas Peters |
9 |
2023-10-11 |
link |
SpikePoint: An Efficient Point-based Spiking Neural Network for Event Cameras Action Recognition |
Hongwei Ren, Yue Zhou,..., Bojun Cheng |
9 |
2023-10-03 |
link |
Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns |
Brian DuSell, David Chiang |
9 |
2023-10-11 |
link |
Generative Modeling with Phase Stochastic Bridges |
Tianrong Chen, Jiatao Gu,..., Shuangfei Zhai |
9 |
2023-10-09 |
link |
An operator preconditioning perspective on training in physics-informed machine learning |
Tim De Ryck, Florent Bonnet,..., Emmanuel de Bezenac |
9 |
2024-01-30 |
link |
Multi-granularity Correspondence Learning from Long-term Noisy Videos |
Yijie Lin, Jie Zhang,..., Xi Peng |
9 |
2023-11-06 |
link |
Tailoring Self-Rationalizers with Multi-Reward Distillation |
Sahana Ramnath, Brihi Joshi,..., Xiang Ren |
9 |
2024-02-07 |
link |
LLMs Meet VLMs: Boost Open Vocabulary Object Detection with Fine-grained Descriptors |
Sheng Jin, Xueying Jiang,..., Shijian Lu |
8 |
2024-05-16 |
link |
Whole-Song Hierarchical Generation of Symbolic Music Using Cascaded Diffusion Models |
Ziyu Wang, Lejun Min, Gus Xia |
8 |
2023-07-26 |
link |
ExeDec: Execution Decomposition for Compositional Generalization in Neural Program Synthesis |
Kensen Shi, Joey Hong,..., Charles Sutton |
8 |
None |
link |
Generative Learning for Financial Time Series with Irregular and Scale-Invariant Patterns |
Hongbin Huang, Minghua Chen, Xiao Qiao |
8 |
2023-10-12 |
link |
Is ImageNet worth 1 video? Learning strong image encoders from 1 long unlabelled video |
Shashanka Venkataramanan, Mamshad Nayeem Rizve,..., Yannis Avrithis |
8 |
2023-10-04 |
link |
USB-NeRF: Unrolling Shutter Bundle Adjusted Neural Radiance Fields |
Moyang Li, Peng Wang,..., Peidong Liu |
8 |
None |
link |
Flow to Better: Offline Preference-based Reinforcement Learning via Preferred Trajectory Generation |
Zhilong Zhang, Yihao Sun,..., Yang Yu |
8 |
None |
link |
TabR: Tabular Deep Learning Meets Nearest Neighbors |
Yury Gorishniy, Ivan Rubachev,..., Artem Babenko |
8 |
None |
link |
Towards Energy Efficient Spiking Neural Networks: An Unstructured Pruning Framework |
Xinyu Shi, Jianhao Ding,..., Zhaofei Yu |
8 |
2023-05-28 |
link |
Repeated Random Sampling for Minimizing the Time-to-Accuracy of Learning |
Patrik Okanovic, Roger Waleffe,..., Theodoros Rekatsinas |
8 |
2023-06-01 |
link |
Understanding Augmentation-based Self-Supervised Representation Learning via RKHS Approximation |
Runtian Zhai, Bingbin Liu,..., Pradeep Kumar Ravikumar |
8 |
2024-03-31 |
link |
Label-Agnostic Forgetting: A Supervision-Free Unlearning in Deep Models |
Shaofei Shen, Chenhao Zhang,..., Miao Xu |
8 |
2024-01-24 |
link |
Navigating Dataset Documentations in AI: A Large-Scale Analysis of Dataset Cards on Hugging Face |
Xinyu Yang, Weixin Liang, James Zou |
8 |
2024-04-16 |
link |
Hierarchical Context Merging: Better Long Context Understanding for Pre-trained LLMs |
Woomin Song, Seunghyuk Oh,..., Jinwoo Shin |
8 |
2023-07-11 |
link |
Benchmarking Algorithms for Federated Domain Generalization |
Ruqi Bai, Saurabh Bagchi, David I. Inouye |
8 |
2024-02-17 |
link |
Boosting of Thoughts: Trial-and-Error Problem Solving with Large Language Models |
Sijia Chen, Baochun Li, Di Niu |
8 |
2024-01-23 |
link |
Energy-based Automated Model Evaluation |
Ru Peng, Heming Zou,..., Junbo Zhao |
8 |
None |
link |
Causal Modelling Agents: Causal Graph Discovery through Synergising Metadata- and Data-driven Reasoning |
Ahmed Abdulaal, adamos hadjivasiliou,..., Daniel C. Alexander |
8 |
2023-11-01 |
link |
Instructive Decoding: Instruction-Tuned Large Language Models are Self-Refiner from Noisy Instructions |
Taehyeon Kim, Joonkee Kim,..., Se-Young Yun |
8 |
2024-01-03 |
link |
On the hardness of learning under symmetries |
Bobak Kiani, Thien Le,..., Melanie Weber |
8 |
2023-09-26 |
link |
SGD Finds then Tunes Features in Two-Layer Neural Networks with near-Optimal Sample Complexity: A Case Study in the XOR problem |
Margalit Glasgow |
8 |
2023-08-30 |
link |
RetroBridge: Modeling Retrosynthesis with Markov Bridges |
Ilia Igashov, Arne Schneuing,..., Bruno Correia |
8 |
2023-11-07 |
link |
A Simple Interpretable Transformer for Fine-Grained Image Classification and Analysis |
DIPANJYOTI PAUL, Arpita Chowdhury,..., Wei-Lun Chao |
8 |
2023-10-02 |
link |
L2MAC: Large Language Model Automatic Computer for Extensive Code Generation |
Samuel Holt, Max Ruiz Luyten, Mihaela van der Schaar |
8 |
2023-10-05 |
link |
Pre-Training and Fine-Tuning Generative Flow Networks |
Ling Pan, Moksh Jain,..., Yoshua Bengio |
8 |
2023-10-03 |
link |
GNNX-BENCH: Unravelling the Utility of Perturbation-based GNN Explainers through In-depth Benchmarking |
Mert Kosan, Samidha Verma,..., Sayan Ranu |
8 |
2024-01-28 |
link |
Neural Network-Based Score Estimation in Diffusion Models: Optimization and Generalization |
Yinbin Han, Meisam Razaviyayn, Renyuan Xu |
8 |
None |
link |
Meta Continual Learning Revisited: Implicitly Enhancing Online Hessian Approximation via Variance Reduction |
Yichen Wu, Long-Kai Huang,..., Ying Wei |
8 |
None |
link |
Improving Non-Transferable Representation Learning by Harnessing Content and Style |
Ziming Hong, Zhenyi Wang,..., Tongliang Liu |
8 |
2023-11-06 |
link |
CoVLM: Composing Visual Entities and Relationships in Large Language Models Via Communicative Decoding |
Junyan Li, Delin Chen,..., Chuang Gan |
8 |
2023-11-19 |
link |
Bounds on Representation-Induced Confounding Bias for Treatment Effect Estimation |
Valentyn Melnychuk, Dennis Frauen, Stefan Feuerriegel |
8 |
2024-01-17 |
link |
Idempotence and Perceptual Image Compression |
Tongda Xu, Ziran Zhu,..., Ya-Qin Zhang |
8 |
2023-12-22 |
link |
Federated Q-Learning: Linear Regret Speedup with Low Communication Cost |
Zhong Zheng, Fengyu Gao,..., Jing Yang |
8 |
2023-10-02 |
link |
From Bricks to Bridges: Product of Invariances to Enhance Latent Space Communication |
Irene Cannistraci, Luca Moschella,..., Emanuele Rodolà |
8 |
2023-10-02 |
link |
Tool-Augmented Reward Modeling |
Lei Li, Yekun Chai,..., Hua Wu |
7 |
2023-12-26 |
link |
Social-Transmotion: Promptable Human Trajectory Prediction |
Saeed Saadatnejad, Yang Gao,..., Alexandre Alahi |
7 |
2023-10-25 |
link |
Frequency-Aware Transformer for Learned Image Compression |
Han Li, Shaohui Li,..., Hongkai Xiong |
7 |
2023-11-27 |
link |
Maximum Likelihood Estimation is All You Need for Well-Specified Covariate Shift |
Jiawei Ge, Shange Tang,..., Chi Jin |
7 |
2023-10-03 |
link |
Towards Training Without Depth Limits: Batch Normalization Without Gradient Explosion |
Alexandru Meterez, Amir Joudaki,..., Hadi Daneshmand |
7 |
None |
link |
Dissecting learning and forgetting in language model finetuning |
Xiao Zhang, Ji Wu |
7 |
None |
link |
Test-time Adaptation against Multi-modal Reliability Bias |
Mouxing Yang, Yunfan Li,..., Xi Peng |
7 |
2023-10-14 |
link |
Mirage: Model-Agnostic Graph Distillation for Graph Classification |
Mridul Gupta, Sahil Manchanda,..., Sayan Ranu |
7 |
None |
link |
CircuitNet 2.0: An Advanced Dataset for Promoting Machine Learning Innovations in Realistic Chip Design Environment |
Xun Jiang, zhuomin chai,..., Ru Huang |
7 |
2023-09-01 |
link |
Large Content And Behavior Models To Understand, Simulate, And Optimize Content And Behavior |
Ashmit Khandelwal, Aditya Agrawal,..., Balaji Krishnamurthy |
7 |
None |
link |
Pre-training Sequence, Structure, and Surface Features for Comprehensive Protein Representation Learning |
Youhan Lee, Hasun Yu,..., Jaehoon Kim |
7 |
2024-01-16 |
link |
Explaining Time Series via Contrastive and Locally Sparse Perturbations |
Zichuan Liu, Yingying ZHANG,..., Qingsong Wen |
7 |
2023-12-02 |
link |
Symmetric Mean-field Langevin Dynamics for Distributional Minimax Problems |
Juno Kim, Kakei Yamamoto,..., Taiji Suzuki |
7 |
2023-05-23 |
link |
Expressive Losses for Verified Robustness via Convex Combinations |
Alessandro De Palma, Rudy R Bunel,..., Alessio Lomuscio |
7 |
2023-07-28 |
link |
Skeleton-of-Thought: Prompting LLMs for Efficient Parallel Generation |
Xuefei Ning, Zinan Lin,..., Yu Wang |
7 |
2023-10-17 |
link |
Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective |
Ming Zhong, Chenxin An,..., Pengcheng He |
7 |
None |
link |
Sparse MoE with Language Guided Routing for Multilingual Machine Translation |
Xinyu Zhao, Xuxi Chen,..., Tianlong Chen |
7 |
None |
link |
Concept Bottleneck Generative Models |
Aya Abdelsalam Ismail, Julius Adebayo,..., Kyunghyun Cho |
7 |
2023-04-04 |
link |
On the Variance of Neural Network Training with respect to Test Sets and Distributions |
Keller Jordan |
7 |
2022-11-16 |
link |
A Stable, Fast, and Fully Automatic Learning Algorithm for Predictive Coding Networks |
Tommaso Salvatori, Yuhang Song,..., Thomas Lukasiewicz |
7 |
2023-10-27 |
link |
Image Clustering Conditioned on Text Criteria |
Sehyun Kwon, Jaeseung Park,..., Kangwook Lee |
7 |
2023-10-10 |
link |
Let Models Speak Ciphers: Multiagent Debate through Embeddings |
Chau Pham, Boyi Liu,..., Hongxia Yang |
7 |
None |
link |
Pre-Training Goal-based Models for Sample-Efficient Reinforcement Learning |
Haoqi Yuan, Zhancun Mu,..., Zongqing Lu |
7 |
2023-10-18 |
link |
De novo protein design using geometric vector field networks |
Weian Mao, Muzhi Zhu,..., Chunhua Shen |
7 |
2023-05-30 |
link |
Inverse Approximation Theory for Nonlinear Recurrent Neural Networks |
Shida Wang, Zhong Li, Qianxiao Li |
7 |
2023-09-10 |
link |
Learning Energy-Based Models by Cooperative Diffusion Recovery Likelihood |
Yaxuan Zhu, Jianwen Xie,..., Ruiqi Gao |
7 |
None |
link |
Language Model Detectors Are Easily Optimized Against |
Charlotte Nicks, Eric Mitchell,..., Stefano Ermon |
7 |
2023-12-13 |
link |
Revisiting the Last-Iterate Convergence of Stochastic Gradient Methods |
Zijian Liu, Zhengyuan Zhou |
7 |
2023-05-22 |
link |
Enhancing Small Medical Learners with Privacy-preserving Contextual Prompting |
Xinlu Zhang, Shiyang Li,..., Linda Ruth Petzold |
7 |
2024-02-14 |
link |
CLIP-MUSED: CLIP-Guided Multi-Subject Visual Neural Information Semantic Decoding |
Qiongyi Zhou, Changde Du,..., Huiguang He |
7 |
2024-02-20 |
link |
Scaling physics-informed hard constraints with mixture-of-experts |
Nithin Chalapathi, Yiheng Du, Aditi S. Krishnapriyan |
7 |
2024-02-22 |
link |
Stable Neural Stochastic Differential Equations in Analyzing Irregular Time Series Data |
YongKyung Oh, Dongyoung Lim, Sungil Kim |
7 |
2024-02-01 |
link |
ODICE: Revealing the Mystery of Distribution Correction Estimation via Orthogonal-gradient Update |
Liyuan Mao, Haoran Xu,..., Xianyuan Zhan |
7 |
None |
link |
Tackling the Data Heterogeneity in Asynchronous Federated Learning with Cached Update Calibration |
Yujia Wang, Yuanpu Cao,..., Jinghui Chen |
7 |
2023-12-27 |
link |
Soft Contrastive Learning for Time Series |
Seunghan Lee, Taeyoung Park, Kibok Lee |
6 |
None |
link |
What Makes a Good Prune? Maximal Unstructured Pruning for Maximal Cosine Similarity |
Gabryel Mason-Williams, Fredrik Dahlqvist |
6 |
2023-06-08 |
link |
SequenceMatch: Imitation Learning for Autoregressive Sequence Modelling with Backtracking |
Chris Cundy, Stefano Ermon |
6 |
2023-10-17 |
link |
Context-Aware Meta-Learning |
Christopher Fifty, Dennis Duan,..., Sebastian Thrun |
6 |
2022-06-14 |
link |
Toward Student-oriented Teacher Network Training for Knowledge Distillation |
Chengyu Dong, Liyuan Liu, Jingbo Shang |
6 |
2023-10-31 |
link |
Vanishing Gradients in Reinforcement Finetuning of Language Models |
Noam Razin, Hattie Zhou,..., Etai Littwin |
6 |
2023-05-22 |
link |
Improving Convergence and Generalization Using Parameter Symmetries |
Bo Zhao, Robert M. Gower,..., Rose Yu |
6 |
2023-10-26 |
link |
Fantastic Gains and Where to Find Them: On the Existence and Prospect of General Knowledge Transfer between Any Pretrained Model |
Karsten Roth, Lukas Thede,..., Zeynep Akata |
6 |
2024-03-18 |
link |
Investigating the Benefits of Projection Head for Representation Learning |
Yihao Xue, Eric Gan,..., Baharan Mirzasoleiman |
6 |
2023-10-09 |
link |
Rephrase, Augment, Reason: Visual Grounding of Questions for Vision-Language Models |
Archiki Prasad, Elias Stengel-Eskin, Mohit Bansal |
6 |
None |
link |
How I Warped Your Noise: a Temporally-Correlated Noise Prior for Diffusion Models |
Pascal Chang, Jingwei Tang,..., Vinicius C. Azevedo |
6 |
None |
link |
"What Data Benefits My Classifier?" Enhancing Model Performance and Interpretability through Influence-Based Data Selection |
Anshuman Chhabra, Peizhao Li,..., Hongfu Liu |
6 |
2023-11-24 |
link |
Large Language Models as Automated Aligners for benchmarking Vision-Language Models |
Yuanfeng Ji, Chongjian GE,..., Ping Luo |
6 |
2023-09-29 |
link |
CrossLoco: Human Motion Driven Control of Legged Robots via Guided Unsupervised Reinforcement Learning |
Tianyu Li, Hyunyoung Jung,..., Sehoon Ha |
6 |
None |
link |
Sample-Efficient Quality-Diversity by Cooperative Coevolution |
Ke Xue, Ren-Jian Wang,..., Chao Qian |
6 |
2023-02-13 |
link |
Generative Adversarial Equilibrium Solvers |
Denizalp Goktas, David C. Parkes,..., Andrea Tacchetti |
6 |
2023-10-30 |
link |
LitCab: Lightweight Language Model Calibration over Short- and Long-form Responses |
Xin Liu, Muhammad Khalifa, Lu Wang |
6 |
2024-01-24 |
link |
Task structure and nonlinearity jointly determine learned representational geometry |
Matteo Alleman, Jack Lindsey, Stefano Fusi |
6 |
2023-10-02 |
link |
Robustifying State-space Models for Long Sequences via Approximate Diagonalization |
Annan Yu, Arnur Nigmetov,..., N. Benjamin Erichson |
6 |
2024-01-18 |
link |
Harnessing Density Ratios for Online Reinforcement Learning |
Philip Amortila, Dylan J Foster,..., Tengyang Xie |
6 |
2023-10-06 |
link |
Asymptotically Free Sketched Ridge Ensembles: Risks, Cross-Validation, and Tuning |
Pratik Patil, Daniel LeJeune |
6 |
None |
link |
Graph Transformers on EHRs: Better Representation Improves Downstream Performance |
Raphael Poulain, Rahmatollah Beheshti |
6 |
2023-10-17 |
link |
Lie Group Decompositions for Equivariant Neural Networks |
Mircea Mironenco, Patrick Forré |
6 |
2024-03-07 |
link |
Dissecting Sample Hardness: A Fine-Grained Analysis of Hardness Characterization Methods for Data-Centric AI |
Nabeel Seedat, Fergus Imrie, Mihaela van der Schaar |
6 |
2023-10-03 |
link |
How Over-Parameterization Slows Down Gradient Descent in Matrix Sensing: The Curses of Symmetry and Initialization |
Nuoya Xiong, Lijun Ding, Simon Shaolei Du |
6 |
2023-10-10 |
link |
Approximating Nash Equilibria in Normal-Form Games via Stochastic Optimization |
Ian Gemp, Luke Marris, Georgios Piliouras |
6 |
2023-05-26 |
link |
Exploring Weight Balancing on Long-Tailed Recognition Problem |
Naoya Hasegawa, Issei Sato |
6 |
2024-04-30 |
link |
Debiased Collaborative Filtering with Kernel-Based Causal Balancing |
Haoxuan Li, Chunyuan Zheng,..., Peng Cui |
6 |
2024-03-07 |
link |
On the Markov Property of Neural Algorithmic Reasoning: Analyses and Methods |
Montgomery Bohde, Meng Liu,..., Shuiwang Ji |
6 |
2023-10-12 |
link |
Visual Data-Type Understanding does not emerge from Scaling Vision-Language Models |
Vishaal Udandarao, Max F Burg,..., Matthias Bethge |
6 |
2023-06-03 |
link |
DOS: Diverse Outlier Sampling for Out-of-Distribution Detection |
Wenyu Jiang, Hao Cheng,..., Hongxin Wei |
6 |
None |
link |
Learning Hierarchical World Models with Adaptive Temporal Abstractions from Discrete Latent Dynamics |
Christian Gumbsch, Noor Sajid,..., Martin V. Butz |
6 |
None |
link |
Training-free Multi-objective Diffusion Model for 3D Molecule Generation |
Xu Han, Caihua Shan,..., Dongsheng Li |
6 |
2023-02-06 |
link |
Improving Domain Generalization with Domain Relations |
Huaxiu Yao, Xinyu Yang,..., Chelsea Finn |
6 |
2023-05-23 |
link |
Point2SSM: Learning Morphological Variations of Anatomies from Point Cloud |
Jadie Adams, Shireen Elhabian |
6 |
2024-02-07 |
link |
Towards Aligned Layout Generation via Diffusion Model with Aesthetic Constraints |
Jian Chen, Ruiyi Zhang,..., Changyou Chen |
6 |
2024-02-11 |
link |
Open-ended VQA benchmarking of Vision-Language models by exploiting Classification datasets and their semantic hierarchy |
Simon Ging, Maria Alejandra Bravo, Thomas Brox |
6 |
2024-03-05 |
link |
TESTAM: A Time-Enhanced Spatio-Temporal Attention Model with Mixture of Experts |
Hyunwook Lee, Sungahn Ko |
6 |
None |
link |
NuwaDynamics: Discovering and Updating in Causal Spatio-Temporal Modeling |
Kun Wang, Hao Wu,..., Yang Wang |
6 |
2023-11-24 |
link |
A General Framework for User-Guided Bayesian Optimization |
Carl Hvarfner, Frank Hutter, Luigi Nardi |
6 |
2023-10-24 |
link |
Privacy Amplification for Matrix Mechanisms |
Christopher A. Choquette-Choo, Arun Ganesh,..., Abhradeep Guha Thakurta |
6 |
2024-01-31 |
link |
Rethinking Channel Dependence for Multivariate Time Series Forecasting: Learning from Leading Indicators |
Lifan Zhao, Yanyan Shen |
6 |
2023-10-05 |
link |
Multimarginal generative modeling with stochastic interpolants |
Michael Samuel Albergo, Nicholas Matthew Boffi,..., Eric Vanden-Eijnden |
6 |
2023-10-10 |
link |
CoT3DRef: Chain-of-Thoughts Data-Efficient 3D Visual Grounding |
Eslam Mohamed BAKR, Mohamed Ayman Mohamed,..., Mohamed Elhoseiny |
6 |
2023-10-04 |
link |
ED-NeRF: Efficient Text-Guided Editing of 3D Scene With Latent Space NeRF |
JangHo Park, Gihyun Kwon, Jong Chul Ye |
6 |
None |
link |
Consistency Training with Learnable Data Augmentation for Graph Anomaly Detection with Limited Supervision |
Nan Chen, Zemin Liu,..., Jia Chen |
6 |
2024-03-10 |
link |
Multisize Dataset Condensation |
Yang He, Lingao Xiao,..., Ivor Tsang |
6 |
2023-12-06 |
link |
Customizable Combination of Parameter-Efficient Modules for Multi-Task Learning |
Haowen Wang, Tao Sun,..., Cong Fan |
6 |
2024-11-16 |
link |
Partitioning Message Passing for Graph Fraud Detection |
Wei Zhuo, Zemin Liu,..., Jia Chen |
5 |
None |
link |
BarLeRIa: An Efficient Tuning Framework for Referring Image Segmentation |
Yaoming Wang, Jin Li,..., Qi Tian |
5 |
None |
link |
EQA-MX: Embodied Question Answering using Multimodal Expression |
Md Mofijul Islam, Alexi Gladstone,..., Tariq Iqbal |
5 |
2024-01-16 |
link |
Bayes Conditional Distribution Estimation for Knowledge Distillation Based on Conditional Mutual Information |
Linfeng Ye, Shayan Mohajer Hamidi,..., EN-HUI YANG |
5 |
2023-03-21 |
link |
Influencer Backdoor Attack on Semantic Segmentation |
Haoheng Lan, Jindong Gu,..., Hengshuang Zhao |
5 |
2024-05-01 |
link |
Are Models Biased on Text without Gender-related Language? |
Catarina G Belém, Preethi Seshadri,..., Sameer Singh |
5 |
2023-10-28 |
link |
Pre-training with Random Orthogonal Projection Image Modeling |
Maryam Haghighat, Peyman Moghadam,..., Piotr Koniusz |
5 |
2023-02-21 |
link |
Some Fundamental Aspects about Lipschitz Continuity of Neural Networks |
Grigory Khromov, Sidak Pal Singh |
5 |
2023-10-09 |
link |
Predictive auxiliary objectives in deep RL mimic learning in the brain |
Ching Fang, Kim Stachenfeld |
5 |
2023-10-09 |
link |
DyST: Towards Dynamic Neural Scene Representations on Real-World Videos |
Maximilian Seitzer, Sjoerd van Steenkiste,..., Mehdi S. M. Sajjadi |
5 |
2023-10-31 |
link |
Stochastic Gradient Descent for Gaussian Processes Done Right |
Jihao Andreas Lin, Shreyas Padhy,..., David Janz |
5 |
None |
link |
GAFormer: Enhancing Timeseries Transformers Through Group-Aware Embeddings |
Jingyun Xiao, Ran Liu, Eva L Dyer |
5 |
2023-04-01 |
link |
Abstractors and relational cross-attention: An inductive bias for explicit relational reasoning in Transformers |
Awni Altabaa, Taylor Whittington Webb,..., John Lafferty |
5 |
2023-05-23 |
link |
Faithful and Efficient Explanations for Neural Networks via Neural Tangent Kernel Surrogate Models |
Andrew William Engel, Zhichao Wang,..., Tony Chiang |
5 |
2024-01-17 |
link |
Bilevel Optimization under Unbounded Smoothness: A New Algorithm and Convergence Analysis |
Jie Hao, Xiaochuan Gong, Mingrui Liu |
5 |
2020-08-09 |
link |
Treatment Effects Estimation By Uniform Transformer |
Ruoqi Yu, Shulei Wang |
5 |
2024-02-19 |
link |
Graph-based Virtual Sensing from Sparse and Partial Multivariate Observations |
Giovanni De Felice, Andrea Cini,..., Cesare Alippi |
5 |
2023-11-25 |
link |
Coordinate-Aware Modulation for Neural Fields |
Joo Chan Lee, Daniel Rho,..., Eunbyung Park |
5 |
2023-10-13 |
link |
Jointly-Learned Exit and Inference for a Dynamic Neural Network : JEI-DNN |
florence regol, Joud Chataoui, Mark Coates |
5 |
2024-03-19 |
link |
Predictive, scalable and interpretable knowledge tracing on structured domains |
Hanqi Zhou, Robert Bamler,..., Álvaro Tejero-Cantero |
5 |
2023-10-02 |
link |
BTR: Binary Token Representations for Efficient Retrieval Augmented Language Models |
Qingqing Cao, Sewon Min,..., Hannaneh Hajishirzi |
5 |
2024-03-06 |
link |
Sparse Spiking Neural Network: Exploiting Heterogeneity in Timescales for Pruning Recurrent SNN |
Biswadeep Chakraborty, Beomseok Kang,..., Saibal Mukhopadhyay |
5 |
2024-04-02 |
link |
Confidence-aware Reward Optimization for Fine-tuning Text-to-Image Models |
Kyuyoung Kim, Jongheon Jeong,..., Kimin Lee |
5 |
2023-07-18 |
link |
Grounded Object-Centric Learning |
Avinash Kori, Francesco Locatello,..., Ben Glocker |
5 |
None |
link |
Fast Imitation via Behavior Foundation Models |
Matteo Pirotta, Andrea Tirinzoni,..., Yann Ollivier |
5 |
2024-01-22 |
link |
Retrieval-Guided Reinforcement Learning for Boolean Circuit Minimization |
Animesh Basak Chowdhury, Marco Romanelli,..., Siddharth Garg |
5 |
2023-10-19 |
link |
To grok or not to grok: Disentangling generalization and memorization on corrupted algorithmic datasets |
Darshil Doshi, Aritra Das,..., Andrey Gromov |
5 |
2023-05-27 |
link |
Diffeomorphic Mesh Deformation via Efficient Optimal Transport for Cortical Surface Reconstruction |
Thanh Tung Le, Khai Nguyen,..., Xiaohui Xie |
5 |
None |
link |
GNNCert: Deterministic Certification of Graph Neural Networks against Adversarial Perturbations |
zaishuo xia, Han Yang,..., Jinyuan Jia |
5 |
2023-05-24 |
link |
Sharpness-Aware Data Poisoning Attack |
Pengfei He, Han Xu,..., Jiliang Tang |
5 |
None |
link |
SaNN: Simple Yet Powerful Simplicial-aware Neural Networks |
Sravanthi Gurugubelli, Sundeep Prabhakar Chepuri |
5 |
2023-11-22 |
link |
Prompt Risk Control: A Rigorous Framework for Responsible Deployment of Large Language Models |
Thomas P Zollo, Todd Morrill,..., Richard Zemel |
5 |
2023-06-01 |
link |
Improving Offline RL by Blending Heuristics |
Sinong Geng, Aldo Pacchiano,..., Ching-An Cheng |
5 |
2024-02-29 |
link |
Masks, Signs, And Learning Rate Rewinding |
Advait Harshal Gadhikar, Rebekka Burkholz |
5 |
2024-03-15 |
link |
Backdoor Secrets Unveiled: Identifying Backdoor Data with Optimized Scaled Prediction Consistency |
Soumyadeep Pal, Yuguang Yao,..., Sijia Liu |
5 |
2024-03-19 |
link |
Non-negative Contrastive Learning |
Yifei Wang, Qi Zhang,..., Yisen Wang |
5 |
2023-10-03 |
link |
Blending Imitation and Reinforcement Learning for Robust Policy Improvement |
Xuefeng Liu, Takuma Yoneda,..., Yuxin Chen |
5 |
2024-01-23 |
link |
Locality Sensitive Sparse Encoding for Learning World Models Online |
Zichen Liu, Chao Du,..., Min Lin |
5 |
None |
link |
A Good Learner can Teach Better: Teacher-Student Collaborative Knowledge Distillation |
Ayan Sengupta, Shantanu Dixit,..., Tanmoy Chakraborty |
5 |
2024-03-25 |
link |
Grounding Language Plans in Demonstrations Through Counterfactual Perturbations |
Yanwei Wang, Tsun-Hsuan Wang,..., Julie Shah |
5 |
2023-12-08 |
link |
Neural Spectral Methods: Self-supervised learning in the spectral domain |
Yiheng Du, Nithin Chalapathi, Aditi S. Krishnapriyan |
5 |
None |
link |
Towards Understanding Factual Knowledge of Large Language Models |
Xuming Hu, Junzhe Chen,..., Zhijiang Guo |
5 |
2024-03-16 |
link |
CORN: Contact-based Object Representation for Nonprehensile Manipulation of General Unseen Objects |
Yoonyoung Cho, Junhyek Han,..., Beomjoon Kim |
5 |
2023-10-11 |
link |
What Matters to You? Towards Visual Representation Alignment for Robot Learning |
Thomas Tian, Chenfeng Xu,..., Andrea Bajcsy |
5 |
2023-05-27 |
link |
Query-Policy Misalignment in Preference-Based Reinforcement Learning |
Xiao Hu, Jianxiong Li,..., Ya-Qin Zhang |
5 |
2023-05-24 |
link |
Leftover Lunch: Advantage-based Offline Reinforcement Learning for Language Models |
Ashutosh Baheti, Ximing Lu,..., Mark Riedl |
5 |
2024-03-17 |
link |
COLEP: Certifiably Robust Learning-Reasoning Conformal Prediction via Probabilistic Circuits |
Mintong Kang, Nezihe Merve Gürel,..., Bo Li |
5 |
2023-05-18 |
link |
Massively Scalable Inverse Reinforcement Learning in Google Maps |
Matt Barnes, Matthew Abueg,..., Shawn O'Banion |
5 |
2024-02-13 |
link |
H2O-SDF: Two-phase Learning for 3D Indoor Reconstruction using Object Surface Fields |
Minyoung Park, Mirae Do,..., Chul Lee |
5 |
None |
link |
Unified Generative Modeling of 3D Molecules with Bayesian Flow Networks |
Yuxuan Song, Jingjing Gong,..., Wei-Ying Ma |
5 |
2023-10-31 |
link |
Contrastive Difference Predictive Coding |
Chongyi Zheng, Ruslan Salakhutdinov, Benjamin Eysenbach |
5 |
None |
link |
On the Scalability and Memory Efficiency of Semidefinite Programs for Lipschitz Constant Estimation of Neural Networks |
Zi Wang, Bin Hu,..., Somesh Jha |
5 |
2023-07-16 |
link |
Tangent Transformers for Composition, Privacy and Removal |
Tian Yu Liu, Aditya Golatkar, Stefano Soatto |
5 |
2024-10-16 |
link |
Reclaiming the Source of Programmatic Policies: Programmatic versus Latent Spaces |
Tales Henrique Carvalho, Kenneth Tjhia, Levi Lelis |
5 |
2024-04-20 |
link |
Deep SE(3)-Equivariant Geometric Reasoning for Precise Placement Tasks |
Ben Eisner, Yi Yang,..., David Held |
5 |
2023-11-13 |
link |
ViLMA: A Zero-Shot Benchmark for Linguistic and Temporal Grounding in Video-Language Models |
Ilker Kesen, Andrea Pedrotti,..., Erkut Erdem |
4 |
2023-10-04 |
link |
CoLiDE: Concomitant Linear DAG Estimation |
Seyed Saman Saboksayr, Gonzalo Mateos, Mariano Tepper |
4 |
2023-05-30 |
link |
Diffusion Model for Dense Matching |
Jisu Nam, Gyuseong Lee,..., Seungryong Kim |
4 |
2023-10-16 |
link |
Equivariant Matrix Function Neural Networks |
Ilyes Batatia, Lars Leon Schaaf,..., Felix Andreas Faber |
4 |
None |
link |
Addressing Signal Delay in Deep Reinforcement Learning |
Wei Wang, Dongqi Han,..., Dongsheng Li |
4 |
2022-07-20 |
link |
Illusory Attacks: Information-theoretic detectability matters in adversarial attacks |
Tim Franzmeyer, Stephen Marcus McAleer,..., Christian Schroeder de Witt |
4 |
2023-06-07 |
link |
On the Joint Interaction of Models, Data, and Features |
Yiding Jiang, Christina Baek, J Zico Kolter |
4 |
None |
link |
Whittle Index with Multiple Actions and State Constraint for Inventory Management |
Chuheng Zhang, Xiangsen Wang,..., Jiang Bian |
4 |
2023-11-09 |
link |
Generating Pragmatic Examples to Train Neural Program Synthesizers |
Saujas Vaduguru, Daniel Fried, Yewen Pu |
4 |
2023-10-13 |
link |
Goodhart's Law in Reinforcement Learning |
Jacek Karwowski, Oliver Hayman,..., Joar Max Viktor Skalse |
4 |
2024-04-18 |
link |
ASID: Active Exploration for System Identification in Robotic Manipulation |
Marius Memmel, Andrew Wagenmaker,..., Abhishek Gupta |
4 |
None |
link |
MCM: Masked Cell Modeling for Anomaly Detection in Tabular Data |
Jiaxin Yin, Yuanyuan Qiao,..., Jie Yang |
4 |
2023-01-09 |
link |
MOTOR: A Time-to-Event Foundation Model For Structured Medical Records |
Ethan Steinberg, Jason Alan Fries,..., Nigam Shah |
4 |
2024-07-12 |
link |
On the Role of Discrete Tokenization in Visual Representation Learning |
Tianqi Du, Yifei Wang, Yisen Wang |
4 |
2023-05-29 |
link |
Provable Reward-Agnostic Preference-Based Reinforcement Learning |
Wenhao Zhan, Masatoshi Uehara,..., Jason D. Lee |
4 |
2023-05-26 |
link |
Physics-Regulated Deep Reinforcement Learning: Invariant Embeddings |
Hongpeng Cao, Yanbing Mao,..., Marco Caccamo |
4 |
2024-04-01 |
link |
Lipsum-FT: Robust Fine-Tuning of Zero-Shot Models Using Random Text Guidance |
Giung Nam, Byeongho Heo, Juho Lee |
4 |
None |
link |
Deep Geodesic Canonical Correlation Analysis for Covariance-Based Neuroimaging Data |
Ce Ju, Reinmar J Kobler,..., Motoaki Kawanabe |
4 |
2024-02-26 |
link |
REFACTOR: Learning to Extract Theorems from Proofs |
Jin Peng Zhou, Yuhuai Wu,..., Roger Baker Grosse |
4 |
2023-10-08 |
link |
Improved Active Learning via Dependent Leverage Score Sampling |
Atsushi Shimizu, Xiaoou Cheng,..., Jonathan Weare |
4 |
None |
link |
On Bias-Variance Alignment in Deep Models |
Lin Chen, Michal Lukasik,..., Sanjiv Kumar |
4 |
2023-07-06 |
link |
Provably Efficient Iterated CVaR Reinforcement Learning with Function Approximation |
Yu Chen, Yihan Du,..., Longbo Huang |
4 |
2023-12-07 |
link |
LiDAR: Sensing Linear Probing Performance in Joint Embedding SSL Architectures |
Vimal Thilak, Chen Huang,..., Etai Littwin |
4 |
2024-01-23 |
link |
DDMI: Domain-Agnostic Latent Diffusion Models for Synthesizing High-Quality Implicit Neural Representations |
Dogyun Park, Sihyeon Kim,..., Hyunwoo J. Kim |
4 |
2023-10-08 |
link |
Understanding the Robustness of Multi-modal Contrastive Learning to Distribution Shift |
Yihao Xue, Siddharth Joshi,..., Baharan Mirzasoleiman |
4 |
2023-10-03 |
link |
Ensemble Distillation for Unsupervised Constituency Parsing |
Behzad Shayegh, Yanshuai Cao,..., Lili Mou |
4 |
2023-10-31 |
link |
Offline RL with Observation Histories: Analyzing and Improving Sample Complexity |
Joey Hong, Anca Dragan, Sergey Levine |