673 |
2024-05-23 |
YOLOv10: Real-Time End-to-End Object Detection |
link |
Ao Wang, Hui Chen,..., Guiguang Ding |
528 |
2024-01-18 |
VMamba: Visual State Space Model |
link |
Yue Liu, Yunjie Tian,..., Yunfan Liu |
464 |
2023-05-24 |
Gorilla: Large Language Model Connected with Massive APIs |
link |
Shishir G Patil, Tianjun Zhang,..., Joseph E. Gonzalez |
426 |
2023-11-06 |
CogVLM: Visual Expert for Pretrained Language Models |
link |
Weihan Wang, Qingsong Lv,..., Jie Tang |
305 |
2024-05-23 |
SimPO: Simple Preference Optimization with a Reference-Free Reward |
link |
Yu Meng, Mengzhou Xia, Danqi Chen |
253 |
2024-06-13 |
Depth Anything V2 |
link |
Lihe Yang, Bingyi Kang,..., Hengshuang Zhao |
246 |
2024-06-24 |
Cambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMs |
link |
Shengbang Tong, Ellis L Brown II,..., Saining Xie |
216 |
2024-04-03 |
Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction |
link |
Keyu Tian, Yi Jiang,..., Liwei Wang |
203 |
2024-03-29 |
Are We on the Right Way for Evaluating Large Vision-Language Models? |
link |
Lin Chen, Jinsong Li,..., Feng Zhao |
185 |
2023-12-04 |
Tree of Attacks: Jailbreaking Black-Box LLMs Automatically |
link |
Anay Mehrotra, Manolis Zampetakis,..., Amin Karbasi |
157 |
2023-11-28 |
LightGaussian: Unbounded 3D Gaussian Compression with 15x Reduction and 200+ FPS |
link |
Zhiwen Fan, Kevin Wang,..., Zhangyang Wang |
152 |
2024-01-31 |
KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization |
link |
Coleman Richard Charles Hooper, Sehoon Kim,..., Amir Gholami |
152 |
2024-05-06 |
SWE-agent: Agent-Computer Interfaces Enable Automated Software Engineering |
link |
John Yang, Carlos E Jimenez,..., Ofir Press |
151 |
2024-06-17 |
Autoregressive Image Generation without Vector Quantization |
link |
Tianhong Li, Yonglong Tian,..., Kaiming He |
147 |
2024-05-03 |
What matters when building vision-language models? |
link |
Hugo Laurençon, Leo Tronchon,..., Victor Sanh |
135 |
2024-05-07 |
xLSTM: Extended Long Short-Term Memory |
link |
Maximilian Beck, Korbinian Pöppel,..., Sepp Hochreiter |
133 |
2024-05-16 |
CAT3D: Create Anything in 3D with Multi-View Diffusion Models |
link |
Ruiqi Gao, Aleksander Holynski,..., Ben Poole |
133 |
2024-04-22 |
SnapKV: LLM Knows What You are Looking for Before Generation |
link |
Yuhong Li, Yingbing Huang,..., Deming Chen |
131 |
2024-04-15 |
LLM Evaluators Recognize and Favor Their Own Generations |
link |
Arjun Panickssery, Samuel R. Bowman, Shi Feng |
123 |
2024-06-17 |
Refusal in Language Models Is Mediated by a Single Direction |
link |
Andy Arditi, Oscar Balcells Obeso,..., Neel Nanda |
117 |
2024-06-06 |
ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search |
link |
Dan Zhang, Sining Zhoubian,..., Jie Tang |
107 |
2024-03-30 |
QuaRot: Outlier-Free 4-Bit Inference in Rotated LLMs |
link |
Saleh Ashkboos, Amirkeivan Mohtashami,..., James Hensman |
105 |
2024-04-09 |
InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD |
link |
Xiaoyi Dong, Pan Zhang,..., Jiaqi Wang |
97 |
2023-10-14 |
Large Language Model Unlearning |
link |
Yuanshun Yao, Xiaojun Xu, Yang Liu |
96 |
2024-04-30 |
Iterative Reasoning Preference Optimization |
link |
Richard Yuanzhe Pang, Weizhe Yuan,..., Jason E Weston |
96 |
None |
Many-shot Jailbreaking |
link |
Cem Anil, Esin DURMUS,..., David Duvenaud |
92 |
2024-07-11 |
FlashAttention-3: Fast and Accurate Attention with Asynchrony and Low-precision |
link |
Jay Shah, Ganesh Bikshandi,..., Tri Dao |
90 |
2023-12-12 |
SGLang: Efficient Execution of Structured Language Model Programs |
link |
Lianmin Zheng, Liangsheng Yin,..., Ying Sheng |
90 |
2024-02-15 |
Chain-of-Thought Reasoning Without Prompting |
link |
Xuezhi Wang, Denny Zhou |
86 |
2024-04-17 |
Many-Shot In-Context Learning |
link |
Rishabh Agarwal, Avi Singh,..., Hugo Larochelle |
84 |
2024-02-16 |
PointMamba: A Simple State Space Model for Point Cloud Analysis |
link |
Dingkang Liang, Xin Zhou,..., Xiang Bai |
81 |
2024-05-23 |
Improved Distribution Matching Distillation for Fast Image Synthesis |
link |
Tianwei Yin, Michaël Gharbi,..., William T. Freeman |
77 |
2024-05-02 |
StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation |
link |
Yupeng Zhou, Daquan Zhou,..., Qibin Hou |
77 |
2024-05-06 |
MAmmoTH2: Scaling Instructions from the Web |
link |
Xiang Yue, Tianyu Zheng,..., Wenhu Chen |
76 |
2024-05-06 |
AlphaMath Almost Zero: Process Supervision without Process |
link |
Guoxin Chen, Minpeng Liao,..., Kai Fan |
74 |
2024-04-16 |
VASA-1: Lifelike Audio-Driven Talking Faces Generated in Real Time |
link |
Sicheng Xu, Guojun Chen,..., Baining Guo |
74 |
2024-06-11 |
An Image is Worth 32 Tokens for Reconstruction and Generation |
link |
Qihang Yu, Mark Weber,..., Liang-Chieh Chen |
68 |
2024-07-02 |
MInference 1.0: Accelerating Pre-filling for Long-Context LLMs via Dynamic Sparse Attention |
link |
Huiqiang Jiang, YUCHENG LI,..., Lili Qiu |
68 |
2024-01-30 |
Robust Prompt Optimization for Defending Language Models Against Jailbreaking Attacks |
link |
Andy Zhou, Bo Li, Haohan Wang |
64 |
2024-05-27 |
Vista: A Generalizable Driving World Model with High Fidelity and Versatile Controllability |
link |
Shenyuan Gao, Jiazhi Yang,..., Hongyang Li |
62 |
2024-06-11 |
Simple and Effective Masked Diffusion Language Models |
link |
Subham Sekhar Sahoo, Marianne Arriola,..., Volodymyr Kuleshov |
61 |
2024-04-03 |
PiSSA: Principal Singular Values and Singular Vectors Adaptation of Large Language Models |
link |
Fanxu Meng, Zhaohui Wang, Muhan Zhang |
61 |
2024-06-06 |
Improving Alignment and Robustness with Circuit Breakers |
link |
Andy Zou, Long Phan,..., Dan Hendrycks |
61 |
2024-07-01 |
Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion |
link |
Boyuan Chen, Diego Martí Monsó,..., Vincent Sitzmann |
59 |
2024-02-12 |
G-Retriever: Retrieval-Augmented Generation for Textual Graph Understanding and Question Answering |
link |
Xiaoxin He, Yijun Tian,..., Bryan Hooi |
58 |
2024-02-26 |
Rainbow Teaming: Open-Ended Generation of Diverse Adversarial Prompts |
link |
Mikayel Samvelyan, Sharath Chandra Raparthy,..., Roberta Raileanu |
57 |
2024-02-29 |
Humanoid Locomotion as Next Token Prediction |
link |
Ilija Radosavovic, Bike Zhang,..., Jitendra Malik |
57 |
2024-04-18 |
Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing |
link |
Ye Tian, Baolin Peng,..., Dong Yu |
56 |
2024-04-04 |
ReFT: Representation Finetuning for Language Models |
link |
Zhengxuan Wu, Aryaman Arora,..., Christopher Potts |
56 |
2024-06-06 |
Simplified and Generalized Masked Diffusion for Discrete Data |
link |
Jiaxin Shi, Kehang Han,..., Michalis Titsias |
56 |
2024-04-21 |
Hyper-SD: Trajectory Segmented Consistency Model for Efficient Image Synthesis |
link |
Yuxi Ren, Xin Xia,..., Xuefeng Xiao |
55 |
2024-04-11 |
Applying Guidance in a Limited Interval Improves Sample and Distribution Quality in Diffusion Models |
link |
Tuomas Kynkäänniemi, Miika Aittala,..., Jaakko Lehtinen |
54 |
2023-12-06 |
Scaling transformer neural networks for skillful and reliable medium-range weather forecasting |
link |
Tung Nguyen, Rohan Shah,..., Aditya Grover |
50 |
2024-06-04 |
Guiding a Diffusion Model with a Bad Version of Itself |
link |
Tero Karras, Miika Aittala,..., Samuli Laine |
50 |
2024-02-17 |
Watch Out for Your Agents! Investigating Backdoor Threats to LLM-Based Agents |
link |
Wenkai Yang, Xiaohan Bi,..., Xu Sun |
50 |
2024-02-07 |
Can Large Language Model Agents Simulate Human Trust Behavior? |
link |
Chengxing Xie, Canyu Chen,..., Guohao Li |
49 |
2024-04-24 |
PuLID: Pure and Lightning ID Customization via Contrastive Alignment |
link |
Zinan Guo, Yanze Wu,..., Qian HE |
48 |
2024-05-08 |
You Only Cache Once: Decoder-Decoder Architectures for Language Models |
link |
Yutao Sun, Li Dong,..., Furu Wei |
48 |
2024-07-22 |
Discrete Flow Matching |
link |
Itai Gat, Tal Remez,..., Yaron Lipman |
47 |
2024-07-25 |
Recursive Introspection: Teaching Language Model Agents How to Self-Improve |
link |
Yuxiao Qu, Tianjun Zhang,..., Aviral Kumar |
47 |
2024-03-14 |
Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision |
link |
Zhiqing Sun, Longhui Yu,..., Chuang Gan |
47 |
2024-04-25 |
Make Your LLM Fully Utilize the Context |
link |
Shengnan An, Zexiong Ma,..., Weizhu Chen |
46 |
2024-02-29 |
How do Large Language Models Handle Multilingualism? |
link |
Yiran Zhao, Wenxuan Zhang,..., Lidong Bing |
46 |
2024-06-10 |
Parallelizing Linear Transformers with the Delta Rule over Sequence Length |
link |
Songlin Yang, Bailin Wang,..., Yoon Kim |
45 |
2024-02-06 |
SELF-DISCOVER: Large Language Models Self-Compose Reasoning Structures |
link |
Pei Zhou, Jay Pujara,..., Steven Zheng |
45 |
2023-06-02 |
Invisible Image Watermarks Are Provably Removable Using Generative AI |
link |
Xuandong Zhao, Kexun Zhang,..., Lei Li |
45 |
2024-06-05 |
Scaling Laws for Reward Model Overoptimization in Direct Alignment Algorithms |
link |
Rafael Rafailov, Yaswanth Chittepu,..., Scott Niekum |
45 |
2024-06-17 |
Twin-Merging: Dynamic Integration of Modular Expertise in Model Merging |
link |
Zhenyi Lu, Chenghao Fan,..., Yu Cheng |
44 |
2024-05-16 |
Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning |
link |
Yuexiang Zhai, Hao Bai,..., Sergey Levine |
44 |
2024-05-24 |
Defensive Unlearning with Adversarial Training for Robust Concept Erasure in Diffusion Models |
link |
Yimeng Zhang, Xin Chen,..., Sijia Liu |
43 |
2023-12-18 |
Cascade Speculative Drafting for Even Faster LLM Inference |
link |
Ziyi Chen, Xiaocong Yang,..., Jie Huang |
42 |
2024-05-24 |
The Road Less Scheduled |
link |
Aaron Defazio, Xingyu Alice Yang,..., Ashok Cutkosky |
42 |
2024-03-23 |
Understanding Emergent Abilities of Language Models from the Loss Perspective |
link |
Zhengxiao Du, Aohan Zeng,..., Jie Tang |
42 |
2024-06-12 |
One-Step Effective Diffusion Network for Real-World Image Super-Resolution |
link |
Rongyuan Wu, Lingchen Sun,..., Lei Zhang |
41 |
2024-05-24 |
Efficient Adversarial Training in LLMs with Continuous Attacks |
link |
Sophie Xhonneux, Alessandro Sordoni,..., Leo Schwinn |
41 |
2024-02-28 |
Keeping LLMs Aligned After Fine-tuning: The Crucial Role of Prompt Templates |
link |
Kaifeng Lyu, Haoyu Zhao,..., Sanjeev Arora |
41 |
2024-07-02 |
RankRAG: Unifying Context Ranking with Retrieval-Augmented Generation in LLMs |
link |
Yue Yu, Wei Ping,..., Bryan Catanzaro |
41 |
2023-12-19 |
Large Language Models Play StarCraft II:Benchmarks and A Chain of Summarization Approach |
link |
Weiyu Ma, Qirui Mi,..., Haifeng Zhang |
41 |
2024-05-23 |
Direct3D: Scalable Image-to-3D Generation via 3D Latent Diffusion Transformer |
link |
Shuang Wu, Youtian Lin,..., Yao Yao |
41 |
2024-06-27 |
OMG-LLaVA: Bridging Image-level, Object-level, Pixel-level Reasoning and Understanding |
link |
Tao Zhang, Xiangtai Li,..., Shuicheng YAN |
41 |
2024-05-30 |
Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image |
link |
Kailu Wu, Fangfu Liu,..., Kaisheng Ma |
41 |
2024-03-27 |
Long-form factuality in large language models |
link |
Jerry Wei, Chengrun Yang,..., Quoc V Le |
40 |
2024-04-15 |
3D Gaussian Splatting as Markov Chain Monte Carlo |
link |
Shakiba Kheradmand, Daniel Rebain,..., Kwang Moo Yi |
40 |
2024-04-04 |
No "Zero-Shot" Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model Performance |
link |
Vishaal Udandarao, Ameya Prabhu,..., Matthias Bethge |
40 |
2024-06-12 |
VisionLLM v2: An End-to-End Generalist Multimodal Large Language Model for Hundreds of Vision-Language Tasks |
link |
Jiannan Wu, Muyan Zhong,..., Jifeng Dai |
39 |
2024-05-13 |
PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator |
link |
Hanshu Yan, Xingchao Liu,..., Jiashi Feng |
39 |
2024-06-05 |
Lumina-Next : Making Lumina-T2X Stronger and Faster with Next-DiT |
link |
Le Zhuo, Ruoyi Du,..., Peng Gao |
38 |
2024-06-22 |
Are Language Models Actually Useful for Time Series Forecasting? |
link |
Mingtian Tan, Mike A Merrill,..., Thomas Hartvigsen |
38 |
2024-06-14 |
Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs |
link |
Rui Yang, Ruomeng Ding,..., Tong Zhang |
38 |
2024-05-26 |
Provably Mitigating Overoptimization in RLHF: Your SFT Loss is Implicitly an Adversarial Regularizer |
link |
Zhihan Liu, Miao Lu,..., Zhaoran Wang |
38 |
2024-05-26 |
Demystify Mamba in Vision: A Linear Attention Perspective |
link |
Dongchen Han, Ziyi Wang,..., Gao Huang |
38 |
2024-06-03 |
SpatialRGPT: Grounded Spatial Reasoning in Vision-Language Models |
link |
An-Chieh Cheng, Hongxu Yin,..., Sifei Liu |
37 |
2024-07-17 |
AgentPoison: Red-teaming LLM Agents via Poisoning Memory or Knowledge Bases |
link |
Zhaorun Chen, Zhen Xiang,..., Bo Li |
37 |
2024-06-26 |
WildTeaming at Scale: From In-the-Wild Jailbreaks to (Adversarially) Safer Language Models |
link |
Liwei Jiang, Kavel Rao,..., Nouha Dziri |
37 |
2024-02-16 |
The Evolution of Statistical Induction Heads: In-Context Learning Markov Chains |
link |
Ezra Edelman, Nikolaos Tsilivis,..., Surbhi Goel |
37 |
2024-02-02 |
ReEvo: Large Language Models as Hyper-Heuristics with Reflective Evolution |
link |
Haoran Ye, Jiarui Wang,..., Guojie Song |
37 |
2023-05-23 |
Decoupled Kullback-Leibler Divergence Loss |
link |
Jiequan Cui, Zhuotao Tian,..., Hanwang Zhang |
37 |
2024-07-18 |
Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies |
link |
Chaofan Tao, Qian Liu,..., Ngai Wong |
37 |
2024-07-11 |
WildGaussians: 3D Gaussian Splatting In the Wild |
link |
Jonas Kulhanek, Songyou Peng,..., Torsten Sattler |
36 |
2024-05-21 |
Reducing Transformer Key-Value Cache Size with Cross-Layer Attention |
link |
William Brandon, Mayank Mishra,..., Jonathan Ragan-Kelley |
36 |
2024-06-20 |
RL on Incorrect Synthetic Data Scales the Efficiency of LLM Math Reasoning by Eight-Fold |
link |
Amrith Setlur, Saurabh Garg,..., Aviral Kumar |
36 |
2024-02-14 |
Soft Prompt Threats: Attacking Safety Alignment and Unlearning in Open-Source LLMs through the Embedding Space |
link |
Leo Schwinn, David Dobre,..., Stephan Günnemann |
36 |
2024-02-26 |
Why Transformers Need Adam: A Hessian Perspective |
link |
Yushun Zhang, Congliang Chen,..., Zhi-Quan Luo |
36 |
2024-02-29 |
TimeXer: Empowering Transformers for Time Series Forecasting with Exogenous Variables |
link |
Yuxuan Wang, Haixu Wu,..., Mingsheng Long |
36 |
2023-12-13 |
Chat-Scene: Bridging 3D Scene and Large Language Models with Object Identifiers |
link |
Haifeng Huang, Yilun Chen,..., Zhou Zhao |
35 |
2024-06-25 |
MotionBooth: Motion-Aware Customized Text-to-Video Generation |
link |
Jianzong Wu, Xiangtai Li,..., Kai Chen |
35 |
2023-11-22 |
SegVol: Universal and Interactive Volumetric Medical Image Segmentation |
link |
Yuxin Du, Fan BAI,..., Bo Zhao |
35 |
2024-06-13 |
Unpacking DPO and PPO: Disentangling Best Practices for Learning from Preference Feedback |
link |
Hamish Ivison, Yizhong Wang,..., Hannaneh Hajishirzi |
35 |
2024-06-14 |
L4GM: Large 4D Gaussian Reconstruction Model |
link |
Jiawei Ren, Kevin Xie,..., Huan Ling |
34 |
2024-02-17 |
OneBit: Towards Extremely Low-bit Large Language Models |
link |
Yuzhuang Xu, Xu Han,..., Wanxiang Che |
34 |
2023-04-26 |
The Closeness of In-Context Learning and Weight Shifting for Softmax Regression |
link |
Shuai Li, Zhao Song,..., Tianyi Zhou |
34 |
2024-01-18 |
ChatQA: Surpassing GPT-4 on Conversational QA and RAG |
link |
Zihan Liu, Wei Ping,..., Bryan Catanzaro |
34 |
2024-05-25 |
Streaming Long Video Understanding with Large Language Models |
link |
Rui Qian, Xiaoyi Dong,..., Jiaqi Wang |
34 |
2024-05-19 |
Era3D: High-Resolution Multiview Diffusion using Efficient Row-wise Attention |
link |
Peng Li, Yuan Liu,..., Yike Guo |
33 |
2024-11-02 |
Rule Based Rewards for Language Model Safety |
link |
Tong Mu, Alec Helyar,..., Lilian Weng |
33 |
2024-05-08 |
Chain of Thoughtlessness? An Analysis of CoT in Planning |
link |
Kaya Stechly, Karthik Valmeekam, Subbarao Kambhampati |
33 |
2024-06-21 |
Is A Picture Worth A Thousand Words? Delving Into Spatial Reasoning for Vision Language Models |
link |
Jiayu Wang, Yifei Ming,..., Neel Joshi |
33 |
2024-02-08 |
Noise Contrastive Alignment of Language Models with Explicit Rewards |
link |
Huayu Chen, Guande He,..., Jun Zhu |
33 |
2024-04-23 |
Rethinking LLM Memorization through the Lens of Adversarial Compression |
link |
Avi Schwarzschild, Zhili Feng,..., J Zico Kolter |
33 |
2024-04-19 |
MoVA: Adapting Mixture of Vision Experts to Multimodal Context |
link |
Zhuofan Zong, Bingqi Ma,..., Yu Liu |
33 |
2024-05-26 |
Diffusion4D: Fast Spatial-temporal Consistent 4D generation via Video Diffusion Models |
link |
HANWEN LIANG, Yuyang Yin,..., Yunchao Wei |
33 |
2024-02-24 |
Spec-Gaussian: Anisotropic View-Dependent Appearance for 3D Gaussian Splatting |
link |
Ziyi Yang, Xinyu Gao,..., Xiaogang Jin |
32 |
2024-05-28 |
Aligning to Thousands of Preferences via System Message Generalization |
link |
Seongyun Lee, Sue Hyun Park,..., Minjoon Seo |
32 |
2024-06-03 |
Mobile-Agent-v2: Mobile Device Operation Assistant with Effective Navigation via Multi-Agent Collaboration |
link |
Junyang Wang, Haiyang Xu,..., Jitao Sang |
32 |
2024-06-03 |
MixEval: Deriving Wisdom of the Crowd from LLM Benchmark Mixtures |
link |
Jinjie Ni, Fuzhao Xue,..., Yang You |
32 |
2024-02-12 |
Model Collapse Demystified: The Case of Regression |
link |
Elvis Dohmatob, Yunzhen Feng, Julia Kempe |
32 |
2024-10-08 |
Vitron: A Unified Pixel-level Vision LLM for Understanding, Generating, Segmenting, Editing |
link |
Hao Fei, Shengqiong Wu,..., Shuicheng YAN |
31 |
2024-05-28 |
Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations |
link |
Alexander Hägele, Elie Bakouch,..., Martin Jaggi |
31 |
2024-02-16 |
Interpreting CLIP with Sparse Linear Concept Embeddings (SpLiCE) |
link |
Usha Bhalla, Alex Oesterling,..., Himabindu Lakkaraju |
31 |
2024-03-26 |
LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-Tuning |
link |
Rui Pan, Xiang Liu,..., Tong Zhang |
31 |
2024-06-13 |
Visual Sketchpad: Sketching as a Visual Chain of Thought for Multimodal Language Models |
link |
Yushi Hu, Weijia Shi,..., Ranjay Krishna |
31 |
2024-05-30 |
Enhancing Large Vision Language Models with Self-Training on Image Comprehension |
link |
Yihe Deng, Pan Lu,..., Wei Wang |
30 |
2024-03-11 |
SARDet-100K: Towards Open-Source Benchmark and ToolKit for Large-Scale SAR Object Detection |
link |
Yuxuan Li, Xiang Li,..., Jian Yang |
30 |
2024-05-17 |
Identifying Functionally Important Features with End-to-End Sparse Dictionary Learning |
link |
Dan Braun, Jordan Taylor,..., Lee Sharkey |
30 |
2024-03-26 |
MAGIS: LLM-Based Multi-Agent Framework for GitHub Issue Resolution |
link |
Wei Tao, Yucheng Zhou,..., Yu Cheng |
30 |
2023-11-29 |
Elo Uncovered: Robustness and Best Practices in Language Model Evaluation |
link |
Meriem Boubdir, Edward Kim,..., Marzieh Fadaee |
30 |
2024-06-14 |
DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning |
link |
Hao Bai, Yifei Zhou,..., Aviral Kumar |
30 |
2024-05-27 |
Safe LoRA: The Silver Lining of Reducing Safety Risks when Finetuning Large Language Models |
link |
Chia-Yi Hsu, Yu-Lin Tsai,..., Chun-Ying Huang |
30 |
2024-06-13 |
OmniTokenizer: A Joint Image-Video Tokenizer for Visual Generation |
link |
Junke Wang, Yi Jiang,..., Yu-Gang Jiang |
29 |
2023-06-13 |
Questioning the Survey Responses of Large Language Models |
link |
Ricardo Dominguez-Olmedo, Moritz Hardt, Celestine Mendler-Dünner |
29 |
2024-04-30 |
HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning |
link |
Chunlin Tian, Zhan Shi,..., Cheng-zhong Xu |
29 |
2024-05-20 |
Diffusion for World Modeling: Visual Details Matter in Atari |
link |
Eloi Alonso, Adam Jelley,..., François Fleuret |
29 |
2024-07-19 |
Compact Language Models via Pruning and Knowledge Distillation |
link |
Saurav Muralidharan, Sharath Turuvekere Sreenivas,..., Pavlo Molchanov |
29 |
2024-06-02 |
BoNBoN Alignment for Large Language Models and the Sweetness of Best-of-n Sampling |
link |
Lin Gui, Cristina Garbacea, Victor Veitch |
29 |
2024-04-25 |
REBEL: Reinforcement Learning via Regressing Relative Rewards |
link |
Zhaolin Gao, Jonathan Daniel Chang,..., Wen Sun |
29 |
2024-06-13 |
Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs |
link |
Xuan Zhang, Chao Du,..., Min Lin |
29 |
2024-02-07 |
InfLLM: Training-Free Long-Context Extrapolation for LLMs with an Efficient Context Memory |
link |
Chaojun Xiao, Pengle Zhang,..., Maosong Sun |
29 |
2024-05-23 |
MiniCache: KV Cache Compression in Depth Dimension for Large Language Models |
link |
Akide Liu, Jing Liu,..., Bohan Zhuang |
28 |
2024-05-23 |
EMR-Merging: Tuning-Free High-Performance Model Merging |
link |
Chenyu Huang, Peng Ye,..., Wanli Ouyang |
28 |
2024-05-27 |
Transformers Can Do Arithmetic with the Right Embeddings |
link |
Sean Michael McLeish, Arpit Bansal,..., Tom Goldstein |
28 |
2023-05-27 |
MADiff: Offline Multi-agent Learning with Diffusion Models |
link |
Zhengbang Zhu, Minghuan Liu,..., Weinan Zhang |
28 |
2024-05-23 |
Multi-Scale VMamba: Hierarchy in Hierarchy Visual State Space Model |
link |
Yuheng Shi, Minjing Dong, Chang Xu |
28 |
2024-02-28 |
Approaching Human-Level Forecasting with Language Models |
link |
Danny Halawi, Fred Zhang,..., Jacob Steinhardt |
28 |
2024-05-31 |
MeshXL: Neural Coordinate Field for Generative 3D Foundation Models |
link |
Sijin Chen, Xin Chen,..., Tao Chen |
27 |
2023-12-12 |
Alignment for Honesty |
link |
Yuqing Yang, Ethan Chern,..., Pengfei Liu |
27 |
2024-06-17 |
How Do Large Language Models Acquire Factual Knowledge During Pretraining? |
link |
Hoyeon Chang, Jinho Park,..., Minjoon Seo |
27 |
2024-04-09 |
MambaAD: Exploring State Space Models for Multi-class Unsupervised Anomaly Detection |
link |
Haoyang He, Yuhu Bai,..., Lei Xie |
27 |
2023-10-26 |
Transformers Learn to Achieve Second-Order Convergence Rates for In-Context Linear Regression |
link |
Deqing Fu, Tian-qi Chen,..., Vatsal Sharan |
26 |
2024-09-30 |
Scaling Proprioceptive-Visual Learning with Heterogeneous Pre-trained Transformers |
link |
Lirui Wang, Xinlei Chen,..., Kaiming He |
26 |
2024-06-14 |
Be like a Goldfish, Don't Memorize! Mitigating Memorization in Generative LLMs |
link |
Abhimanyu Hans, John Kirchenbauer,..., Tom Goldstein |
26 |
2024-03-05 |
Found in the Middle: How Language Models Use Long Contexts Better via Plug-and-Play Positional Encoding |
link |
Zhenyu Zhang, Runjin Chen,..., Zhangyang Wang |
26 |
2024-05-29 |
T2V-Turbo: Breaking the Quality Bottleneck of Video Consistency Model with Mixed Reward Feedback |
link |
Jiachen Li, Weixi Feng,..., William Yang Wang |
26 |
2024-07-02 |
Meta 3D AssetGen: Text-to-Mesh Generation with High-Quality Geometry, Texture, and PBR Materials |
link |
Yawar Siddiqui, Tom Monnier,..., David Novotny |
26 |
2024-06-03 |
Improved Few-Shot Jailbreaking Can Circumvent Aligned Language Models and Their Defenses |
link |
Xiaosen Zheng, Tianyu Pang,..., Min Lin |
26 |
2024-05-09 |
CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts |
link |
Jiachen Li, Xinyao Wang,..., Longyin Wen |
25 |
2024-02-04 |
Aligner: Efficient Alignment by Learning to Correct |
link |
Jiaming Ji, Boyuan Chen,..., Yaodong Yang |
25 |
2024-08-19 |
Personalizing Reinforcement Learning from Human Feedback with Variational Preference Learning |
link |
Sriyash Poddar, Yanming Wan,..., Natasha Jaques |
25 |
2024-06-17 |
Unveiling Encoder-Free Vision-Language Models |
link |
Haiwen Diao, Yufeng Cui,..., Xinlong Wang |
25 |
2024-06-06 |
Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models |
link |
Ling Yang, Zhaochen Yu,..., Bin CUI |
25 |
2024-02-18 |
Federated Fine-tuning of Large Language Models under Heterogeneous Tasks and Client Resources |
link |
Jiamu Bai, Daoyuan Chen,..., Yaliang Li |
25 |
2024-05-28 |
Understanding Transformer Reasoning Capabilities via Graph Algorithms |
link |
Clayton Sanford, Bahare Fatemi,..., Vahab Mirrokni |
25 |
2024-05-31 |
4Diffusion: Multi-view Video Diffusion Model for 4D Generation |
link |
Haiyu Zhang, Xinyuan Chen,..., Yu Qiao |
25 |
2024-02-03 |
Panacea: Pareto Alignment via Preference Adaptation for LLMs |
link |
Yifan Zhong, Chengdong Ma,..., Yaodong Yang |
25 |
2024-05-22 |
xRAG: Extreme Context Compression for Retrieval-augmented Generation with One Token |
link |
Xin Cheng, Xun Wang,..., Dongyan Zhao |
25 |
2024-06-18 |
DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving |
link |
Yuxuan Tong, Xiwen Zhang,..., Junxian He |
25 |
2024-04-12 |
Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length |
link |
Xuezhe Ma, Xiaomeng Yang,..., Chunting Zhou |
25 |
2024-07-01 |
On Statistical Rates and Provably Efficient Criteria of Latent Diffusion Transformers (DiTs) |
link |
Jerry Yao-Chieh Hu, Weimin Wu,..., Han Liu |
25 |
2023-05-15 |
PLIP: Language-Image Pre-training for Person Representation Learning |
link |
Jialong Zuo, Jiahao Hong,..., Jingdong Wang |
25 |
2024-05-28 |
Why are Visually-Grounded Language Models Bad at Image Classification? |
link |
Yuhui Zhang, Alyssa Unell,..., Serena Yeung-Levy |
24 |
2024-06-06 |
Evaluating the World Model Implicit in a Generative Model |
link |
Keyon Vafa, Justin Y. Chen,..., Sendhil Mullainathan |
24 |
2024-07-31 |
Measuring Progress in Dictionary Learning for Language Model Interpretability with Board Game Models |
link |
Adam Karvonen, Benjamin Wright,..., Samuel Marks |
24 |
2024-02-19 |
Query-Based Adversarial Prompt Generation |
link |
Jonathan Hayase, Ema Borevković,..., Milad Nasr |
24 |
2024-03-28 |
InterDreamer: Zero-Shot Text to 3D Dynamic Human-Object Interaction |
link |
Sirui Xu, Ziyin Wang,..., Liangyan Gui |
24 |
2024-02-22 |
Large Language Models as Urban Residents: An LLM Agent Framework for Personal Mobility Generation |
link |
Jiawei Wang, Renhe Jiang,..., Chuan Xiao |
24 |
2023-10-06 |
Why Do We Need Weight Decay in Modern Deep Learning? |
link |
Francesco D'Angelo, Maksym Andriushchenko,..., Nicolas Flammarion |
24 |
2024-04-06 |
Aligning Diffusion Models by Optimizing Human Utility |
link |
Shufan Li, Konstantinos Kallidromitis,..., Kazuki Kozuka |
24 |
2024-03-01 |
Gradient Cuff: Detecting Jailbreak Attacks on Large Language Models by Exploring Refusal Loss Landscapes |
link |
Xiaomeng Hu, Pin-Yu Chen, Tsung-Yi Ho |
24 |
2023-12-06 |
OctreeOcc: Efficient and Multi-Granularity Occupancy Prediction Using Octree Queries |
link |
Yuhang Lu, Xinge ZHU,..., Yuexin Ma |
24 |
2024-08-27 |
The Mamba in the Llama: Distilling and Accelerating Hybrid Models |
link |
Junxiong Wang, Daniele Paliotta,..., Tri Dao |
24 |
2024-05-03 |
DreamScene4D: Dynamic Multi-Object Scene Generation from Monocular Videos |
link |
Wen-Hsuan Chu, Lei Ke, Katerina Fragkiadaki |
24 |
2024-06-04 |
OpenGaussian: Towards Point-Level 3D Gaussian-based Open Vocabulary Understanding |
link |
Yanmin Wu, Jiarui Meng,..., Jian Zhang |
24 |
2024-05-27 |
Collaborative Video Diffusion: Consistent Multi-video Generation with Camera Control |
link |
Zhengfei Kuang, Shengqu Cai,..., Gordon Wetzstein |
24 |
2024-06-10 |
LLM Dataset Inference: Did you train on my dataset? |
link |
Pratyush Maini, Hengrui Jia,..., Adam Dziedzic |
23 |
2024-02-29 |
Heavy-Tailed Class Imbalance and Why Adam Outperforms Gradient Descent on Language Models |
link |
Frederik Kunstner, Alan Milligan,..., Alberto Bietti |
23 |
2024-06-04 |
Chain of Agents: Large Language Models Collaborating on Long-Context Tasks |
link |
Yusen Zhang, Ruoxi Sun,..., Sercan O Arik |
23 |
2024-05-23 |
JiuZhang3.0: Efficiently Improving Mathematical Reasoning by Training Small Data Synthesis Models |
link |
Kun Zhou, Beichen Zhang,..., Ji-Rong Wen |
23 |
2024-02-09 |
Fight Back Against Jailbreaking via Prompt Adversarial Tuning |
link |
Yichuan Mo, Yuji Wang,..., Yisen Wang |
23 |
2024-06-12 |
Large Language Model Unlearning via Embedding-Corrupted Prompts |
link |
Chris Yuhao Liu, Yaxuan Wang,..., Yang Liu |
23 |
2024-02-29 |
Theoretical Foundations of Deep Selective State-Space Models |
link |
Nicola Muca Cirone, Antonio Orvieto,..., Terry Lyons |
23 |
2024-06-11 |
4Real: Towards Photorealistic 4D Scene Generation via Video Diffusion Models |
link |
Heng Yu, Chaoyang Wang,..., Hsin-Ying Lee |
23 |
2024-06-27 |
Decoding-Time Language Model Alignment with Multiple Objectives |
link |
Ruizhe Shi, Yifang Chen,..., Simon Shaolei Du |
23 |
2024-07-05 |
On scalable oversight with weak LLMs judging strong LLMs |
link |
Zachary Kenton, Noah Yamamoto Siegel,..., Rohin Shah |
23 |
2024-03-14 |
MambaTalk: Efficient Holistic Gesture Synthesis with Selective State Space Models |
link |
Zunnan Xu, Yukang Lin,..., Xiu Li |
23 |
2024-05-23 |
ZipCache: Accurate and Efficient KV Cache Quantization with Salient Token Identification |
link |
Yefei He, Luoming Zhang,..., Bohan Zhuang |
23 |
2024-05-23 |
Calibrated Self-Rewarding Vision Language Models |
link |
Yiyang Zhou, Zhiyuan Fan,..., Huaxiu Yao |
23 |
2024-05-31 |
Amortizing intractable inference in diffusion models for vision, language, and control |
link |
Siddarth Venkatraman, Moksh Jain,..., Nikolay Malkin |
23 |
2024-07-29 |
FreeLong: Training-Free Long Video Generation with SpectralBlend Temporal Attention |
link |
Yu Lu, Yuanzhi Liang,..., Yi Yang |
22 |
2024-06-03 |
DuQuant: Distributing Outliers via Dual Transformation Makes Stronger Quantized LLMs |
link |
Haokun Lin, Haobo Xu,..., Ying Wei |
22 |
2024-04-23 |
Aligning LLM Agents by Learning Latent Preference from User Edits |
link |
Ge Gao, Alexey Taymanov,..., Dipendra Misra |
22 |
2024-06-06 |
Transformers need glasses! Information over-squashing in language tasks |
link |
Federico Barbero, Andrea Banino,..., Petar Veličković |
22 |
2024-07-06 |
LoRA-GA: Low-Rank Adaptation with Gradient Approximation |
link |
Shaowen Wang, Linxi Yu, Jian Li |
22 |
2024-05-07 |
KV Cache is 1 Bit Per Channel: Efficient Large Language Model Inference with Coupled Quantization |
link |
Tianyi Zhang, Jonah Wonkyu Yi,..., Anshumali Shrivastava |
22 |
2024-05-27 |
EM Distillation for One-step Diffusion Models |
link |
Sirui Xie, Zhisheng Xiao,..., Ruiqi Gao |
22 |
2024-01-11 |
A Closer Look at AUROC and AUPRC under Class Imbalance |
link |
Matthew B.A. McDermott, Haoran Zhang,..., Jack Gallifant |
22 |
2024-04-16 |
Self-playing Adversarial Language Game Enhances LLM Reasoning |
link |
Pengyu Cheng, Tianhao Hu,..., Xiaolong Li |
22 |
2024-06-11 |
BAKU: An Efficient Transformer for Multi-Task Policy Learning |
link |
Siddhant Haldar, Zhuoran Peng, Lerrel Pinto |
21 |
2024-02-19 |
A Critical Evaluation of AI Feedback for Aligning Large Language Models |
link |
Archit Sharma, Sedrick Keh,..., Thomas Kollar |
21 |
2024-06-06 |
Multistep Distillation of Diffusion Models via Moment Matching |
link |
Tim Salimans, Thomas Mensink,..., Emiel Hoogeboom |
21 |
2024-05-30 |
Transfer Q-star : Principled Decoding for LLM Alignment |
link |
Souradip Chakraborty, Soumya Suvra Ghosal,..., Furong Huang |
21 |
2024-02-19 |
WorldCoder, a Model-Based LLM Agent: Building World Models by Writing Code and Interacting with the Environment |
link |
Hao Tang, Darren Yan Key, Kevin Ellis |
21 |
2022-08-22 |
Efficiency of the First-Price Auction in the Autobidding World |
link |
Yuan Deng, Jieming Mao,..., Song Zuo |
21 |
2024-05-23 |
HippoRAG: Neurobiologically Inspired Long-Term Memory for Large Language Models |
link |
Bernal Jimenez Gutierrez, Yiheng Shu,..., Yu Su |
21 |
2023-10-11 |
MatFormer: Nested Transformer for Elastic Inference |
link |
Fnu Devvrit, Sneha Kudugunta,..., Prateek Jain |
21 |
2024-05-31 |
ContextGS : Compact 3D Gaussian Splatting with Anchor Level Context Model |
link |
Yufei Wang, Zhihao Li,..., Bihan Wen |
20 |
2024-02-15 |
Self-Play Fine-tuning of Diffusion Models for Text-to-image Generation |
link |
Huizhuo Yuan, Zixiang Chen,..., Quanquan Gu |
20 |
2024-06-10 |
Aligning Large Language Models with Representation Editing: A Control Perspective |
link |
Lingkai Kong, Haorui Wang,..., Chao Zhang |
20 |
2024-06-03 |
Neural network learns low-dimensional polynomials with SGD near the information-theoretic limit |
link |
Jason D. Lee, Kazusato Oko,..., Denny Wu |
20 |
2024-05-29 |
Poseidon: Efficient Foundation Models for PDEs |
link |
Maximilian Herde, Bogdan Raonic,..., Siddhartha Mishra |
20 |
2024-05-19 |
FIFO-Diffusion: Generating Infinite Videos from Text without Training |
link |
Jihwan Kim, Junoh Kang,..., Bohyung Han |
20 |
None |
IMAGPose: A Unified Conditional Framework for Pose-Guided Person Generation |
link |
Fei Shen, Jinhui Tang |
19 |
2023-12-06 |
Return of Unconditional Generation: A Self-supervised Representation Generation Method |
link |
Tianhong Li, Dina Katabi, Kaiming He |
19 |
2024-07-08 |
GenArtist: Multimodal LLM as an Agent for Unified Image Generation and Editing |
link |
Zhenyu Wang, Aoxue Li,..., Xihui Liu |
19 |
2024-06-10 |
MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models |
link |
Zichun Yu, Spandan Das, Chenyan Xiong |
19 |
2024-05-30 |
Kernel Language Entropy: Fine-grained Uncertainty Quantification for LLMs from Semantic Similarities |
link |
Alexander V Nikitin, Jannik Kossen,..., Pekka Marttinen |
19 |
2024-05-27 |
Navigating the Safety Landscape: Measuring Risks in Finetuning Large Language Models |
link |
ShengYun Peng, Pin-Yu Chen,..., Duen Horng Chau |
19 |
2024-05-29 |
Weak-to-Strong Search: Align Large Language Models via Searching over Small Language Models |
link |
Zhanhui Zhou, Zhixuan Liu,..., Yu Qiao |
19 |
2024-05-30 |
CV-VAE: A Compatible Video VAE for Latent Generative Video Models |
link |
Sijie Zhao, Yong Zhang,..., Ying Shan |
19 |
2024-05-27 |
PromptFix: You Prompt and We Fix the Photo |
link |
Yongsheng Yu, Ziyun Zeng,..., Jiebo Luo |
19 |
2024-05-22 |
ReVideo: Remake a Video with Motion and Content Control |
link |
Chong Mou, Mingdeng Cao,..., Jian Zhang |
18 |
2024-06-14 |
Large language model validity via enhanced conformal prediction methods |
link |
John Cherian, Isaac Gibbs, Emmanuel Candes |
18 |
2023-08-04 |
Adaptive Proximal Gradient Method for Convex Optimization |
link |
Yura Malitsky, Konstantin Mishchenko |
18 |
2024-01-29 |
Contracting with a Learning Agent |
link |
Guru Guruganesh, Yoav Kolumbus,..., S. Matthew Weinberg |
18 |
2024-06-18 |
SWT-Bench: Testing and Validating Real-World Bug-Fixes with Code Agents |
link |
Niels Mündler, Mark Niklas Mueller,..., Martin Vechev |
18 |
2024-04-23 |
Gradient Guidance for Diffusion Models: An Optimization Perspective |
link |
Yingqing Guo, Hui Yuan,..., Mengdi Wang |
18 |
None |
Are More LLM Calls All You Need? Towards the Scaling Properties of Compound AI Systems |
link |
Lingjiao Chen, Jared Quincy Davis,..., James Zou |
18 |
2024-10-26 |
Fast Best-of-N Decoding via Speculative Rejection |
link |
Hanshi Sun, Momin Haider,..., Andrea Zanette |
18 |
2024-08-19 |
Transformers to SSMs: Distilling Quadratic Knowledge to Subquadratic Models |
link |
Aviv Bick, Kevin Li,..., Albert Gu |
18 |
2024-05-29 |
Preference Learning Algorithms Do Not Learn Preference Rankings |
link |
Angelica Chen, Sadhika Malladi,..., Kyunghyun Cho |
18 |
2024-09-09 |
FLoRA: Federated Fine-Tuning Large Language Models with Heterogeneous Low-Rank Adaptations |
link |
Ziyao Wang, Zheyu Shen,..., Ang Li |
18 |
2024-01-11 |
Interpretable Concept Bottlenecks to Align Reinforcement Learning Agents |
link |
Quentin Delfosse, Sebastian Sztwiertnia,..., Kristian Kersting |
18 |
2024-03-25 |
Provably Robust Score-Based Diffusion Posterior Sampling for Plug-and-Play Image Reconstruction |
link |
Xingyu Xu, Yuejie Chi |
18 |
2024-06-12 |
DiTFastAttn: Attention Compression for Diffusion Transformer Models |
link |
Zhihang Yuan, Hanling Zhang,..., Yu Wang |
18 |
2024-06-17 |
Emotion-LLaMA: Multimodal Emotion Recognition and Reasoning with Instruction Tuning |
link |
Zebang Cheng, Zhi-Qi Cheng,..., Alexander G Hauptmann |
18 |
2024-02-15 |
BitDelta: Your Fine-Tune May Only Be Worth One Bit |
link |
James Liu, Guangxuan Xiao,..., Tianle Cai |
18 |
2024-05-17 |
ProSST: Protein Language Modeling with Quantized Structure and Disentangled Attention |
link |
Mingchen Li, Yang Tan,..., Liang Hong |
18 |
2024-04-04 |
CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching |
link |
Dongzhi Jiang, Guanglu Song,..., Hongsheng Li |
18 |
2024-05-25 |
Theoretical Analysis of Weak-to-Strong Generalization |
link |
Hunter Lang, David Sontag, Aravindan Vijayaraghavan |
18 |
2024-06-12 |
Reversing the Forget-Retain Objectives: An Efficient LLM Unlearning Framework from Logit Difference |
link |
Jiabao Ji, Yujian Liu,..., Shiyu Chang |
18 |
2024-05-28 |
FreeSplat: Generalizable 3D Gaussian Splatting Towards Free View Synthesis of Indoor Scenes |
link |
Yunsong Wang, Tianxin Huang,..., Gim Hee Lee |
18 |
2024-05-24 |
iVideoGPT: Interactive VideoGPTs are Scalable World Models |
link |
Jialong Wu, Shaofeng Yin,..., Mingsheng Long |
17 |
2024-08-19 |
MeshFormer : High-Quality Mesh Generation with 3D-Guided Reconstruction Model |
link |
Minghua Liu, Chong Zeng,..., Hao Su |
17 |
2024-04-19 |
Ensemble Learning for Heterogeneous Large Language Models with Deep Parallel Collaboration |
link |
Yichong Huang, Xiaocheng Feng,..., Bing Qin |
17 |
2024-10-21 |
3DGS-Enhancer: Enhancing Unbounded 3D Gaussian Splatting with View-consistent 2D Diffusion Priors |
link |
Xi Liu, Chaoyi Zhou, Siyu Huang |
17 |
2024-06-15 |
Voxel Mamba: Group-Free State Space Models for Point Cloud based 3D Object Detection |
link |
Guowen Zhang, Lue Fan,..., Lei Zhang |
17 |
2024-07-09 |
Scaling Retrieval-Based Language Models with a Trillion-Token Datastore |
link |
Rulin Shao, Jacqueline He,..., Pang Wei Koh |
17 |
2024-06-13 |
On Softmax Direct Preference Optimization for Recommendation |
link |
Yuxin Chen, Junfei Tan,..., Tat-Seng Chua |
17 |
2024-03-25 |
Antigen-Specific Antibody Design via Direct Energy-based Preference Optimization |
link |
Xiangxin Zhou, Dongyu Xue,..., Quanquan Gu |
17 |
2024-05-28 |
Personalized Steering of Large Language Models: Versatile Steering Vectors Through Bi-directional Preference Optimization |
link |
Yuanpu Cao, Tianrong Zhang,..., Jinghui Chen |
17 |
2024-05-24 |
Meteor: Mamba-based Traversal of Rationale for Large Language and Vision Models |
link |
Byung-Kwan Lee, Chae Won Kim,..., Yong Man Ro |
17 |
2024-06-13 |
Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Models and Time-Dependent Layer Normalization |
link |
Qihao Liu, Zhanpeng Zeng,..., Liang-Chieh Chen |
17 |
2024-03-22 |
Can large language models explore in-context? |
link |
Akshay Krishnamurthy, Keegan Harris,..., Aleksandrs Slivkins |
17 |
2024-03-12 |
Visual Decoding and Reconstruction via EEG Embeddings with Guided Diffusion |
link |
Dongyang Li, Chen Wei,..., Quanying Liu |
17 |
2024-05-28 |
Knowledge Circuits in Pretrained Transformers |
link |
Yunzhi Yao, Ningyu Zhang,..., Huajun Chen |
17 |
2024-07-23 |
Harmonizing Visual Text Comprehension and Generation |
link |
Zhen Zhao, Jingqun Tang,..., Yuan Xie |
17 |
2024-09-29 |
One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos |
link |
Zechen Bai, Tong He,..., Mike Zheng Shou |
17 |
2024-05-25 |
PTQ4DiT: Post-training Quantization for Diffusion Transformers |
link |
Junyi Wu, Haoxuan Wang,..., Yan Yan |
17 |
2024-02-04 |
AutoTimes: Autoregressive Time Series Forecasters via Large Language Models |
link |
Yong Liu, Guo Qin,..., Mingsheng Long |
17 |
2024-06-17 |
Exploring the Role of Large Language Models in Prompt Encoding for Diffusion Models |
link |
Bingqi Ma, Zhuofan Zong,..., Yu Liu |
17 |
2024-06-21 |
GeoLRM: Geometry-Aware Large Reconstruction Model for High-Quality 3D Gaussian Generation |
link |
Chubin Zhang, Hongliang Song,..., Yansong Tang |
16 |
2024-06-27 |
Resolving Discrepancies in Compute-Optimal Scaling of Language Models |
link |
Tomer Porian, Mitchell Wortsman,..., Yair Carmon |
16 |
2024-09-26 |
HaloScope: Harnessing Unlabeled LLM Generations for Hallucination Detection |
link |
Xuefeng Du, Chaowei Xiao, Yixuan Li |
16 |
2024-07-05 |
Better by default: Strong pre-tuned MLPs and boosted trees on tabular data |
link |
David Holzmüller, Leo Grinsztajn, Ingo Steinwart |
16 |
2024-03-09 |
Algorithmic progress in language models |
link |
Anson Ho, Tamay Besiroglu,..., Jaime Sevilla |
16 |
2024-06-17 |
Transcoders find interpretable LLM feature circuits |
link |
Jacob Dunefsky, Philippe Chlenski, Neel Nanda |
16 |
2024-05-27 |
GenWarp: Single Image to Novel Views with Semantic-Preserving Generative Warping |
link |
Junyoung Seo, Kazumi Fukuda,..., Yuki Mitsufuji |
16 |
2024-02-09 |
CultureLLM: Incorporating Cultural Differences into Large Language Models |
link |
CHENG LI, Mengzhuo Chen,..., Xing Xie |
16 |
2024-05-23 |
Base of RoPE Bounds Context Length |
link |
Mingyu Xu, Xin Men,..., weipeng chen |
16 |
2024-04-22 |
SOFTS: Efficient Multivariate Time Series Forecasting with Series-Core Fusion |
link |
Lu Han, Xu-Yang Chen,..., De-Chuan Zhan |
16 |
2023-12-20 |
UniSDF: Unifying Neural Representations for High-Fidelity 3D Reconstruction of Complex Scenes with Reflections |
link |
Fangjinhua Wang, Marie-Julie Rakotosaona,..., Federico Tombari |
16 |
2024-06-20 |
MR-Ben: A Meta-Reasoning Benchmark for Evaluating System-2 Thinking in LLMs |
link |
Zhongshen Zeng, Yinhong Liu,..., Jiaya Jia |
16 |
2024-05-16 |
Conformal Alignment: Knowing When to Trust Foundation Models with Guarantees |
link |
Yu Gui, Ying Jin, Zhimei Ren |
16 |
2024-03-07 |
Online Adaptation of Language Models with a Memory of Amortized Contexts |
link |
Jihoon Tack, Jaehyung Kim,..., Jonathan Richard Schwarz |
16 |
2024-08-07 |
Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon Tasks |
link |
Zaijing Li, Yuquan Xie,..., Liqiang Nie |
16 |
2024-05-23 |
Adapting to Unknown Low-Dimensional Structures in Score-Based Diffusion Models |
link |
Gen Li, Yuling Yan |
16 |
2024-02-05 |
FuseMoE: Mixture-of-Experts Transformers for Fleximodal Fusion |
link |
Xing Han, Huy Nguyen,..., Suchi Saria |
16 |
2024-03-12 |
Lumen: Unleashing Versatile Vision-Centric Capabilities of Large Multimodal Models |
link |
Yang Jiao, Shaoxiang Chen,..., Yu-Gang Jiang |
16 |
2024-06-11 |
Zero-shot Image Editing with Reference Imitation |
link |
Xi Chen, Yutong Feng,..., Hengshuang Zhao |
16 |
2024-06-18 |
HumanSplat: Generalizable Single-Image Human Gaussian Splatting with Structure Priors |
link |
Panwang Pan, Zhuo Su,..., Yebin Liu |
16 |
2024-07-02 |
UltraPixel: Advancing Ultra High-Resolution Image Synthesis to New Peaks |
link |
Jingjing Ren, Wenbo Li,..., Lei Zhu |
16 |
2024-05-30 |
Group Robust Preference Optimization in Reward-free RLHF |
link |
Shyam Sundhar Ramesh, Yifan Hu,..., Ilija Bogunovic |
16 |
2024-05-22 |
DOGS: Distributed-Oriented Gaussian Splatting for Large-Scale 3D Reconstruction Via Gaussian Consensus |
link |
Yu Chen, Gim Hee Lee |
16 |
2024-06-14 |
Wild-GS: Real-Time Novel View Synthesis from Unconstrained Photo Collections |
link |
Jiacong Xu, Yiqun Mei, Vishal M. Patel |
16 |
2024-06-13 |
COVE: Unleashing the Diffusion Feature Correspondence for Consistent Video Editing |
link |
Jiangshan Wang, Yue Ma,..., Xiu Li |
16 |
2024-03-19 |
Pretraining Codomain Attention Neural Operators for Solving Multiphysics PDEs |
link |
Md Ashiqur Rahman, Robert Joseph George,..., Anima Anandkumar |
15 |
2023-10-19 |
AutoMix: Automatically Mixing Language Models |
link |
Pranjal Aggarwal, Aman Madaan,..., Mausam . |
15 |
2024-06-13 |
Understanding Hallucinations in Diffusion Models through Mode Interpolation |
link |
Sumukh K Aithal, Pratyush Maini,..., J Zico Kolter |
15 |
2024-02-02 |
AMOR: A Recipe for Building Adaptable Modular Knowledge Agents Through Process Feedback |
link |
Jian Guan, Wei Wu,..., Minlie Huang |
15 |
2024-06-10 |
AutoSurvey: Large Language Models Can Automatically Write Surveys |
link |
Yidong Wang, Qi Guo,..., Yue Zhang |
15 |
2024-09-01 |
ContextCite: Attributing Model Generation to Context |
link |
Benjamin Cohen-Wang, Harshay Shah,..., Aleksander Madry |
15 |
2024-05-27 |
BehaviorGPT: Smart Agent Simulation for Autonomous Driving with Next-Patch Prediction |
link |
Zikang Zhou, Haibo HU,..., Chun Jason Xue |
15 |
2024-05-23 |
PaGoDA: Progressive Growing of a One-Step Generator from a Low-Resolution Diffusion Teacher |
link |
Dongjun Kim, Chieh-Hsin Lai,..., Stefano Ermon |
15 |
2024-05-24 |
Quantifying the Gain in Weak-to-Strong Generalization |
link |
Moses Charikar, Chirag Pabbaraju, Kirankumar Shiragur |
15 |
2024-05-30 |
Improving the Training of Rectified Flows |
link |
Sangyun Lee, Zinan Lin, Giulia Fanti |
15 |
2024-03-28 |
Dual-Personalizing Adapter for Federated Foundation Models |
link |
yiyuan yang, Guodong Long,..., Michael Blumenstein |
15 |
2024-01-18 |
Cross-Modality Perturbation Synergy Attack for Person Re-identification |
link |
Yunpeng Gong, Zhun Zhong,..., Min Jiang |
15 |
2024-06-20 |
Prism: A Framework for Decoupling and Assessing the Capabilities of VLMs |
link |
Yuxuan Qiao, Haodong Duan,..., Kai Chen |
15 |
2024-09-05 |
Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding |
link |
Yunze Man, Shuhong Zheng,..., Yu-Xiong Wang |
15 |
2024-10-31 |
SelfCodeAlign: Self-Alignment for Code Generation |
link |
Yuxiang Wei, Federico Cassano,..., LINGMING ZHANG |
15 |
2024-07-16 |
Animate3D: Animating Any 3D Model with Multi-view Video Diffusion |
link |
Yanqin Jiang, Chaohui Yu,..., Jin Gao |
15 |
2024-01-08 |
Tiny Time Mixers (TTMs): Fast Pre-trained Models for Enhanced Zero/Few-Shot Forecasting of Multivariate Time Series |
link |
Vijay Ekambaram, Arindam Jati,..., Jayant Kalagnanam |
15 |
2024-06-06 |
BitsFusion: 1.99 bits Weight Quantization of Diffusion Model |
link |
Yang Sui, Yanyu Li,..., Jian Ren |
15 |
2024-05-15 |
Spectral Editing of Activations for Large Language Model Alignment |
link |
Yifu QIU, Zheng Zhao,..., Shay B Cohen |
15 |
2024-05-23 |
Video Diffusion Models are Training-free Motion Interpreter and Controller |
link |
Zeqi Xiao, Yifan Zhou,..., Xingang Pan |
15 |
2024-05-23 |
Instruction Tuning With Loss Over Instructions |
link |
Zhengyan Shi, Adam X. Yang,..., Aldo Lipani |
15 |
2024-06-14 |
UniAudio 1.5: Large Language Model-Driven Audio Codec is A Few-Shot Audio Task Learner |
link |
Dongchao Yang, Haohan Guo,..., Helen M. Meng |
15 |
2024-05-23 |
Representation Noising: A Defence Mechanism Against Harmful Finetuning |
link |
Domenic Rosati, Jan Wehner,..., Frank Rudzicz |
15 |
2024-10-25 |
DiffGS: Functional Gaussian Splatting Diffusion |
link |
Junsheng Zhou, Weiqi Zhang, Yu-Shen Liu |
15 |
2024-05-04 |
U-DiTs: Downsample Tokens in U-Shaped Diffusion Transformers |
link |
Yuchuan Tian, Zhijun Tu,..., Yunhe Wang |
15 |
2024-05-23 |
WISE: Rethinking the Knowledge Memory for Lifelong Model Editing of Large Language Models |
link |
Peng Wang, Zexi Li,..., Huajun Chen |
14 |
2024-05-23 |
PV-Tuning: Beyond Straight-Through Estimation for Extreme LLM Compression |
link |
Vladimir Malinovskii, Denis Mazur,..., Peter Richtárik |
14 |
2024-05-25 |
Bigger, Regularized, Optimistic: scaling for compute and sample efficient continuous control |
link |
Michal Nauman, Mateusz Ostaszewski,..., Marek Cygan |
14 |
2024-06-10 |
Get rich quick: exact solutions reveal how unbalanced initializations promote rapid feature learning |
link |
Daniel Kunin, Allan Raventos,..., Surya Ganguli |
14 |
2024-05-29 |
Adaptive Image Quality Assessment via Teaching Large Multimodal Model to Compare |
link |
Hanwei Zhu, Haoning Wu,..., Shiqi Wang |
14 |
2024-06-05 |
Dynamic 3D Gaussian Fields for Urban Areas |
link |
Tobias Fischer, Jonas Kulhanek,..., Peter Kontschieder |
14 |
2024-06-03 |
What makes unlearning hard and what to do about it |
link |
Kairan Zhao, Meghdad Kurmanji,..., Peter Triantafillou |
14 |
2023-10-10 |
A General Protocol to Probe Large Vision Models for 3D Physical Understanding |
link |
Guanqi Zhan, Chuanxia Zheng,..., Andrew Zisserman |
14 |
2024-05-20 |
Metacognitive Capabilities of LLMs: An Exploration in Mathematical Problem Solving |
link |
Aniket Rajiv Didolkar, Anirudh Goyal,..., Sanjeev Arora |
14 |
2024-06-06 |
VideoTetris: Towards Compositional Text-to-Video Generation |
link |
Ye Tian, Ling Yang,..., Bin CUI |
14 |
2024-04-22 |
Protecting Your LLMs with Information Bottleneck |
link |
Zichuan Liu, Zefan Wang,..., Jiang Bian |
14 |
2024-05-29 |
Principled Probabilistic Imaging using Diffusion Models as Plug-and-Play Priors |
link |
Zihui Wu, Yu Sun,..., Katherine Bouman |
14 |
2024-05-23 |
Fisher Flow Matching for Generative Modeling over Discrete Data |
link |
Oscar Davis, Samuel Kessler,..., Joey Bose |
14 |
2024-06-12 |
A Concept-Based Explainability Framework for Large Multimodal Models |
link |
Jayneel Parekh, Pegah KHAYATAN,..., Matthieu Cord |
14 |
2024-06-06 |
ReNO: Enhancing One-step Text-to-Image Models through Reward-based Noise Optimization |
link |
Luca Eyring, Shyamgopal Karthik,..., Zeynep Akata |
14 |
2024-02-05 |
Estimating Epistemic and Aleatoric Uncertainty with a Single Model |
link |
Matthew Albert Chan, Maria J. Molina, Christopher Metzler |
14 |
2024-04-05 |
Identity Decoupling for Multi-Subject Personalization of Text-to-Image Models |
link |
Sangwon Jang, Jaehyeong Jo,..., Sung Ju Hwang |
14 |
2024-07-08 |
Multi-Object Hallucination in Vision Language Models |
link |
Xuweiyi Chen, Ziqiao Ma,..., Joyce Chai |
14 |
2024-02-07 |
Improved off-policy training of diffusion samplers |
link |
Marcin Sendera, Minsu Kim,..., Nikolay Malkin |
14 |
2024-10-21 |
Mitigating Object Hallucination via Concentric Causal Attention |
link |
Yun Xing, Yiheng Li,..., Shijian Lu |
14 |
2024-10-18 |
Neural Signed Distance Function Inference through Splatting 3D Gaussians Pulled on Zero-Level Set |
link |
Wenyuan Zhang, Yu-Shen Liu, Zhizhong Han |
14 |
2024-02-29 |
UniTS: A Unified Multi-Task Time Series Model |
link |
Shanghua Gao, Teddy Koker,..., Marinka Zitnik |
14 |
2024-11-04 |
DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution |
link |
Yang Yue, Yulin Wang,..., Gao Huang |
14 |
2024-06-13 |
LRM-Zero: Training Large Reconstruction Models with Synthesized Data |
link |
Desai Xie, Sai Bi,..., Hao Tan |
13 |
2024-02-29 |
RL-GPT: Integrating Reinforcement Learning and Code-as-policy |
link |
Shaoteng Liu, Haoqi Yuan,..., Jiaya Jia |
13 |
2024-06-24 |
Finding Transformer Circuits With Edge Pruning |
link |
Adithya Bhaskar, Alexander Wettig,..., Danqi Chen |
13 |
2024-05-23 |
Axioms for AI Alignment from Human Feedback |
link |
Luise Ge, Daniel Halpern,..., Junlin Wu |
13 |
2024-06-13 |
4M-21: An Any-to-Any Vision Model for Tens of Tasks and Modalities |
link |
Roman Bachmann, Oğuzhan Fatih Kar,..., Amir Zamir |
13 |
2023-10-21 |
Masked Hard-Attention Transformers Recognize Exactly the Star-Free Languages |
link |
Andy Yang, David Chiang, Dana Angluin |
13 |
2024-05-29 |
Grasp as You Say: Language-guided Dexterous Grasp Generation |
link |
Yi-Lin Wei, Jian-Jian Jiang,..., Wei-Shi Zheng |
13 |
2024-01-24 |
Beyond Concept Bottleneck Models: How to Make Black Boxes Intervenable? |
link |
Sonia Laguna, Ričards Marcinkevičs,..., Julia E Vogt |
13 |
2024-02-07 |
Amortized Planning with Large-Scale Transformers: A Case Study on Chess |
link |
Anian Ruoss, Gregoire Deletang,..., Tim Genewein |
13 |
2024-02-05 |
Shadowcast: Stealthy Data Poisoning Attacks Against Vision-Language Models |
link |
Yuancheng Xu, Jiarui Yao,..., Furong Huang |
13 |
2024-06-03 |
SemCoder: Training Code Language Models with Comprehensive Semantics Reasoning |
link |
Yangruibo Ding, Jinjun Peng,..., Baishakhi Ray |
13 |
2024-06-12 |
Scaling Laws in Linear Regression: Compute, Parameters, and Data |
link |
Licong Lin, Jingfeng Wu,..., Jason D. Lee |
13 |
2024-06-25 |
DiffusionPDE: Generative PDE-Solving under Partial Observation |
link |
Jiahe Huang, Guandao Yang,..., Jeong Joon Park |
13 |
2024-06-12 |
Discovering Preference Optimization Algorithms with and for Large Language Models |
link |
Chris Lu, Samuel Holt,..., Robert Tjarko Lange |
13 |
2024-10-22 |
One-Step Diffusion Distillation through Score Implicit Matching |
link |
Weijian Luo, Zemin Huang,..., Guo-Jun Qi |
13 |
2024-01-27 |
DiffuserLite: Towards Real-time Diffusion Planning |
link |
Zibin Dong, Jianye HAO,..., YAN ZHENG |
13 |
2024-05-24 |
Medformer: A Multi-Granularity Patching Transformer for Medical Time-Series Classification |
link |
Yihe Wang, Nan Huang,..., Xiang Zhang |
13 |
2024-06-17 |
AvaTaR: Optimizing LLM Agents for Tool Usage via Contrastive Reasoning |
link |
Shirley Wu, Shiyu Zhao,..., James Zou |
13 |
2024-08-28 |
Efficient LLM Scheduling by Learning to Rank |
link |
Yichao Fu, Siqi Zhu,..., Hao Zhang |
13 |
2024-05-23 |
Lorentz-Equivariant Geometric Algebra Transformers for High-Energy Physics |
link |
Jonas Spinner, Victor Breso Pla,..., Johann Brehmer |
13 |
2024-04-04 |
Mind's Eye of LLMs: Visualization-of-Thought Elicits Spatial Reasoning in Large Language Models |
link |
Wenshan Wu, Shaoguang Mao,..., Furu Wei |
13 |
2024-02-28 |
Implicit Optimization Bias of Next-token Prediction in Linear Models |
link |
Christos Thrampoulidis |
13 |
2024-05-27 |
Vidu4D: Single Generated Video to High-Fidelity 4D Reconstruction with Dynamic Gaussian Surfels |
link |
Yikai Wang, Xinzhou Wang,..., Jun Zhu |
13 |
2024-05-22 |
Dense Connector for MLLMs |
link |
Huanjin Yao, Wenhao Wu,..., Jingdong Wang |
13 |
2024-11-07 |
MVSplat360: Feed-Forward 360 Scene Synthesis from Sparse Views |
link |
Yuedong Chen, Chuanxia Zheng,..., Jianfei Cai |
13 |
2024-05-02 |
FLAME : Factuality-Aware Alignment for Large Language Models |
link |
Sheng-Chieh Lin, Luyu Gao,..., Xilun Chen |
13 |
2024-06-12 |
Vivid-ZOO: Multi-View Video Generation with Diffusion Model |
link |
Bing Li, Cheng Zheng,..., Bernard Ghanem |
13 |
2024-03-03 |
GuardT2I: Defending Text-to-Image Models from Adversarial Prompts |
link |
Yijun Yang, Ruiyuan Gao,..., Qiang Xu |
12 |
None |
Not All Tokens Are What You Need for Pretraining |
link |
Zhenghao Lin, Zhibin Gou,..., Weizhu Chen |
12 |
2024-04-22 |
MDAgents: An Adaptive Collaboration of LLMs for Medical Decision-Making |
link |
Yubin Kim, Chanwoo Park,..., Hae Won Park |
12 |
2024-03-25 |
QKFormer: Hierarchical Spiking Transformer using Q-K Attention |
link |
Chenlin Zhou, Han Zhang,..., Yonghong Tian |
12 |
2024-05-23 |
4+3 Phases of Compute-Optimal Neural Scaling Laws |
link |
Elliot Paquette, Courtney Paquette,..., Jeffrey Pennington |
12 |
2024-05-22 |
Context and Geometry Aware Voxel Transformer for Semantic Scene Completion |
link |
Zhu Yu, Runmin Zhang,..., Hui-liang Shen |
12 |
2024-02-21 |
Linear Transformers are Versatile In-Context Learners |
link |
Max Vladymyrov, Johannes Von Oswald,..., Rong Ge |
12 |
2024-01-22 |
Self-Labeling the Job Shop Scheduling Problem |
link |
Andrea Corsini, Angelo Porrello,..., Mauro Dell'Amico |
12 |
2024-04-08 |
SpeechAlign: Aligning Speech Generation to Human Preferences |
link |
Dong Zhang, Zhaowei Li,..., Xipeng Qiu |
12 |
2024-06-05 |
HYDRA: Model Factorization Framework for Black-Box LLM Personalization |
link |
Yuchen Zhuang, Haotian Sun,..., Bo Dai |
12 |
2024-10-10 |
Generalizable and Animatable Gaussian Head Avatar |
link |
Xuangeng Chu, Tatsuya Harada |
12 |
2024-02-22 |
In-Context Learning of a Linear Transformer Block: Benefits of the MLP Component and One-Step GD Initialization |
link |
Ruiqi Zhang, Jingfeng Wu, Peter Bartlett |
12 |
2024-02-26 |
SelectIT: Selective Instruction Tuning for LLMs via Uncertainty-Aware Self-Reflection |
link |
Liangxin Liu, Xuebo Liu,..., Min Zhang |
12 |
2024-06-27 |
Length Optimization in Conformal Prediction |
link |
Shayan Kiyani, George J. Pappas, Hamed Hassani |
12 |
2024-05-30 |
Bridging Model-Based Optimization and Generative Modeling via Conservative Fine-Tuning of Diffusion Models |
link |
Masatoshi Uehara, Yulai Zhao,..., Tommaso Biancalani |
12 |
2024-05-26 |
Code Repair with LLMs gives an Exploration-Exploitation Tradeoff |
link |
Hao Tang, Keya Hu,..., Kevin Ellis |
12 |
2024-05-28 |
A Theoretical Understanding of Self-Correction through In-context Alignment |
link |
Yifei Wang, Yuyang Wu,..., Yisen Wang |
12 |
2024-05-25 |
Breaking the False Sense of Security in Backdoor Defense through Re-Activation Attack |
link |
Mingli Zhu, Siyuan Liang, Baoyuan Wu |
12 |
2024-05-29 |
Stress-Testing Capability Elicitation With Password-Locked Models |
link |
Ryan Greenblatt, Fabien Roger,..., David Krueger |
12 |
2024-06-03 |
The Importance of Online Data: Understanding Preference Fine-tuning via Coverage |
link |
Yuda Song, Gokul Swamy,..., Wen Sun |
12 |
2024-09-26 |
From News to Forecast: Integrating Event Analysis in LLM-Based Time Series Forecasting with Reflection |
link |
Xinlei Wang, Maike Feng,..., Junhua Zhao |
12 |
2024-07-22 |
QueST: Self-Supervised Skill Abstractions for Learning Continuous Control |
link |
Atharva Mete, Haotian Xue,..., Animesh Garg |
12 |
2024-05-24 |
Score Distillation via Reparametrized DDIM |
link |
Artem Lukoianov, Haitz Sáez de Ocáriz Borde,..., Justin Solomon |
12 |
2024-06-17 |
Large Scale Transfer Learning for Tabular Data via Language Modeling |
link |
Joshua P Gardner, Juan Carlos Perdomo, Ludwig Schmidt |
12 |
2024-06-21 |
Multimodal Task Vectors Enable Many-Shot Multimodal In-Context Learning |
link |
Brandon Huang, Chancharik Mitra,..., Roei Herzig |
12 |
2024-10-24 |
Binocular-Guided 3D Gaussian Splatting with View Consistency for Sparse View Synthesis |
link |
Liang Han, Junsheng Zhou,..., Zhizhong Han |
12 |
2024-06-03 |
TabPedia: Towards Comprehensive Visual Table Understanding with Concept Synergy |
link |
Weichao Zhao, Hao Feng,..., Can Huang |
12 |
2024-06-10 |
MVGamba: Unify 3D Content Generation as State Space Sequence Modeling |
link |
Xuanyu Yi, Zike Wu,..., Hanwang Zhang |
12 |
2024-07-01 |
Evaluation of Text-to-Video Generation Models: A Dynamics Perspective |
link |
Mingxiang Liao, Hannan Lu,..., Xinyu Zhang |
12 |
2024-06-13 |
Yo'LLaVA: Your Personalized Language and Vision Assistant |
link |
Thao Nguyen, Haotian Liu,..., Yong Jae Lee |
12 |
2024-05-24 |
GS-Hider: Hiding Messages into 3D Gaussian Splatting |
link |
Xuanyu Zhang, Jiarui Meng,..., Jian Zhang |
12 |
2024-02-04 |
Diffusion Models are Certifiably Robust Classifiers |
link |
Huanran Chen, Yinpeng Dong,..., Jun Zhu |
11 |
2024-10-08 |
Unlocking the Capabilities of Thought: A Reasoning Boundary Framework to Quantify and Optimize Chain-of-Thought |
link |
Qiguang Chen, Libo Qin,..., Wanxiang Che |
11 |
2024-06-12 |
Self-Consuming Generative Models with Curated Data Provably Optimize Human Preferences |
link |
Damien Ferbach, Quentin Bertrand,..., Gauthier Gidel |
11 |
2024-06-09 |
Training Compute-Optimal Protein Language Models |
link |
Xingyi Cheng, Bo Chen,..., Le Song |
11 |
2024-06-20 |
CooHOI: Learning Cooperative Human-Object Interaction with Manipulated Object Dynamics |
link |
Jiawei Gao, Ziqin Wang,..., Jiangmiao Pang |
11 |
2024-02-09 |
Learn To be Efficient: Build Structured Sparsity in Large Language Models |
link |
Haizhong Zheng, Xiaoyan Bai,..., Atul Prakash |
11 |
2023-10-29 |
Optimal Algorithms for Online Convex Optimization with Adversarial Constraints |
link |
Abhishek Sinha, Rahul Vaze |
11 |
2024-02-06 |
A Phase Transition between Positional and Semantic Learning in a Solvable Model of Dot-Product Attention |
link |
Hugo Cui, Freya Behrens,..., Lenka Zdeborova |
11 |
2024-06-23 |
Trace is the Next AutoDiff: Generative Optimization with Rich Feedback, Execution Traces, and LLMs |
link |
Ching-An Cheng, Allen Nie, Adith Swaminathan |
11 |
2024-05-28 |
Linguistic Collapse: Neural Collapse in (Large) Language Models |
link |
Robert Wu, Vardan Papyan |
11 |
2024-09-26 |
Generative Modeling of Molecular Dynamics Trajectories |
link |
Bowen Jing, Hannes Stark,..., Bonnie Berger |
11 |
2024-10-10 |
Global Lyapunov functions: a long-standing open problem in mathematics, with symbolic transformers |
link |
Alberto Alfarano, Francois Charton, Amaury Hayat |
11 |
2024-06-01 |
RGFN: Synthesizable Molecular Generation Using GFlowNets |
link |
Michał Koziarski, Andrei Rekesh,..., Robert A. Batey |
11 |
2024-06-09 |
Distributional Preference Alignment of LLMs via Optimal Transport |
link |
Igor Melnyk, Youssef Mroueh,..., Jarret Ross |
11 |
2024-02-06 |
Scaling laws for learning with real and surrogate data |
link |
Ayush Jain, Andrea Montanari, Eren Sasoglu |
11 |
2023-12-13 |
SwitchHead: Accelerating Transformers with Mixture-of-Experts Attention |
link |
Róbert Csordás, Piotr Piękos,..., Jürgen Schmidhuber |
11 |
2024-05-07 |
Towards a Theoretical Understanding of the 'Reversal Curse' via Training Dynamics |
link |
Hanlin Zhu, Baihe Huang,..., Stuart Russell |
11 |
2024-11-04 |
Can Language Models Learn to Skip Steps? |
link |
Tengxiao Liu, Qipeng Guo,..., Zheng Zhang |
11 |
2023-11-03 |
Towards Calibrated Robust Fine-Tuning of Vision-Language Models |
link |
Changdae Oh, Hyesu Lim,..., Kyungwoo Song |
11 |
2024-09-11 |
Gated Slot Attention for Efficient Linear-Time Sequence Modeling |
link |
Yu Zhang, Songlin Yang,..., Guohong Fu |
11 |
2024-05-28 |
Getting More Juice Out of the SFT Data: Reward Learning from Human Demonstration Improves SFT for LLM Alignment |
link |
Jiaxiang Li, Siliang Zeng,..., Mingyi Hong |
11 |
2024-06-04 |
Loki: Low-rank Keys for Efficient Sparse Attention |
link |
Prajwal Singhania, Siddharth Singh,..., Abhinav Bhatele |
11 |
2023-05-22 |
Imprecise Label Learning: A Unified Framework for Learning with Various Imprecise Label Configurations |
link |
Hao Chen, Ankit Shah,..., Bhiksha Raj |
11 |
2024-03-02 |
Accelerating Greedy Coordinate Gradient and General Prompt Optimization via Probe Sampling |
link |
Yiran Zhao, Wenyue Zheng,..., Michael Shieh |
11 |
2024-12-19 |
Cal-DPO: Calibrated Direct Preference Optimization for Language Model Alignment |
link |
Teng Xiao, Yige Yuan,..., Vasant G Honavar |
11 |
2024-05-23 |
Nearly Tight Black-Box Auditing of Differentially Private Machine Learning |
link |
Meenatchi Sundaram Muthu Selva Annamalai, Emiliano De Cristofaro |
11 |
2024-06-03 |
LoFiT: Localized Fine-tuning on LLM Representations |
link |
Fangcong Yin, Xi Ye, Greg Durrett |
11 |
2024-03-12 |
SmallToLarge (S2L): Scalable Data Selection for Fine-tuning Large Language Models by Summarizing Training Trajectories of Small Models |
link |
Yu Yang, Siddhartha Mishra,..., Baharan Mirzasoleiman |
11 |
2024-02-07 |
QGFN: Controllable Greediness with Action Values |
link |
Elaine Lau, Stephen Zhewen Lu,..., Emmanuel Bengio |
11 |
2024-05-21 |
Single Image Unlearning: Efficient Machine Unlearning in Multimodal Large Language Models |
link |
Jiaqi Li, Qianshan Wei,..., Fan Liu |
11 |
2024-06-24 |
Confidence Regulation Neurons in Language Models |
link |
Alessandro Stolfo, Ben Peng Wu,..., Neel Nanda |
11 |
2024-02-21 |
Average gradient outer product as a mechanism for deep neural collapse |
link |
Daniel Beaglehole, Peter Súkeník,..., Mikhail Belkin |
11 |
2024-05-23 |
Metric Flow Matching for Smooth Interpolations on the Data Manifold |
link |
Kacper Kapusniak, Peter Potaptchik,..., Francesco Di Giovanni |
11 |
2024-07-17 |
Direct Unlearning Optimization for Robust and Safe Text-to-Image Models |
link |
Yong-Hyun Park, Sangdoo Yun,..., Gayoung Lee |
11 |
2024-03-25 |
MetaAligner: Towards Generalizable Multi-Objective Alignment of Language Models |
link |
Kailai Yang, Zhiwei Liu,..., Sophia Ananiadou |
11 |
2024-03-18 |
Dynamic Tuning Towards Parameter and Inference Efficiency for ViT Adaptation |
link |
Wangbo Zhao, Jiasheng Tang,..., Yang You |
11 |
2024-06-11 |
Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance |
link |
Kuan Heng Lin, Sicheng Mo,..., Bolei Zhou |
11 |
2023-05-21 |
DPIC: Decoupling Prompt and Intrinsic Characteristics for LLM Generated Text Detection |
link |
Xiao Yu, Yuang Qi,..., Nenghai Yu |
11 |
2024-05-23 |
Agent Planning with World Knowledge Model |
link |
Shuofei Qiao, Runnan Fang,..., Huajun Chen |
11 |
2024-02-07 |
The Fine-Grained Complexity of Gradient Computation for Training Large Language Models |
link |
Josh Alman, Zhao Song |
11 |
2024-06-11 |
Neural Gaffer: Relighting Any Object via Diffusion |
link |
Haian Jin, Yuan Li,..., Noah Snavely |
11 |
2024-04-23 |
Multi-Head Mixture-of-Experts |
link |
Xun Wu, Shaohan Huang,..., Furu Wei |
11 |
2024-06-09 |
VCR-GauS: View Consistent Depth-Normal Regularizer for Gaussian Surface Reconstruction |
link |
Hanlin Chen, Fangyin Wei,..., Gim Hee Lee |
11 |
2024-07-25 |
LION: Linear Group RNN for 3D Object Detection in Point Clouds |
link |
Zhe Liu, Jinghua Hou,..., Xiang Bai |
11 |
2024-06-12 |
Large Language Models Must Be Taught to Know What They Don’t Know |
link |
Sanyam Kapoor, Nate Gruver,..., Andrew Gordon Wilson |
11 |
2024-02-17 |
TuneTables: Context Optimization for Scalable Prior-Data Fitted Networks |
link |
Benjamin Feuer, Robin Tibor Schirrmeister,..., Colin White |
10 |
2024-02-18 |
In-Context Learning with Transformers: Softmax Attention Adapts to Function Lipschitzness |
link |
Liam Collins, Advait U Parulekar,..., Sanjay Shakkottai |
10 |
2024-06-17 |
Transcendence: Generative Models Can Outperform The Experts That Train Them |
link |
Edwin Zhang, Vincent Zhu,..., eran malach |
10 |
2024-02-16 |
Conformalized Credal Set Predictors |
link |
Alireza Javanmardi, David Stutz, Eyke Hüllermeier |
10 |
2024-10-30 |
FlowLLM: Flow Matching for Material Generation with Large Language Models as Base Distributions |
link |
Anuroop Sriram, Benjamin Kurt Miller,..., Brandon M Wood |
10 |
2024-06-27 |
OmniJARVIS: Unified Vision-Language-Action Tokenization Enables Open-World Instruction Following Agents |
link |
Zihao Wang, Shaofei Cai,..., Yitao Liang |
10 |
2024-02-07 |
Universal Neural Functionals |
link |
Allan Zhou, Chelsea Finn, James Harrison |
10 |
2024-06-03 |
MediQ: Question-Asking LLMs and a Benchmark for Reliable Interactive Clinical Reasoning |
link |
Shuyue Stella Li, Vidhisha Balachandran,..., Yulia Tsvetkov |
10 |
2024-06-15 |
A Label is Worth A Thousand Images in Dataset Distillation |
link |
Tian Qin, Zhiwei Deng, David Alvarez-Melis |
10 |
2024-03-06 |
WaterMax: breaking the LLM watermark detectability-robustness-quality trade-off |
link |
Eva Giboulot, Teddy Furon |
10 |
2024-06-10 |
How Far Can Transformers Reason? The Globality Barrier and Inductive Scratchpad |
link |
Emmanuel Abbe, Samy Bengio,..., Omid Saremi |
10 |
2024-05-24 |
Alleviating Hallucinations in Large Vision-Language Models through Hallucination-Induced Optimization |
link |
Xinyu Lyu, Beitao Chen,..., Jingkuan Song |
10 |
2024-06-13 |
Talking Heads: Understanding Inter-Layer Communication in Transformer Language Models |
link |
Jack Merullo, Carsten Eickhoff, Ellie Pavlick |
10 |
2024-05-29 |
Nearest Neighbor Speculative Decoding for LLM Generation and Attribution |
link |
Minghan Li, Xilun Chen,..., Xi Victoria Lin |
10 |
2024-09-30 |
Magnet: We Never Know How Text-to-Image Diffusion Models Work, Until We Learn How Vision-Language Models Function |
link |
Chenyi Zhuang, Ying Hu, Pan Gao |
10 |
2024-05-31 |
Kaleido Diffusion: Improving Conditional Diffusion Models with Autoregressive Latent Modeling |
link |
Jiatao Gu, Ying Shen,..., Joshua M. Susskind |
10 |
2024-02-21 |
Full-Atom Peptide Design with Geometric Latent Diffusion |
link |
Xiangzhe Kong, Yinjun Jia,..., Yang Liu |
10 |
2024-06-13 |
Rethinking Score Distillation as a Bridge Between Image Distributions |
link |
David McAllister, Songwei Ge,..., Angjoo Kanazawa |
10 |
2024-05-24 |
VB-LoRA: Extreme Parameter Efficient Fine-Tuning with Vector Banks |
link |
Yang Li, Shaobo Han, Shihao Ji |
10 |
2024-06-04 |
SpecExec: Massively Parallel Speculative Decoding For Interactive LLM Inference on Consumer Devices |
link |
Ruslan Svirschevski, Avner May,..., Max Ryabinin |
10 |
2024-10-03 |
Parameter Competition Balancing for Model Merging |
link |
Guodong DU, Junlin Lee,..., Min Zhang |
10 |
2024-02-22 |
Learning an Actionable Discrete Diffusion Policy via Large-Scale Actionless Video Pre-Training |
link |
Haoran He, Chenjia Bai,..., Xuelong Li |
10 |
2024-03-19 |
Optimal Flow Matching: Learning Straight Trajectories in Just One Step |
link |
Nikita Maksimovich Kornilov, Petr Mokrov,..., Alexander Korotin |
10 |
2024-05-23 |
ALI-Agent: Assessing LLMs' Alignment with Human Values via Agent-based Evaluation |
link |
Jingnan Zheng, Han Wang,..., Tat-Seng Chua |
10 |
2024-06-02 |
Evidence of Learned Look-Ahead in a Chess-Playing Neural Network |
link |
Erik Jenner, Shreyas Kapur,..., Stuart Russell |
10 |
2024-06-03 |
D-CPT Law: Domain-specific Continual Pre-Training Scaling Law for Large Language Models |
link |
Haoran Que, Jiaheng Liu,..., Bo Zheng |
10 |
2024-05-30 |
Jailbreaking Large Language Models Against Moderation Guardrails via Cipher Characters |
link |
Haibo Jin, Andy Zhou,..., Haohan Wang |
10 |
2024-03-18 |
A Sober Look at the Robustness of CLIPs to Spurious Features |
link |
Qizhou Wang, Yong Lin,..., Tong Zhang |
10 |
2024-05-25 |
M$^3$GPT: An Advanced Multimodal, Multitask Framework for Motion Comprehension and Generation |
link |
Mingshuang Luo, RuiBing Hou,..., Shiguang Shan |
10 |
2024-09-11 |
NVRC: Neural Video Representation Compression |
link |
Ho Man Kwan, Ge Gao,..., David Bull |
10 |
2023-11-01 |
Learning Cooperative Trajectory Representations for Motion Forecasting |
link |
Hongzhi Ruan, Haibao Yu,..., Zaiqing Nie |
10 |
2024-05-31 |
R$^2$-Gaussian: Rectifying Radiative Gaussian Splatting for Tomographic Reconstruction |
link |
Ruyi Zha, Tao Jun Lin,..., Hongdong Li |
10 |
2024-05-24 |
HDR-GS: Efficient High Dynamic Range Novel View Synthesis at 1000x Speed via Gaussian Splatting |
link |
Yuanhao Cai, Zihao Xiao,..., Alan Yuille |
10 |
2024-06-04 |
Finding NeMo: Localizing Neurons Responsible For Memorization in Diffusion Models |
link |
Dominik Hintersdorf, Lukas Struppek,..., Franziska Boenisch |
10 |
2024-07-13 |
Hydra: Bidirectional State Space Models Through Generalized Matrix Mixers |
link |
Sukjun Hwang, Aakash Lahoti,..., Albert Gu |
10 |
2024-05-28 |
Phased Consistency Models |
link |
Fu-Yun Wang, Zhaoyang Huang,..., Hongsheng Li |
10 |
2024-06-13 |
SimGen: Simulator-conditioned Driving Scene Generation |
link |
Yunsong Zhou, Michael Simon,..., Bolei Zhou |
10 |
2024-05-24 |
ART: Automatic Red-teaming for Text-to-Image Models to Protect Benign Users |
link |
Guanlin Li, Kangjie Chen,..., Tianwei Zhang |
10 |
2024-02-02 |
Segment Any Change |
link |
Zhuo Zheng, Yanfei Zhong,..., Stefano Ermon |
10 |
2024-04-25 |
Cooperate or Collapse: Emergence of Sustainable Cooperation in a Society of LLM Agents |
link |
Giorgio Piatti, Zhijing Jin,..., Rada Mihalcea |
10 |
2024-05-22 |
RadarOcc: Robust 3D Occupancy Prediction with 4D Imaging Radar |
link |
Fangqiang Ding, Xiangyu Wen,..., Chris Xiaoxuan Lu |
9 |
2024-06-04 |
Learning to grok: Emergence of in-context learning and skill composition in modular arithmetic tasks |
link |
Tianyu He, Darshil Doshi,..., Andrey Gromov |
9 |
2024-05-31 |
LLM-ESR: Large Language Models Enhancement for Long-tailed Sequential Recommendation |
link |
Qidong Liu, Xian Wu,..., Xiangyu Zhao |
9 |
2024-09-14 |
Schrodinger Bridge Flow for Unpaired Data Translation |
link |
Valentin De Bortoli, Iryna Korshunova,..., Arnaud Doucet |
9 |
2024-06-05 |
A Geometric View of Data Complexity: Efficient Local Intrinsic Dimension Estimation with Diffusion Models |
link |
Hamidreza Kamkari, Brendan Leigh Ross,..., Gabriel Loaiza-Ganem |
9 |
2024-09-24 |
TFG: Unified Training-Free Guidance for Diffusion Models |
link |
Haotian Ye, Haowei Lin,..., Stefano Ermon |
9 |
2024-05-24 |
Accelerating Diffusion Models with Parallel Sampling: Inference at Sub-Linear Time Complexity |
link |
Haoxuan Chen, Yinuo Ren,..., Grant M. Rotskoff |
9 |
2024-05-27 |
MultiOOD: Scaling Out-of-Distribution Detection for Multiple Modalities |
link |
Hao Dong, Yue Zhao,..., Olga Fink |
9 |
2024-10-31 |
Understanding the Limits of Vision Language Models Through the Lens of the Binding Problem |
link |
Declan Iain Campbell, Sunayana Rane,..., Taylor Whittington Webb |
9 |
2024-07-14 |
What Makes and Breaks Safety Fine-tuning? A Mechanistic Study |
link |
Samyak Jain, Ekdeep Singh Lubana,..., Puneet K. Dokania |
9 |
2024-06-07 |
The Factorization Curse: Which Tokens You Predict Underlie the Reversal Curse and More |
link |
Ouail Kitouni, Niklas Nolte,..., Mark Ibrahim |
9 |
2024-07-15 |
LLM Circuit Analyses Are Consistent Across Training and Scale |
link |
Curt Tigges, Michael Hanna,..., Stella Biderman |
9 |
2024-10-16 |
Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective |
link |
Yongxin Zhu, Bocheng Li,..., Lidong Bing |
9 |
2024-02-05 |
Constrained Synthesis with Projected Diffusion Models |
link |
Jacob K Christopher, Stephen Baek, Ferdinando Fioretto |
9 |
2024-05-24 |
MicroAdam: Accurate Adaptive Optimization with Low Space Overhead and Provable Convergence |
link |
Ionut-Vlad Modoranu, Mher Safaryan,..., Dan Alistarh |
9 |
2024-10-24 |
Schedule Your Edit: A Simple yet Effective Diffusion Noise Schedule for Image Editing |
link |
Haonan Lin, Yan Chen,..., QianYing Wang |
9 |
2024-07-19 |
Towards a "Universal Translator" for Neural Dynamics at Single-Cell, Single-Spike Resolution |
link |
Yizi Zhang, Yanchen Wang,..., Cole Lincoln Hurwitz |
9 |
2024-02-06 |
On Convergence of Adam for Stochastic Optimization under Relaxed Assumptions |
link |
Yusu Hong, Junhong Lin |
9 |
2024-10-31 |
The Importance of Being Scalable: Improving the Speed and Accuracy of Neural Network Interatomic Potentials Across Chemical Domains |
link |
Eric Qu, Aditi S. Krishnapriyan |
9 |
2024-12-05 |
SceneDiffuser: Efficient and Controllable Driving Simulation Initialization and Rollout |
link |
Chiyu Max Jiang, Yijing Bai,..., Dragomir Anguelov |
9 |
2024-02-25 |
No Free Lunch in LLM Watermarking: Trade-offs in Watermarking Design Choices |
link |
Qi Pang, Shengyuan Hu,..., Virginia Smith |
9 |
2024-05-27 |
ARC: A Generalist Graph Anomaly Detector with In-Context Learning |
link |
Yixin Liu, Shiyuan Li,..., Shirui Pan |
9 |
2024-05-27 |
DMPlug: A Plug-in Method for Solving Inverse Problems with Diffusion Models |
link |
Hengkang Wang, Xu Zhang,..., Ju Sun |
9 |
2024-05-29 |
On the Role of Attention Masks and LayerNorm in Transformers |
link |
Xinyi Wu, Amir Ajorlou,..., Ali Jadbabaie |
9 |
2024-05-31 |
Grammar-Aligned Decoding |
link |
Kanghee Park, Jiayu Wang,..., Loris D'Antoni |
9 |
2024-06-29 |
UDC: A Unified Neural Divide-and-Conquer Framework for Large-Scale Combinatorial Optimization Problems |
link |
Zhi Zheng, Changliang Zhou,..., Zhenkun Wang |
9 |
2024-04-23 |
SHED: Shapley-Based Automated Dataset Refinement for Instruction Fine-Tuning |
link |
Yexiao He, Ziyao Wang,..., Ang Li |
9 |
2024-06-06 |
Understanding Information Storage and Transfer in Multi-Modal Large Language Models |
link |
Samyadeep Basu, Martin Grayson,..., Daniela Massiceti |
9 |
2024-04-05 |
Dynamic Conditional Optimal Transport through Simulation-Free Flows |
link |
Gavin Kerrigan, Giosue Migliorini, Padhraic Smyth |
9 |
2024-02-24 |
Data-Efficient Operator Learning via Unsupervised Pretraining and In-Context Learning |
link |
Wuyang Chen, Jialin Song,..., Michael W. Mahoney |
9 |
2024-04-25 |
PhyRecon: Physically Plausible Neural Scene Reconstruction |
link |
Junfeng Ni, Yixin Chen,..., Siyuan Huang |
9 |
2024-05-28 |
FASTopic: Pretrained Transformer is a Fast, Adaptive, Stable, and Transferable Topic Model |
link |
Xiaobao Wu, Thong Thanh Nguyen,..., Anh Tuan Luu |
9 |
2024-06-12 |
The Impact of Initialization on LoRA Finetuning Dynamics |
link |
Soufiane Hayou, Nikhil Ghosh, Bin Yu |
9 |
2024-02-14 |
InfoRM: Mitigating Reward Hacking in RLHF via Information-Theoretic Reward Modeling |
link |
Yuchun Miao, Sen Zhang,..., Dacheng Tao |
9 |
2023-10-27 |
Proportional Fairness in Clustering: A Social Choice Perspective |
link |
Leon Kellerhals, Jannik Peters |
9 |
2024-05-27 |
Entity Alignment with Noisy Annotations from Large Language Models |
link |
Shengyuan Chen, Qinggang Zhang,..., Xiao Huang |
9 |
2024-04-01 |
Privacy Backdoors: Enhancing Membership Inference through Poisoning Pre-trained Models |
link |
Yuxin Wen, Leo Marchyok,..., Nicholas Carlini |
9 |
2024-02-06 |
Discovery of the Hidden World with Large Language Models |
link |
Chenxi Liu, Yongqiang Chen,..., Kun Zhang |
9 |
2024-04-22 |
Self-Supervised Alignment with Mutual Information: Learning to Follow Principles without Preference Labels |
link |
Jan-Philipp Fränken, Eric Zelikman,..., Noah Goodman |
9 |
2024-10-24 |
Large Spatial Model: End-to-end Unposed Images to Semantic 3D |
link |
Zhiwen Fan, Jian Zhang,..., Yue Wang |
9 |
2024-06-11 |
Motion Consistency Model: Accelerating Video Diffusion with Disentangled Motion-Appearance Distillation |
link |
Yuanhao Zhai, Kevin Lin,..., Lijuan Wang |
9 |
2024-05-23 |
Unchosen Experts Can Contribute Too: Unleashing MoE Models’ Power by Self-Contrast |
link |
Chufan Shi, Cheng Yang,..., Yu Meng |
9 |
2024-02-22 |
A Decision-Language Model (DLM) for Dynamic Restless Multi-Armed Bandit Tasks in Public Health |
link |
Nikhil Behari, Edwin Zhang,..., Milind Tambe |
9 |
2024-05-21 |
LLM Processes: Numerical Predictive Distributions Conditioned on Natural Language |
link |
James Requeima, John F Bronskill,..., David Duvenaud |
9 |
2024-05-26 |
Categorical Flow Matching on Statistical Manifolds |
link |
Chaoran Cheng, Jiahan Li,..., Ge Liu |
9 |
2024-10-23 |
How to Continually Adapt Text-to-Image Diffusion Models for Flexible Customization? |
link |
Jiahua Dong, Wenqi Liang,..., Fahad Khan |
9 |
2024-10-30 |
Provably Optimal Memory Capacity for Modern Hopfield Models: Transformer-Compatible Dense Associative Memories as Spherical Codes |
link |
Jerry Yao-Chieh Hu, Dennis Wu, Han Liu |
9 |
2024-09-13 |
Closed-Loop Visuomotor Control with Generative Expectation for Robotic Manipulation |
link |
Qingwen Bu, Jia Zeng,..., Hongyang Li |
9 |
2024-03-21 |
SyncTweedies: A General Generative Framework Based on Synchronized Diffusions |
link |
Jaihoon Kim, Juil Koo,..., Minhyuk Sung |
9 |
2024-05-20 |
Images that Sound: Composing Images and Sounds on a Single Canvas |
link |
Ziyang Chen, Daniel Geng, Andrew Owens |
8 |
2024-10-25 |
NeuroClips: Towards High-fidelity and Smooth fMRI-to-Video Reconstruction |
link |
Zixuan Gong, Guangyin Bao,..., Yu Zhang |
8 |
2024-06-05 |
Reparameterization invariance in approximate Bayesian inference |
link |
Hrittik Roy, Marco Miani,..., Søren Hauberg |
8 |
2024-07-20 |
Is Behavior Cloning All You Need? Understanding Horizon in Imitation Learning |
link |
Dylan J Foster, Adam Block, Dipendra Misra |
8 |
2024-02-22 |
Watermarking Makes Language Models Radioactive |
link |
Tom Sander, Pierre Fernandez,..., Teddy Furon |
8 |
2024-06-07 |
Probabilistic Weather Forecasting with Hierarchical Graph Neural Networks |
link |
Joel Oskarsson, Tomas Landelius,..., Fredrik Lindsten |
8 |
2024-09-27 |
CycleNet: Enhancing Time Series Forecasting through Modeling Periodic Patterns |
link |
Shengsheng Lin, Weiwei Lin,..., Haocheng Zhong |
8 |
2024-04-05 |
Who Evaluates the Evaluations? Objectively Scoring Text-to-Image Prompt Coherence Metrics with T2IScoreScore (TS2) |
link |
Michael Saxon, Fatima Jahara,..., William Yang Wang |
8 |
2024-03-25 |
Is Your LiDAR Placement Optimized for 3D Scene Understanding? |
link |
Ye Li, Lingdong Kong,..., Xiaonan Huang |
8 |
2024-06-13 |
Interpreting the Weight Space of Customized Diffusion Models |
link |
Amil Dravid, Yossi Gandelsman,..., Kfir Aberman |
8 |
2024-09-16 |
Causal language modeling can elicit search and reasoning capabilities on logic puzzles |
link |
Kulin Shah, Nishanth Dikkala,..., Rina Panigrahy |
8 |
2024-05-27 |
Mixed Dynamics In Linear Networks: Unifying the Lazy and Active Regimes |
link |
Zhenfeng Tu, Santiago Aranguri, Arthur Jacot |
8 |
2024-07-26 |
SLIM: Style-Linguistics Mismatch Model for Generalized Audio Deepfake Detection |
link |
Yi Zhu, Surya Koppisetti,..., Gaurav Bharaj |
8 |
2024-06-22 |
Teach Better or Show Smarter? On Instructions and Exemplars in Automatic Prompt Optimization |
link |
Xingchen Wan, Ruoxi Sun,..., Sercan O Arik |
8 |
2024-06-12 |
Grounding Multimodal Large Language Models in Actions |
link |
Andrew Szot, Bogdan Mazoure,..., Alexander T Toshev |
8 |
2024-06-13 |
Separations in the Representational Capabilities of Transformers and Recurrent Architectures |
link |
Satwik Bhattamishra, Michael Hahn,..., Varun Kanade |
8 |
2023-12-09 |
Consistency Models for Scalable and Fast Simulation-Based Inference |
link |
Marvin Schmitt, Valentin Pratz,..., Stefan T. Radev |
8 |
2024-09-26 |
DarkSAM: Fooling Segment Anything Model to Segment Nothing |
link |
Ziqi Zhou, Yufei Song,..., Hai Jin |
8 |
2024-06-07 |
Variational Flow Matching for Graph Generation |
link |
Floor Eijkelboom, Grigory Bartosh,..., Jan-Willem van de Meent |
8 |
2024-06-20 |
Transferable Boltzmann Generators |
link |
Leon Klein, Frank Noe |
8 |
2024-02-12 |
Policy Improvement using Language Feedback Models |
link |
Victor Zhong, Dipendra Misra,..., Marc-Alexandre Côté |
8 |
2024-02-01 |
Understanding the Expressive Power and Mechanisms of Transformer for Sequence Modeling |
link |
Mingze Wang, Weinan E |
8 |
2024-07-08 |
B'MOJO: Hybrid State Space Realizations of Foundation Models with Eidetic and Fading Memory |
link |
Luca Zancato, Arjun Seshadri,..., Stefano Soatto |
8 |
2024-05-28 |
Exploiting LLM Quantization |
link |
Kazuki Egashira, Mark Vero,..., Martin Vechev |
8 |
2024-06-12 |
Optimized Feature Generation for Tabular Data via LLMs with Decision Tree Reasoning |
link |
Jaehyun Nam, Kyuyoung Kim,..., Jinwoo Shin |
8 |
2024-05-29 |
A Full-duplex Speech Dialogue Scheme Based On Large Language Model |
link |
Peng Wang, Songshuo Lu,..., Yuanjun Xiong |
8 |
2024-05-22 |
Spectral Adapter: Fine-Tuning in Spectral Space |
link |
Fangzhao Zhang, Mert Pilanci |
8 |
2024-03-31 |
From Similarity to Superiority: Channel Clustering for Time Series Forecasting |
link |
Jialin Chen, Jan Eric Lenssen,..., Rex Ying |
8 |
2023-11-26 |
A Nearly Optimal and Low-Switching Algorithm for Reinforcement Learning with General Function Approximation |
link |
Heyang Zhao, Jiafan He, Quanquan Gu |
8 |
2024-04-06 |
MACM: Utilizing a Multi-Agent System for Condition Mining in Solving Complex Mathematical Problems |
link |
Bin Lei, Yi Zhang,..., Caiwen Ding |
8 |
2024-10-07 |
TableRAG: Million-Token Table Understanding with Language Models |
link |
Si-An Chen, Lesly Miculicich,..., Tomas Pfister |
8 |
2024-05-22 |
A Versatile Diffusion Transformer with Mixture of Noise Levels for Audiovisual Generation |
link |
Gwanghyun Kim, Alonso Martinez,..., Krishna Somandepalli |
8 |
2024-02-06 |
AdaFlow: Imitation Learning with Variance-Adaptive Flow-Based Policies |
link |
Xixi Hu, qiang liu,..., Bo Liu |
8 |
2024-04-18 |
Thought of Search: Planning with Language Models Through The Lens of Efficiency |
link |
Michael Katz, Harsha Kokel,..., Shirin Sohrabi |
8 |
2024-05-23 |
Scalable Optimization in the Modular Norm |
link |
Tim Large, Yang Liu,..., Jeremy Bernstein |
8 |
2024-04-17 |
On the Scalability of GNNs for Molecular Graphs |
link |
Maciej Sypetkowski, Frederik Wenkel,..., Dominique Beaini |
8 |
2024-06-12 |
Is Programming by Example Solved by LLMs? |
link |
Wen-Ding Li, Kevin Ellis |
8 |
2024-05-29 |
Matryoshka Query Transformer for Large Vision-Language Models |
link |
Wenbo Hu, Zi-Yi Dou,..., Kai-Wei Chang |
8 |
2024-06-26 |
IRCAN: Mitigating Knowledge Conflicts in LLM Generation via Identifying and Reweighting Context-Aware Neurons |
link |
Dan Shi, Renren Jin,..., Deyi Xiong |
8 |
2024-09-14 |
Symbolic Regression with a Learned Concept Library |
link |
Arya Grayeli, Atharva Sehgal,..., Swarat Chaudhuri |
8 |
2024-06-01 |
Frieren: Efficient Video-to-Audio Generation Network with Rectified Flow Matching |
link |
Yongqi Wang, Wenxiang Guo,..., Zhou Zhao |
8 |
2024-02-03 |
GITA: Graph to Visual and Textual Integration for Vision-Language Graph Reasoning |
link |
Yanbin Wei, Shuai Fu,..., Yu Zhang |
8 |
2024-02-11 |
Online Iterative Reinforcement Learning from Human Feedback with General Preference Model |
link |
Chenlu Ye, Wei Xiong,..., Tong Zhang |
8 |
2024-08-29 |
VideoLLM-MoD: Efficient Video-Language Streaming with Mixture-of-Depths Vision Computation |
link |
Shiwei Wu, Joya Chen,..., Mike Zheng Shou |
8 |
2024-08-22 |
Transformers are Minimax Optimal Nonparametric In-Context Learners |
link |
Juno Kim, Tai Nakamaki, Taiji Suzuki |