374 |
2023-05-24 |
link |
Gorilla: Large Language Model Connected with Massive APIs |
Shishir G Patil, Tianjun Zhang,..., Joseph E. Gonzalez |
332 |
2024-01-18 |
link |
VMamba: Visual State Space Model |
Yue Liu, Yunjie Tian,..., Yunfan Liu |
325 |
2023-11-06 |
link |
CogVLM: Visual Expert for Pretrained Language Models |
Weihan Wang, Qingsong Lv,..., Jie Tang |
238 |
2024-05-23 |
link |
YOLOv10: Real-Time End-to-End Object Detection |
Ao Wang, Hui Chen,..., Guiguang Ding |
163 |
2024-05-23 |
link |
SimPO: Simple Preference Optimization with a Reference-Free Reward |
Yu Meng, Mengzhou Xia, Danqi Chen |
135 |
2023-12-04 |
link |
Tree of Attacks: Jailbreaking Black-Box LLMs Automatically |
Anay Mehrotra, Manolis Zampetakis,..., Amin Karbasi |
102 |
2024-03-29 |
link |
Are We on the Right Way for Evaluating Large Vision-Language Models? |
Lin Chen, Jinsong Li,..., Feng Zhao |
96 |
2024-06-24 |
link |
Cambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMs |
Shengbang Tong, Ellis L Brown II,..., Saining Xie |
91 |
2024-01-31 |
link |
KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization |
Coleman Richard Charles Hooper, Sehoon Kim,..., Amir Gholami |
88 |
2023-11-28 |
link |
LightGaussian: Unbounded 3D Gaussian Compression with 15x Reduction and 200+ FPS |
Zhiwen Fan, Kevin Wang,..., Zhangyang Wang |
83 |
2024-05-03 |
link |
What matters when building vision-language models? |
Hugo Laurençon, Leo Tronchon,..., Victor Sanh |
82 |
2024-04-03 |
link |
Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction |
Keyu Tian, Yi Jiang,..., Liwei Wang |
79 |
2024-04-09 |
link |
InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD |
Xiaoyi Dong, Pan Zhang,..., Jiaqi Wang |
77 |
2024-05-06 |
link |
SWE-agent: Agent-Computer Interfaces Enable Automated Software Engineering |
John Yang, Carlos E Jimenez,..., Ofir Press |
77 |
2023-10-14 |
link |
Large Language Model Unlearning |
Yuanshun Yao, Xiaojun Xu, Yang Liu |
77 |
2024-04-15 |
link |
LLM Evaluators Recognize and Favor Their Own Generations |
Arjun Panickssery, Samuel R. Bowman, Shi Feng |
70 |
None |
link |
Many-shot Jailbreaking |
Cem Anil, Esin DURMUS,..., David Duvenaud |
68 |
2024-06-13 |
link |
Depth Anything V2 |
Lihe Yang, Bingyi Kang,..., Hengshuang Zhao |
63 |
2024-05-07 |
link |
xLSTM: Extended Long Short-Term Memory |
Maximilian Beck, Korbinian Pöppel,..., Sepp Hochreiter |
58 |
2024-05-16 |
link |
CAT3D: Create Anything in 3D with Multi-View Diffusion Models |
Ruiqi Gao, Aleksander Holynski,..., Ben Poole |
56 |
2024-02-15 |
link |
Chain-of-Thought Reasoning Without Prompting |
Xuezhi Wang, Denny Zhou |
56 |
2024-03-30 |
link |
QuaRot: Outlier-Free 4-Bit Inference in Rotated LLMs |
Saleh Ashkboos, Amirkeivan Mohtashami,..., James Hensman |
55 |
2024-04-17 |
link |
Many-Shot In-Context Learning |
Rishabh Agarwal, Avi Singh,..., Hugo Larochelle |
55 |
2024-04-22 |
link |
SnapKV: LLM Knows What You are Looking for Before Generation |
Yuhong Li, Yingbing Huang,..., Deming Chen |
54 |
2024-01-30 |
link |
Robust Prompt Optimization for Defending Language Models Against Jailbreaking Attacks |
Andy Zhou, Bo Li, Haohan Wang |
52 |
2024-02-16 |
link |
PointMamba: A Simple State Space Model for Point Cloud Analysis |
Dingkang Liang, Xin Zhou,..., Xiang Bai |
50 |
2024-05-06 |
link |
MAmmoTH2: Scaling Instructions from the Web |
Xiang Yue, Tianyu Zheng,..., Wenhu Chen |
48 |
2024-04-30 |
link |
Iterative Reasoning Preference Optimization |
Richard Yuanzhe Pang, Weizhe Yuan,..., Jason E Weston |
47 |
2024-06-17 |
link |
Autoregressive Image Generation without Vector Quantization |
Tianhong Li, Yonglong Tian,..., Kaiming He |
46 |
2024-06-17 |
link |
Refusal in Language Models Is Mediated by a Single Direction |
Andy Arditi, Oscar Balcells Obeso,..., Neel Nanda |
40 |
2023-12-06 |
link |
Scaling transformer neural networks for skillful and reliable medium-range weather forecasting |
Tung Nguyen, Rohan Shah,..., Aditya Grover |
38 |
2023-12-12 |
link |
SGLang: Efficient Execution of Structured Language Model Programs |
Lianmin Zheng, Liangsheng Yin,..., Ying Sheng |
38 |
2023-10-26 |
link |
Transformers Learn to Achieve Second-Order Convergence Rates for In-Context Linear Regression |
Deqing Fu, Tian-qi Chen,..., Vatsal Sharan |
37 |
2024-04-16 |
link |
VASA-1: Lifelike Audio-Driven Talking Faces Generated in Real Time |
Sicheng Xu, Guojun Chen,..., Baining Guo |
37 |
2024-07-11 |
link |
FlashAttention-3: Fast and Accurate Attention with Asynchrony and Low-precision |
Jay Shah, Ganesh Bikshandi,..., Tri Dao |
35 |
2024-02-06 |
link |
Self-Discover: Large Language Models Self-Compose Reasoning Structures |
Pei Zhou, Jay Pujara,..., Steven Zheng |
35 |
2024-05-02 |
link |
StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation |
Yupeng Zhou, Daquan Zhou,..., Qibin Hou |
35 |
2024-04-21 |
link |
Hyper-SD: Trajectory Segmented Consistency Model for Efficient Image Synthesis |
Yuxi Ren, Xin Xia,..., Xuefeng Xiao |
34 |
2024-04-03 |
link |
PiSSA: Principal Singular Values and Singular Vectors Adaptation of Large Language Models |
Fanxu Meng, Zhaohui Wang, Muhan Zhang |
33 |
2024-02-07 |
link |
Can Large Language Model Agents Simulate Human Trust Behaviors? |
Chengxing Xie, Canyu Chen,..., Guohao Li |
33 |
2023-12-18 |
link |
Cascade Speculative Drafting for Even Faster LLM Inference |
Ziyi Chen, Xiaocong Yang,..., Jie Huang |
33 |
2023-06-02 |
link |
Invisible Image Watermarks Are Provably Removable Using Generative AI |
Xuandong Zhao, Kexun Zhang,..., Lei Li |
33 |
2024-04-04 |
link |
No "Zero-Shot" Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model Performance |
Vishaal Udandarao, Ameya Prabhu,..., Matthias Bethge |
33 |
2023-04-26 |
link |
The Closeness of In-Context Learning and Weight Shifting for Softmax Regression |
Shuai Li, Zhao Song,..., Tianyi Zhou |
31 |
2024-02-12 |
link |
G-Retriever: Retrieval-Augmented Generation for Textual Graph Understanding and Question Answering |
Xiaoxin He, Yijun Tian,..., Bryan Hooi |
31 |
2024-05-23 |
link |
Improved Distribution Matching Distillation for Fast Image Synthesis |
Tianwei Yin, Michaël Gharbi,..., William T. Freeman |
31 |
2024-02-26 |
link |
Rainbow Teaming: Open-Ended Generation of Diverse Adversarial Prompts |
Mikayel Samvelyan, Sharath Chandra Raparthy,..., Roberta Raileanu |
30 |
2023-12-19 |
link |
Large Language Models Play StarCraft II: Benchmarks and A Chain of Summarization Approach |
Weiyu Ma, Qirui Mi,..., Haifeng Zhang |
30 |
2024-04-04 |
link |
ReFT: Representation Finetuning for Language Models |
Zhengxuan Wu, Aryaman Arora,..., Christopher Potts |
30 |
2024-02-17 |
link |
Watch Out for Your Agents! Investigating Backdoor Threats to LLM-Based Agents |
Wenkai Yang, Xiaohan Bi,..., Xu Sun |
29 |
2023-05-23 |
link |
Decoupled Kullback-Leibler Divergence Loss |
Jiequan Cui, Zhuotao Tian,..., Hanwang Zhang |
29 |
2024-04-18 |
link |
Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing |
Ye Tian, Baolin Peng,..., Dong Yu |
29 |
2024-02-29 |
link |
Humanoid Locomotion as Next Token Prediction |
Ilija Radosavovic, Jathushan Rajasegaran,..., Jitendra Malik |
28 |
2024-03-27 |
link |
Long-form factuality in large language models |
Jerry Wei, Chengrun Yang,..., Quoc V Le |
28 |
2024-05-21 |
link |
Reducing Transformer Key-Value Cache Size with Cross-Layer Attention |
William Brandon, Mayank Mishra,..., Jonathan Ragan-Kelley |
28 |
2024-06-11 |
link |
An Image is Worth 32 Tokens for Reconstruction and Generation |
Qihang Yu, Mark Weber,..., Liang-Chieh Chen |
28 |
2024-07-02 |
link |
MInference 1.0: Accelerating Pre-filling for Long-Context LLMs via Dynamic Sparse Attention |
Huiqiang Jiang, YUCHENG LI,..., Lili Qiu |
28 |
2024-03-23 |
link |
Understanding Emergent Abilities of Language Models from the Loss Perspective |
Zhengxiao Du, Aohan Zeng,..., Jie Tang |
27 |
2024-03-14 |
link |
Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision |
Zhiqing Sun, Longhui Yu,..., Chuang Gan |
27 |
2024-06-06 |
link |
ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search |
Dan Zhang, Sining Zhoubian,..., Jie Tang |
27 |
2024-05-08 |
link |
You Only Cache Once: Decoder-Decoder Architectures for Language Models |
Yutao Sun, Li Dong,..., Furu Wei |
26 |
2024-06-05 |
link |
Scaling Laws for Reward Model Overoptimization in Direct Alignment Algorithms |
Rafael Rafailov, Yaswanth Chittepu,..., Scott Niekum |
25 |
2024-05-06 |
link |
AlphaMath Almost Zero: process Supervision without process |
Guoxin Chen, Minpeng Liao,..., Kai Fan |
25 |
2024-06-06 |
link |
Improving Alignment and Robustness with Circuit Breakers |
Andy Zou, Long Phan,..., Dan Hendrycks |
25 |
2024-04-25 |
link |
Make Your LLM Fully Utilize the Context |
Shengnan An, Zexiong Ma,..., Weizhu Chen |
25 |
2023-11-22 |
link |
SegVol: Universal and Interactive Volumetric Medical Image Segmentation |
Yuxin Du, Fan BAI,..., Bo Zhao |
25 |
2024-02-16 |
link |
The Evolution of Statistical Induction Heads: In-Context Learning Markov Chains |
Ezra Edelman, Nikolaos Tsilivis,..., Surbhi Goel |
25 |
2024-02-29 |
link |
How do Large Language Models Handle Multilingualism? |
Yiran Zhao, Wenxuan Zhang,..., Lidong Bing |
23 |
2023-11-29 |
link |
Elo Uncovered: Robustness and Best Practices in Language Model Evaluation |
Meriem Boubdir, Edward Kim,..., Marzieh Fadaee |
23 |
2024-05-27 |
link |
Vista: A Generalizable Driving World Model with High Fidelity and Versatile Controllability |
Shenyuan Gao, Jiazhi Yang,..., Hongyang Li |
22 |
2024-01-18 |
link |
ChatQA: Surpassing GPT-4 on Conversational QA and RAG |
Zihan Liu, Wei Ping,..., Bryan Catanzaro |
22 |
2024-06-11 |
link |
Simple and Effective Masked Diffusion Language Models |
Subham Sekhar Sahoo, Marianne Arriola,..., Volodymyr Kuleshov |
22 |
2024-03-26 |
link |
LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-Tuning |
Rui Pan, Xiang Liu,..., Tong Zhang |
22 |
2023-05-27 |
link |
MADiff: Offline Multi-agent Learning with Diffusion Models |
Zhengbang Zhu, Minghuan Liu,..., Weinan Zhang |
22 |
2024-02-14 |
link |
Soft Prompt Threats: Attacking Safety Alignment and Unlearning in Open-Source LLMs through the Embedding Space |
Leo Schwinn, David Dobre,..., Stephan Günnemann |
21 |
2024-02-12 |
link |
Model Collapse Demystified: The Case of Regression |
Elvis Dohmatob, Yunzhen Feng, Julia Kempe |
21 |
2024-02-24 |
link |
Spec-Gaussian: Anisotropic View-Dependent Appearance for 3D Gaussian Splatting |
Ziyi Yang, Xinyu Gao,..., Xiaogang Jin |
21 |
2024-02-28 |
link |
Approaching Human-Level Forecasting with Language Models |
Danny Halawi, Fred Zhang,..., Jacob Steinhardt |
21 |
2024-04-19 |
link |
MoVA: Adapting Mixture of Vision Experts to Multimodal Context |
Zhuofan Zong, Bingqi Ma,..., Yu Liu |
21 |
2024-06-20 |
link |
RL on Incorrect Synthetic Data Scales the Efficiency of LLM Math Reasoning by Eight-Fold |
Amrith Setlur, Saurabh Garg,..., Aviral Kumar |
21 |
2022-08-22 |
link |
Efficiency of the First-Price Auction in the Autobidding World |
Yuan Deng, Jieming Mao,..., Song Zuo |
21 |
2023-12-12 |
link |
Alignment for Honesty |
Yuqing Yang, Ethan Chern,..., Pengfei Liu |
21 |
2024-02-28 |
link |
Keeping LLMs Aligned After Fine-tuning: The Crucial Role of Prompt Templates |
Kaifeng Lyu, Haoyu Zhao,..., Sanjeev Arora |
20 |
2024-02-02 |
link |
ReEvo: Large Language Models as Hyper-Heuristics with Reflective Evolution |
Haoran Ye, Jiarui Wang,..., Guojie Song |
20 |
2024-06-06 |
link |
Simplified and Generalized Masked Diffusion for Discrete Data |
Jiaxin Shi, Kehang Han,..., Michalis Titsias |
20 |
2024-02-19 |
link |
Query-Based Adversarial Prompt Generation |
Jonathan Hayase, Ema Borevković,..., Milad Nasr |
20 |
2024-07-02 |
link |
RankRAG: Unifying Context Ranking with Retrieval-Augmented Generation in LLMs |
Yue Yu, Wei Ping,..., Bryan Catanzaro |
20 |
2024-05-26 |
link |
Demystify Mamba in Vision: A Linear Attention Perspective |
Dongchen Han, Ziyi Wang,..., Gao Huang |
20 |
2024-06-03 |
link |
MixEval: Deriving Wisdom of the Crowd from LLM Benchmark Mixtures |
Jinjie Ni, Fuzhao Xue,..., Yang You |
20 |
2024-04-11 |
link |
Applying Guidance in a Limited Interval Improves Sample and Distribution Quality in Diffusion Models |
Tuomas Kynkäänniemi, Miika Aittala,..., Jaakko Lehtinen |
20 |
2023-05-15 |
link |
PLIP: Language-Image Pre-training for Person Representation Learning |
Jialong Zuo, Jiahao Hong,..., Jingdong Wang |
20 |
2024-07-18 |
link |
Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies |
Chaofan Tao, Qian Liu,..., Ngai Wong |
19 |
2024-05-13 |
link |
PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator |
Hanshu Yan, Xingchao Liu,..., Jiashi Feng |
19 |
2024-05-30 |
link |
Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image |
Kailu Wu, Fangfu Liu,..., Kaisheng Ma |
19 |
2024-06-14 |
link |
Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs |
Rui Yang, Ruomeng Ding,..., Tong Zhang |
19 |
2024-05-27 |
link |
Transformers Can Do Arithmetic with the Right Embeddings |
Sean Michael McLeish, Arpit Bansal,..., Tom Goldstein |
18 |
2023-12-13 |
link |
Chat-Scene: Bridging 3D Scene and Large Language Models with Object Identifiers |
Haifeng Huang, Yilun Chen,..., Zhou Zhao |
18 |
2024-04-15 |
link |
3D Gaussian Splatting as Markov Chain Monte Carlo |
Shakiba Kheradmand, Daniel Rebain,..., Kwang Moo Yi |
18 |
2024-02-26 |
link |
Why Transformers Need Adam: A Hessian Perspective |
Yushun Zhang, Congliang Chen,..., Zhi-Quan Luo |
18 |
2024-06-26 |
link |
WildTeaming at Scale: From In-the-Wild Jailbreaks to (Adversarially) Safer Language Models |
Liwei Jiang, Kavel Rao,..., Nouha Dziri |
18 |
2024-03-26 |
link |
MAGIS: LLM-Based Multi-Agent Framework for GitHub Issue Resolution |
Wei Tao, Yucheng Zhou,..., Yu Cheng |
18 |
2024-04-12 |
link |
Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length |
Xuezhe Ma, Xiaomeng Yang,..., Chunting Zhou |
18 |
2024-07-01 |
link |
Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion |
Boyuan Chen, Diego Martí Monsó,..., Vincent Sitzmann |
18 |
2024-05-28 |
link |
Aligning to Thousands of Preferences via System Message Generalization |
Seongyun Lee, Sue Hyun Park,..., Minjoon Seo |
18 |
2024-05-26 |
link |
Provably Mitigating Overoptimization in RLHF: Your SFT Loss is Implicitly an Adversarial Regularizer |
Zhihan Liu, Miao Lu,..., Zhaoran Wang |
17 |
2024-05-24 |
link |
Efficient Adversarial Training in LLMs with Continuous Attacks |
Sophie Xhonneux, Alessandro Sordoni,..., Leo Schwinn |
17 |
2024-02-08 |
link |
Noise Contrastive Alignment of Language Models with Explicit Rewards |
Huayu Chen, Guande He,..., Jun Zhu |
17 |
2024-02-17 |
link |
OneBit: Towards Extremely Low-bit Large Language Models |
Yuzhuang Xu, Xu Han,..., Wanxiang Che |
17 |
2024-06-17 |
link |
Twin-Merging: Dynamic Integration of Modular Expertise in Model Merging |
Zhenyi Lu, Chenghao Fan,..., Yu Cheng |
17 |
2024-04-23 |
link |
Rethinking LLM Memorization through the Lens of Adversarial Compression |
Avi Schwarzschild, Zhili Feng,..., J Zico Kolter |
17 |
2023-12-06 |
link |
OctreeOcc: Efficient and Multi-Granularity Occupancy Prediction Using Octree Queries |
Yuhang Lu, Xinge ZHU,..., Yuexin Ma |
17 |
2024-06-12 |
link |
VisionLLM v2: An End-to-End Generalist Multimodal Large Language Model for Hundreds of Vision-Language Tasks |
Jiannan Wu, Muyan Zhong,..., Jifeng Dai |
17 |
2024-04-25 |
link |
REBEL: Reinforcement Learning via Regressing Relative Rewards |
Zhaolin Gao, Jonathan Daniel Chang,..., Wen Sun |
17 |
2024-05-24 |
link |
Defensive Unlearning with Adversarial Training for Robust Concept Erasure in Diffusion Models |
Yimeng Zhang, Xin Chen,..., Sijia Liu |
17 |
2024-06-03 |
link |
SpatialRGPT: Grounded Spatial Reasoning in Vision Language Model |
An-Chieh Cheng, Hongxu Yin,..., Sifei Liu |
17 |
2024-05-24 |
link |
The Road Less Scheduled |
Aaron Defazio, Xingyu Alice Yang,..., Ashok Cutkosky |
17 |
2024-03-05 |
link |
Found in the Middle: How Language Models Use Long Contexts Better via Plug-and-Play Positional Encoding |
Zhenyu Zhang, Runjin Chen,..., Zhangyang Wang |
17 |
2024-05-09 |
link |
CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts |
Jiachen Li, Xinyao Wang,..., Longyin Wen |
16 |
2023-10-06 |
link |
Why Do We Need Weight Decay in Modern Deep Learning? |
Francesco D'Angelo, Maksym Andriushchenko,..., Nicolas Flammarion |
16 |
2024-06-04 |
link |
Guiding a Diffusion Model with a Bad Version of Itself |
Tero Karras, Miika Aittala,..., Samuli Laine |
16 |
2024-05-23 |
link |
JiuZhang3.0: Efficiently Improving Mathematical Reasoning by Training Small Data Synthesis Models |
Kun Zhou, Beichen Zhang,..., Ji-Rong Wen |
16 |
2024-06-22 |
link |
Are Language Models Actually Useful for Time Series Forecasting? |
Mingtian Tan, Mike A Merrill,..., Thomas Hartvigsen |
16 |
2024-07-22 |
link |
Discrete Flow Matching |
Itai Gat, Tal Remez,..., Yaron Lipman |
16 |
2024-02-09 |
link |
Fight Back Against Jailbreaking via Prompt Adversarial Tuning |
Yichuan Mo, Yuji Wang,..., Yisen Wang |
16 |
2024-04-09 |
link |
MambaAD: Exploring State Space Models for Multi-class Unsupervised Anomaly Detection |
Haoyang He, Yuhu Bai,..., Lei Xie |
16 |
2024-02-18 |
link |
Federated Fine-tuning of Large Language Models under Heterogeneous Language Tasks and Client Resources |
Jiamu Bai, Daoyuan Chen,..., Yaliang Li |
16 |
2024-02-19 |
link |
A Critical Evaluation of AI Feedback for Aligning Large Language Models |
Archit Sharma, Sedrick Keh,..., Thomas Kollar |
16 |
2023-10-12 |
link |
MatFormer: Nested Transformer for Elastic Inference |
Fnu Devvrit, Sneha Kudugunta,..., Prateek Jain |
16 |
2024-06-03 |
link |
Improved Few-Shot Jailbreaking Can Circumvent Aligned Language Models and Their Defenses |
Xiaosen Zheng, Tianyu Pang,..., Min Lin |
16 |
2023-06-13 |
link |
Questioning the Survey Responses of Large Language Models |
Ricardo Dominguez-Olmedo, Moritz Hardt, Celestine Mendler-Dünner |
16 |
None |
link |
AutoGuide: Automated Generation and Selection of State-Aware Guidelines for Large Language Model Agents |
Yao Fu, Dong-Ki Kim,..., Honglak Lee |
16 |
2023-08-04 |
link |
Adaptive Proximal Gradient Method for Convex Optimization |
Yura Malitsky, Konstantin Mishchenko |
15 |
2024-06-17 |
link |
How Do Large Language Models Acquire Factual Knowledge During Pretraining? |
Hoyeon Chang, Jinho Park,..., Minjoon Seo |
15 |
2024-05-16 |
link |
Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning |
Yuexiang Zhai, Hao Bai,..., Sergey Levine |
15 |
2024-05-28 |
link |
Understanding Transformer Reasoning Capabilities via Graph Algorithms |
Clayton Sanford, Bahare Fatemi,..., Vahab Mirrokni |
15 |
2024-05-17 |
link |
Identifying Functionally Important Features with End-to-End Sparse Dictionary Learning |
Dan Braun, Jordan Taylor,..., Lee Sharkey |
15 |
2024-02-16 |
link |
Interpreting CLIP with Sparse Linear Concept Embeddings (SpLiCE) |
Usha Bhalla, Alex Oesterling,..., Himabindu Lakkaraju |
15 |
2024-03-01 |
link |
Gradient Cuff: Detecting Jailbreak Attacks on Large Language Models by Exploring Refusal Loss Landscapes |
Xiaomeng Hu, Pin-Yu Chen, Tsung-Yi Ho |
15 |
2024-02-29 |
link |
TimeXer: Empowering Transformers for Time Series Forecasting with Exogenous Variables |
Yuxuan Wang, Haixu Wu,..., Mingsheng Long |
15 |
2024-07-19 |
link |
Compact Language Models via Pruning and Knowledge Distillation |
Saurav Muralidharan, Sharath Turuvekere Sreenivas,..., Pavlo Molchanov |
15 |
2024-05-19 |
link |
Era3D: High-Resolution Multiview Diffusion using Efficient Row-wise Attention |
Peng Li, Yuan Liu,..., Yike Guo |
15 |
2024-05-29 |
link |
Poseidon: Efficient Foundation Models for PDEs |
Maximilian Herde, Bogdan Raonic,..., Siddhartha Mishra |
15 |
2024-02-03 |
link |
Panacea: Pareto Alignment via Preference Adaptation for LLMs |
Yifan Zhong, Chengdong Ma,..., Yaodong Yang |
14 |
2024-05-24 |
link |
Meteor: Mamba-based Traversal of Rationale for Large Language and Vision Models |
Byung-Kwan Lee, Chae Won Kim,..., Yong Man Ro |
14 |
2024-06-03 |
link |
Mobile-Agent-v2: Mobile Device Operation Assistant with Effective Navigation via Multi-Agent Collaboration |
Junyang Wang, Haiyang Xu,..., Jitao Sang |
14 |
2024-05-28 |
link |
Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations |
Alexander Hägele, Elie Bakouch,..., Martin Jaggi |
14 |
2024-06-02 |
link |
BoNBoN Alignment for Large Language Models and the Sweetness of Best-of-n Sampling |
Lin Gui, Cristina Garbacea, Victor Veitch |
14 |
2024-05-26 |
link |
Diffusion4D: Fast Spatial-temporal Consistent 4D Generation via Video Diffusion Models |
HANWEN LIANG, Yuyang Yin,..., Yunchao Wei |
14 |
2024-05-23 |
link |
Multi-Scale VMamba: Hierarchy in Hierarchy Visual State Space Model |
Yuheng Shi, Minjing Dong, Chang Xu |
14 |
2024-05-03 |
link |
DreamScene4D: Dynamic Multi-Object Scene Generation from Monocular Videos |
Wen-Hsuan Chu, Lei Ke, Katerina Fragkiadaki |
14 |
2024-05-23 |
link |
MiniCache: KV Cache Compression in Depth Dimension for Large Language Models |
Akide Liu, Jing Liu,..., Bohan Zhuang |
14 |
2024-01-11 |
link |
A Closer Look at AUROC and AUPRC under Class Imbalance |
Matthew B.A. McDermott, Haoran Zhang,..., Jack Gallifant |
14 |
2024-05-22 |
link |
xRAG: Extreme Context Compression for Retrieval-augmented Generation with One Token |
Xin Cheng, Xun Wang,..., Dongyan Zhao |
14 |
2024-05-08 |
link |
Chain of Thoughtlessness? An Analysis of CoT in Planning |
Kaya Stechly, Karthik Valmeekam, Subbarao Kambhampati |
14 |
2024-02-15 |
link |
BitDelta: Your Fine-Tune May Only Be Worth One Bit |
James Liu, Guangxuan Xiao,..., Tianle Cai |
13 |
2024-05-23 |
link |
Adapting to Unknown Low-Dimensional Structures in Score-Based Diffusion Models |
Gen Li, Yuling Yan |
13 |
2024-03-25 |
link |
Antigen-Specific Antibody Design via Direct Energy-based Preference Optimization |
Xiangxin Zhou, Dongyu Xue,..., Quanquan Gu |
13 |
2024-03-22 |
link |
Can large language models explore in-context? |
Akshay Krishnamurthy, Keegan Harris,..., Aleksandrs Slivkins |
13 |
2024-04-24 |
link |
PuLID: Pure and Lightning ID Customization via Contrastive Alignment |
Zinan Guo, Yanze Wu,..., Qian HE |
13 |
2024-06-10 |
link |
Parallelizing Linear Transformers with the Delta Rule over Sequence Length |
Songlin Yang, Bailin Wang,..., Yoon Kim |
13 |
2024-05-29 |
link |
Preference Learning Algorithms Do Not Learn Preference Rankings |
Angelica Chen, Sadhika Malladi,..., Kyunghyun Cho |
13 |
2024-02-22 |
link |
Large Language Models as Urban Residents: An LLM Agent Framework for Personal Mobility Generation |
Jiawei Wang, Renhe Jiang,..., Chuan Xiao |
13 |
2024-05-07 |
link |
KV Cache is 1 Bit Per Channel: Efficient Large Language Model Inference with Coupled Quantization |
Tianyi Zhang, Jonah Wonkyu Yi,..., Anshumali Shrivastava |
13 |
2024-03-19 |
link |
Pretraining Codomain Attention Neural Operators for Solving Multiphysics PDEs |
Md Ashiqur Rahman, Robert Joseph George,..., Anima Anandkumar |
13 |
2024-06-14 |
link |
DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning |
Hao Bai, Yifei Zhou,..., Aviral Kumar |
13 |
2024-06-27 |
link |
OMG-LLaVA: Bridging Image-level, Object-level, Pixel-level Reasoning and Understanding |
Tao Zhang, Xiangtai Li,..., Shuicheng YAN |
13 |
2024-06-13 |
link |
Unpacking DPO and PPO: Disentangling Best Practices for Learning from Preference Feedback |
Hamish Ivison, Yizhong Wang,..., Hannaneh Hajishirzi |
13 |
2024-05-27 |
link |
Safe LoRA: the Silver Lining of Reducing Safety Risks when Fine-tuning Large Language Models |
Chia-Yi Hsu, Yu-Lin Tsai,..., Chun-Ying Huang |
13 |
2024-06-06 |
link |
Transformers need glasses! Information over-squashing in language tasks |
Federico Barbero, Andrea Banino,..., Petar Veličković |
13 |
2024-05-31 |
link |
Amortizing intractable inference in diffusion models for vision, language, and control |
Siddarth Venkatraman, Moksh Jain,..., Nikolay Malkin |
12 |
2024-05-23 |
link |
HippoRAG: Neurobiologically Inspired Long-Term Memory for Large Language Models |
Bernal Jimenez Gutierrez, Yiheng Shu,..., Yu Su |
12 |
2024-02-04 |
link |
Aligner: Efficient Alignment by Learning to Correct |
Jiaming Ji, Boyuan Chen,..., Yaodong Yang |
12 |
2024-04-23 |
link |
Aligning LLM Agents by Learning Latent Preference from User Edits |
Ge Gao, Alexey Taymanov,..., Dipendra Misra |
12 |
2024-05-30 |
link |
Enhancing Large Vision Language Models with Self-Training on Image Comprehension |
Yihe Deng, Pan Lu,..., Wei Wang |
12 |
2024-06-11 |
link |
BAKU: An Efficient Transformer for Multi-Task Policy Learning |
Siddhant Haldar, Zhuoran Peng, Lerrel Pinto |
12 |
2024-06-03 |
link |
Neural network learns low-dimensional polynomials with SGD near the information-theoretic limit |
Jason D. Lee, Kazusato Oko,..., Denny Wu |
12 |
2024-05-29 |
link |
Weak-to-Strong Search: Align Large Language Models via Searching over Small Language Models |
Zhanhui Zhou, Zhixuan Liu,..., Yu Qiao |
12 |
2024-02-28 |
link |
Implicit Optimization Bias of Next-Token Prediction in Linear Models |
Christos Thrampoulidis |
12 |
2024-06-10 |
link |
LLM Dataset Inference: Did you train on my dataset? |
Pratyush Maini, Hengrui Jia,..., Adam Dziedzic |
12 |
2024-05-23 |
link |
Representation Noising: A Defence Mechanism Against Harmful Finetuning |
Domenic Rosati, Jan Wehner,..., Frank Rudzicz |
12 |
2024-02-29 |
link |
Theoretical Foundations of Deep Selective State-Space Models |
Nicola Muca Cirone, Antonio Orvieto,..., Terry Lyons |
12 |
2024-05-27 |
link |
EM Distillation for One-step Diffusion Models |
Sirui Xie, Zhisheng Xiao,..., Ruiqi Gao |
12 |
2024-11-02 |
link |
Rule Based Rewards for Language Model Safety |
Tong Mu, Alec Helyar,..., Lilian Weng |
12 |
2024-01-18 |
link |
Cross-Modality Perturbation Synergy Attack for Person Re-identification |
Yunpeng Gong, Zhun Zhong,..., Min Jiang |
11 |
2024-06-27 |
link |
Resolving Discrepancies in Compute-Optimal Scaling of Language Models |
Tomer Porian, Mitchell Wortsman,..., Yair Carmon |
11 |
2024-02-05 |
link |
FuseMoE: Mixture-of-Experts Transformers for Fleximodal Fusion |
Xing Han, Huy Nguyen,..., Suchi Saria |
11 |
2024-06-12 |
link |
Large Language Model Unlearning via Embedding-Corrupted Prompts |
Chris Yuhao Liu, Yaxuan Wang,..., Yang Liu |
11 |
2024-05-25 |
link |
Theoretical Analysis of Weak-to-Strong Generalization |
Hunter Lang, David Sontag, Aravindan Vijayaraghavan |
11 |
2024-01-29 |
link |
Contracting with a Learning Agent |
Guru Guruganesh, Yoav Kolumbus,..., S. Matthew Weinberg |
11 |
2024-03-14 |
link |
MambaTalk: Efficient Holistic Gesture Synthesis with Selective State Space Models |
Zunnan Xu, Yukang Lin,..., Xiu Li |
11 |
2023-10-19 |
link |
AutoMix: Automatically Mixing Language Models |
Pranjal Aggarwal, Aman Madaan,..., Mausam . |
11 |
2024-05-23 |
link |
ZipCache: Accurate and Efficient KV Cache Quantization with Salient Token Identification |
Yefei He, Luoming Zhang,..., Bohan Zhuang |
11 |
2024-06-10 |
link |
MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models |
Zichun Yu, Spandan Das, Chenyan Xiong |
11 |
2024-07-11 |
link |
WildGaussians: 3D Gaussian Splatting in the Wild |
Jonas Kulhanek, Songyou Peng,..., Torsten Sattler |
11 |
2024-07-31 |
link |
Measuring Progress in Dictionary Learning for Language Model Interpretability with Board Game Models |
Adam Karvonen, Benjamin Wright,..., Samuel Marks |
11 |
2023-05-21 |
link |
DPIC: Decoupling Prompt and Intrinsic Characteristics for LLM Generated Text Detection |
Xiao Yu, Yuang Qi,..., Nenghai Yu |
11 |
2024-03-07 |
link |
Online Adaptation of Language Models with a Memory of Amortized Contexts |
Jihoon Tack, Jaehyung Kim,..., Jonathan Richard Schwarz |
11 |
2024-06-06 |
link |
Multistep Distillation of Diffusion Models via Moment Matching |
Tim Salimans, Thomas Mensink,..., Emiel Hoogeboom |
11 |
2024-05-27 |
link |
Collaborative Video Diffusion: Consistent Multi-video Generation with Camera Control |
Zhengfei Kuang, Shengqu Cai,..., Gordon Wetzstein |
11 |
2024-06-18 |
link |
DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving |
Yuxuan Tong, Xiwen Zhang,..., Junxian He |
11 |
2024-02-07 |
link |
InfLLM: Training-Free Long-Context Extrapolation for LLMs with an Efficient Context Memory |
Chaojun Xiao, Pengle Zhang,..., Maosong Sun |
11 |
2024-05-25 |
link |
Streaming Long Video Understanding with Large Language Models |
Rui Qian, Xiaoyi Dong,..., Jiaqi Wang |
11 |
2024-05-23 |
link |
Direct3D: Scalable Image-to-3D Generation via 3D Latent Diffusion Transformer |
Shuang Wu, Youtian Lin,..., Yao Yao |
10 |
2024-05-23 |
link |
WISE: Rethinking the Knowledge Memory for Lifelong Model Editing of Large Language Models |
Peng Wang, Zexi Li,..., Huajun Chen |
10 |
2024-03-06 |
link |
WaterMax: breaking the LLM watermark detectability-robustness-quality trade-off |
Eva Giboulot, Teddy Furon |
10 |
2024-05-23 |
link |
Instruction Tuning With Loss Over Instructions |
Zhengyan Shi, Adam X. Yang,..., Aldo Lipani |
10 |
2024-04-06 |
link |
Aligning Diffusion Models by Optimizing Human Utility |
Shufan Li, Konstantinos Kallidromitis,..., Kazuki Kozuka |
10 |
2024-05-23 |
link |
EMR-Merging: Tuning-Free High-Performance Model Merging |
Chenyu Huang, Peng Ye,..., Wanli Ouyang |
10 |
2024-01-11 |
link |
Interpretable Concept Bottlenecks to Align Reinforcement Learning Agents |
Quentin Delfosse, Sebastian Sztwiertnia,..., Kristian Kersting |
10 |
2024-02-21 |
link |
Average gradient outer product as a mechanism for deep neural collapse |
Daniel Beaglehole, Peter Súkeník,..., Mikhail Belkin |
10 |
2024-06-11 |
link |
4Real: Towards Photorealistic 4D Scene Generation via Video Diffusion Models |
Heng Yu, Chaoyang Wang,..., Hsin-Ying Lee |
10 |
2024-05-27 |
link |
Navigating the Safety Landscape: Measuring Risks in Finetuning Large Language Models |
ShengYun Peng, Pin-Yu Chen,..., Duen Horng Chau |
10 |
2024-06-27 |
link |
Decoding-Time Language Model Alignment with Multiple Objectives |
Ruizhe Shi, Yifang Chen,..., Simon Shaolei Du |
10 |
2024-07-02 |
link |
Meta 3D AssetGen: Text-to-Mesh Generation with High-Quality Geometry, Texture, and PBR Materials |
Yawar Siddiqui, Tom Monnier,..., David Novotny |
10 |
2024-06-21 |
link |
Is A Picture Worth A Thousand Words? Delving Into Spatial Reasoning for Vision Language Models |
Jiayu Wang, Yifei Ming,..., Neel Joshi |
10 |
2024-04-16 |
link |
Self-playing Adversarial Language Game Enhances LLM Reasoning |
Pengyu Cheng, Tianhao Hu,..., Xiaolong Li |
10 |
2024-02-07 |
link |
Amortized Planning with Large-Scale Transformers: A Case Study on Chess |
Anian Ruoss, Gregoire Deletang,..., Tim Genewein |
10 |
2024-05-28 |
link |
Why are Visually-Grounded Language Models Bad at Image Classification? |
Yuhui Zhang, Alyssa Unell,..., Serena Yeung-Levy |
10 |
2024-07-17 |
link |
AgentPoison: Red-teaming LLM Agents via Poisoning Memory or Knowledge Bases |
Zhaorun Chen, Zhen Xiang,..., Bo Li |
10 |
2023-10-10 |
link |
A General Protocol to Probe Large Vision Models for 3D Physical Understanding |
Guanqi Zhan, Chuanxia Zheng,..., Andrew Zisserman |
10 |
2024-04-08 |
link |
SpeechAlign: Aligning Speech Generation to Human Preferences |
Dong Zhang, Zhaowei Li,..., Xipeng Qiu |
10 |
2024-02-22 |
link |
In-Context Learning of a Linear Transformer Block: Benefits of the MLP Component and One-Step GD Initialization |
Ruiqi Zhang, Jingfeng Wu, Peter Bartlett |
10 |
2024-02-09 |
link |
CultureLLM: Incorporating Cultural Differences into Large Language Models |
CHENG LI, Mengzhuo Chen,..., Xing Xie |
10 |
2024-03-09 |
link |
Algorithmic progress in language models |
Anson Ho, Tamay Besiroglu,..., Jaime Sevilla |
10 |
2024-02-15 |
link |
Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation |
Huizhuo Yuan, Zixiang Chen,..., Quanquan Gu |
10 |
2024-02-04 |
link |
Diffusion Models are Certifiably Robust Classifiers |
Huanran Chen, Yinpeng Dong,..., Jun Zhu |
10 |
2024-02-19 |
link |
WorldCoder, a Model-Based LLM Agent: Building World Models by Writing Code and Interacting with the Environment |
Hao Tang, Darren Yan Key, Kevin Ellis |
10 |
2024-05-23 |
link |
Calibrated Self-Rewarding Vision Language Models |
Yiyang Zhou, Zhiyuan Fan,..., Huaxiu Yao |
10 |
2024-05-17 |
link |
ProSST: Protein Language Modeling with Quantized Structure and Disentangled Attention |
Mingchen Li, Yang Tan,..., Liang Hong |
10 |
2024-03-12 |
link |
Lumen: Unleashing Versatile Vision-Centric Capabilities of Large Multimodal Models |
Yang Jiao, Shaoxiang Chen,..., Yu-Gang Jiang |
10 |
2024-05-27 |
link |
Vidu4D: Single Generated Video to High-Fidelity 4D Reconstruction with Dynamic Gaussian Surfels |
Yikai Wang, Xinzhou Wang,..., Jun Zhu |
10 |
2024-02-29 |
link |
Heavy-Tailed Class Imbalance and Why Adam Outperforms Gradient Descent on Language Models |
Frederik Kunstner, Robin Yadav,..., Alberto Bietti |
10 |
2024-06-14 |
link |
L4GM: Large 4D Gaussian Reconstruction Model |
Jiawei Ren, Kevin Xie,..., Huan Ling |
10 |
2024-04-04 |
link |
CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching |
Dongzhi Jiang, Guanglu Song,..., Hongsheng Li |
9 |
2023-12-20 |
link |
UniSDF: Unifying Neural Representations for High-Fidelity 3D Reconstruction of Complex Scenes with Reflections |
Fangjinhua Wang, Marie-Julie Rakotosaona,..., Federico Tombari |
9 |
2024-05-27 |
link |
PromptFix: You Prompt and We Fix the Photo |
Yongsheng Yu, Ziyun Zeng,..., Jiebo Luo |
9 |
2024-03-28 |
link |
InterDreamer: Zero-Shot Text to 3D Dynamic Human-Object Interaction |
Sirui Xu, Ziyin Wang,..., Liangyan Gui |
9 |
2024-06-03 |
link |
D-CPT Law: Domain-specific Continual Pre-Training Scaling Law for Large Language Models |
Haoran Que, Jiaheng Liu,..., Bo Zheng |
9 |
2024-05-31 |
link |
4Diffusion: Multi-view Video Diffusion Model for 4D Generation |
Haiyu Zhang, Xinyuan Chen,..., Yu Qiao |
9 |
2024-07-06 |
link |
LoRA-GA: Low-Rank Adaptation with Gradient Approximation |
Shaowen Wang, Linxi Yu, Jian Li |
9 |
2024-06-17 |
link |
Exploring the Role of Large Language Models in Prompt Encoding for Diffusion Models |
Bingqi Ma, Zhuofan Zong,..., Yu Liu |
9 |
2024-04-22 |
link |
Protecting Your LLMs with Information Bottleneck |
Zichuan Liu, Zefan Wang,..., Jiang Bian |
9 |
2023-12-06 |
link |
Return of Unconditional Generation: A Self-supervised Representation Generation Method |
Tianhong Li, Dina Katabi, Kaiming He |
9 |
2024-05-31 |
link |
MeshXL: Neural Coordinate Field for Generative 3D Foundation Models |
Sijin Chen, Xin Chen,..., Tao Chen |
9 |
2023-10-20 |
link |
Towards Understanding How Transformers Learn In-context Through a Representation Learning Lens |
Ruifeng Ren, Yong Liu |
9 |
2024-05-28 |
link |
Linguistic Collapse: Neural Collapse in (Large) Language Models |
Robert Wu, Vardan Papyan |
9 |
2024-05-30 |
link |
Sequence-Augmented SE(3)-Flow Matching For Conditional Protein Backbone Generation |
Guillaume Huguet, James Vuckovic,..., Joey Bose |
9 |
2024-03-12 |
link |
Visual Decoding and Reconstruction via EEG Embeddings with Guided Diffusion |
Dongyang Li, Chen Wei,..., Quanying Liu |
9 |
2024-06-25 |
link |
MotionBooth: Motion-Aware Customized Text-to-Video Generation |
Jianzong Wu, Xiangtai Li,..., Kai Chen |
9 |
2024-05-24 |
link |
Quantifying the Gain in Weak-to-Strong Generalization |
Moses Charikar, Chirag Pabbaraju, Kirankumar Shiragur |
9 |
2024-06-13 |
link |
Visual Sketchpad: Sketching as a Visual Chain of Thought for Multimodal Language Models |
Yushi Hu, Weijia Shi,..., Ranjay Krishna |
9 |
2024-07-25 |
link |
Recursive Introspection: Teaching Language Model Agents How to Self-Improve |
Yuxiao Qu, Tianjun Zhang,..., Aviral Kumar |
9 |
2024-06-06 |
link |
Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models |
Ling Yang, Zhaochen Yu,..., Bin CUI |
9 |
2024-02-06 |
link |
Scaling laws for learning with real and surrogate data |
Ayush Jain, Andrea Montanari, Eren Sasoglu |
9 |
2024-05-23 |
link |
PaGoDA: Progressive Growing of a One-Step Generator from a Low-Resolution Diffusion Teacher |
Dongjun Kim, Chieh-Hsin Lai,..., Stefano Ermon |
8 |
2024-09-30 |
link |
Scaling Proprioceptive-Visual Learning with Heterogeneous Pre-trained Transformers |
Lirui Wang, Xinlei Chen,..., Kaiming He |
8 |
2024-04-19 |
link |
Ensemble Learning for Heterogeneous Large Language Models with Deep Parallel Collaboration |
Yichong Huang, Xiaocheng Feng,..., Bing Qin |
8 |
2023-10-27 |
link |
Proportional Fairness in Clustering: A Social Choice Perspective |
Leon Kellerhals, Jannik Peters |
8 |
2024-06-12 |
link |
One-Step Effective Diffusion Network for Real-World Image Super-Resolution |
Rongyuan Wu, Lingchen Sun,..., Lei Zhang |
8 |
2024-05-28 |
link |
A Theoretical Understanding of Self-Correction through In-context Alignment |
Yifei Wang, Yuyang Wu,..., Yisen Wang |
8 |
2024-02-06 |
link |
A phase transition between positional and semantic learning in a solvable model of dot-product attention |
Hugo Cui, Freya Behrens,..., Lenka Zdeborova |
8 |
2024-05-30 |
link |
Jailbreaking Large Language Models Against Moderation Guardrails via Cipher Characters |
Haibo Jin, Andy Zhou,..., Haohan Wang |
8 |
2024-02-07 |
link |
The Fine-Grained Complexity of Gradient Computation for Training Large Language Models |
Josh Alman, Zhao Song |
8 |
2024-05-20 |
link |
Diffusion for World Modeling: Visual Details Matter in Atari |
Eloi Alonso, Adam Jelley,..., François Fleuret |
8 |
2024-06-06 |
link |
Evaluating the World Model Implicit in a Generative Model |
Keyon Vafa, Justin Y. Chen,..., Sendhil Mullainathan |
8 |
2024-03-25 |
link |
Provably Robust Score-Based Diffusion Posterior Sampling for Plug-and-Play Image Reconstruction |
Xingyu Xu, Yuejie Chi |
8 |
2024-05-24 |
link |
iVideoGPT: Interactive VideoGPTs are Scalable World Models |
Jialong Wu, Shaofeng Yin,..., Mingsheng Long |
8 |
2024-02-07 |
link |
QGFN: Controllable Greediness with Action Values |
Elaine Lau, Stephen Zhewen Lu,..., Emmanuel Bengio |
8 |
2024-05-20 |
link |
Metacognitive Capabilities of LLMs: An Exploration in Mathematical Problem Solving |
Aniket Rajiv Didolkar, Anirudh Goyal,..., Sanjeev Arora |
8 |
2024-02-02 |
link |
AMOR: A Recipe for Building Adaptable Modular Knowledge Agents Through Process Feedback |
Jian Guan, Wei Wu,..., Minlie Huang |
8 |
2024-04-22 |
link |
Self-Supervised Alignment with Mutual Information: Learning to Follow Principles without Preference Labels |
Jan-Philipp Fränken, Eric Zelikman,..., Noah Goodman |
8 |
2024-06-17 |
link |
Transcendence: Generative Models Can Outperform The Experts That Train Them |
Edwin Zhang, Vincent Zhu,..., eran malach |
8 |
2024-07-29 |
link |
FreeLong: Training-Free Long Video Generation with SpectralBlend Temporal Attention |
Yu Lu, Yuanzhi Liang,..., Yi Yang |
8 |
2024-05-23 |
link |
Base of RoPE Bounds Context Length |
Mingyu Xu, Xin Men,..., weipeng chen |
8 |
2024-03-25 |
link |
Is Your LiDAR Placement Optimized for 3D Scene Understanding? |
Ye Li, Lingdong Kong,..., Xiaonan Huang |
8 |
2024-06-13 |
link |
OmniTokenizer: A Joint Image-Video Tokenizer for Visual Generation |
Junke Wang, Yi Jiang,..., Yu-Gang Jiang |
8 |
2024-06-13 |
link |
COVE: Unleashing the Diffusion Feature Correspondence for Consistent Video Editing |
Jiangshan Wang, Yue Ma,..., Xiu Li |
8 |
2024-02-05 |
link |
Shadowcast: Stealthy Data Poisoning Attacks Against Vision-Language Models |
Yuancheng Xu, Jiarui Yao,..., Furong Huang |
8 |
2024-02-07 |
link |
Universal Neural Functionals |
Allan Zhou, Chelsea Finn, James Harrison |
8 |
2024-05-16 |
link |
Conformal Alignment: Knowing When to Trust Foundation Models with Guarantees |
Yu Gui, Ying Jin, Zhimei Ren |
8 |
2024-04-01 |
link |
Privacy Backdoors: Enhancing Membership Inference through Poisoning Pre-trained Models |
Yuxin Wen, Leo Marchyok,..., Nicholas Carlini |
8 |
2024-04-23 |
link |
Gradient Guidance for Diffusion Models: An Optimization Perspective |
Yingqing Guo, Hui Yuan,..., Mengdi Wang |
8 |
2024-05-23 |
link |
4+3 Phases of Compute-Optimal Neural Scaling Laws |
Elliot Paquette, Courtney Paquette,..., Jeffrey Pennington |
8 |
2024-06-13 |
link |
Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs |
Xuan Zhang, Chao Du,..., Min Lin |
8 |
2024-06-01 |
link |
RGFN: Synthesizable Molecular Generation Using GFlowNets |
Michał Koziarski, Andrei Rekesh,..., Robert A. Batey |
8 |
2024-05-22 |
link |
DOGS: Distributed-Oriented Gaussian Splatting for Large-Scale 3D Reconstruction Via Gaussian Consensus |
Yu Chen, Gim Hee Lee |
8 |
2024-07-05 |
link |
On scalable oversight with weak LLMs judging strong LLMs |
Zachary Kenton, Noah Yamamoto Siegel,..., Rohin Shah |
8 |
2024-06-13 |
link |
LRM-Zero: Training Large Reconstruction Models with Synthesized Data |
Desai Xie, Sai Bi,..., Hao Tan |
8 |
2024-05-15 |
link |
Spectral Editing of Activations for Large Language Model Alignment |
Yifu QIU, Zheng Zhao,..., Shay B Cohen |
8 |
2024-05-25 |
link |
Breaking the False Sense of Security in Backdoor Defense through Re-Activation Attack |
Mingli Zhu, Siyuan Liang, Baoyuan Wu |
8 |
2023-05-22 |
link |
Imprecise Label Learning: A Unified Framework for Learning with Various Imprecise Label Configurations |
Hao Chen, Ankit Shah,..., Bhiksha Raj |
8 |
2024-05-30 |
link |
Improving the Training of Rectified Flows |
Sangyun Lee, Zinan Lin, Giulia Fanti |
8 |
2024-01-24 |
link |
Beyond Concept Bottleneck Models: How to Make Black Boxes Intervenable? |
Sonia Laguna, Ričards Marcinkevičs,..., Julia E Vogt |
8 |
2024-03-03 |
link |
GuardT2I: Defending Text-to-Image Models from Adversarial Prompts |
Yijun Yang, Ruiyuan Gao,..., Qiang Xu |
8 |
2024-05-30 |
link |
Bridging Model-Based Optimization and Generative Modeling via Conservative Fine-Tuning of Diffusion Models |
Masatoshi Uehara, Yulai Zhao,..., Tommaso Biancalani |
7 |
2024-06-14 |
link |
Large language model validity via enhanced conformal prediction methods |
John Cherian, Isaac Gibbs, Emmanuel Candes |
7 |
2024-06-23 |
link |
Trace is the Next AutoDiff: Generative Optimization with Rich Feedback, Execution Traces, and LLMs |
Ching-An Cheng, Allen Nie, Adith Swaminathan |
7 |
2024-06-20 |
link |
Prism: A Framework for Decoupling and Assessing the Capabilities of VLMs |
Yuxuan Qiao, Haodong Duan,..., Kai Chen |
7 |
2024-05-29 |
link |
T2V-Turbo: Breaking the Quality Bottleneck of Video Consistency Model with Mixed Reward Feedback |
Jiachen Li, Weixi Feng,..., William Yang Wang |
7 |
2024-09-09 |
link |
FLoRA: Federated Fine-Tuning Large Language Models with Heterogeneous Low-Rank Adaptations |
Ziyao Wang, Zheyu Shen,..., Ang Li |
7 |
2024-05-23 |
link |
Lorentz-Equivariant Geometric Algebra Transformers for High-Energy Physics |
Jonas Spinner, Victor Breso Pla,..., Johann Brehmer |
7 |
2024-05-26 |
link |
Code Repair with LLMs gives an Exploration-Exploitation Tradeoff |
Hao Tang, Keya Hu,..., Kevin Ellis |
7 |
2024-05-23 |
link |
Unchosen Experts Can Contribute Too: Unleashing MoE Models' Power by Self-Contrast |
Chufan Shi, Cheng Yang,..., Yu Meng |
7 |
2024-04-23 |
link |
Multi-Head Mixture-of-Experts |
Xun Wu, Shaohan Huang,..., Furu Wei |
7 |
2024-02-05 |
link |
Estimating Epistemic and Aleatoric Uncertainty with a Single Model |
Matthew Albert Chan, Maria J. Molina, Christopher Metzler |
7 |
2024-07-08 |
link |
Multi-Object Hallucination in Vision-Language Models |
Xuweiyi Chen, Ziqiao Ma,..., Joyce Chai |
7 |
2024-01-22 |
link |
Self-Labeling the Job Shop Scheduling Problem |
Andrea Corsini, Angelo Porrello,..., Mauro Dell'Amico |
7 |
2024-06-03 |
link |
DuQuant: Distributing Outliers via Dual Transformation Makes Stronger Quantized LLMs |
Haokun Lin, Haobo Xu,..., Ying Wei |
7 |
2024-06-10 |
link |
Aligning Large Language Models with Representation Editing: A Control Perspective |
Lingkai Kong, Haorui Wang,..., Chao Zhang |
7 |
2024-02-24 |
link |
Data-Efficient Operator Learning via Unsupervised Pretraining and In-Context Learning |
Wuyang Chen, Jialin Song,..., Michael W. Mahoney |
7 |
2024-06-12 |
link |
Reversing the Forget-Retain Objectives: An Efficient LLM Unlearning Framework from Logit Difference |
Jiabao Ji, Yujian Liu,..., Shiyu Chang |
7 |
2023-11-03 |
link |
Towards Calibrated Robust Fine-Tuning of Vision-Language Models |
Changdae Oh, Hyesu Lim,..., Kyungwoo Song |
7 |
2024-05-23 |
link |
Metric Flow Matching for Smooth Interpolations on the Data Manifold |
Kacper Kapusniak, Peter Potaptchik,..., Francesco Di Giovanni |
7 |
2024-06-10 |
link |
MVGamba: Unify 3D Content Generation as State Space Sequence Modeling |
Xuanyu Yi, Zike Wu,..., Hanwang Zhang |
7 |
2024-06-06 |
link |
ReNO: Enhancing One-step Text-to-Image Models through Reward-based Noise Optimization |
Luca Eyring, Shyamgopal Karthik,..., Zeynep Akata |
7 |
2024-03-12 |
link |
SmallToLarge (S2L): Scalable Data Selection for Fine-tuning Large Language Models by Summarizing Training Trajectories of Small Models |
Yu Yang, Siddhartha Mishra,..., Baharan Mirzasoleiman |
7 |
2024-02-21 |
link |
Linear Transformers are Versatile In-Context Learners |
Max Vladymyrov, Johannes Von Oswald,..., Rong Ge |
7 |
2024-06-15 |
link |
Voxel Mamba: Group-Free State Space Models for Point Cloud based 3D Object Detection |
Guowen Zhang, Lue Fan,..., Lei Zhang |
7 |
2024-02-17 |
link |
TuneTables: Context Optimization for Scalable Prior-Data Fitted Networks |
Benjamin Feuer, Robin Tibor Schirrmeister,..., Colin White |
7 |
2024-06-14 |
link |
Be like a Goldfish, Don't Memorize! Mitigating Memorization in Generative LLMs |
Abhimanyu Hans, John Kirchenbauer,..., Tom Goldstein |
7 |
2024-06-21 |
link |
GeoLRM: Geometry-Aware Large Reconstruction Model for High-Quality 3D Gaussian Generation |
Chubin Zhang, Hongliang Song,..., Yansong Tang |
7 |
2024-06-17 |
link |
Transcoders Find Interpretable LLM Feature Circuits |
Jacob Dunefsky, Philippe Chlenski, Neel Nanda |
7 |
2024-07-17 |
link |
Direct Unlearning Optimization for Robust and Safe Text-to-Image Models |
Yong-Hyun Park, Sangdoo Yun,..., Gayoung Lee |
7 |
2024-02-29 |
link |
RL-GPT: Integrating Reinforcement Learning and Code-as-policy |
Shaoteng Liu, Haoqi Yuan,..., Jiaya Jia |
7 |
2024-05-30 |
link |
Kernel Language Entropy: Fine-grained Uncertainty Quantification for LLMs from Semantic Similarities |
Alexander V Nikitin, Jannik Kossen,..., Pekka Marttinen |
7 |
2024-02-22 |
link |
Learning an Actionable Discrete Diffusion Policy via Large-Scale Actionless Video Pre-Training |
Haoran He, Chenjia Bai,..., Xuelong Li |
7 |
2024-02-18 |
link |
In-Context Learning with Transformers: Softmax Attention Adapts to Function Lipschitzness |
Liam Collins, Advait U Parulekar,..., Sanjay Shakkottai |
7 |
2024-05-28 |
link |
A Canonicalization Perspective on Invariant and Equivariant Learning |
George Ma, Yifei Wang,..., Yisen Wang |
7 |
2024-03-11 |
link |
SARDet-100K: Towards Open-Source Benchmark and ToolKit for Large-Scale SAR Object Detection |
Yuxuan Li, Xiang Li,..., Jian Yang |
6 |
2023-12-13 |
link |
SwitchHead: Accelerating Transformers with Mixture-of-Experts Attention |
Róbert Csordás, Piotr Piękos,..., Jürgen Schmidhuber |
6 |
2024-06-04 |
link |
Chain of Agents: Large Language Models Collaborating on Long-Context Tasks |
Yusen Zhang, Ruoxi Sun,..., Sercan O Arik |
6 |
2024-06-03 |
link |
DEFT: Efficient Fine-Tuning of Diffusion Models by Learning the Generalised $h$-transform |
Alexander Denker, Francisco Vargas,..., Pietro Lio |
6 |
2024-03-31 |
link |
From Similarity to Superiority: Channel Clustering for Time Series Forecasting |
Jialin Chen, Jan Eric Lenssen,..., Rex Ying |
6 |
2024-06-06 |
link |
VideoTetris: Towards Compositional Text-to-Video Generation |
Ye Tian, Ling Yang,..., Bin CUI |
6 |
2024-05-29 |
link |
PediatricsGPT: Large Language Models as Chinese Medical Assistants for Pediatric Applications |
Dingkang Yang, Jinjie Wei,..., Lihua Zhang |
6 |
2024-10-25 |
link |
DiffGS: Functional Gaussian Splatting Diffusion |
Junsheng Zhou, Weiqi Zhang, Yu-Shen Liu |
6 |
2024-06-28 |
link |
Mixture of In-Context Experts Enhance LLMs' Long Context Awareness |
Hongzhan Lin, Ang Lv,..., Rui Yan |
6 |
2024-06-13 |
link |
4M-21: An Any-to-Any Vision Model for Tens of Tasks and Modalities |
Roman Bachmann, Oğuzhan Fatih Kar,..., Amir Zamir |
6 |
2024-02-05 |
link |
Uncertainty of Thoughts: Uncertainty-Aware Planning Enhances Information Seeking in Large Language Models |
Zhiyuan Hu, Chumin Liu,..., Bryan Hooi |
6 |
2024-05-30 |
link |
Transfer Q Star: Principled Decoding for LLM Alignment |
Souradip Chakraborty, Soumya Suvra Ghosal,..., Furong Huang |
6 |
2024-05-24 |
link |
Understanding the differences in Foundation Models: Attention, State Space Models, and Recurrent Neural Networks |
Jerome Sieber, Carmen Amo Alonso,..., Antonio Orvieto |
6 |
2024-09-01 |
link |
ContextCite: Attributing Model Generation to Context |
Benjamin Cohen-Wang, Harshay Shah,..., Aleksander Madry |
6 |
2024-08-07 |
link |
Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon Tasks |
Zaijing Li, Yuquan Xie,..., Liqiang Nie |
6 |
2024-05-24 |
link |
ART: Automatic Red-teaming for Text-to-Image Models to Protect Benign Users |
Guanlin Li, Kangjie Chen,..., Tianwei Zhang |
6 |
2024-06-14 |
link |
Wild-GS: Real-Time Novel View Synthesis from Unconstrained Photo Collections |
Jiacong Xu, Yiqun Mei, Vishal M. Patel |
6 |
2024-04-30 |
link |
HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning |
Chunlin Tian, Zhan Shi,..., Cheng-zhong Xu |
6 |
2024-05-07 |
link |
Towards a Theoretical Understanding of the 'Reversal Curse' via Training Dynamics |
Hanlin Zhu, Baihe Huang,..., Stuart Russell |
6 |
2024-06-04 |
link |
OpenGaussian: Towards Point-Level 3D Gaussian-based Open Vocabulary Understanding |
Yanmin Wu, Jiarui Meng,..., Jian Zhang |
6 |
2024-06-10 |
link |
How Far Can Transformers Reason? The Globality Barrier and Inductive Scratchpad |
Emmanuel Abbe, Samy Bengio,..., Omid Saremi |
6 |
2024-06-12 |
link |
The Impact of Initialization on LoRA Finetuning Dynamics |
Soufiane Hayou, Nikhil Ghosh, Bin Yu |
6 |
2024-08-27 |
link |
The Mamba in the Llama: Distilling and Accelerating Hybrid Models |
Junxiong Wang, Daniele Paliotta,..., Tri Dao |
6 |
2024-05-27 |
link |
AutoPSV: Automated Process-Supervised Verifier |
Jianqiao Lu, Zhiyang Dou,..., Zhijiang Guo |
6 |
2024-05-31 |
link |
Kaleido Diffusion: Improving Conditional Diffusion Models with Autoregressive Latent Modeling |
Jiatao Gu, Ying Shen,..., Joshua M. Susskind |
6 |
2024-05-25 |
link |
PTQ4DiT: Post-training Quantization for Diffusion Transformers |
Junyi Wu, Haoxuan Wang,..., Yan Yan |
6 |
2024-01-27 |
link |
DiffuserLite: Towards Real-time Diffusion Planning |
Zibin Dong, Jianye HAO,..., YAN ZHENG |
6 |
2024-04-22 |
link |
SOFTS: Efficient Multivariate Time Series Forecasting with Series-Core Fusion |
Lu Han, Xu-Yang Chen,..., De-Chuan Zhan |
6 |
2024-06-13 |
link |
Understanding Hallucinations in Diffusion Models through Mode Interpolation |
Sumukh K Aithal, Pratyush Maini,..., J Zico Kolter |
6 |
2024-02-09 |
link |
Learn To be Efficient: Build Structured Sparsity in Large Language Models |
Haizhong Zheng, Xiaoyan Bai,..., Atul Prakash |
6 |
2024-07-14 |
link |
What Makes and Breaks Safety Fine-tuning? A Mechanistic Study |
Samyak Jain, Ekdeep Singh Lubana,..., Puneet K. Dokania |
6 |
2024-05-30 |
link |
CV-VAE: A Compatible Video VAE for Latent Generative Video Models |
Sijie Zhao, Yong Zhang,..., Ying Shan |
6 |
2024-05-29 |
link |
Principled Probabilistic Imaging using Diffusion Models as Plug-and-Play Priors |
Zihui Wu, Yu Sun,..., Katherine Bouman |
6 |
2024-05-24 |
link |
Accelerating Diffusion Models with Parallel Sampling: Inference at Sub-Linear Time Complexity |
Haoxuan Chen, Yinuo Ren,..., Grant M. Rotskoff |
6 |
2024-06-05 |
link |
Reparameterization invariance in approximate Bayesian inference |
Hrittik Roy, Marco Miani,..., Søren Hauberg |
6 |
2024-06-12 |
link |
Large Language Models Must Be Taught to Know What They Don't Know |
Sanyam Kapoor, Nate Gruver,..., Andrew Gordon Wilson |
6 |
2024-06-02 |
link |
Evidence of Learned Look-Ahead in a Chess-Playing Neural Network |
Erik Jenner, Shreyas Kapur,..., Stuart Russell |
6 |
2024-05-28 |
link |
Personalized Steering of Large Language Models: Versatile Steering Vectors Through Bi-directional Preference Optimization |
Yuanpu Cao, Tianrong Zhang,..., Jinghui Chen |
6 |
2023-05-30 |
link |
Geometry-aware training of factorized layers in tensor Tucker format |
Emanuele Zangrando, Steffen Schotthöfer,..., Francesco Tudisco |
6 |
2024-01-08 |
link |
Attack-Resilient Image Watermarking Using Stable Diffusion |
Lijun Zhang, Xiao Liu,..., Hui Guan |
6 |
2024-05-24 |
link |
VB-LoRA: Extreme Parameter Efficient Fine-Tuning with Vector Banks |
Yang Li, Shaobo Han, Shihao Ji |
6 |
2024-06-14 |
link |
UniAudio 1.5: Large Language Model-driven Audio Codec is A Few-shot Audio Task Learner |
Dongchao Yang, Haohan Guo,..., Helen M. Meng |
6 |
2024-06-12 |
link |
Self-Consuming Generative Models with Curated Data Provably Optimize Human Preferences |
Damien Ferbach, Quentin Bertrand,..., Gauthier Gidel |
6 |
2023-05-29 |
link |
Approximation Rate of the Transformer Architecture for Sequence Modeling |
Haotian Jiang, Qianxiao Li |
6 |
2023-07-15 |
link |
RegExplainer: Generating Explanations for Graph Neural Networks in Regression Task |
Jiaxing Zhang, Zhuomin Chen,..., Hua Wei |
6 |
2024-03-18 |
link |
Dynamic Tuning Towards Parameter and Inference Efficiency for ViT Adaptation |
Wangbo Zhao, Jiasheng Tang,..., Yang You |
6 |
2024-06-05 |
link |
Dynamic 3D Gaussian Fields for Urban Areas |
Tobias Fischer, Jonas Kulhanek,..., Peter Kontschieder |
6 |
2024-05-21 |
link |
Single Image Unlearning: Efficient Machine Unlearning in Multimodal Large Language Models |
Jiaqi Li, Qianshan Wei,..., Fan Liu |
6 |
2024-04-05 |
link |
Identity Decoupling for Multi-Subject Personalization of Text-to-Image Models |
Sangwon Jang, Jaehyeong Jo,..., Sung Ju Hwang |
6 |
2024-05-28 |
link |
Knowledge Circuits in Pretrained Transformers |
Yunzhi Yao, Ningyu Zhang,..., Huajun Chen |
6 |
2024-05-27 |
link |
LCM: Locally Constrained Compact Point Cloud Model for Masked Point Modeling |
Yaohua Zha, Naiqi Li,..., Shu-Tao Xia |
6 |
2024-04-17 |
link |
On the Scalability of GNNs for Molecular Graphs |
Maciej Sypetkowski, Frederik Wenkel,..., Dominique Beaini |
6 |
2024-05-27 |
link |
DMPlug: A Plug-in Method for Solving Inverse Problems with Diffusion Models |
Hengkang Wang, Xu Zhang,..., Ju Sun |
6 |
2024-06-12 |
link |
Vivid-ZOO: Multi-View Video Generation with Diffusion Model |
Bing Li, Cheng Zheng,..., Bernard Ghanem |
6 |
2024-09-04 |
link |
Exploring Low-Dimensional Subspaces in Diffusion Models for Controllable Image Editing |
Siyi Chen, Huijie Zhang,..., Qing Qu |
6 |
2023-11-01 |
link |
Learning Cooperative Trajectory Representations for Motion Forecasting |
Hongzhi Ruan, Haibao Yu,..., Zaiqing Nie |
6 |
2024-05-23 |
link |
Fisher Flow Matching for Generative Modeling over Discrete Data |
Oscar Davis, Samuel Kessler,..., Joey Bose |
6 |
2024-04-05 |
link |
Who Evaluates the Evaluations? Objectively Scoring Text-to-Image Prompt Coherence Metrics with T2IScoreScore (TS2) |
Michael Saxon, Fatima Jahara,..., William Yang Wang |
6 |
2023-05-26 |
link |
Set-based Neural Network Encoding Without Weight Tying |
Bruno Andreis, Bedionita Soro,..., Sung Ju Hwang |
6 |
2024-08-22 |
link |
Transformers are Minimax Optimal Nonparametric In-Context Learners |
Juno Kim, Tai Nakamaki, Taiji Suzuki |
6 |
2024-01-30 |
link |
Diff-eRank: A Novel Rank-Based Metric for Evaluating Large Language Models |
Lai Wei, Zhiquan Tan,..., Weiran Huang |
6 |
2024-02-18 |
link |
Attractor Memory for Long-Term Time Series Forecasting: A Chaos Perspective |
Jiaxi Hu, Yuehong HU,..., Yuxuan Liang |
6 |
2023-12-09 |
link |
Consistency Models for Scalable and Fast Simulation-Based Inference |
Marvin Schmitt, Valentin Pratz,..., Stefan T. Radev |
6 |
2024-02-22 |
link |
A Decision-Language Model (DLM) for Dynamic Restless Multi-Armed Bandit Tasks in Public Health |
Nikhil Behari, Edwin Zhang,..., Milind Tambe |
6 |
2024-02-02 |
link |
Segment Any Change |
Zhuo Zheng, Yanfei Zhong,..., Stefano Ermon |
6 |
2024-06-10 |
link |
AutoSurvey: Large Language Models Can Automatically Write Surveys |
Yidong Wang, Qi Guo,..., Yue Zhang |
5 |
2024-01-02 |
link |
PAC-Bayes-Chernoff bounds for unbounded losses |
Ioar Casado, Luis A. Ortega,..., Andres R Masegosa |
5 |
2024-06-03 |
link |
What makes unlearning hard and what to do about it |
Kairan Zhao, Meghdad Kurmanji,..., Peter Triantafillou |
5 |
2024-06-06 |
link |
DeepStack: Deeply Stacking Visual Tokens is Surprisingly Simple and Effective for LMMs |
Lingchen Meng, Jianwei Yang,..., Yu-Gang Jiang |
5 |
2024-06-13 |
link |
On Softmax Direct Preference Optimization for Recommendation |
Yuxin Chen, Junfei Tan,..., Tat-Seng Chua |
5 |
2024-07-09 |
link |
End-To-End Causal Effect Estimation from Unstructured Natural Language Data |
Nikita Dhawan, Leonardo Cotta,..., Chris J. Maddison |
5 |
2024-05-23 |
link |
Membership Inference on Text-to-Image Diffusion Models via Conditional Likelihood Discrepancy |
Shengfang Zhai, Huanran Chen,..., Yang Liu |
5 |
2024-05-27 |
link |
MultiOOD: Scaling Out-of-Distribution Detection for Multiple Modalities |
Hao Dong, Yue Zhao,..., Olga Fink |
5 |
2024-05-29 |
link |
Stress-Testing Capability Elicitation With Password-Locked Models |
Ryan Greenblatt, Fabien Roger,..., David Krueger |
5 |
2024-06-11 |
link |
Zero-shot Image Editing with Reference Imitation |
Xi Chen, Yutong Feng,..., Hengshuang Zhao |
5 |
2024-05-26 |
link |
Categorical Flow Matching on Statistical Manifolds |
Chaoran Cheng, Jiahan Li,..., Ge Liu |
5 |
2024-05-14 |
link |
Energy-based Hopfield Boosting for Out-of-Distribution Detection |
Claus Hofmann, Simon Lucas Schmid,..., Sepp Hochreiter |
5 |
2024-06-27 |
link |
Length Optimization in Conformal Prediction |
Shayan Kiyani, George J. Pappas, Hamed Hassani |
5 |
2024-05-27 |
link |
ARC: A Generalist Graph Anomaly Detector with In-Context Learning |
Yixin Liu, Shiyuan Li,..., Shirui Pan |
5 |
2024-05-31 |
link |
LLM-ESR: Large Language Models Enhancement for Long-tailed Sequential Recommendation |
Qidong Liu, Xian Wu,..., Xiangyu Zhao |
5 |
2024-02-27 |
link |
Zeroth-Order Sampling Methods for Non-Log-Concave Distributions: Alleviating Metastability by Denoising Diffusion |
Ye He, Kevin Rojas, Molei Tao |
5 |
2023-07-03 |
link |
Understanding the Transferability of Representations via Task-Relatedness |
Akshay Mehra, Yunbei Zhang, Jihun Hamm |
5 |
2023-10-21 |
link |
Masked Hard-Attention Transformers Recognize Exactly the Star-Free Languages |
Andy Yang, David Chiang, Dana Angluin |
5 |
2024-05-21 |
link |
LLM Processes: Numerical Predictive Distributions Conditioned on Natural Language |
James Requeima, John F Bronskill,..., David Duvenaud |
5 |
2024-06-11 |
link |
Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance |
Kuan Heng Lin, Sicheng Mo,..., Bolei Zhou |
5 |
2023-07-05 |
link |
Convolutions and More as Einsum: A Tensor Network Perspective with Advances for Second-Order Methods |
Felix Dangel |
5 |
2024-05-04 |
link |
U-DiTs: Downsample Tokens in U-Shaped Diffusion Transformers |
Yuchuan Tian, Zhijun Tu,..., Yunhe Wang |
5 |
2024-06-21 |
link |
Multimodal Task Vectors Enable Many-Shot Multimodal In-Context Learning |
Brandon Huang, Chancharik Mitra,..., Roei Herzig |
5 |
2024-05-22 |
link |
Context and Geometry Aware Voxel Transformer for Semantic Scene Completion |
Zhu Yu, Runmin Zhang,..., Hui-liang Shen |
5 |
2024-04-21 |
link |
Adversarial Representation Engineering: A General Model Editing Framework for Large Language Models |
Yihao Zhang, Zeming Wei,..., Meng Sun |
5 |
2024-10-22 |
link |
One-Step Diffusion Distillation through Score Implicit Matching |
Weijian Luo, Zemin Huang,..., Guo-Jun Qi |
5 |
2024-06-17 |
link |
Large Scale Transfer Learning for Tabular Data via Language Modeling |
Joshua P Gardner, Juan Carlos Perdomo, Ludwig Schmidt |
5 |
2024-03-28 |
link |
Dual-Personalizing Adapter for Federated Foundation Models |
yiyuan yang, Guodong Long,..., Michael Blumenstein |
5 |
2024-05-22 |
link |
Spectral Adapter: Fine-Tuning in Spectral Space |
Fangzhao Zhang, Mert Pilanci |
5 |
2024-05-22 |
link |
Dense Connector for MLLMs |
Huanjin Yao, Wenhao Wu,..., Jingdong Wang |
5 |
2024-10-18 |
link |
Neural Signed Distance Function Inference through Splatting 3D Gaussians Pulled on Zero-Level Set |
Wenyuan Zhang, Yu-Shen Liu, Zhizhong Han |
5 |
2024-10-24 |
link |
Binocular-Guided 3D Gaussian Splatting with View Consistency for Sparse View Synthesis |
Liang Han, Junsheng Zhou,..., Zhizhong Han |
5 |
2024-05-24 |
link |
MicroAdam: Accurate Adaptive Optimization with Low Space Overhead and Provable Convergence |
Ionut-Vlad Modoranu, Mher Safaryan,..., Dan Alistarh |
5 |
2024-02-04 |
link |
AutoTimes: Autoregressive Time Series Forecasters via Large Language Models |
Yong Liu, Guo Qin,..., Mingsheng Long |
5 |
2024-02-04 |
link |
DenseFormer: Enhancing Information Flow in Transformers via Depth Weighted Averaging |
Matteo Pagliardini, Amirkeivan Mohtashami,..., Martin Jaggi |
5 |
2024-06-06 |
link |
BitsFusion: 1.99 bits Weight Quantization of Diffusion Model |
Yang Sui, Yanyu Li,..., Jian Ren |
5 |
2023-11-26 |
link |
A Nearly Optimal and Low-Switching Algorithm for Reinforcement Learning with General Function Approximation |
Heyang Zhao, Jiafan He, Quanquan Gu |
5 |
2024-05-27 |
link |
DC-Gaussian: Improving 3D Gaussian Splatting for Reflective Dash Cam Videos |
Linhan Wang, Kai Cheng,..., Chang-Tien Lu |
5 |
2024-06-17 |
link |
Unveiling Encoder-Free Vision-Language Models |
Haiwen Diao, Yufeng Cui,..., Xinlong Wang |
5 |
2024-05-22 |
link |
RadarOcc: Robust 3D Occupancy Prediction with 4D Imaging Radar |
Fangqiang Ding, Xiangyu Wen,..., Chris Xiaoxuan Lu |
5 |
2024-05-24 |
link |
Medformer: A Multi-Granularity Patching Transformer for Medical Time-Series Classification |
Yihe Wang, Nan Huang,..., Xiang Zhang |
5 |
2024-05-29 |
link |
Matryoshka Query Transformer for Large Vision-Language Models |
Wenbo Hu, Zi-Yi Dou,..., Kai-Wei Chang |
5 |
2024-06-13 |
link |
Talking Heads: Understanding Inter-layer Communication in Transformer Language Models |
Jack Merullo, Carsten Eickhoff, Ellie Pavlick |
5 |
2023-08-22 |
link |
Enhancing Graph Transformers with Hierarchical Distance Structural Encoding |
Yuankai Luo, Hongkang Li,..., Xiao-Ming Wu |
5 |
2024-02-21 |
link |
Full-Atom Peptide Design with Geometric Latent Diffusion |
Xiangzhe Kong, Yinjun Jia,..., Yang Liu |
5 |
2024-06-07 |
link |
Probabilistic Weather Forecasting with Hierarchical Graph Neural Networks |
Joel Oskarsson, Tomas Landelius,..., Fredrik Lindsten |
5 |
2024-03-06 |
link |
Inference via Interpolation: Contrastive Representations Provably Enable Planning and Inference |
Benjamin Eysenbach, Vivek Myers,..., Sergey Levine |
5 |
2024-02-16 |
link |
Conformalized Credal Set Predictors |
Alireza Javanmardi, David Stutz, Eyke Hüllermeier |
5 |
2024-06-20 |
link |
Transferable Boltzmann Generators |
Leon Klein, Frank Noe |
5 |
2024-05-18 |
link |
Automated Multi-level Preference for MLLMs |
Mengxi Zhang, Wenhao Wu,..., Yifan Sun |
5 |
2024-02-16 |
link |
Provably Safe Neural Network Controllers via Differential Dynamic Logic |
Samuel Teuber, Stefan Mitsch, Andre Platzer |
5 |
2024-05-28 |
link |
Towards a theory of how the structure of language is acquired by deep neural networks |
Francesco Cagnetta, Matthieu Wyart |
5 |
2024-05-23 |
link |
Video Diffusion Models are Training-free Motion Interpreter and Controller |
Zeqi Xiao, Yifan Zhou,..., Xingang Pan |
5 |
2024-05-28 |
link |
FASTopic: Pretrained Transformer is a Fast, Adaptive, Stable, and Transferable Topic Model |
Xiaobao Wu, Thong Thanh Nguyen,..., Anh Tuan Luu |
5 |
2024-07-09 |
link |
Scaling Retrieval-Based Language Models with a Trillion-Token Datastore |
Rulin Shao, Jacqueline He,..., Pang Wei Koh |
5 |
2024-05-24 |
link |
GS-Hider: Hiding Messages into 3D Gaussian Splatting |
Xuanyu Zhang, Jiarui Meng,..., Jian Zhang |
5 |
2024-06-01 |
link |
Frieren: Efficient Video-to-Audio Generation Network with Rectified Flow Matching |
Yongqi Wang, Wenxiang Guo,..., Zhou Zhao |
5 |
2024-06-09 |
link |
Training Compute-Optimal Protein Language Models |
Xingyi Cheng, Bo Chen,..., Le Song |
5 |
2024-05-23 |
link |
PV-Tuning: Beyond Straight-Through Estimation for Extreme LLM Compression |
Vladimir Malinovskii, Denis Mazur,..., Peter Richtárik |
5 |
2024-05-23 |
link |
Scalable Optimization in the Modular Norm |
Tim Large, Yang Liu,..., Jeremy Bernstein |
5 |
2024-06-10 |
link |
Get rich quick: exact solutions reveal how unbalanced initializations promote rapid feature learning |
Daniel Kunin, Allan Raventos,..., Surya Ganguli |
5 |
2024-05-27 |
link |
BehaviorGPT: Smart Agent Simulation for Autonomous Driving with Next-Patch Prediction |
Zikang Zhou, Haibo HU,..., Chun Jason Xue |
5 |
2023-10-11 |
link |
Non-asymptotic Approximation Error Bounds of Parameterized Quantum Circuits |
Zhan Yu, Qiuhao Chen,..., Jerry Zhijian Yang |
5 |
2024-05-29 |
link |
Adaptive Image Quality Assessment via Teaching Large Multimodal Model to Compare |
Hanwei Zhu, Haoning Wu,..., Shiqi Wang |
5 |
2024-06-12 |
link |
A Concept-Based Explainability Framework for Large Multimodal Models |
Jayneel Parekh, Pegah KHAYATAN,..., Matthieu Cord |
5 |
2024-06-12 |
link |
Scaling Laws in Linear Regression: Compute, Parameters, and Data |
Licong Lin, Jingfeng Wu,..., Jason D. Lee |
5 |
2024-06-17 |
link |
Emotion-LLaMA: Multimodal Emotion Recognition and Reasoning with Instruction Tuning |
Zebang Cheng, Zhi-Qi Cheng,..., Alexander G Hauptmann |
5 |
2024-09-14 |
link |
Schrödinger Bridge Flow for Unpaired Data Translation |
Valentin De Bortoli, Iryna Korshunova,..., Arnaud Doucet |
5 |
2024-06-25 |
link |
DiffusionPDE: Generative PDE-Solving Under Partial Observation |
Jiahe Huang, Guandao Yang,..., Jeong Joon Park |
5 |
2024-05-28 |
link |
Exploiting LLM Quantization |
Kazuki Egashira, Mark Vero,..., Martin Vechev |
5 |
2024-05-23 |
link |
Axioms for AI Alignment from Human Feedback |
Luise Ge, Daniel Halpern,..., Junlin Wu |
5 |
2024-06-05 |
link |
A Geometric View of Data Complexity: Efficient Local Intrinsic Dimension Estimation with Diffusion Models |
Hamidreza Kamkari, Brendan Leigh Ross,..., Gabriel Loaiza-Ganem |
5 |
2024-05-29 |
link |
On the Role of Attention Masks and LayerNorm in Transformers |
Xinyi Wu, Amir Ajorlou,..., Ali Jadbabaie |
5 |
2024-02-01 |
link |
Understanding the Expressive Power and Mechanisms of Transformer for Sequence Modeling |
Mingze Wang, Weinan E |
5 |
2024-09-29 |
link |
One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos |
Zechen Bai, Tong He,..., Mike Zheng Shou |
5 |
2023-10-07 |
link |
Test-Time Adaptation Induces Stronger Accuracy and Agreement-on-the-Line |
Eungyeup Kim, Mingjie Sun,..., J Zico Kolter |
5 |
2024-05-27 |
link |
GenWarp: Single Image to Novel Views with Semantic-Preserving Generative Warping |
Junyoung Seo, Kazumi Fukuda,..., Yuki Mitsufuji |
5 |
2024-06-24 |
link |
Confidence Regulation Neurons in Language Models |
Alessandro Stolfo, Ben Peng Wu,..., Neel Nanda |
5 |
2024-05-09 |
link |
A Universal Growth Rate for Learning with Smooth Surrogate Losses |
Anqi Mao, Mehryar Mohri, Yutao Zhong |
5 |
2024-06-17 |
link |
AvaTaR: Optimizing LLM Agents for Tool Usage via Contrastive Reasoning |
Shirley Wu, Shiyu Zhao,..., James Zou |
4 |
2024-06-11 |
link |
Advancing Tool-Augmented Large Language Models: Integrating Insights from Errors in Inference Trees |
Sijia Chen, Yibo Wang,..., Lijun Zhang |
4 |
2024-05-20 |
link |
Images that Sound: Composing Images and Sounds on a Single Canvas |
Ziyang Chen, Daniel Geng, Andrew Owens |
4 |
2024-03-28 |
link |
CLAP4CLIP: Continual Learning with Probabilistic Finetuning for Vision-Language Models |
Saurav Jha, Dong Gong, Lina Yao |
4 |
2024-03-19 |
link |
Optimal Flow Matching: Learning Straight Trajectories in Just One Step |
Nikita Maksimovich Kornilov, Petr Mokrov,..., Alexander Korotin |
4 |
2024-06-14 |
link |
Neural Concept Binder |
Wolfgang Stammer, Antonia Wüst,..., Kristian Kersting |
4 |
2024-06-04 |
link |
Bileve: Securing Text Provenance in Large Language Models Against Spoofing with Bi-level Signature |
Tong Zhou, Xuandong Zhao,..., Shaolei Ren |
4 |
2024-07-25 |
link |
RestoreAgent: Autonomous Image Restoration Agent via Multimodal Large Language Models |
Haoyu Chen, Wenbo Li,..., Lei Zhu |
4 |
2024-02-12 |
link |
PANORAMIA: Privacy Auditing of Machine Learning Models without Retraining |
Mishaal Kazmi, Hadrien Lautraite,..., Mathias Lécuyer |
4 |
2024-05-21 |
link |
Dataset Decomposition: Faster LLM Training with Variable Sequence Length Curriculum |
Hadi Pouransari, Chun-Liang Li,..., Oncel Tuzel |
4 |
2024-06-24 |
link |
Inferring stochastic low-rank recurrent neural networks from neural data |
Matthijs Pals, A Erdem Sağtekin,..., Jakob H. Macke |
4 |
2024-05-22 |
link |
ReVideo: Remake a Video with Motion and Content Control |
Chong Mou, Mingdeng Cao,..., Jian Zhang |
4 |
2024-06-11 |
link |
Neural Gaffer: Relighting Any Object via Diffusion |
Haian Jin, Yuan Li,..., Noah Snavely |
4 |
2024-08-02 |
link |
Mission Impossible: A Statistical Perspective on Jailbreaking LLMs |
Jingtong Su, Julia Kempe, Karen Ullrich |
4 |
2024-06-09 |
link |
Distributional Preference Alignment of LLMs via Optimal Transport |
Igor Melnyk, Youssef Mroueh,..., Jarret Ross |
4 |
2024-05-28 |
link |
Improved Generation of Adversarial Examples Against Safety-aligned LLMs |
Qizhang Li, Yiwen Guo,..., Hao Chen |
4 |
2024-06-11 |
link |
MambaLRP: Explaining Selective State Space Sequence Models |
Farnoush Rezaei Jafari, Grégoire Montavon,..., Oliver Eberle |
4 |
2024-06-03 |
link |
SemCoder: Training Code Language Models with Comprehensive Semantics |
Yangruibo Ding, Jinjun Peng,..., Baishakhi Ray |
4 |
2024-06-12 |
link |
Discovering Preference Optimization Algorithms with and for Large Language Models |
Chris Lu, Samuel Holt,..., Robert Tjarko Lange |
4 |
2024-06-05 |
link |
HYDRA: Model Factorization Framework for Black-Box LLM Personalization |
Yuchen Zhuang, Haotian Sun,..., Bo Dai |
4 |
2024-04-23 |
link |
SHED: Shapley-Based Automated Dataset Refinement for Instruction Fine-Tuning |
Yexiao He, Ziyao Wang,..., Ang Li |
4 |
2024-05-25 |
link |
Pessimistic Backward Policy for GFlowNets |
Hyosoon Jang, Yunhui Jang,..., Sungsoo Ahn |
4 |
2024-01-21 |
link |
Language Models as Hierarchy Encoders |
Yuan He, Moy Yuan,..., Ian Horrocks |
4 |
2024-03-18 |
link |
Divide-and-Conquer Posterior Sampling for Denoising Diffusion Priors |
Yazid Janati, Badr MOUFAD,..., Jimmy Olsson |
4 |
2024-08-30 |
link |
Can We Leave Deepfake Data Behind in Training Deepfake Detector? |
Jikang Cheng, Zhiyuan Yan,..., Chen Li |
4 |
2024-05-13 |
link |
Zero-Shot Tokenizer Transfer |
Benjamin Minixhofer, Edoardo Ponti, Ivan Vulić |
4 |
2024-05-23 |
link |
Large Language Models-guided Dynamic Adaptation for Temporal Knowledge Graph Reasoning |
Jiapu Wang, Kai Sun,..., Baocai Yin |
4 |
2024-06-29 |
link |
UDC: A Unified Neural Divide-and-Conquer Framework for Large-Scale Combinatorial Optimization Problems |
Zhi Zheng, Changliang Zhou,..., Zhenkun Wang |
4 |
2024-03-14 |
link |
Minimax Optimal and Computationally Efficient Algorithms for Distributionally Robust Offline Reinforcement Learning |
Zhishuai Liu, Pan Xu |
4 |
2024-05-22 |
link |
The Power of Extrapolation in Federated Learning |
Hanmin Li, Kirill Acharya, Peter Richtárik |
4 |
2024-02-29 |
link |
UniTS: A Unified Multi-Task Time Series Model |
Shanghua Gao, Teddy Koker,..., Marinka Zitnik |
4 |
2024-05-23 |
link |
Surge Phenomenon in Optimal Learning Rate and Batch Size Scaling |
Shuaipeng Li, Penghao Zhao,..., Di Wang |
4 |
2024-02-01 |
link |
Credal Learning Theory |
Michele Caprio, Maryam Sultana,..., Fabio Cuzzolin |
4 |
2024-03-30 |
link |
Communication Efficient Distributed Training with Distributed Lion |
Bo Liu, Lemeng Wu,..., qiang liu |
4 |
2024-05-08 |
link |
Initialization is Critical to Whether Transformers Fit Composite Functions by Inference or Memorizing |
Zhongwang Zhang, Pengxiao Lin,..., Zhi-Qin John Xu |
4 |
2024-06-12 |
link |
Optimized Feature Generation for Tabular Data via LLMs with Decision Tree Reasoning |
Jaehyun Nam, Kyuyoung Kim,..., Jinwoo Shin |
4 |
2024-02-29 |
link |
Learning Commonality, Divergence and Variety for Unsupervised Visible-Infrared Person Re-identification |
Jiangming Shi, Xiangbo Yin,..., Yanyun Qu |
4 |
2024-09-11 |
link |
NVRC: Neural Video Representation Compression |
Ho Man Kwan, Ge Gao,..., David Bull |
4 |
2024-02-26 |
link |
Graph Diffusion Policy Optimization |
Yijing Liu, Chao Du,..., Wei Chen |
4 |
2024-07-05 |
link |
Better by Default: Strong Pre-Tuned MLPs and Boosted Trees on Tabular Data |
David Holzmüller, Leo Grinsztajn, Ingo Steinwart |
4 |
2024-07-25 |
link |
Unlocking Tokens as Data Points for Generalization Bounds on Larger Language Models |
Sanae Lotfi, Yilun Kuang,..., Andrew Gordon Wilson |
4 |
2024-05-24 |
link |
Alleviating Hallucinations in Large Vision-Language Models through Hallucination-Induced Optimization |
Beitao Chen, Xinyu Lyu,..., Jingkuan Song |
4 |
2024-03-06 |
link |
Directional Smoothness and Gradient Methods: Convergence and Adaptivity |
Aaron Mishkin, Ahmed Khaled,..., Robert M. Gower |
4 |
2024-03-20 |
link |
Bridge the Modality and Capability Gaps in Vision-Language Model Selection |
Chao Yi, Yuhang He,..., Han-Jia Ye |
4 |
2024-06-17 |
link |
Probing the Decision Boundaries of In-context Learning in Large Language Models |
Siyan Zhao, Tung Nguyen, Aditya Grover |
4 |
2024-02-07 |
link |
Improved off-policy training of diffusion samplers |
Marcin Sendera, Minsu Kim,..., Nikolay Malkin |
4 |
2024-06-13 |
link |
Is Value Learning Really the Main Bottleneck in Offline RL? |
Seohong Park, Kevin Frans,..., Aviral Kumar |
4 |
2023-12-08 |
link |
HuRef: HUman-REadable Fingerprint for Large Language Models |
Boyi Zeng, Lizheng Wang,..., Zhouhan Lin |
4 |
2024-06-04 |
link |
Finding NeMo: Localizing Neurons Responsible For Memorization in Diffusion Models |
Dominik Hintersdorf, Lukas Struppek,..., Franziska Boenisch |
4 |
2024-02-04 |
link |
Towards an Information Theoretic Framework of Context-Based Offline Meta-Reinforcement Learning |
Lanqing Li, Hai Zhang,..., Pheng-Ann Heng |
4 |
2024-02-25 |
link |
No Free Lunch in LLM Watermarking: Trade-offs in Watermarking Design Choices |
Qi Pang, Shengyuan Hu,..., Virginia Smith |
4 |
2023-03-16 |
link |
Addressing bias in online selection with limited budget of comparisons |
Ziyad Benomar, Evgenii Chzhen,..., Vianney Perchet |
4 |
2023-11-19 |
link |
Large Pre-trained time series models for cross-domain Time series analysis tasks |
Harshavardhan Kamarthi, B. Aditya Prakash |
4 |
2024-05-24 |
link |
Stacking Your Transformers: A Closer Look at Model Growth for Efficient LLM Pre-Training |
Wenyu Du, Tongxu Luo,..., Jie Fu |
4 |
2024-02-06 |
link |
AdaFlow: Imitation Learning with Variance-Adaptive Flow-Based Policies |
Xixi Hu, qiang liu,..., Bo Liu |
4 |
2023-12-03 |
link |
G2D: From Global to Dense Radiography Representation Learning via Vision-Language Pre-training |
Che Liu, Cheng Ouyang,..., Rossella Arcucci |
4 |
2024-08-19 |
link |
NeuRodin: A Two-stage Framework for High-Fidelity Neural Surface Reconstruction |
Yifan Wang, Di Huang,..., Tong He |
4 |
2024-05-02 |
link |
In-and-Out: Algorithmic Diffusion for Sampling Convex Bodies |
Yunbum Kook, Santosh Vempala, Matthew Shunshi Zhang |
4 |
2024-07-01 |
link |
Aligning Target-Aware Molecule Diffusion Models with Exact Energy Optimization |
Siyi Gu, Minkai Xu,..., Stefano Ermon |
4 |
2024-05-25 |
link |
Bigger, Regularized, Optimistic: scaling for compute and sample-efficient continuous control |
Michal Nauman, Mateusz Ostaszewski,..., Marek Cygan |
4 |
2024-06-04 |
link |
Loki: Low-Rank Keys for Efficient Sparse Attention |
Prajwal Singhania, Siddharth Singh,..., Abhinav Bhatele |
4 |
2024-05-23 |
link |
Unveiling the Tapestry of Consistency in Large Vision-Language Models |
Yuan Zhang, Fei xiao,..., Haoyuan Guo |
4 |
2024-01-19 |
link |
Neglected Hessian component explains mysteries in Sharpness regularization |
Yann Dauphin, Atish Agarwala, Hossein Mobahi |
4 |
2024-06-03 |
link |
The Importance of Online Data: Understanding Preference Fine-tuning via Coverage |
Yuda Song, Gokul Swamy,..., Wen Sun |
4 |
2024-06-13 |
link |
Interpreting the Weight Space of Customized Diffusion Models |
Amil Dravid, Yossi Gandelsman,..., Kfir Aberman |
4 |
2024-02-12 |
link |
Policy Improvement using Language Feedback Models |
Victor Zhong, Dipendra Misra,..., Marc-Alexandre Côté |
4 |
2024-06-06 |
link |
PaCE: Parsimonious Concept Engineering for Large Language Models |
Jinqi Luo, Tianjiao Ding,..., Rene Vidal |
4 |
2024-06-27 |
link |
OmniJARVIS: Unified Vision-Language-Action Tokenization Enables Open-World Instruction Following Agents |
Zihao Wang, Shaofei Cai,..., Yitao Liang |
4 |
2024-07-26 |
link |
SLIM: Style-Linguistics Mismatch Model for Generalized Audio Deepfake Detection |
Yi Zhu, Surya Koppisetti,..., Gaurav Bharaj |
4 |
2024-06-24 |
link |
Finding Transformer Circuits with Edge Pruning |
Adithya Bhaskar, Alexander Wettig,..., Danqi Chen |
4 |
2024-09-26 |
link |
Generative Modeling of Molecular Dynamics Trajectories |
Bowen Jing, Hannes Stark,..., Bonnie Berger |
4 |
2024-04-25 |
link |
PhyRecon: Physically Plausible Neural Scene Reconstruction |
Junfeng Ni, Yixin Chen,..., Siyuan Huang |
4 |
2024-06-22 |
link |
Identifying and Solving Conditional Image Leakage in Image-to-Video Diffusion Model |
Min Zhao, Hongzhou Zhu,..., Jun Zhu |
4 |
2024-07-12 |
link |
GAVEL: Generating Games Via Evolution and Language Models |
Graham Todd, Alexander George Padula,..., Julian Togelius |
4 |
2024-04-20 |
link |
GRANOLA: Adaptive Normalization for Graph Neural Networks |
Moshe Eliasof, Beatrice Bevilacqua,..., Haggai Maron |
4 |
2024-05-19 |
link |
FIFO-Diffusion: Generating Infinite Videos from Text without Training |
Jihwan Kim, Junoh Kang,..., Bohyung Han |
4 |
2024-02-06 |
link |
Discovery of the Hidden World with Large Language Models |
Chenxi Liu, Yongqiang Chen,..., Kun Zhang |
4 |
2024-05-29 |
link |
A Full-duplex Speech Dialogue Scheme Based On Large Language Models |
Peng Wang, Songshuo Lu,..., Yuanjun Xiong |
4 |
2024-10-10 |
link |
Global Lyapunov functions: a long-standing open problem in mathematics, with symbolic transformers |
Alberto Alfarano, Francois Charton, Amaury Hayat |
4 |
2024-07-01 |
link |
Evaluation of Text-to-Video Generation Models: A Dynamics Perspective |
Mingxiang Liao, Hannan Lu,..., Xinyu Zhang |
4 |
2024-10-03 |
link |
Parameter Competition Balancing for Model Merging |
Guodong DU, Junlin Lee,..., Min Zhang |
4 |
2024-03-02 |
link |
NoMAD-Attention: Efficient LLM Inference on CPUs Through Multiply-add-free Attention |
Tianyi Zhang, Jonah Wonkyu Yi,..., Anshumali Shrivastava |
4 |
2024-06-12 |
link |
DiTFastAttn: Attention Compression for Diffusion Transformer Models |
Zhihang Yuan, Hanling Zhang,..., Yu Wang |
4 |
2024-02-06 |
link |
On Convergence of Adam for Stochastic Optimization under Relaxed Assumptions |
Yusu Hong, Junhong Lin |
4 |
2024-05-27 |
link |
Entity Alignment with Noisy Annotations from Large Language Models |
Shengyuan Chen, Qinggang Zhang,..., Xiao Huang |
4 |
2024-04-05 |
link |
Dynamic Conditional Optimal Transport through Simulation-Free Flows |
Gavin Kerrigan, Giosue Migliorini, Padhraic Smyth |
4 |
2024-05-23 |
link |
D-MiSo: Editing Dynamic 3D Scenes using Multi-Gaussians Soup |
Joanna Waczynska, Piotr Borycki,..., Przemysław Spurek |
4 |
2024-06-06 |
link |
Understanding Information Storage and Transfer in Multi-modal Large Language Models |
Samyadeep Basu, Martin Grayson,..., Daniela Massiceti |
4 |
2024-08-28 |
link |
Efficient LLM Scheduling by Learning to Rank |
Yichao Fu, Siqi Zhu,..., Hao Zhang |
4 |
2024-06-10 |
link |
IllumiNeRF: 3D Relighting without Inverse Rendering |
Xiaoming Zhao, Pratul P. Srinivasan,..., Philipp Henzler |
4 |
2024-03-25 |
link |
MetaAligner: Towards Generalizable Multi-Objective Alignment of Language Models |
Kailai Yang, Zhiwei Liu,..., Sophia Ananiadou |
4 |
2024-07-08 |
link |
On the Complexity of Learning Sparse Functions with Statistical and Gradient Queries |
Nirmit Joshi, Theodor Misiakiewicz, Nathan Srebro |
4 |
2024-06-23 |
link |
LiveScene: Language Embedding Interactive Radiance Fields for Physical Scene Rendering and Control |
Delin Qu, Qizhi Chen,..., Xuelong Li |
4 |
2024-05-28 |
link |
Seeing the Image: Prioritizing Visual Correlation by Contrastive Alignment |
Xin Xiao, Bohong Wu,..., Haoyuan Guo |
4 |
2024-02-22 |
link |
Watermarking Makes Language Models Radioactive |
Tom Sander, Pierre Fernandez,..., Teddy Furon |
4 |
2024-07-29 |
link |
Mixture of Nested Experts: Adaptive Processing of Visual Tokens |
Gagan Jain, Nidhi Hegde,..., Sujoy Paul |