Last updated: 2025-04-16 04:18:04. Maintained by Weisen Jiang.

citation publish date title (pdf) review authors
524 2023-11-16 Video-LLaVA: Learning United Visual Representation by Alignment Before Projection link Lin, Bin,..., Li
413 2022-12-31 A Survey on In-context Learning link Dong, Qingxiu,..., Zhifang
350 2023-05-30 Encouraging Divergent Thinking in Large Language Models through Multi-Agent
Debate
link Liang, Tian,..., Zhaopeng
178 2024-03-12 ORPO: Monolithic Preference Optimization without Reference Model link Hong, Jiwoo,..., James
141 2024-05-02 Prometheus 2: An Open Source Language Model Specialized in
Evaluating Other Language Models
link Kim, Seungone,..., Minjoon
115 2023-12-10 Fine-Tuning or Retrieval? Comparing Knowledge Injection in LLMs link Ovadia, Oded,..., Oren
96 2023-11-15 Chain-of-Note: Enhancing Robustness in Retrieval-Augmented Language Models link Yu, Wenhao,..., Dong
88 2022-09-02 FOLIO: Natural Language Reasoning with First-Order Logic link Han, Simeng,..., Dragomir
85 2024-05-09 Does Fine-Tuning LLMs on New Knowledge Encourage Hallucinations? link Gekhman, Zorik,..., Jonathan
79 2024-02-16 Humans or LLMs as the Judge? A Study on
Judgement Bias
link Chen, Guiming Hardy,..., Benyou
75 2020-12-03 GottBERT: a pure German Language Model link Scheible, Raphael,..., Martin
74 2024-03-13 Knowledge Conflicts for LLMs: A Survey link Xu, Rongwu,..., Wei
73 2023-12-28 A Simple LLM Framework for Long-Range Video Question-Answering link Zhang, Ce,..., Gedas
67 2024-04-16 MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents link Tang, Liyan,..., Greg
65 2024-06-24 LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-Training link Zhu, Tong,..., Yu
56 2023-09-12 Mitigating the Alignment Tax of RLHF link Lin, Yong,..., Tong
54 2023-08-08 FLIRT: Feedback Loop In-context Red Teaming link Mehrabi, Ninareh,..., Rahul
54 2024-01-05 MLLM-Protector: Ensuring MLLM`s Safety without Hurting Performance link Pi, Renjie,..., Tong
49 2024-02-29 Controllable Preference Optimization: Toward Controllable Multi-Objective Alignment link Guo, Yiju,..., Maosong
46 2024-06-24 EAGLE-2: Faster Inference of Language Models with Dynamic Draft
Trees
link Li, Yuhui,..., Hongyang
46 2024-01-14 Small LLMs Are Weak Tool Learners: A Multi-LLM Agent link Shen, Weizhou,..., Fei
43 2024-02-06 Systematic Biases in LLM Simulations of Debates link Taubenfeld, Amir,..., Ariel
43 2024-02-21 Is LLM-as-a-Judge Robust? Investigating Universal Adversarial Attacks on Zero-shot
LLM Assessment
link Raina, Vyas,..., Mark
40 2024-02-06 Democratizing Large Language Models via Personalized Parameter-Efficient Fine-tuning link Tan, Zhaoxuan,..., Meng
39 2024-01-12 Heterogeneous LoRA for Federated Fine-tuning of On-Device Foundation Models link Cho, Yae Jee,..., Gauri
38 2024-02-21 Large Language Models for Data Annotation and Synthesis: A
Survey
link Tan, Zhen,..., Huan
38 2024-06-25 Leave No Document Behind: Benchmarking Long-Context LLMs with Extended
Multi-Doc QA
link Wang, Minzheng,..., Yongbin
37 2024-06-21 VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback
for Video Generation
link He, Xuan,..., Wenhu
37 2024-01-20 InferAligner: Inference-Time Alignment for Harmlessness through Cross-Model Guidance link Wang, Pengyu,..., Xipeng
36 2024-07-01 Summary of a Haystack: A Challenge to Long-Context LLMs
and RAG Systems
link Laban, Philippe,..., Chien-Sheng
36 2023-09-29 Split and Merge: Aligning Position Biases in LLM-based Evaluators link Li, Zongjie,..., Yang
36 2024-07-15 Foundational Autoraters: Taming Large Language Models for Better Automatic
Evaluation
link Vu, Tu,..., Yun-Hsuan
34 2023-10-23 LLM-Based Agent Society Investigation: Collaboration and Confrontation in Avalon
Gameplay
link Lan, Yihuai,..., Hao
33 2023-10-23 Moral Foundations of Large Language Models link Abdulhai, Marwa,..., Natasha
32 2023-09-28 LawBench: Benchmarking Legal Knowledge of Large Language Models link Fei, Zhiwei,..., Vincent
32 2024-02-27 Information Flow Routes: Automatically Interpreting Language Models at Scale link Ferrando, Javier,..., Elena
31 2024-02-10 A Thorough Examination of Decoding Methods in the Era
of LLMs
link Shi, Chufan,..., Wai
31 2024-07-01 Searching for Best Practices in Retrieval-Augmented Generation link Wang, Xiaohua,..., Xuanjing
31 2024-03-08 Is this the real life? Is this just fantasy?
The Misleading Success of Simulating Social Interactions With LLMs
link Zhou, Xuhui,..., Maarten
30 2024-04-28 SOUL: Unlocking the Power of Second-Order Optimization for LLM
Unlearning
link Jia, Jinghan,..., Sijia
30 2024-06-16 A Peek into Token Bias: Large Language Models Are
Not Yet Genuine Reasoners
link Jiang, Bowen,..., Dan
30 2024-01-11 Universal Vulnerabilities in Large Language Models: Backdoor Attacks for
In-context Learning
link Zhao, Shuai,..., Jinming
30 2024-01-09 Model Editing Harms General Abilities of Large Language Models:
Regularization to the Rescue
link Gu, Jia-Chen,..., Nanyun
29 2024-06-17 GAMA: A Large Audio-Language Model with Advanced Audio Understanding
and Complex Reasoning Abilities
link Ghosh, Sreyan,..., Dinesh
29 2024-01-11 Transformers are Multi-State RNNs link Oren, Matanel,..., Roy
27 2024-07-09 Lookback Lens: Detecting and Mitigating Contextual Hallucinations in Large
Language Models Using Only Attention Maps
link Chuang, Yung-Sung,..., James R.
27 2024-06-26 The Multilingual Alignment Prism: Aligning Global and Local Preferences
to Reduce Harm
link Aakanksha, Ahmadian,..., Sara
26 2023-10-27 Personas as a Way to Model Truthfulness in Language
Models
link Joshi, Nitish,..., He
25 2024-06-22 Modular Pluralism: Pluralistic Alignment via Multi-LLM Collaboration link Feng, Shangbin,..., Yulia
25 2024-02-22 Middleware for LLMs: Tools Are Instrumental for Language Agents
in Complex Environments
link Gu, Yu,..., Yu
25 2024-07-01 Roleplay-doh: Enabling Domain-Experts to Create LLM-simulated Patients via Eliciting
and Adhering to Principles
link Louie, Ryan,..., Diyi
24 2024-02-01 Learning Planning-based Reasoning by Trajectories Collection and Process Reward
Synthesizing
link Jiao, Fangkai,..., Shafiq
24 2024-02-28 Tokenization Is More Than Compression link Schmidt, Craig W,..., Chris
24 2024-07-06 RULE: Reliable Multimodal RAG for Factuality in Medical Vision
Language Models
link Xia, Peng,..., Huaxiu
24 2024-02-17 Puzzle Solving using Reasoning of Large Language Models: A
Survey
link Giadikiaroglou, Panagiotis,..., Giorgos
23 2024-06-24 LLMs Assist NLP Researchers: Critique Paper (Meta-)Reviewing link Du, Jiangshu,..., Wenpeng
23 2024-06-16 A Comprehensive Survey of Scientific Large Language Models and
Their Applications in Scientific Discovery
link Zhang, Yu,..., Jiawei
23 2024-07-22 AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks? link Yoran, Ori,..., Jonathan
23 2022-12-01 Language models and brains align due to more than
next-word prediction and word-level information
link Merlin, Gabriele,..., Mariya
23 2024-01-13 EHRAgent: Code Empowers Large Language Models for Few-shot Complex
Tabular Reasoning on Electronic Health Records
link Shi, Wenqi,..., May Dongmei
22 2024-02-28 Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality
Estimation
link Ge, Yuan,..., JingBo
22 2023-12-11 Dense X Retrieval: What Retrieval Granularity Should We Use? link Chen, Tong,..., Dong
21 2024-03-28 Language Models Learn Rare Phenomena from Less Rare Phenomena:
The Case of the Missing AANNs
link Misra, Kanishka,..., Kyle
21 2024-07-15 Benchmarking Vision Language Models for Cultural Understanding link Nayak, Shravan,..., Aishwarya
21 2024-05-31 SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales link Xu, Tianyang,..., Jing
21 2024-01-19 Breaking the Curse of Multilinguality with Cross-lingual Expert Language
Models
link Blevins, Terra,..., Luke
21 2024-02-21 Neeko: Leveraging Dynamic LoRA for Efficient Multi-Character Role-Playing Agent link Yu, Xiaoyan,..., Liehuang
21 2024-07-02 RLHF Can Speak Many Languages: Unlocking Multilingual Preference Optimization
for LLMs
link Dang, John,..., Sara
20 2022-12-20 Evaluating Psychological Safety of Large Language Models link Li, Xingxuan,..., Lidong
20 2023-10-13 QUIK: Towards End-to-end 4-Bit Inference on Generative Large Language
Models
link Ashkboos, Saleh,..., Dan
20 2024-04-29 PromptReps: Prompting Large Language Models to Generate Dense and
Sparse Representations for Zero-Shot Document Retrieval
link Zhuang, Shengyao,..., Guido
20 2024-03-29 LUQ: Long-text Uncertainty Quantification for LLMs link Zhang, Caiqi,..., Nigel
20 2024-06-28 Understanding and Mitigating Language Confusion in LLMs link Marchisio, Kelly,..., Sebastian
20 2023-11-14 MAgIC: Investigation of Large Language Model Powered Multi-Agent in
Cognition, Adaptability, Rationality and Collaboration
link Xu, Lin,..., Jiashi
20 2024-06-17 Optimizing Instructions and Demonstrations for Multi-Stage Language Model Programs link Opsahl-Ong, Krista,..., Omar
20 2024-05-08 Fishing for Magikarp: Automatically Detecting Under-trained Tokens in Large
Language Models
link Land, Sander,..., Max
20 2023-11-13 An Analysis and Mitigation of the Reversal Curse link Lv, Ang,..., Rui
20 2024-07-04 A Systematic Survey and Critical Review on Evaluating Large
Language Models: Challenges, Limitations, and Recommendations
link Laskar, Md Tahmid Rahman,..., Jimmy
20 2024-06-17 A Simple and Effective $L_2$ Norm-Based Strategy for KV
Cache Compression
link Devoto, Alessio,..., Pasquale
19 2023-08-17 Evaluating the Instruction-Following Robustness of Large Language Models to
Prompt Injection
link Li, Zekun,..., Xifeng
19 2024-04-18 LongEmbed: Extending Embedding Models for Long Context Retrieval link Zhu, Dawei,..., Sujian
19 2024-06-17 Watch Every Step! LLM Agent Learning via Iterative Step-level
Process Refinement
link Xiong, Weimin,..., Sujian
19 2024-06-20 Instruction Pre-Training: Language Models are Supervised Multitask Learners link Cheng, Daixuan,..., Furu
19 2024-06-16 Mixture-of-Subspaces in Low-Rank Adaptation link Wu, Taiqiang,..., Ngai
19 2024-06-13 Linguistic Bias in ChatGPT: Language Models Reinforce Dialect Discrimination link Fleisig, Eve,..., Dan
19 2024-06-18 Hopping Too Late: Exploring the Limitations of Large Language
Models on Multi-Hop Queries
link Biran, Eden,..., Amir
19 2023-09-12 Re-Reading Improves Reasoning in Large Language Models link Xu, Xiaohan,..., Shuai
18 2024-07-25 Demystifying Verbatim Memorization in Large Language Models link Huang, Jing,..., Christopher
18 2024-06-18 Attention Score is not All You Need for Token
Importance Indicator in KV Cache Reduction: Value Also Matters
link Guo, Zhiyu,..., Taro
17 2024-05-05 ImageInWords: Unlocking Hyper-Detailed Image Descriptions link Garg, Roopal,..., Radu
17 2024-06-15 Personalized Pieces: Efficient Personalized Large Language Models through Collaborative
Efforts
link Tan, Zhaoxuan,..., Meng
17 2024-06-24 BEEAR: Embedding-based Adversarial Removal of Safety Backdoors in Instruction-tuned
Language Models
link Zeng, Yi,..., Ruoxi
17 2024-02-18 SciAgent: Tool-augmented Language Models for Scientific Reasoning link Ma, Yubo,..., Aixin
17 2024-07-18 Are Large Language Models Capable of Generating Human-Level Narratives? link Tian, Yufei,..., Nanyun
17 2024-02-04 Factuality of Large Language Models: A Survey link Wang, Yuxia,..., Preslav
17 2024-03-11 Rebuilding ROME : Resolving Model Collapse during Sequential Model
Editing
link Gupta, Akshat,..., Gopala
17 2024-04-29 BMRetriever: Tuning Large Language Models as Better Biomedical Text
Retrievers
link Xu, Ran,..., Carl
16 2024-06-17 WPO: Enhancing RLHF with Weighted Preference Optimization link Zhou, Wenxuan,..., Chenguang
16 2024-02-23 NuNER: Entity Recognition Encoder Pre-training via LLM-Annotated Data link Bogdanov, Sergei,..., Etienne P
16 2024-02-29 AKEW: Assessing Knowledge Editing in the Wild link Wu, Xiaobao,..., Anh Tuan
16 2024-05-05 MedAdapter: Efficient Test-Time Adaptation of Large Language Models Towards
Medical Reasoning
link Shi, Wenqi,..., May Dongmei
15 2024-01-16 RoTBench: A Multi-Level Benchmark for Evaluating the Robustness of
Large Language Models in Tool Learning
link Ye, Junjie,..., Xuanjing
15 2023-05-11 Chain-of-Dictionary Prompting Elicits Translation in Large Language Models link Lu, Hongyuan,..., Furu
15 2024-05-02 Verification and Refinement of Natural Language Explanations through LLM-Symbolic
Theorem Proving
link Quan, Xin,..., Andre
15 2024-03-08 LLM4Decompile: Decompiling Binary Code with Large Language Models link Tan, Hanzhuo,..., Yuqun
15 2024-10-12 VLFeedback: A Large-Scale AI Feedback Dataset for Large Vision-Language
Models Alignment
link Li, Lei,..., Qi
15 2023-11-15 Social Bias Probing: Fairness Benchmarking for Language Models link Marchiori Manerba, Marta,..., Isabelle
15 2024-07-09 CopyBench: Measuring Literal and Non-Literal Reproduction of Copyright-Protected Text
in Language Model Generation
link Chen, Tong,..., Pang Wei
15 2024-03-13 Distract Large Language Models for Automatic Jailbreak Attack link Xiao, Zeguan,..., Yun
15 2024-06-10 Reasoning in Token Economies: Budget-Aware Evaluation of LLM Reasoning
Strategies
link Wang, Junlin,..., Ben
15 2024-02-21 Knowledge Graph Enhanced Large Language Model Editing link Zhang, Mengqi,..., Zhumin
14 2024-04-04 Uncertainty in Language Models: Assessment through Rank-Calibration link Huang, Xinmeng,..., Edgar
14 2024-02-16 BlendFilter: Advancing Retrieval-Augmented Large Language Models via Query Generation
Blending and Knowledge Filtering
link Wang, Haoyu,..., Jing
14 2024-06-17 MetaGPT: Merging Large Language Models Using Model Exclusive Task
Arithmetic
link Zhou, Yuyan,..., Weipeng
14 2024-04-25 TinyChart: Efficient Chart Understanding with Program-of-Thoughts Learning and Visual
Token Merging
link Zhang, Liang,..., Fei
14 2024-05-27 Can Large Language Models Faithfully Express Their Intrinsic Uncertainty
in Words?
link Yona, Gal,..., Mor
14 2024-04-19 Evaluating Character Understanding of Large Language Models via Character
Profiling from Fictional Works
link Yuan, Xinfeng,..., Deqing
14 2023-10-13 User Inference Attacks on Large Language Models link Kandpal, Nikhil,..., Zheng
14 2024-02-27 Can LLM Generate Culturally Relevant Commonsense QA Data? Case
Study in Indonesian and Sundanese
link Putri, Rifki Afina,..., Alice
13 2024-03-30 NumeroLogic: Number Encoding for Enhanced LLMs' Numerical Reasoning link Schwartz, Eli,..., Assaf
13 2024-06-17 GeoGPT4V: Towards Geometric Multi-modal Large Language Models with Geometric
Image Generation
link Cai, Shihao,..., Bo
13 2024-06-18 SHIELD: Evaluation and Defense Strategies for Copyright Compliance in
LLM Text Generation
link Liu, Xiaoze,..., Jing
13 2024-06-16 Leading Whitespaces of Language Models' Subword Vocabulary Pose a
Confound for Calculating Word Probabilities
link Oh, Byung-Doh,..., William
13 2024-09-24 M$^2$PT: Multimodal Prompt Tuning for Zero-shot Instruction Learning link Wang, Taowen,..., Dongfang
13 2024-06-17 Unifying Multimodal Retrieval via Document Screenshot Embedding link Ma, Xueguang,..., Jimmy
13 2024-06-17 mDPO: Conditional Preference Optimization for Multimodal Large Language Models link Wang, Fei,..., Muhao
13 2024-04-05 Extract, Define, Canonicalize: An LLM-based Framework for Knowledge Graph
Construction
link Zhang, Bowen,..., Harold
13 2024-07-03 TheoremLlama: Transforming General-Purpose LLMs into Lean4 Experts link Wang, Ruida,..., Tong
13 2024-06-20 How to Compute the Probability of a Word link Pimentel, Tiago,..., Clara
13 2024-04-03 Language Models as Compilers: Simulating Pseudocode Execution Improves Algorithmic
Reasoning in Language Models
link Chae, Hyungjoo,..., Jinyoung
12 2024-01-05 Parameter-Efficient Sparsity Crafting from Dense to Mixture-of-Experts for Instruction
Tuning on General Tasks
link Wu, Haoyuan,..., Bei
12 2024-06-16 Eliminating Biased Length Reliance of Direct Preference Optimization via
Down-Sampled KL Divergence
link Lu, Junru,..., Xing
12 2024-04-18 Reuse Your Rewards: Reward Model Transfer for Zero-Shot Cross-Lingual
Alignment
link Wu, Zhaofeng,..., Ahmad
12 2024-06-04 TopViewRS: Vision-Language Models as Top-View Spatial Reasoners link Li, Chengzu,..., Ivan
12 2024-02-25 ASETF: A Novel Method for Jailbreak Attack on LLMs
through Translate Suffix Embeddings
link Wang, Hao,..., Lei
12 2023-11-02 Learn to Refuse: Making Large Language Models More Controllable
and Reliable through Knowledge Scope Limitation and Refusal Mechanism
link Cao, Lang
12 2023-10-04 Discovering Knowledge-Critical Subnetworks in Pretrained Language Models link Bayazit, Deniz,..., Antoine
12 2024-05-23 Extracting Prompts by Inverting LLM Outputs link Zhang, Collin,..., Vitaly
12 2024-07-14 BiasAlert: A Plug-and-play Tool for Social Bias Detection in
LLMs
link Fan, Zhiting,..., Zuozhu
12 2024-01-30 MT-Eval: A Multi-Turn Capabilities Evaluation Benchmark for Large Language
Models
link Kwan, Wai-Chung,..., Kam-Fai
12 2024-09-23 Beyond Turn-Based Interfaces: Synchronous LLMs as Full-Duplex Dialogue Agents link Veluri, Bandhav,..., Shyamnath
11 2024-06-18 AgentReview: Exploring Peer Review Dynamics with LLM Agents link Jin, Yiqiao,..., Jindong
11 2024-06-26 Glue pizza and eat rocks - Exploiting Vulnerabilities in
Retrieval-Augmented Generative Models
link Tan, Zhen,..., Huan
11 2024-07-19 RAG-QA Arena: Evaluating Domain Robustness for Long-form Retrieval Augmented
Question Answering
link Han, Rujun,..., Vittorio
11 2024-06-28 Detection and Measurement of Syntactic Templates in Generated Text link Shaib, Chantal,..., Byron C
11 2024-06-23 ReCaLL: Membership Inference via Relative Conditional Log-Likelihoods link Xie, Roy,..., Bhuwan
11 2024-04-01 An image speaks a thousand words, but can everyone
listen? On image transcreation for cultural relevance
link Khanuja, Simran,..., Graham
11 2023-11-01 The Mystery of In-Context Learning: A Comprehensive Survey on
Interpretation and Analysis
link Zhou, Yuxiang,..., Yulan
11 2024-04-23 Generate-on-Graph: Treat LLM as both Agent and KG for
Incomplete Knowledge Graph Question Answering
link Xu, Yao,..., Kang
11 2024-02-29 TV-TREES: Multimodal Entailment Trees for Neuro-Symbolic Video Reasoning link Sanders, Kate,..., Benjamin
11 2024-02-20 Defending Jailbreak Prompts via In-Context Adversarial Game link Zhou, Yujun,..., Xiangliang
11 2024-04-22 Filtered Direct Preference Optimization link Morimura, Tetsuro,..., Kaito
10 2024-04-21 ChatRetriever: Adapting Large Language Models for Generalized and Robust
Conversational Dense Retrieval
link Mao, Kelong,..., Zhicheng
10 2024-04-18 Aligning Language Models to Explicitly Handle Ambiguity link Kim, Hyuhng Joon,..., Taeuk
10 2024-01-01 LogicAsker: Evaluating and Improving the Logical Reasoning Ability of
Large Language Models
link Wan, Yuxuan,..., Michael
10 2024-06-21 Direct Multi-Turn Preference Optimization for Language Agents link Shi, Wentao,..., Fuli
10 2024-10-31 Hidden Persuaders: LLMs' Political Leaning and Their Influence on
Voters
link Potter, Yujin,..., Dawn
10 2024-06-11 RWKV-CLIP: A Robust Vision-Language Representation Learner link Gu, Tiancheng,..., Jiankang
10 2024-09-21 SURf: Teaching Large Vision-Language Models to Selectively Utilize Retrieved
Information
link Sun, Jiashuo,..., Yu
10 2024-10-06 DAMRO: Dive into the Attention Mechanism of LVLM to
Reduce Object Hallucination
link Gong, Xuan,..., Zhihua
10 2024-02-21 Ouroboros: Generating Longer Drafts Phrase by Phrase for Faster
Speculative Decoding
link Zhao, Weilin,..., Maosong
10 2024-06-17 Learn Beyond The Answer: Training Language Models with Reflection
for Mathematical Reasoning
link Zhang, Zhihan,..., Meng
10 2024-02-01 LLMs learn governing principles of dynamical systems, revealing an
in-context neural scaling law
link Liu, Toni J.b.,..., Christopher
10 2024-06-19 Finding Blind Spots in Evaluator LLMs with Interpretable Checklists link Doddapaneni, Sumanth,..., Mitesh M
10 2024-04-05 FFN-SkipLLM: A Hidden Gem for Autoregressive Decoding with Adaptive
Feed Forward Skipping
link Jaiswal, Ajay Kumar,..., Aditya
10 2023-11-13 Fuse to Forget: Bias Reduction and Selective Memorization through
Model Fusion
link Zaman, Kerem,..., Shashank
10 2024-07-01 Pron vs Prompt: Can Large Language Models already Challenge
a World-Class Fiction Author at Creative Text Writing?
link Marco, Guillermo,..., Ram{\'o}n Del Castillo
10 2024-02-07 ApiQ: Finetuning of 2-Bit Quantized Large Language Model link Liao, Baohao,..., Christof
10 2024-07-21 MIBench: Evaluating Multimodal Large Language Models over Multiple Images link Liu, Haowei,..., Weiming
9 2024-03-31 Scaling Properties of Speech Language Models link Cuervo, Santiago,..., Ricard
9 2024-04-22 Fine-Tuning Large Language Models to Translate: Will a Touch
of Noisy Data in Misaligned Languages Suffice?
link Zhu, Dawei,..., Dietrich
9 2024-06-17 Fairer Preferences Elicit Improved Human-Aligned Large Language Model Judgments link Zhou, Han,..., Anna
9 2024-02-20 Backward Lens: Projecting Language Model Gradients into the Vocabulary
Space
link Katz, Shahar,..., Lior
9 2024-02-13 PRompt Optimization in Multi-Step Tasks (PROMST): Integrating Human Feedback
and Heuristic-based Sampling
link Chen, Yongchao,..., Chuchu
9 2024-05-13 MuMath-Code: Combining Tool-Use Large Language Models with Multi-perspective Data
Augmentation for Mathematical Reasoning
link Yin, Shuo,..., Jinfeng
9 2023-11-16 How Far Can We Extract Diverse Perspectives from Large
Language Models?
link Hayati, Shirley Anugrah,..., Dongyeop
9 2024-02-22 Enhancing Systematic Decompositional Natural Language Inference Using Informal Logic link Weir, Nathaniel,..., Benjamin
9 2024-06-27 Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens
Grounding
link Fan, Yue,..., Xin Eric
9 2024-08-27 Into the Unknown Unknowns: Engaged Human Learning through Participation
in Language Model Agent Conversations
link Jiang, Yucheng,..., Monica
9 2023-05-22 Atomic Inference for NLI with Generated Facts as Atoms link Stacey, Joe,..., Marek
9 2024-01-18 Code Prompting Elicits Conditional Reasoning Abilities in Text+Code LLMs link Puerto, Haritz,..., Iryna
9 2024-06-22 Beyond the Turn-Based Game: Enabling Real-Time Conversations with Duplex
Models
link Zhang, Xinrong,..., Zhiyuan
9 2024-06-18 Defending Against Social Engineering Attacks in the Age of
LLMs
link Ai, Lin,..., Julia
9 2024-05-06 Lifelong Knowledge Editing for LLMs with Retrieval-Augmented Continuous Prompt
Learning
link Chen, Qizhou,..., Hui
9 2024-02-18 How Susceptible are Large Language Models to Ideological Manipulation? link Chen, Kai,..., Kristina
9 2024-06-25 CoSafe: Evaluating Large Language Model Safety in Multi-Turn Dialogue
Coreference
link Yu, Erxin,..., Lanqing
9 2024-07-09 Multimodal Self-Instruct: Synthetic Abstract Image and Visual Reasoning Instruction
Using Language Model
link Zhang, Wenqi,..., Yueting
9 2024-07-09 Virtual Personas for Language Models via an Anthology of
Backstories
link Moon, Suhong,..., David
9 2024-07-12 CompAct: Compressing Retrieved Documents Actively for Question Answering link Yoon, Chanwoong,..., Jaewoo
9 2024-04-25 Evaluating Large Language Models on Time Series Feature Understanding:
A Comprehensive Taxonomy and Benchmark
link Fons, Elizabeth,..., Svitlana
9 2024-05-21 GPT-4 Jailbreaks Itself with Near-Perfect Success Using Self-Explanation link Ramesh, Govind,..., Wei
8 2024-08-22 Large Language Models as Foundations for Next-Gen Dense Retrieval:
A Comprehensive Empirical Assessment
link Luo, Kun,..., Kang
8 2023-02-24 Retrieved Sequence Augmentation for Protein Representation Learning link Ma, Chang,..., Lingpeng
8 2024-01-27 Do We Need Language-Specific Fact-Checking Models? The Case of
Chinese
link Zhang, Caiqi,..., Andreas
8 2024-05-01 Self-Refine Instruction-Tuning for Aligning Reasoning in Language Models link Ranaldi, Leonardo,..., Andre
8 2024-06-20 African or European Swallow? Benchmarking Large Vision-Language Models for
Fine-Grained Object Classification
link Geigle, Gregor,..., Goran
8 2024-06-28 From the Least to the Most: Building a Plug-and-Play
Visual Reasoner via Data Synthesis
link Cheng, Chuanqi,..., Rui
8 2024-06-20 Investigating Mysteries of CoT-Augmented Distillation link Wadhwa, Somin,..., Byron C
8 2024-06-02 Automatic Instruction Evolving for Large Language Models link Zeng, Weihao,..., Weizhu
8 2024-06-16 Towards Understanding Jailbreak Attacks in LLMs: A Representation Space
Analysis
link Lin, Yuping,..., Jiliang
8 None Towards Injecting Medical Visual Knowledge into Multimodal LLMs at
Scale
link Chen, Junying,..., Benyou
8 2023-12-14 Towards Verifiable Text Generation with Evolving Memory and Self-Reflection link Sun, Hao,..., Dawei
8 2024-06-21 FIRST: Faster Improved Listwise Reranking with Single Token Decoding link Gangi Reddy, Revanth,..., Heng
8 2024-06-27 RuBLiMP: Russian Benchmark of Linguistic Minimal Pairs link Taktasheva, Ekaterina,..., Vladislav
8 2024-04-22 Text-Tuple-Table: Towards Information Integration in Text-to-Table Generation via Global
Tuple Extraction
link Deng, Zheye,..., Yangqiu
8 2024-03-25 If CLIP Could Talk: Understanding Vision-Language Model Representations Through
Their Preferred Concept Descriptions
link Esfandiarpoor, Reza,..., Stephen
8 2024-02-25 Efficient Temporal Extrapolation of Multimodal Large Language Models with
Temporal Grounding Bridge
link Wang, Yuxuan,..., Zilong
8 2024-06-24 OmAgent: A Multi-modal Agent Framework for Complex Video Understanding
with Task Divide-and-Conquer
link Zhang, Lu,..., Kyusong
8 2024-07-15 Fine-Tuning and Prompt Optimization: Two Great Steps that Work
Better Together
link Soylu, Dilara,..., Omar
8 2024-06-24 Segment Any Text: A Universal Approach for Robust, Efficient
and Adaptable Sentence Segmentation
link Frohmann, Markus,..., Markus
8 2024-07-02 Why Does New Knowledge Create Messy Ripple Effects in
LLMs?
link Qin, Jiaxin,..., Heng
8 2024-09-11 SUPER: Evaluating Agents on Setting Up and Executing Tasks
from Research Repositories
link Bogin, Ben,..., Tushar
8 2024-05-30 PATIENT-$\psi$: Using Large Language Models to Simulate Patients for
Training Mental Health Professionals
link Wang, Ruiyi,..., Zhiyu
8 2024-02-19 AnaloBench: Benchmarking the Identification of Abstract and Long-context Analogies link Ye, Xiao,..., Daniel
8 2024-06-18 Evaluating $n$-Gram Novelty of Language Models Using Rusty-DAWG link Merrill, William,..., Yanai
8 2024-06-29 Is It Really Long Context if All You Need
Is Retrieval? Towards Genuinely Difficult Long Context NLP
link Goldman, Omer,..., Reut
8 2024-04-24 Annotator-Centric Active Learning for Subjective NLP Tasks link van der Meer, Michiel,..., Enrico
8 2024-04-10 Not All Contexts Are Equal: Teaching LLMs Credibility-aware Generation link Pan, Ruotong,..., Le
7 2024-02-17 When LLMs Meets Acoustic Landmarks: An Efficient Approach to
Integrate Speech into Large Language Models for Depression Detection
link Zhang, Xiangyu,..., Julien
7 2024-06-26 AdaZeta: Adaptive Zeroth-Order Tensor-Train Adaption for Memory-Efficient Large Language
Models Fine-Tuning
link Yang, Yifan,..., Zheng
7 2024-06-16 RoseLoRA: Row and Column-wise Sparse Low-rank Adaptation of Pre-trained
Language Model for Knowledge Editing and Fine-tuning
link Wang, Haoyu,..., Jing
7 2024-02-22 Triad: A Framework Leveraging a Multi-Role LLM-based Agent to
Solve Knowledge Base Question Answering
link Zong, Chang,..., Yueting
7 2024-06-07 CHIQ: Contextual History Enhancement for Improving Query Rewriting in
Conversational Search
link Mo, Fengran,..., Jian-Yun
7 2024-04-03 Calibrating the Confidence of Large Language Models by Eliciting
Fidelity
link Zhang, Mozhi,..., Xipeng
7 2024-09-12 On the Role of Context in Reading Time Prediction link Opedal, Andreas,..., Ethan
7 2024-06-18 Estimating Knowledge in Large Language Models Without Generating a
Single Token
link Gottesman, Daniela,..., Mor
7 2023-11-15 When Is Multilinguality a Curse? Language Modeling for 250
High- and Low-Resource Languages
link Chang, Tyler A.,..., Ben
7 2024-06-14 SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite
for Southeast Asian Languages
link Lovenia, Holy,..., Samuel
7 2024-08-28 EPO: Hierarchical LLM Agents with Environment Preference Optimization link Zhao, Qi,..., George
7 2024-04-17 Neuron Specialization: Leveraging Intrinsic Task Modularity for Multilingual Machine
Translation
link Tan, Shaomu,..., Christof
7 2024-06-17 MMNeuron: Discovering Neuron-Level Domain-Specific Interpretation in Multimodal Large Language
Model
link Huo, Jiahao,..., Xuming
7 2024-05-08 ADELIE: Aligning Large Language Models on Information Extraction link Qi, Yunjia,..., Juanzi
7 2024-06-15 MIND: Multimodal Shopping Intention Distillation from Large Vision-language Models
for E-commerce Purchase Understanding
link Xu, Baixuan,..., Yangqiu
7 2024-06-21 PARIKSHA: A Large-Scale Investigation of Human-LLM Evaluator Agreement on
Multilingual and Multi-Cultural Data
link Watts, Ishaan,..., Sunayana
7 2024-09-25 VPTQ: Extreme Low-bit Vector Post-Training Quantization for Large Language
Models
link Liu, Yifei,..., Mao
7 2024-09-20 Unlocking Memorization in Large Language Models with Dynamic Soft
Prompting
link Wang, Zhepeng,..., Yanfu
7 2024-06-18 LLMs Are Prone to Fallacies in Causal Inference link Joshi, Nitish,..., He
7 2024-03-05 Evidence-Focused Fact Summarization for Knowledge-Augmented Zero-Shot Question Answering link Ko, Sungho,..., Dongha
7 2023-05-23 ReadMe++: Benchmarking Multilingual Language Models for Multi-Domain Readability Assessment link Naous, Tarek,..., Wei
7 2024-03-08 Consecutive Batch Model Editing with HooK Layers link Li, Shuaiyi,..., Wai
7 2024-10-19 Are LLMs Good Zero-Shot Fallacy Classifiers? link Pan, Fengjun,..., Anh Tuan
7 2024-05-28 Can Automatic Metrics Assess High-Quality Translations? link Agrawal, Sweta,..., Andre
7 2024-06-17 Small Agent Can Also Rock! Empowering Small Language Models
as Hallucination Detector
link Cheng, Xiaoxue,..., Ji-Rong
7 2024-06-24 RaTEScore: A Metric for Radiology Report Generation link Zhao, Weike,..., Weidi
7 2024-01-13 Leveraging Large Language Models for NLG Evaluation: Advances and
Challenges
link Li, Zhen,..., Shuai
7 2024-07-21 Intrinsic Self-correction for Enhanced Morality: An Analysis of Internal
Mechanisms and the Superficial Hypothesis
link Liu, Guangliang,..., Kristen
7 2024-12-18 MedCoT: Medical Chain of Thought via Hierarchical Expert link Liu, Jiaxiang,..., Zuozhu
7 2024-04-16 D3CODE: Disentangling Disagreements in Data across Cultures on Offensiveness
Detection and Evaluation
link Mostafazadeh Davani, Aida,..., Vinodkumar
7 2024-06-16 FoodieQA: A Multimodal Dataset for Fine-Grained Understanding of Chinese
Food Culture
link Li, Wenyan,..., Desmond
7 2024-06-21 Evaluating Diversity in Automatic Poetry Generation link Chen, Yanran,..., Steffen
7 2024-06-06 ArMeme: Propagandistic Content in Arabic Memes link Alam, Firoj,..., Maram
6 None CryptoTrade: A Reflective LLM-based Agent to Guide Zero-shot Cryptocurrency
Trading
link Li, Yuan,..., Bingsheng
6 2024-06-17 Prefixing Attention Sinks can Mitigate Activation Outliers for Large
Language Model Quantization
link Son, Seungwoo,..., Jaeho
6 2024-06-28 LEMoE: Advanced Mixture of Experts Adaptor for Lifelong Model
Editing of Large Language Models
link Wang, Renzhi,..., Piji
6 2024-02-05 How do Large Language Models Learn In-Context? Query and
Key Matrices of In-Context Heads are Two Towers for Metric Learning
link Yu, Zeping,..., Sophia
6 2024-09-21 Interpreting Arithmetic Mechanism in Large Language Models through Comparative
Neuron Analysis
link Yu, Zeping,..., Sophia
6 2024-06-17 GoldCoin: Grounding Large Language Models in Privacy Laws via
Contextual Integrity Theory
link Fan, Wei,..., Yangqiu
6 2024-09-29 CoTKR: Chain-of-Thought Enhanced Knowledge Rewriting for Complex Knowledge Graph
Question Answering
link Wu, Yike,..., Jeff Z.
6 2024-06-17 When Reasoning Meets Information Aggregation: A Case Study with
Sports Narratives
link Hu, Yebowen,..., Fei
6 2024-03-25 Outcome-Constrained Large Language Models for Countering Hate Speech link Hong, Lingzi,..., Xiaoying
6 2024-06-21 Retrieve-Plan-Generation: An Iterative Planning and Answering Framework for Knowledge-Intensive
LLM Generation
link Lyu, Yuanjie,..., Enhong
6 2024-06-16 Concept-skill Transferability-based Data Selection for Large Vision-Language Models link Lee, Jaewoo,..., Sung Ju
6 2024-06-21 Safely Learning with Private Data: A Federated Learning Framework
for Large Language Model
link Zheng, Jia-Ying,..., Zhi-Ming
6 2024-02-27 REAR: A Relevance-Aware Retrieval-Augmented Framework for Open-Domain Question Answering link Wang, Yuhao,..., Ji-Rong
6 2024-06-24 C-LLM: Learn to Check Chinese Spelling Errors Character by
Character
link Li, Kunting,..., Jie
6 2024-06-18 CleanGen: Mitigating Backdoor Attacks for Generation Tasks in Large
Language Models
link Li, Yuetai,..., Radha
6 None Enhancing Reinforcement Learning with Dense Rewards from Language Model
Critic
link Cao, Meng,..., Lei
6 2023-05-23 APPLS: Evaluating Evaluation Metrics for Plain Language Summarization link Guo, Yue,..., Lucy Lu
6 2022-12-20 Ontologically Faithful Generation of Non-Player Character Dialogues link Weir, Nathaniel,..., Harsh
6 2024-04-01 TraveLER: A Modular Multi-LMM Agent Framework for Video Question-Answering link Shang, Chuyi,..., Roei
6 2024-06-18 TroL: Traversal of Layers for Large Language and Vision
Models
link Lee, Byung-Kwan,..., Yong Man
6 2024-11-06 Medical Adaptation of Large Language and Vision-Language Models: Are
We Making Progress?
link Jeong, Daniel P,..., Michael
6 2024-10-09 DA-Code: Agent Data Science Code Generation Benchmark for Large
Language Models
link Huang, Yiming,..., Kang
6 2024-10-10 AppBench: Planning of Multiple APIs from Various APPs for
Complex User Instruction
link Wang, Hongru,..., Kam-Fai
6 2024-06-03 Re-ReST: Reflection-Reinforced Self-Training for Language Agents link Dou, Zi-Yi,..., Nanyun
6 2024-06-24 Scaling Laws for Linear Complexity Language Models link Shen, Xuyang,..., Yiran
6 2024-06-17 Community-Cross-Instruct: Unsupervised Instruction Generation for Aligning Large Language Models
to Online Communities
link He, Zihao,..., Kristina
6 2024-09-15 Reasoning Paths with Reference Objects Elicit Quantitative Spatial Reasoning
in Large Vision-Language Models
link Liao, Yuan-Hong,..., David
6 2024-06-21 From LLMs to MLLMs: Exploring the Landscape of Multimodal
Jailbreaking
link Wang, Siyuan,..., Zhongyu
6 2024-04-11 LLoCO: Learning Long Contexts Offline link Tan, Sijun,..., Raluca Ada
6 2024-02-25 Don`t Forget Your Reward Values: Language Model Alignment via
Value-based Calibration
link Mao, Xin,..., Anh Tuan
6 2024-10-09 Is C4 Dataset Optimal for Pruning? An Investigation of
Calibration Data for LLM Pruning
link Bandari, Abhinav,..., Shiwei
6 2023-11-28 Eliciting In-Context Learning in Vision-Language Models for Videos Through
Curated Data Distributional Properties
link Yu, Keunwoo Peter,..., Joyce
6 2024-06-17 Safety Arithmetic: A Framework for Test-time Safety Alignment of
Language Models by Steering Parameters and Activations
link Hazra, Rima,..., Soujanya
6 None Do LLMs Plan Like Human Writers? Comparing Journalist Coverage
of Press Releases with LLMs
link Spangher, Alexander,..., Mark
6 2024-06-20 Holistic Evaluation for Interleaved Text-and-Image Generation link Liu, Minqian,..., Lifu
6 2024-08-07 Is Child-Directed Speech Effective Training Data for Language Models? link Feng, Steven Y.,..., Michael
6 2024-10-23 LongRAG: A Dual-Perspective Retrieval-Augmented Generation Paradigm for Long-Context Question
Answering
link Zhao, Qingfei,..., Jie
6 2024-01-24 Instruction Fine-Tuning: Does Prompt Loss Matter? link Huerta-Enochian, Mathew,..., Seung Yong
5 2024-02-22 A Usage-centric Take on Intent Understanding in E-Commerce link Zhou, Wendi,..., Jeff Z.
5 2024-04-17 Consolidating Ranking and Relevance Predictions of Large Language Models
through Post-Processing
link Yan, Le,..., Harrie
5 2024-02-15 EFUF: Efficient Fine-Grained Unlearning Framework for Mitigating Hallucinations in
Multimodal Large Language Models
link Xing, Shangyu,..., Xinyu
5 2024-06-17 Tracking the perspectives of interacting language models link Helm, Hayden,..., Carey
5 2024-09-21 Enhancing Advanced Visual Reasoning Ability of Large Language Models link Li, Zhiyuan,..., Weidong
5 2023-08-16 CMD: a framework for Context-aware Model self-Detoxification link Tang, Zecheng,..., Min
5 2024-07-15 By My Eyes: Grounding Multimodal Large Language Models with
Sensor Data via Visual Prompting
link Yoon, Hyungjun,..., Sung-Ju
5 2024-05-15 Word Alignment as Preference for Machine Translation link Wu, Qiyu,..., Yoshimasa
5 2024-04-22 A User-Centric Multi-Intent Benchmark for Evaluating Large Language Models link Wang, Jiayin,..., Jian-Yun
5 2024-09-21 PTD-SQL: Partitioning and Targeted Drilling with LLMs in Text-to-SQL link Luo, Ruilin,..., Yujiu
5 2024-01-30 Conditional and Modal Reasoning in Large Language Models link Holliday, Wesley H.,..., Cedegao E.
5 2024-05-31 Unveiling the Lexical Sensitivity of LLMs: Combinatorial Optimization for
Prompt Enhancement
link Zhan, Pengwei,..., Ru
5 2024-09-23 Pretraining Data Detection for Large Language Models: A Divergence-based
Calibration Method
link Zhang, Weichao,..., Xueqi
5 2024-04-19 How Does the Textual Information Affect the Retrieval of
Multimodal In-Context Learning?
link Luo, Yang,..., Yang
5 None Adaption-of-Thought: Learning Question Difficulty Improves Large Language Models for
Reasoning
link Xu, Mayi,..., Tieyun
5 2024-03-03 Right for Right Reasons: Large Language Models for Verifiable
Commonsense Knowledge Graph Question Answering
link Toroghi, Armin,..., Scott
5 None Does Large Language Model Contain Task-Specific Neurons? link Song, Ran,..., Zhengtao
5 2024-04-17 Position Engineering: Boosting Large Language Models through Positional Information
Manipulation
link He, Zhiyuan,..., Lili
5 2024-08-17 FEDKIM: Adaptive Federated Knowledge Injection into Medical Foundation Models link Wang, Xiaochen,..., Fenglong
5 2024-07-12 Stepwise Verification and Remediation of Student Reasoning Errors with
Large Language Model Tutors
link Daheim, Nico,..., Mrinmaya
5 2024-07-24 Revisiting Who`s Harry Potter: Towards Targeted Unlearning from a
Causal Intervention Perspective
link Liu, Yujian,..., Shiyu
5 2024-06-28 Token Erasure as a Footprint of Implicit Vocabulary Items
in LLMs
link Feucht, Sheridan,..., David
5 None Fine-grained Pluggable Gradient Ascent for Knowledge Unlearning in Language
Models
link Feng, XiaoHua,..., Zibin
5 2024-06-30 Towards Robust Speech Representation Learning for Thousands of Languages link Chen, William,..., Shinji
5 2024-07-23 PreAlign: Boosting Cross-Lingual Transfer by Early Establishment of Multilingual
Alignment
link Li, Jiahuan,..., Jiajun
5 2024-06-21 Decoding Matters: Addressing Amplification Bias and Homogeneity Issue in
Recommendations for Large Language Models
link Bao, Keqin,..., Fuli
5 2024-05-28 ATM: Adversarial Tuning Multi-agent System Makes a Robust Retrieval-Augmented
Generator
link Zhu, Junda,..., Lei
5 2024-01-10 Towards Online Continuous Sign Language Recognition and Translation link Zuo, Ronglai,..., Brian
5 2024-06-26 PrExMe! Large Scale Prompt Exploration of Open Source LLMs
for Machine Translation and Summarization Evaluation
link Leiter, Christoph,..., Steffen
5 2024-06-14 SciEx: Benchmarking Large Language Models on Scientific Exams with
Human Expert Grading and Automatic Grading
link Dinh, Tu Anh,..., Jan
5 2024-05-23 Large Language Models Can Self-Correct with Key Condition Verification link Wu, Zhenyu,..., Meng
5 2024-10-14 How to Leverage Demonstration Data in Alignment for Large
Language Model? A Self-Imitation Learning Perspective
link Xiao, Teng,..., Vasant G
5 2024-10-01 Style-Specific Neurons for Steering LLMs in Text Style Transfer link Lai, Wen,..., Alexander
5 2024-02-17 Grasping the Essentials: Tailoring Large Language Models for Zero-Shot
Relation Extraction
link Zhou, Sizhe,..., Jiawei
5 None Large Language Models Are Poor Clinical Decision-Makers: A Comprehensive
Benchmark
link Liu, Fenglin,..., David A.
5 2024-04-17 Related Work and Citation Text Generation: A Survey link Li, Xiangci,..., Jessica
5 2024-02-17 KnowTuning: Knowledge-aware Fine-tuning for Large Language Models link Lyu, Yougang,..., Zhaochun
5 2024-06-18 Bridging Local Details and Global Context in Text-Attributed Graphs link Wang, Yaoke,..., Siliang
5 2024-10-31 Commonsense Knowledge Editing Based on Free-Text in LLMs link Huang, Xiusheng,..., Kang
5 2024-06-25 GRASS: Compute Efficient Low-Memory LLM Training with Structured Sparse
Gradients
link Muhamed, Aashiq,..., Virginia
5 2024-07-10 LitSearch: A Retrieval Benchmark for Scientific Literature Search link Ajith, Anirudh,..., Tianyu
5 2024-07-19 ECCO: Can We Improve Model-Generated Code Efficiency Without Sacrificing
Functional Correctness?
link Waghjale, Siddhant,..., Daniel
5 2024-10-02 InfiniPot: Infinite Context Processing on Memory-Constrained LLMs link Kim, Minsoo,..., Simyung
5 2024-02-08 On the Robustness of Editing Large Language Models link Ma, Xinbei,..., Yulong
5 2024-10-17 Towards Cross-Cultural Machine Translation with Retrieval-Augmented Generation from Multilingual
Knowledge Graphs
link Conia, Simone,..., Yunyao
5 2024-08-24 Symbolic Working Memory Enhances Language Models for Complex Rule
Application
link Wang, Siyuan,..., Xiang
5 2024-06-19 Data Contamination Can Cross Language Barriers link Yao, Feng,..., Jingbo
5 2024-09-29 Calibrating Language Models with Adaptive Temperature Scaling link Xie, Johnathan,..., Chelsea
5 2024-07-11 On the Universal Truthfulness Hyperplane Inside LLMs link Liu, Junteng,..., Junxian
5 2024-10-03 On the Proper Treatment of Tokenization in Psycholinguistics link Giulianelli, Mario,..., Ryan
5 2024-05-16 SynthesizRR: Generating Diverse Datasets with Retrieval Augmentation link Divekar, Abhishek,..., Greg
5 2024-08-09 DataNarrative: Automated Data-Driven Storytelling with Visualizations and Texts link Islam, Mohammed Saidul,..., Shafiq
5 2024-07-20 Step-by-Step Reasoning to Solve Grid Puzzles: Where do LLMs
Falter?
link Tyagi, Nemika,..., Chitta
5 2024-11-14 Unveiling Multi-level and Multi-modal Semantic Representations in the Human
Brain using Large Language Models
link Nakagi, Yuko,..., Yu
5 2024-04-17 Advancing Social Intelligence in AI Agents: Technical Challenges and
Open Questions
link Mathur, Leena,..., Louis-Philippe
5 2024-07-09 Mixture-of-Modules: Reinventing Transformers as Dynamic Assemblies of Modules link Gong, Zhuocheng,..., Rui
5 2024-11-06 No Culture Left Behind: ArtELingo-28, a Benchmark of WikiArt
with Captions in 28 Languages
link Mohamed, Youssef,..., Mohamed
5 2024-11-09 An Empirical Analysis on Spatial Reasoning Capabilities of Large
Multimodal Models
link Shiri, Fatemeh,..., Yuan-Fang
5 2024-06-17 STAR: SocioTechnical Approach to Red Teaming Language Models link Weidinger, Laura,..., William
5 2024-05-03 Assessing and Verifying Task Utility in LLM-Powered Applications link Arabzadeh, Negar,..., Julia
5 2024-06-09 RE-RAG: Improving Open-Domain QA Performance and Interpretability with Relevance
Estimator in Retrieval-Augmented Generation
link Kim, Kiseung,..., Jay-Yoon
5 2024-07-22 Improving Minimum Bayes Risk Decoding with Multi-Prompt link Heineman, David,..., Wei
5 2023-12-06 Mitigating Open-Vocabulary Caption Hallucinations link Ben-Kish, Assaf,..., Hadar
4 2024-03-11 Strength Lies in Differences! Improving Strategy Planning for Non-collaborative
Dialogues via Diversified User Simulation
link Zhang, Tong,..., Tat-Seng
4 2024-04-10 Personality-aware Student Simulation for Conversational Intelligent Tutoring Systems link Liu, Zhengyuan,..., Nancy F.
4 2024-06-18 Can Large Language Models Always Solve Easy Problems if
They Can Solve Harder Ones?
link Yang, Zhe,..., Zhifang
4 2024-10-16 Rethinking Token Reduction for State Space Models link Zhan, Zheng,..., Yanzhi
4 2024-09-30 HELPD: Mitigating Hallucination of LVLMs by Hierarchical Feedback Learning
with Vision-enhanced Penalty Decoding
link Yuan, Fan,..., Piji
4 2024-04-19 AutoScraper: A Progressive Understanding Web Agent for Web Scraper
Generation
link Huang, Wenhao,..., Zulong
4 2024-06-18 From Insights to Actions: The Impact of Interpretability and
Analysis Research on NLP
link Mosbach, Marius,..., Mor
4 2023-11-14 Toxicity Detection is NOT all you Need: Measuring the
Gaps to Supporting Volunteer Content Moderators through a User-Centric Method
link Cao, Yang Trista,..., Hal
4 2024-06-22 Teaching LLMs to Abstain across Languages via Multilingual Feedback link Feng, Shangbin,..., Yulia
4 2024-09-30 World to Code: Multi-modal Data Generation via Self-Instructed Compositional
Captioning and Filtering
link Wang, Jiacong,..., Jun
4 2024-02-24 How Do Humans Write Code? Large Models Do It
the Same Way Too
link Li, Long,..., Liang
4 2024-11-07 Bayesian Calibration of Win Rate Estimation with LLM Evaluators link Gao, Yicheng,..., Arman
4 2023-12-03 NLEBench+NorGLM: A Comprehensive Empirical Analysis and Benchmark Dataset for
Generative Language Models in Norwegian
link Liu, Peng,..., Zhirong
4 2024-06-19 Model Internals-based Answer Attribution for Trustworthy Retrieval-Augmented Generation link Qi, Jirui,..., Arianna
4 2024-04-11 An Audit on the Perspectives and Challenges of Hallucinations
in NLP
link Narayanan Venkit, Pranav,..., Shomir
4 2024-06-18 InterIntent: Investigating Social Intelligence of LLMs via Intention Understanding
in an Interactive Game Context
link Liu, Ziyi,..., Jieyu
4 2024-05-09 Efficient LLM Comparative Assessment: A Product of Experts Framework
for Pairwise Comparisons
link Liusie, Adian,..., Mark
4 2024-02-16 Python is Not Always the Best Choice: Embracing Multilingual
Program of Thoughts
link Luo, Xianzhen,..., Wanxiang
4 2024-08-06 Unveiling Factual Recall Behaviors of Large Language Models through
Knowledge Neurons
link Wang, Yifei,..., Daniel Dajun
4 2024-05-28 More Than Catastrophic Forgetting: Integrating General Capabilities For Domain-Specific
LLMs
link Liu, Chengyuan,..., Fei
4 2024-06-23 Unlocking the Future: Exploring Look-Ahead Planning Mechanistic Interpretability in
Large Language Models
link Men, Tianyi,..., Jun
4 2024-07-02 Breaking Language Barriers: Cross-Lingual Continual Pre-Training at Scale link Zheng, Wenzhen,..., Ming
4 2024-10-06 Empowering Backbone Models for Visual Text Generation with Input
Granularity Control and Glyph-Aware Training
link Li, Wenbo,..., Jinsong
4 2024-10-09 CoBa: Convergence Balancer for Multitask Finetuning of Large Language
Models
link Gong, Zi,..., Jianguo
4 2024-07-10 Attribute or Abstain: Large Language Models as Long Document
Assistants
link Buchmann, Jan,..., Iryna
4 2024-07-04 Resampled Datasets Are Not Enough: Mitigating Societal Bias Beyond
Single Attributes
link Hirota, Yusuke,..., Alice
4 2024-05-13 MetaReflection: Learning Instructions for Language Agents using Past Reflections link Gupta, Priyanshu,..., Sherry
4 None Jellyfish: Instruction-Tuning Local Large Language Models for Data Preprocessing link Zhang, Haochen,..., Masafumi
4 2024-06-29 From RAG to Riches: Retrieval Interlaced with Sequence Generation link Jain, Palak,..., Tom
4 2024-06-20 PostMark: A Robust Blackbox Watermark for Large Language Models link Chang, Yapei,..., Mohit
4 2024-06-27 DiVERT: Distractor Generation with Variational Errors Represented as Text
for Math Multiple-choice Questions
link Fernandez, Nigel,..., Andrew
4 2024-02-20 Structure Guided Prompt: Instructing Large Language Model in Multi-Step
Reasoning by Exploring Graph Structure of the Text
link Cheng, Kewei,..., Yizhou
4 2024-10-21 Susu Box or Piggy Bank: Assessing Cultural Commonsense Knowledge
between Ghana and the US
link Acquaye, Christabel,..., Rachel
4 2024-06-05 Ranking Manipulation for Conversational Search Engines link Pfrommer, Samuel,..., Somayeh
4 2024-07-09 STORYSUMM: Evaluating Faithfulness in Story Summarization link Subbiah, Melanie,..., Kathleen
4 2024-06-28 Detecting Subtle Differences between Human and Model Languages Using
Spectrum of Relative Likelihood
link Xu, Yang,..., Yongyuan
4 2024-02-17 I Learn Better If You Speak My Language: Understanding
the Superior Performance of Fine-Tuning Large Language Models with LLM-Generated Responses
link Ren, Xuan,..., Lingqiao
4 2024-06-25 Encourage or Inhibit Monosemanticity? Revisit Monosemanticity from a Feature
Decorrelation Perspective
link Yan, Hanqi,..., Yulan
4 2024-02-27 AmbigNLG: Addressing Task Ambiguity in Instruction for NLG link Niwa, Ayana,..., Hayate
4 2024-06-28 Paraphrase Types Elicit Prompt Engineering Capabilities link Wahle, Jan Philip,..., Bela
4 2024-10-21 Improve Dense Passage Retrieval with Entailment Tuning link Dai, Lu,..., Hui
4 2024-06-18 What Are the Odds? Language Models Are Capable of
Probabilistic Reasoning
link Paruchuri, Akshay,..., Daniel
4 None Empowering Multi-step Reasoning across Languages via Program-Aided Language Models link Ranaldi, Leonardo,..., Alexandra
4 2024-03-11 GlossLM: A Massively Multilingual Corpus and Pretrained Model for
Interlinear Glossed Text
link Ginn, Michael,..., Lori
4 2024-07-24 Unveiling In-Context Learning: A Coordinate System to Understand Its
Working Mechanism
link Zhao, Anhao,..., Xiaoyu
4 2024-01-23 SLANG: New Concept Comprehension of Large Language Models link Mei, Lingrui,..., Xueqi
4 2024-05-21 Atomic Self-Consistency for Better Long Form Generations link Thirukovalluru, Raghuveer,..., Bhuwan
4 2024-02-17 Turn Waste into Worth: Rectifying Top-$k$ Router of MoE link Zeng, Zhiyuan,..., Xipeng
4 2024-10-01 Preserving Generalization of Language models in Few-shot Continual Relation
Extraction
link Tran, Quyen,..., Thien Huu
4 2024-10-14 MAIR: A Massive Benchmark for Evaluating Instructed Retrieval link Sun, Weiwei,..., Zhaochun
4 2024-06-27 Tools Fail: Detecting Silent Errors in Faulty Tools link Sun, Jimin,..., Yonatan
4 None More DWUGs: Extending and Evaluating Word Usage Graph Datasets
in Multiple Languages
link Schlechtweg, Dominik,..., Nina
4 2024-06-22 Ladder: A Model-Agnostic Framework Boosting LLM-based Machine Translation to
the Next Level
link Feng, Zhaopeng,..., Zuozhu
4 2024-07-01 DogeRM: Equipping Reward Models with Domain Knowledge through Model
Merging
link Lin, Tzu-Han,..., Yun-Nung
4 2024-10-09 Mitigating the Language Mismatch and Repetition Issues in LLM-based
Machine Translation via Model Editing
link Wang, Weichuan,..., Ying
4 2024-06-17 Cultural Conditioning or Placebo? On the Effectiveness of Socio-Demographic
Prompting
link Mukherjee, Sagnik,..., Monojit
4 2024-10-03 Hate Personified: Investigating the role of LLMs in content
moderation
link Masud, Sarah,..., Tanmoy
4 2024-06-26 Themis: A Reference-free NLG Evaluation Language Model with Flexibility
and Interpretability
link Hu, Xinyu,..., Xiaojun
4 2024-11-06 WorryWords: Norms of Anxiety Association for over 44k English
Words
link Mohammad, Saif M.
4 2024-02-23 Interpreting Context Look-ups in Transformers: Investigating Attention-MLP Interactions link Neo, Clement,..., Fazl
4 2024-06-27 The Odyssey of Commonsense Causality: From Foundational Benchmarks to
Cutting-Edge Reasoning
link Cui, Shaobo,..., Boi
4 2024-10-05 Adaptive Question Answering: Enhancing Language Model Proficiency for Addressing
Knowledge Conflicts with Source Citations
link Shaier, Sagi,..., Philip V.
4 2024-05-03 MedReadMe: A Systematic Study for Fine-grained Sentence Readability in
Medical Domain
link Jiang, Chao,..., Wei
4 2024-09-23 MemeCLIP: Leveraging CLIP Representations for Multimodal Meme Classification link Shah, Siddhant Bikram,..., Haohan
4 2023-11-16 StorySparkQA: Expert-Annotated QA Pairs with Real-World Knowledge for Children`s
Story-Based Learning
link Chen, Jiaju,..., Yuling
4 2024-01-26 HiFT: A Hierarchical Full Parameter Fine-Tuning Strategy link Liu, YongKang,..., Hinrich
4 2024-05-16 Simultaneous Masking, Not Prompting Optimization: A Paradigm Shift in
Fine-tuning LLMs for Simultaneous Translation
link Raffel, Matthew,..., Lizhong
4 2024-02-20 Enhanced Hallucination Detection in Neural Machine Translation through Simple
Detector Aggregation
link Himmi, Anas,..., Nuno M
4 2024-06-06 BLSP-Emo: Towards Empathetic Large Speech-Language Models link Wang, Chen,..., Jiajun
4 2024-10-22 Altogether: Image Captioning via Re-aligning Alt-text link Xu, Hu,..., Christoph
4 2024-07-04 Investigating the Role of Instruction Variety and Task Difficulty
in Robotic Manipulation Tasks
link Parekh, Amit,..., Ioannis
4 2024-04-12 The Generation Gap: Exploring Age Bias in the Value
Systems of Large Language Models
link Liu, Siyang,..., Rada
4 2024-07-08 Perceptions to Beliefs: Exploring Precursory Inferences for Theory of
Mind in Large Language Models
link Jung, Chani,..., Hyunwoo
4 2024-02-21 SYNFAC-EDIT: Synthetic Imitation Edit Feedback for Factual Alignment in
Clinical Summarization
link Mishra, Prakamya,..., Hong
4 2024-02-23 DeMPT: Decoding-enhanced Multi-phase Prompt Tuning for Making LLMs Be
Better Context-aware Translators
link Lyu, Xinglin,..., Min
4 2024-11-08 SpecHub: Provable Acceleration to Multi-Draft Speculative Decoding link Sun, Ryan,..., Lichao
4 2024-04-23 Bayesian Example Selection Improves In-Context Learning for Speech, Text
and Visual Modalities
link Wang, Siyin,..., Chao
4 2024-02-21 Investigating Multilingual Instruction-Tuning: Do Polyglot Models Demand for Multilingual
Instructions?
link Weber, Alexander Arno,..., Mehdi
4 2024-06-27 Contrastive Policy Gradient: Aligning LLMs on sequence-level scores in
a supervised-friendly fashion
link Flet-Berliac, Yannis,..., Matthieu
4 None Can LLMs Learn Uncertainty on Their Own? Expressing Uncertainty
Effectively in A Self-Training Manner
link Liu, Shudong,..., Min
4 2024-06-27 T-FREE: Subword Tokenizer-Free Generative LLMs via Sparse Representations for
Memory-Efficient Embeddings
link Deiseroth, Bj{\"o}rn,..., Samuel
4 2024-04-18 Simultaneous Interpretation Corpus Construction by Large Language Models in
Distant Language Pair
link Sakai, Yusuke,..., Taro
3 2023-11-13 Prompts have evil twins link Melamed, Rimon,..., Enric
3 2023-12-21 EmphAssess : a Prosodic Benchmark on Assessing Emphasis Transfer
in Speech-to-Speech Models
link de Seyssel, Maureen,..., Emmanuel
3 None On Fake News Detection with LLM Enhanced Semantics Mining link Ma, Xiaoxiao,..., Hao
3 2024-02-20 On Sensitivity of Learning with Limited Labelled Data to
the Effects of Randomness: Impact of Interactions and Systematic Choices
link Pecher, Branislav,..., Maria
3 2024-06-28 Learning Interpretable Legal Case Retrieval via Knowledge-Guided Case Reformulation link Deng, Chenlong,..., Zhicheng
3 2023-11-27 Pre-trained Language Models Do Not Help Auto-regressive Text-to-Image Generation link Zhang, Yuhui,..., Alexander T
3 2024-08-02 QUDSELECT: Selective Decoding for Questions Under Discussion Parsing link Suvarna, Ashima,..., Nanyun
3 2024-08-21 UniFashion: A Unified Vision-Language Model for Multimodal Fashion Retrieval
and Generation
link Zhao, Xiangyu,..., Xiao-Ming
3 2024-02-19 Standardize: Aligning Language Models with Expert-Defined Standards for Content
Generation
link Imperial, Joseph Marvin,..., Harish
3 2024-06-26 MatchTime: Towards Automatic Soccer Game Commentary Generation link Rao, Jiayuan,..., Weidi
3 2024-06-29 Advancing Process Verification for Large Language Models via Tree-Based
Preference Learning
link He, Mingqian,..., Weiming
3 2024-07-03 VIVA: A Benchmark for Vision-Grounded Decision-Making with Human Values link Hu, Zhe,..., Yu
3 2024-07-07 Large Language Model as an Assignment Evaluator: Insights, Feedback,
and Challenges in a 1000+ Student Course
link Chiang, Cheng-Han,..., Hung-yi
3 2024-09-08 Seemingly Plausible Distractors in Multi-Hop Reasoning: Are Large Language
Models Attentive Readers?
link Bhuiya, Neeladri,..., Stefan
3 2024-01-19 Knowledge Verification to Nip Hallucination in the Bud link Wan, Fanqi,..., Shuming
3 2024-02-29 Whispers that Shake Foundations: Analyzing and Mitigating False Premise
Hallucinations in Large Language Models
link Yuan, Hongbang,..., Jun
3 2024-02-02 KB-Plugin: A Plug-and-play Framework for Large Language Models to
Induce Programs over Low-resourced Knowledge Bases
link Zhang, Jiajie,..., Juanzi
3 2024-09-23 CUTE: Measuring LLMs' Understanding of Their Tokens link Edman, Lukas,..., Alexander
3 2024-04-11 On Training Data Influence of GPT Models link Chai, Yekun,..., Hua
3 2023-12-19 Neuron-Level Knowledge Attribution in Large Language Models link Yu, Zeping,..., Sophia
3 2024-10-04 Is Safer Better? The Impact of Guardrails on the
Argumentative Strength of LLMs in Hate Speech Countering
link Bonaldi, Helena,..., Marco
3 2024-07-10 Decompose and Compare Consistency: Measuring VLMs' Answer Reliability via
Task-Decomposition Consistency Comparison
link Yang, Qian,..., Aishwarya
3 2024-04-16 Incubating Text Classifiers Following User Instruction with Nothing but
LLM
link Peng, Letian,..., Jingbo
3 2024-10-17 Advancing Large Language Model Attribution through Self-Improving link Huang, Lei,..., Bing
3 2024-08-16 Where is the signal in tokenization space? link Geh, Renato,..., Guy
3 2024-09-28 Crafting Personalized Agents through Retrieval-Augmented Generation on Editable Memory
Graphs
link Wang, Zheng,..., Wei
3 None Modeling Nonnative Sentence Processing with L2 Language Models link Aoyama, Tatsuya,..., Nathan
3 2024-04-07 Cross-Domain Audio Deepfake Detection: Dataset and Analysis link Li, Yuang,..., Hao
3 2024-10-01 Concept Space Alignment in Multilingual LLMs link Peng, Qiwei,..., Anders
3 2024-10-02 Quantifying the Gaps Between Translation and Native Perception in
Training for Multimodal, Multilingual Retrieval
link Buettner, Kyle,..., Adriana
3 2022-10-09 Fine-Grained Detection of Solidarity for Women and Migrants in
155 Years of German Parliamentary Debates
link Kostikova, Aida,..., Steffen
3 2024-06-17 CItruS: Chunked Instruction-aware State Eviction for Long Sequence Modeling link Bai, Yu,..., Jackie CK
3 2024-08-26 Focused Large Language Models are Stable Many-Shot Learners link Yuan, Peiwen,..., Kan
3 2024-07-09 ChatGPT Doesn`t Trust Chargers Fans: Guardrail Sensitivity in Context link Li, Victoria R,..., Naomi
3 2024-06-16 Optimized Speculative Sampling for GPU Hardware Accelerators link Wagner, Dominik,..., Tobias
3 2024-02-05 Reconstruct Your Previous Conversations! Comprehensively Investigating Privacy Leakage Risks
in Conversations with GPT Models
link Chu, Junjie,..., Yang
3 2024-02-04 Can Large Language Models Learn Independent Causal Mechanisms? link Gendron, Gael,..., Gillian
3 2024-06-18 Liar, Liar, Logical Mire: A Benchmark for Suppositional Reasoning
in Large Language Models
link Mondorf, Philipp,..., Barbara
3 2024-05-09 Muting Whisper: A Universal Acoustic Adversarial Attack on Speech
Foundation Models
link Raina, Vyas,..., Mark
3 2023-11-15 XplainLLM: A Knowledge-Augmented Dataset for Reliable Grounded Explanations in
LLMs
link Chen, Zichen,..., Misha
3 2024-10-08 KnowledgeSG: Privacy-Preserving Synthetic Text Generation with Knowledge Distillation from
Server
link Wang, WenHao,..., Yanfeng
3 2024-07-03 Improving Retrieval-augmented Text-to-SQL with AST-based Ranking and Schema Pruning link Shen, Zhili,..., Jeff Z.
3 2024-05-22 Getting More from Less: Large Language Models are Good
Spontaneous Multilingual Learners
link Zhang, Shimao,..., Shujian
3 2024-07-22 Walking in Others' Shoes: How Perspective-Taking Guides Large Language
Models in Reducing Toxicity and Bias
link Xu, Rongwu,..., Han
3 2024-06-10 Annotation alignment: Comparing LLM and human annotations of conversational
safety
link Movva, Rajiv,..., Emma
3 2024-10-04 CommonIT: Commonality-Aware Instruction Tuning for Large Language Models via
Data Partitions
link Rao, Jun,..., Min
3 2024-04-30 ESC: Efficient Speech Coding with Cross-Scale Residual Vector Quantized
Transformers
link Gu, Yuzhe,..., Enmao
3 None Optimizing Language Models with Fair and Stable Reward Composition
in Reinforcement Learning
link Li, Jiahui,..., Jun
3 2024-06-19 When Parts Are Greater Than Sums: Individual LLM Components
Can Outperform Full Models
link Chang, Ting-Yun,..., Robin
3 2024-06-19 Enhancing Language Model Factuality via Activation-Based Confidence Calibration and
Guided Decoding
link Liu, Xin,..., Lu
3 2024-07-08 Data, Data Everywhere: A Guide for Pretraining Dataset Construction link Parmar, Jupinder,..., Bryan
3 2024-06-24 Towards Fast Multilingual LLM Inference: Speculative Decoding and Specialized
Drafters
link Yi, Euiin,..., Se-Young
3 2024-09-04 What is lost in Normalization? Exploring Pitfalls in Multilingual
ASR Model Evaluations
link Manohar, Kavya,..., Leena G
3 2024-02-03 CodeAgent: Autonomous Communicative Agents for Code Review link Tang, Xunzhu,..., Tegawend{\'e} F.
3 2024-01-12 Experimental Contexts Can Facilitate Robust Semantic Property Inference in
Language Models, but Inconsistently
link Misra, Kanishka,..., Kyle
3 None ABSEval: An Agent-based Framework for Script Evaluation link Liang, Sirui,..., Kang
3 2024-08-22 FIRST: Teach A Reliable Large Language Model Through Efficient
Trustworthy Distillation
link Shum, KaShun,..., Muhammad Omer
3 2024-10-02 ACE: A LLM-based Negotiation Coaching System link Shea, Ryan,..., Zhou
3 2024-08-28 CoGen: Learning from Feedback with Coupled Comprehension and Generation link Gul, Mustafa Omer,..., Yoav
3 None Null-Shot Prompting: Rethinking Prompting Large Language Models With Hallucination link Taveekitworachai, Pittawat,..., Ruck
3 2024-02-26 Pre-training Cross-lingual Open Domain Question Answering with Large-scale Synthetic
Supervision
link Jiang, Fan,..., Trevor
3 2024-02-19 KARL: Knowledge-Aware Retrieval and Representations aid Retention and Learning
in Students
link Shu, Matthew,..., Jordan Lee
3 2024-02-02 Distractor Generation in Multiple-Choice Tasks: A Survey of Methods,
Datasets, and Evaluation
link Alhazmi, Elaf,..., Ahoud
3 2024-10-10 Modeling User Preferences with Automatic Metrics: Creating a High-Quality
Preference Dataset for Machine Translation
link Agrawal, Sweta,..., Andre
3 2024-02-28 LLM Task Interference: An Initial Study on the Impact
of Task-Switch in Conversational History
link Gupta, Akash,..., Mario
3 None Revisiting Automated Evaluation for Long-form Table Question Answering link Wang, Yuqi,..., Yilun
3 None HalluMeasure: Fine-grained Hallucination Measurement Using Chain-of-Thought Reasoning link Akbar, Shayan Ali,..., Erwin
3 2024-09-23 FineCops-Ref: A new Dataset and Task for Fine-Grained Compositional
Referring Expression Comprehension
link Liu, Junzhuo,..., Peng
3 2024-10-01 VideoCLIP-XL: Advancing Long Description Understanding for Video CLIP Models link Wang, Jiapeng,..., Lianwen
3 2024-07-24 CMR Scaling Law: Predicting Critical Mixture Ratios for Continual
Pre-training of Language Models
link Gu, Jiawei,..., Fei
3 2024-05-05 Exploring the Compositional Deficiency of Large Language Models in
Mathematical Reasoning Through Trap Problems
link Zhao, Jun,..., Xuanjing
3 None Working Memory Identifies Reasoning Limits in Language Models link Zhang, Chunhui,..., Soroush
3 2024-06-18 Measuring Psychological Depth in Language Models link Harel-Canada, Fabrice Y,..., Nanyun
3 2024-09-23 Knowledge Planning in Large Language Models for Domain-Aligned Counseling
Summarization
link Srivastava, Aseem,..., Md Shad
3 2024-06-20 From Descriptive Richness to Bias: Unveiling the Dark Side
of Generative Image Caption Enrichment
link Hirota, Yusuke,..., Yuta
3 None Game on Tree: Visual Hallucination Mitigation via Coarse-to-Fine View
Tree and Game Theory
link Zhuang, Xianwei,..., Yuexian
3 2024-10-01 What the Harm? Quantifying the Tangible Impact of Gender
Bias in Machine Translation with a Human-centered Study
link Savoldi, Beatrice,..., Luisa
3 2024-06-25 Dual-Space Knowledge Distillation for Large Language Models link Zhang, Songming,..., Jinan
3 2024-09-23 ToolPlanner: A Tool Augmented LLM for Multi Granularity Instructions
with Path Planning and Feedback
link Wu, Qinzhuo,..., Bin
3 2024-02-14 Recurrent Alignment with Hard Attention for Hierarchical Text Rating link Lin, Chenxi,..., Xiaomin
3 2024-09-02 CHESS: Optimizing LLM Inference via Channel-Wise Thresholding and Selective
Sparsification
link He, Junhui,..., Qingan
3 2024-10-21 Surprise! Uniform Information Density Isn`t the Whole Story: Predicting
Surprisal Contours in Long-form Discourse
link Tsipidi, Eleftheria,..., Alex
3 2024-10-07 Preserving Multi-Modal Capabilities of Pre-trained VLMs for Improving Vision-Linguistic
Compositionality
link Oh, Youngtaek,..., Junmo
3 2024-02-25 DetoxLLM: A Framework for Detoxification with Explanations link Khondaker, Md Tawkat Islam,..., Laks V. S.
3 2024-06-22 CaT-Bench: Benchmarking Language Model Understanding of Causal and Temporal
Dependencies in Plans
link Lal, Yash Kumar,..., Ray
3 2024-10-03 CodeJudge: Evaluating Code Generation with Large Language Models link Tong, Weixi,..., Tianyi
3 2024-06-28 Self-Training Large Language and Vision Assistant for Medical Question
Answering
link Sun, Guohao,..., Zhiqiang
3 2024-06-12 Updating CLIP to Prefer Descriptions Over Captions link Zur, Amir,..., Atticus
3 2024-10-14 AlphaLoRA: Assigning LoRA Experts Based on Layer Training Quality link Qing, Peijun,..., Soroush
3 2024-06-24 Multi-LogiEval: Towards Evaluating Multi-Step Logical Reasoning Ability of Large
Language Models
link Patel, Nisarg,..., Chitta
3 None Memorize Step by Step: Efficient Long-Context Prefilling with Incremental
Memory and Decremental Chunk
link Zeng, Zhiyuan,..., Xipeng
3 2024-02-28 Twists, Humps, and Pebbles: Multilingual Speech Recognition Models Exhibit
Gender Performance Gaps
link Attanasio, Giuseppe,..., Dirk
3 2024-08-22 Preference-Guided Reflective Sampling for Aligning Language Models link Ye, Hai,..., Hwee Tou
3 2024-06-20 xCOMET-lite: Bridging the Gap Between Efficiency and Quality in
Learned MT Evaluation Metrics
link Larionov, Daniil,..., Steffen
3 2024-10-07 The LLM Effect: Are Humans Truly Using LLMs, or
Are They Being Influenced By Them Instead?
link Choi, Alexander,..., Antonios
3 2024-09-29 Coffee-Gym: An Environment for Evaluating and Improving Natural Language
Feedback on Erroneous Code
link Chae, Hyungjoo,..., Jinyoung
3 2024-10-31 Nearest Neighbor Normalization Improves Multimodal Retrieval link Chowdhury, Neil,..., Tristan
2 2024-04-15 Multi-News+: Cost-efficient Dataset Cleansing via LLM-based Data Annotation link Choi, Juhwan,..., YoungBin
2 2024-04-17 FIZZ: Factual Inconsistency Detection by Zoom-in Summary and Zoom-out
Document
link Yang, Joonho,..., Hwanhee
2 2024-02-16 Speaking in Wavelet Domain: A Simple and Efficient Approach
to Speed up Speech Diffusion Model
link Zhang, Xiangyu,..., Lina
2 2024-05-27 HEART-felt Narratives: Tracing Empathy and Narrative Style in Personal
Stories with LLMs
link Shen, Jocelyn,..., Maarten
2 None MAR: Matching-Augmented Reasoning for Enhancing Visual-based Entity Question Answering link Zhang, Zhengxuan,..., Nan
2 2023-11-14 DA$^3$: A Distribution-Aware Adversarial Attack against Language Models link Wang, Yibo,..., Philip S.
2 2024-09-24 TCSinger: Zero-Shot Singing Voice Synthesis with Style Transfer and
Multi-Level Style Control
link Zhang, Yu,..., Zhou
2 2024-06-18 FuseGen: PLM Fusion for Data-generation based Zero-shot Learning link Zou, Tianyuan,..., Ya-Qin
2 2024-07-20 I Need Help! Evaluating LLM`s Ability to Ask for
Users' Support: A Case Study on Text-to-SQL Generation
link Wu, Cheng-Kuang,..., Yun-Nung
2 2024-03-01 Surveying the Dead Minds: Historical-Psychological Text Analysis with Contextualized
Construct Representation (CCR) for Classical Chinese
link Chen, Yuqi,..., Mohammad
2 None Alignment-Enhanced Decoding: Defending Jailbreaks via Token-Level Adaptive Refining of
Probability Distributions
link Liu, Quan,..., Sen
2 None DGLF: A Dual Graph-based Learning Framework for Multi-modal Sarcasm
Detection
link Zhu, Zhihong,..., Yefeng
2 None BC-Prover: Backward Chaining Prover for Formal Theorem Proving link He, Yuhang,..., Wotao
2 2024-04-16 Autoregressive Pre-Training on Pixels and Texts link Chai, Yekun,..., Hua
2 2024-10-06 Fine-Grained Prediction of Reading Comprehension from Eye Movements link Shubi, Omer,..., Yevgeni
2 None D2R: Dual-Branch Dynamic Routing Network for Multimodal Sentiment Detection link Chen, Yifan,..., Fenghuan
2 2024-06-18 A Generic Method for Fine-grained Category Discovery in Natural
Language Texts
link Tian, Chang,..., Marie-Francine
2 None VGBench: Evaluating Large Language Models on Vector Graphics Understanding
and Generation
link Zou, Bocheng,..., Yong Jae
2 2024-11-07 Performance-Guided LLM Knowledge Distillation for Efficient Text Classification at
Scale
link Palo, Flavio Di,..., Bilal H
2 2024-10-09 Dissecting Fine-Tuning Unlearning in Large Language Models link Hong, Yihuai,..., Haiqin
2 2024-10-05 Consistent Autoformalization for Constructing Mathematical Libraries link Zhang, Lan,..., Andre
2 2024-01-13 MiTTenS: A Dataset for Evaluating Gender Mistranslation link Robinson, Kevin,..., Jasmijn
2 2024-08-28 StyleRemix: Interpretable Authorship Obfuscation via Distillation and Perturbation of
Style Elements
link Fisher, Jillian,..., Yejin
2 2024-07-08 VIMI: Grounding Video Generation through Multi-modal Instruction link Fang, Yuwei,..., Sergey
2 None DVD: Dynamic Contrastive Decoding for Knowledge Amplification in Multi-Document
Question Answering
link Jin, Jing,..., Zhijiang
2 2024-06-13 An Unsupervised Approach to Achieve Supervised-Level Explainability in Healthcare
Records
link Edin, Joakim,..., Tuukka
2 2024-09-20 MaPPER: Multimodal Prior-guided Parameter Efficient Tuning for Referring Expression
Comprehension
link Liu, Ting,..., Quanjun
2 2024-06-27 Hierarchical Deconstruction of LLM Reasoning: A Graph-Based Framework for
Analyzing Knowledge Utilization
link Ko, Miyoung,..., Minjoon
2 2024-06-20 An LLM Feature-based Framework for Dialogue Constructiveness Assessment link Zhou, Lexin,..., Andreas
2 2024-07-11 Investigating LLMs as Voting Assistants via Contextual Augmentation: A
Case Study on the European Parliament Elections 2024
link Chalkidis, Ilias
2 2024-10-08 Scaling Laws Across Model Architectures: A Comparative Analysis of
Dense and MoE Models in Large Language Models
link Wang, Siqi,..., Jingang
2 None Teaching Small Language Models Reasoning through Counterfactual Distillation link Feng, Tao,..., Yin
2 None Pretraining Language Models Using Translationese link Doshi, Meet,..., Pushpak
2 2024-06-18 ToxiCloakCN: Evaluating Robustness of Offensive Language Detection in Chinese
with Cloaking Perturbations
link Xiao, Yunze,..., Roy Ka-Wei
2 2024-06-17 Boosting Scientific Concepts Understanding: Can Analogy from Teacher Models
Empower Student Models?
link Yuan, Siyu,..., Deqing
2 2023-07-18 Distilling Knowledge from Text-to-Image Generative Models Improves Visio-Linguistic Reasoning
in CLIP
link Basu, Samyadeep,..., Soheil
2 2024-06-16 Reconsidering Sentence-Level Sign Language Translation link Tanzer, Garrett,..., David
2 2024-06-28 From Local Concepts to Universals: Evaluating the Multicultural Understanding
of Vision-Language Models
link Bhatia, Mehar,..., Vered
2 2024-02-21 Dynamic Multi-Reward Weighting for Multi-Style Controllable Generation link De Langis, Karin,..., Dongyeop
2 2024-03-14 Revealing the Parallel Multilingual Learning within Large Language Models link Mu, Yongyu,..., JingBo
2 2024-05-30 Encoding and Controlling Global Semantics for Long-form Video Question
Answering
link Nguyen, Thong Thanh,..., Anh Tuan
2 2024-10-25 Taxonomy-guided Semantic Indexing for Academic Paper Search link Kang, SeongKu,..., Hwanjo
2 2024-08-27 Advancing Adversarial Suffix Transfer Learning on Aligned Large Language
Models
link Liu, Hongfu,..., Michael
2 2024-06-20 Aligning Large Language Models with Diverse Political Viewpoints link Stammbach, Dominik,..., Elliott
2 None Rethinking the Reversal Curse of LLMs: a Prescription from
Human Knowledge Reversal
link Lu, Zhicong,..., Xunliang
2 None GENRA: Enhancing Zero-shot Retrieval with Rank Aggregation link Katsimpras, Georgios,..., Georgios
2 2024-07-02 MORPHEUS: Modeling Role from Personalized Dialogue History by Exploring
and Utilizing Latent Space
link Tang, Yihong,..., Yuexian
2 2024-10-08 Bridging Modalities: Enhancing Cross-Modality Hate Speech Detection with Few-Shot
In-Context Learning
link Hee, Ming Shan,..., Roy Ka-Wei
2 2024-09-19 Efficient Performance Tracking: Leveraging Large Language Models for Automated
Construction of Scientific Leaderboards
link {\c{S}}ahinu{\c{c}}, Furkan,..., Iryna
2 2024-10-17 AdaSwitch: Adaptive Switching between Small and Large Agents for
Effective Cloud-Local Collaborative Learning
link Sun, Hao,..., Dawei
2 2024-10-01 EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion Control link Chen, Haozhe,..., Julia
2 2023-05-06 The Best Defense is Attack: Repairing Semantics in Textual
Adversarial Examples
link Yang, Heng,..., Ke
2 2024-07-22 Perceptions of Linguistic Uncertainty by Language Models and Humans link Bel{\'e}m, Catarina G,..., Padhraic
2 2024-07-09 LIONs: An Empirically Optimized Approach to Align Language Models link Yu, Xiao,..., Zhou
2 2023-10-27 MOSEL: Inference Serving Using Dynamic Modality Selection link Hu, Bodun,..., Aditya
2 2024-06-05 Task Arithmetic can Mitigate Synthetic-to-Real Gap in Automatic Speech
Recognition
link Su, Hsuan,..., Hung-yi