524 |
2023-11-16 |
Video-LLaVA: Learning United Visual Representation by Alignment Before Projection |
link |
Lin, Bin,..., Li |
413 |
2022-12-31 |
A Survey on In-context Learning |
link |
Dong, Qingxiu,..., Zhifang |
350 |
2023-05-30 |
Encouraging Divergent Thinking in Large Language Models through Multi-Agent Debate |
link |
Liang, Tian,..., Zhaopeng |
178 |
2024-03-12 |
ORPO: Monolithic Preference Optimization without Reference Model |
link |
Hong, Jiwoo,..., James |
141 |
2024-05-02 |
Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models |
link |
Kim, Seungone,..., Minjoon |
115 |
2023-12-10 |
Fine-Tuning or Retrieval? Comparing Knowledge Injection in LLMs |
link |
Ovadia, Oded,..., Oren |
96 |
2023-11-15 |
Chain-of-Note: Enhancing Robustness in Retrieval-Augmented Language Models |
link |
Yu, Wenhao,..., Dong |
88 |
2022-09-02 |
FOLIO: Natural Language Reasoning with First-Order Logic |
link |
Han, Simeng,..., Dragomir |
85 |
2024-05-09 |
Does Fine-Tuning LLMs on New Knowledge Encourage Hallucinations? |
link |
Gekhman, Zorik,..., Jonathan |
79 |
2024-02-16 |
Humans or LLMs as the Judge? A Study on Judgement Bias |
link |
Chen, Guiming Hardy,..., Benyou |
75 |
2020-12-03 |
GottBERT: a pure German Language Model |
link |
Scheible, Raphael,..., Martin |
74 |
2024-03-13 |
Knowledge Conflicts for LLMs: A Survey |
link |
Xu, Rongwu,..., Wei |
73 |
2023-12-28 |
A Simple LLM Framework for Long-Range Video Question-Answering |
link |
Zhang, Ce,..., Gedas |
67 |
2024-04-16 |
MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents |
link |
Tang, Liyan,..., Greg |
65 |
2024-06-24 |
LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-Training |
link |
Zhu, Tong,..., Yu |
56 |
2023-09-12 |
Mitigating the Alignment Tax of RLHF |
link |
Lin, Yong,..., Tong |
54 |
2023-08-08 |
FLIRT: Feedback Loop In-context Red Teaming |
link |
Mehrabi, Ninareh,..., Rahul |
54 |
2024-01-05 |
MLLM-Protector: Ensuring MLLM`s Safety without Hurting Performance |
link |
Pi, Renjie,..., Tong |
49 |
2024-02-29 |
Controllable Preference Optimization: Toward Controllable Multi-Objective Alignment |
link |
Guo, Yiju,..., Maosong |
46 |
2024-06-24 |
EAGLE-2: Faster Inference of Language Models with Dynamic Draft Trees |
link |
Li, Yuhui,..., Hongyang |
46 |
2024-01-14 |
Small LLMs Are Weak Tool Learners: A Multi-LLM Agent |
link |
Shen, Weizhou,..., Fei |
43 |
2024-02-06 |
Systematic Biases in LLM Simulations of Debates |
link |
Taubenfeld, Amir,..., Ariel |
43 |
2024-02-21 |
Is LLM-as-a-Judge Robust? Investigating Universal Adversarial Attacks on Zero-shot LLM Assessment |
link |
Raina, Vyas,..., Mark |
40 |
2024-02-06 |
Democratizing Large Language Models via Personalized Parameter-Efficient Fine-tuning |
link |
Tan, Zhaoxuan,..., Meng |
39 |
2024-01-12 |
Heterogeneous LoRA for Federated Fine-tuning of On-Device Foundation Models |
link |
Cho, Yae Jee,..., Gauri |
38 |
2024-02-21 |
Large Language Models for Data Annotation and Synthesis: A Survey |
link |
Tan, Zhen,..., Huan |
38 |
2024-06-25 |
Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA |
link |
Wang, Minzheng,..., Yongbin |
37 |
2024-06-21 |
VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation |
link |
He, Xuan,..., Wenhu |
37 |
2024-01-20 |
InferAligner: Inference-Time Alignment for Harmlessness through Cross-Model Guidance |
link |
Wang, Pengyu,..., Xipeng |
36 |
2024-07-01 |
Summary of a Haystack: A Challenge to Long-Context LLMs and RAG Systems |
link |
Laban, Philippe,..., Chien-Sheng |
36 |
2023-09-29 |
Split and Merge: Aligning Position Biases in LLM-based Evaluators |
link |
Li, Zongjie,..., Yang |
36 |
2024-07-15 |
Foundational Autoraters: Taming Large Language Models for Better Automatic Evaluation |
link |
Vu, Tu,..., Yun-Hsuan |
34 |
2023-10-23 |
LLM-Based Agent Society Investigation: Collaboration and Confrontation in Avalon Gameplay |
link |
Lan, Yihuai,..., Hao |
33 |
2023-10-23 |
Moral Foundations of Large Language Models |
link |
Abdulhai, Marwa,..., Natasha |
32 |
2023-09-28 |
LawBench: Benchmarking Legal Knowledge of Large Language Models |
link |
Fei, Zhiwei,..., Vincent |
32 |
2024-02-27 |
Information Flow Routes: Automatically Interpreting Language Models at Scale |
link |
Ferrando, Javier,..., Elena |
31 |
2024-02-10 |
A Thorough Examination of Decoding Methods in the Era of LLMs |
link |
Shi, Chufan,..., Wai |
31 |
2024-07-01 |
Searching for Best Practices in Retrieval-Augmented Generation |
link |
Wang, Xiaohua,..., Xuanjing |
31 |
2024-03-08 |
Is this the real life? Is this just fantasy? The Misleading Success of Simulating Social Interactions With LLMs |
link |
Zhou, Xuhui,..., Maarten |
30 |
2024-04-28 |
SOUL: Unlocking the Power of Second-Order Optimization for LLM Unlearning |
link |
Jia, Jinghan,..., Sijia |
30 |
2024-06-16 |
A Peek into Token Bias: Large Language Models Are Not Yet Genuine Reasoners |
link |
Jiang, Bowen,..., Dan |
30 |
2024-01-11 |
Universal Vulnerabilities in Large Language Models: Backdoor Attacks for In-context Learning |
link |
Zhao, Shuai,..., Jinming |
30 |
2024-01-09 |
Model Editing Harms General Abilities of Large Language Models: Regularization to the Rescue |
link |
Gu, Jia-Chen,..., Nanyun |
29 |
2024-06-17 |
GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities |
link |
Ghosh, Sreyan,..., Dinesh |
29 |
2024-01-11 |
Transformers are Multi-State RNNs |
link |
Oren, Matanel,..., Roy |
27 |
2024-07-09 |
Lookback Lens: Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps |
link |
Chuang, Yung-Sung,..., James R. |
27 |
2024-06-26 |
The Multilingual Alignment Prism: Aligning Global and Local Preferences to Reduce Harm |
link |
Aakanksha, Ahmadian,..., Sara |
26 |
2023-10-27 |
Personas as a Way to Model Truthfulness in Language Models |
link |
Joshi, Nitish,..., He |
25 |
2024-06-22 |
Modular Pluralism: Pluralistic Alignment via Multi-LLM Collaboration |
link |
Feng, Shangbin,..., Yulia |
25 |
2024-02-22 |
Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments |
link |
Gu, Yu,..., Yu |
25 |
2024-07-01 |
Roleplay-doh: Enabling Domain-Experts to Create LLM-simulated Patients via Eliciting and Adhering to Principles |
link |
Louie, Ryan,..., Diyi |
24 |
2024-02-01 |
Learning Planning-based Reasoning by Trajectories Collection and Process Reward Synthesizing |
link |
Jiao, Fangkai,..., Shafiq |
24 |
2024-02-28 |
Tokenization Is More Than Compression |
link |
Schmidt, Craig W,..., Chris |
24 |
2024-07-06 |
RULE: Reliable Multimodal RAG for Factuality in Medical Vision Language Models |
link |
Xia, Peng,..., Huaxiu |
24 |
2024-02-17 |
Puzzle Solving using Reasoning of Large Language Models: A Survey |
link |
Giadikiaroglou, Panagiotis,..., Giorgos |
23 |
2024-06-24 |
LLMs Assist NLP Researchers: Critique Paper (Meta-)Reviewing |
link |
Du, Jiangshu,..., Wenpeng |
23 |
2024-06-16 |
A Comprehensive Survey of Scientific Large Language Models and Their Applications in Scientific Discovery |
link |
Zhang, Yu,..., Jiawei |
23 |
2024-07-22 |
AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks? |
link |
Yoran, Ori,..., Jonathan |
23 |
2022-12-01 |
Language models and brains align due to more than next-word prediction and word-level information |
link |
Merlin, Gabriele,..., Mariya |
23 |
2024-01-13 |
EHRAgent: Code Empowers Large Language Models for Few-shot Complex Tabular Reasoning on Electronic Health Records |
link |
Shi, Wenqi,..., May Dongmei |
22 |
2024-02-28 |
Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation |
link |
Ge, Yuan,..., JingBo |
22 |
2023-12-11 |
Dense X Retrieval: What Retrieval Granularity Should We Use? |
link |
Chen, Tong,..., Dong |
21 |
2024-03-28 |
Language Models Learn Rare Phenomena from Less Rare Phenomena: The Case of the Missing AANNs |
link |
Misra, Kanishka,..., Kyle |
21 |
2024-07-15 |
Benchmarking Vision Language Models for Cultural Understanding |
link |
Nayak, Shravan,..., Aishwarya |
21 |
2024-05-31 |
SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales |
link |
Xu, Tianyang,..., Jing |
21 |
2024-01-19 |
Breaking the Curse of Multilinguality with Cross-lingual Expert Language Models |
link |
Blevins, Terra,..., Luke |
21 |
2024-02-21 |
Neeko: Leveraging Dynamic LoRA for Efficient Multi-Character Role-Playing Agent |
link |
Yu, Xiaoyan,..., Liehuang |
21 |
2024-07-02 |
RLHF Can Speak Many Languages: Unlocking Multilingual Preference Optimization for LLMs |
link |
Dang, John,..., Sara |
20 |
2022-12-20 |
Evaluating Psychological Safety of Large Language Models |
link |
Li, Xingxuan,..., Lidong |
20 |
2023-10-13 |
QUIK: Towards End-to-end 4-Bit Inference on Generative Large Language Models |
link |
Ashkboos, Saleh,..., Dan |
20 |
2024-04-29 |
PromptReps: Prompting Large Language Models to Generate Dense and Sparse Representations for Zero-Shot Document Retrieval |
link |
Zhuang, Shengyao,..., Guido |
20 |
2024-03-29 |
LUQ: Long-text Uncertainty Quantification for LLMs |
link |
Zhang, Caiqi,..., Nigel |
20 |
2024-06-28 |
Understanding and Mitigating Language Confusion in LLMs |
link |
Marchisio, Kelly,..., Sebastian |
20 |
2023-11-14 |
MAgIC: Investigation of Large Language Model Powered Multi-Agent in Cognition, Adaptability, Rationality and Collaboration |
link |
Xu, Lin,..., Jiashi |
20 |
2024-06-17 |
Optimizing Instructions and Demonstrations for Multi-Stage Language Model Programs |
link |
Opsahl-Ong, Krista,..., Omar |
20 |
2024-05-08 |
Fishing for Magikarp: Automatically Detecting Under-trained Tokens in Large Language Models |
link |
Land, Sander,..., Max |
20 |
2023-11-13 |
An Analysis and Mitigation of the Reversal Curse |
link |
Lv, Ang,..., Rui |
20 |
2024-07-04 |
A Systematic Survey and Critical Review on Evaluating Large Language Models: Challenges, Limitations, and Recommendations |
link |
Laskar, Md Tahmid Rahman,..., Jimmy |
20 |
2024-06-17 |
A Simple and Effective $L_2$ Norm-Based Strategy for KV Cache Compression |
link |
Devoto, Alessio,..., Pasquale |
19 |
2023-08-17 |
Evaluating the Instruction-Following Robustness of Large Language Models to Prompt Injection |
link |
Li, Zekun,..., Xifeng |
19 |
2024-04-18 |
LongEmbed: Extending Embedding Models for Long Context Retrieval |
link |
Zhu, Dawei,..., Sujian |
19 |
2024-06-17 |
Watch Every Step! LLM Agent Learning via Iterative Step-level Process Refinement |
link |
Xiong, Weimin,..., Sujian |
19 |
2024-06-20 |
Instruction Pre-Training: Language Models are Supervised Multitask Learners |
link |
Cheng, Daixuan,..., Furu |
19 |
2024-06-16 |
Mixture-of-Subspaces in Low-Rank Adaptation |
link |
Wu, Taiqiang,..., Ngai |
19 |
2024-06-13 |
Linguistic Bias in ChatGPT: Language Models Reinforce Dialect Discrimination |
link |
Fleisig, Eve,..., Dan |
19 |
2024-06-18 |
Hopping Too Late: Exploring the Limitations of Large Language Models on Multi-Hop Queries |
link |
Biran, Eden,..., Amir |
19 |
2023-09-12 |
Re-Reading Improves Reasoning in Large Language Models |
link |
Xu, Xiaohan,..., Shuai |
18 |
2024-07-25 |
Demystifying Verbatim Memorization in Large Language Models |
link |
Huang, Jing,..., Christopher |
18 |
2024-06-18 |
Attention Score is not All You Need for Token Importance Indicator in KV Cache Reduction: Value Also Matters |
link |
Guo, Zhiyu,..., Taro |
17 |
2024-05-05 |
ImageInWords: Unlocking Hyper-Detailed Image Descriptions |
link |
Garg, Roopal,..., Radu |
17 |
2024-06-15 |
Personalized Pieces: Efficient Personalized Large Language Models through Collaborative Efforts |
link |
Tan, Zhaoxuan,..., Meng |
17 |
2024-06-24 |
BEEAR: Embedding-based Adversarial Removal of Safety Backdoors in Instruction-tuned Language Models |
link |
Zeng, Yi,..., Ruoxi |
17 |
2024-02-18 |
SciAgent: Tool-augmented Language Models for Scientific Reasoning |
link |
Ma, Yubo,..., Aixin |
17 |
2024-07-18 |
Are Large Language Models Capable of Generating Human-Level Narratives? |
link |
Tian, Yufei,..., Nanyun |
17 |
2024-02-04 |
Factuality of Large Language Models: A Survey |
link |
Wang, Yuxia,..., Preslav |
17 |
2024-03-11 |
Rebuilding ROME : Resolving Model Collapse during Sequential Model Editing |
link |
Gupta, Akshat,..., Gopala |
17 |
2024-04-29 |
BMRetriever: Tuning Large Language Models as Better Biomedical Text Retrievers |
link |
Xu, Ran,..., Carl |
16 |
2024-06-17 |
WPO: Enhancing RLHF with Weighted Preference Optimization |
link |
Zhou, Wenxuan,..., Chenguang |
16 |
2024-02-23 |
NuNER: Entity Recognition Encoder Pre-training via LLM-Annotated Data |
link |
Bogdanov, Sergei,..., Etienne P |
16 |
2024-02-29 |
AKEW: Assessing Knowledge Editing in the Wild |
link |
Wu, Xiaobao,..., Anh Tuan |
16 |
2024-05-05 |
MedAdapter: Efficient Test-Time Adaptation of Large Language Models Towards Medical Reasoning |
link |
Shi, Wenqi,..., May Dongmei |
15 |
2024-01-16 |
RoTBench: A Multi-Level Benchmark for Evaluating the Robustness of Large Language Models in Tool Learning |
link |
Ye, Junjie,..., Xuanjing |
15 |
2023-05-11 |
Chain-of-Dictionary Prompting Elicits Translation in Large Language Models |
link |
Lu, Hongyuan,..., Furu |
15 |
2024-05-02 |
Verification and Refinement of Natural Language Explanations through LLM-Symbolic Theorem Proving |
link |
Quan, Xin,..., Andre |
15 |
2024-03-08 |
LLM4Decompile: Decompiling Binary Code with Large Language Models |
link |
Tan, Hanzhuo,..., Yuqun |
15 |
2024-10-12 |
VLFeedback: A Large-Scale AI Feedback Dataset for Large Vision-Language Models Alignment |
link |
Li, Lei,..., Qi |
15 |
2023-11-15 |
Social Bias Probing: Fairness Benchmarking for Language Models |
link |
Marchiori Manerba, Marta,..., Isabelle |
15 |
2024-07-09 |
CopyBench: Measuring Literal and Non-Literal Reproduction of Copyright-Protected Text in Language Model Generation |
link |
Chen, Tong,..., Pang Wei |
15 |
2024-03-13 |
Distract Large Language Models for Automatic Jailbreak Attack |
link |
Xiao, Zeguan,..., Yun |
15 |
2024-06-10 |
Reasoning in Token Economies: Budget-Aware Evaluation of LLM Reasoning Strategies |
link |
Wang, Junlin,..., Ben |
15 |
2024-02-21 |
Knowledge Graph Enhanced Large Language Model Editing |
link |
Zhang, Mengqi,..., Zhumin |
14 |
2024-04-04 |
Uncertainty in Language Models: Assessment through Rank-Calibration |
link |
Huang, Xinmeng,..., Edgar |
14 |
2024-02-16 |
BlendFilter: Advancing Retrieval-Augmented Large Language Models via Query Generation Blending and Knowledge Filtering |
link |
Wang, Haoyu,..., Jing |
14 |
2024-06-17 |
MetaGPT: Merging Large Language Models Using Model Exclusive Task Arithmetic |
link |
Zhou, Yuyan,..., Weipeng |
14 |
2024-04-25 |
TinyChart: Efficient Chart Understanding with Program-of-Thoughts Learning and Visual Token Merging |
link |
Zhang, Liang,..., Fei |
14 |
2024-05-27 |
Can Large Language Models Faithfully Express Their Intrinsic Uncertainty in Words? |
link |
Yona, Gal,..., Mor |
14 |
2024-04-19 |
Evaluating Character Understanding of Large Language Models via Character Profiling from Fictional Works |
link |
Yuan, Xinfeng,..., Deqing |
14 |
2023-10-13 |
User Inference Attacks on Large Language Models |
link |
Kandpal, Nikhil,..., Zheng |
14 |
2024-02-27 |
Can LLM Generate Culturally Relevant Commonsense QA Data? Case Study in Indonesian and Sundanese |
link |
Putri, Rifki Afina,..., Alice |
13 |
2024-03-30 |
NumeroLogic: Number Encoding for Enhanced LLMs' Numerical Reasoning |
link |
Schwartz, Eli,..., Assaf |
13 |
2024-06-17 |
GeoGPT4V: Towards Geometric Multi-modal Large Language Models with Geometric Image Generation |
link |
Cai, Shihao,..., Bo |
13 |
2024-06-18 |
SHIELD: Evaluation and Defense Strategies for Copyright Compliance in LLM Text Generation |
link |
Liu, Xiaoze,..., Jing |
13 |
2024-06-16 |
Leading Whitespaces of Language Models' Subword Vocabulary Pose a Confound for Calculating Word Probabilities |
link |
Oh, Byung-Doh,..., William |
13 |
2024-09-24 |
M$^2$PT: Multimodal Prompt Tuning for Zero-shot Instruction Learning |
link |
Wang, Taowen,..., Dongfang |
13 |
2024-06-17 |
Unifying Multimodal Retrieval via Document Screenshot Embedding |
link |
Ma, Xueguang,..., Jimmy |
13 |
2024-06-17 |
mDPO: Conditional Preference Optimization for Multimodal Large Language Models |
link |
Wang, Fei,..., Muhao |
13 |
2024-04-05 |
Extract, Define, Canonicalize: An LLM-based Framework for Knowledge Graph Construction |
link |
Zhang, Bowen,..., Harold |
13 |
2024-07-03 |
TheoremLlama: Transforming General-Purpose LLMs into Lean4 Experts |
link |
Wang, Ruida,..., Tong |
13 |
2024-06-20 |
How to Compute the Probability of a Word |
link |
Pimentel, Tiago,..., Clara |
13 |
2024-04-03 |
Language Models as Compilers: Simulating Pseudocode Execution Improves Algorithmic Reasoning in Language Models |
link |
Chae, Hyungjoo,..., Jinyoung |
12 |
2024-01-05 |
Parameter-Efficient Sparsity Crafting from Dense to Mixture-of-Experts for Instruction Tuning on General Tasks |
link |
Wu, Haoyuan,..., Bei |
12 |
2024-06-16 |
Eliminating Biased Length Reliance of Direct Preference Optimization via Down-Sampled KL Divergence |
link |
Lu, Junru,..., Xing |
12 |
2024-04-18 |
Reuse Your Rewards: Reward Model Transfer for Zero-Shot Cross-Lingual Alignment |
link |
Wu, Zhaofeng,..., Ahmad |
12 |
2024-06-04 |
TopViewRS: Vision-Language Models as Top-View Spatial Reasoners |
link |
Li, Chengzu,..., Ivan |
12 |
2024-02-25 |
ASETF: A Novel Method for Jailbreak Attack on LLMs through Translate Suffix Embeddings |
link |
Wang, Hao,..., Lei |
12 |
2023-11-02 |
Learn to Refuse: Making Large Language Models More Controllable and Reliable through Knowledge Scope Limitation and Refusal Mechanism |
link |
Cao, Lang |
12 |
2023-10-04 |
Discovering Knowledge-Critical Subnetworks in Pretrained Language Models |
link |
Bayazit, Deniz,..., Antoine |
12 |
2024-05-23 |
Extracting Prompts by Inverting LLM Outputs |
link |
Zhang, Collin,..., Vitaly |
12 |
2024-07-14 |
BiasAlert: A Plug-and-play Tool for Social Bias Detection in LLMs |
link |
Fan, Zhiting,..., Zuozhu |
12 |
2024-01-30 |
MT-Eval: A Multi-Turn Capabilities Evaluation Benchmark for Large Language Models |
link |
Kwan, Wai-Chung,..., Kam-Fai |
12 |
2024-09-23 |
Beyond Turn-Based Interfaces: Synchronous LLMs as Full-Duplex Dialogue Agents |
link |
Veluri, Bandhav,..., Shyamnath |
11 |
2024-06-18 |
AgentReview: Exploring Peer Review Dynamics with LLM Agents |
link |
Jin, Yiqiao,..., Jindong |
11 |
2024-06-26 |
Glue pizza and eat rocks - Exploiting Vulnerabilities in Retrieval-Augmented Generative Models |
link |
Tan, Zhen,..., Huan |
11 |
2024-07-19 |
RAG-QA Arena: Evaluating Domain Robustness for Long-form Retrieval Augmented Question Answering |
link |
Han, Rujun,..., Vittorio |
11 |
2024-06-28 |
Detection and Measurement of Syntactic Templates in Generated Text |
link |
Shaib, Chantal,..., Byron C |
11 |
2024-06-23 |
ReCaLL: Membership Inference via Relative Conditional Log-Likelihoods |
link |
Xie, Roy,..., Bhuwan |
11 |
2024-04-01 |
An image speaks a thousand words, but can everyone listen? On image transcreation for cultural relevance |
link |
Khanuja, Simran,..., Graham |
11 |
2023-11-01 |
The Mystery of In-Context Learning: A Comprehensive Survey on Interpretation and Analysis |
link |
Zhou, Yuxiang,..., Yulan |
11 |
2024-04-23 |
Generate-on-Graph: Treat LLM as both Agent and KG for Incomplete Knowledge Graph Question Answering |
link |
Xu, Yao,..., Kang |
11 |
2024-02-29 |
TV-TREES: Multimodal Entailment Trees for Neuro-Symbolic Video Reasoning |
link |
Sanders, Kate,..., Benjamin |
11 |
2024-02-20 |
Defending Jailbreak Prompts via In-Context Adversarial Game |
link |
Zhou, Yujun,..., Xiangliang |
11 |
2024-04-22 |
Filtered Direct Preference Optimization |
link |
Morimura, Tetsuro,..., Kaito |
10 |
2024-04-21 |
ChatRetriever: Adapting Large Language Models for Generalized and Robust Conversational Dense Retrieval |
link |
Mao, Kelong,..., Zhicheng |
10 |
2024-04-18 |
Aligning Language Models to Explicitly Handle Ambiguity |
link |
Kim, Hyuhng Joon,..., Taeuk |
10 |
2024-01-01 |
LogicAsker: Evaluating and Improving the Logical Reasoning Ability of Large Language Models |
link |
Wan, Yuxuan,..., Michael |
10 |
2024-06-21 |
Direct Multi-Turn Preference Optimization for Language Agents |
link |
Shi, Wentao,..., Fuli |
10 |
2024-10-31 |
Hidden Persuaders: LLMs' Political Leaning and Their Influence on Voters |
link |
Potter, Yujin,..., Dawn |
10 |
2024-06-11 |
RWKV-CLIP: A Robust Vision-Language Representation Learner |
link |
Gu, Tiancheng,..., Jiankang |
10 |
2024-09-21 |
SURf: Teaching Large Vision-Language Models to Selectively Utilize Retrieved Information |
link |
Sun, Jiashuo,..., Yu |
10 |
2024-10-06 |
DAMRO: Dive into the Attention Mechanism of LVLM to Reduce Object Hallucination |
link |
Gong, Xuan,..., Zhihua |
10 |
2024-02-21 |
Ouroboros: Generating Longer Drafts Phrase by Phrase for Faster Speculative Decoding |
link |
Zhao, Weilin,..., Maosong |
10 |
2024-06-17 |
Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning |
link |
Zhang, Zhihan,..., Meng |
10 |
2024-02-01 |
LLMs learn governing principles of dynamical systems, revealing an in-context neural scaling law |
link |
Liu, Toni J.b.,..., Christopher |
10 |
2024-06-19 |
Finding Blind Spots in Evaluator LLMs with Interpretable Checklists |
link |
Doddapaneni, Sumanth,..., Mitesh M |
10 |
2024-04-05 |
FFN-SkipLLM: A Hidden Gem for Autoregressive Decoding with Adaptive Feed Forward Skipping |
link |
Jaiswal, Ajay Kumar,..., Aditya |
10 |
2023-11-13 |
Fuse to Forget: Bias Reduction and Selective Memorization through Model Fusion |
link |
Zaman, Kerem,..., Shashank |
10 |
2024-07-01 |
Pron vs Prompt: Can Large Language Models already Challenge a World-Class Fiction Author at Creative Text Writing? |
link |
Marco, Guillermo,..., Ram{\'o}n Del Castillo |
10 |
2024-02-07 |
ApiQ: Finetuning of 2-Bit Quantized Large Language Model |
link |
Liao, Baohao,..., Christof |
10 |
2024-07-21 |
MIBench: Evaluating Multimodal Large Language Models over Multiple Images |
link |
Liu, Haowei,..., Weiming |
9 |
2024-03-31 |
Scaling Properties of Speech Language Models |
link |
Cuervo, Santiago,..., Ricard |
9 |
2024-04-22 |
Fine-Tuning Large Language Models to Translate: Will a Touch of Noisy Data in Misaligned Languages Suffice? |
link |
Zhu, Dawei,..., Dietrich |
9 |
2024-06-17 |
Fairer Preferences Elicit Improved Human-Aligned Large Language Model Judgments |
link |
Zhou, Han,..., Anna |
9 |
2024-02-20 |
Backward Lens: Projecting Language Model Gradients into the Vocabulary Space |
link |
Katz, Shahar,..., Lior |
9 |
2024-02-13 |
PRompt Optimization in Multi-Step Tasks (PROMST): Integrating Human Feedback and Heuristic-based Sampling |
link |
Chen, Yongchao,..., Chuchu |
9 |
2024-05-13 |
MuMath-Code: Combining Tool-Use Large Language Models with Multi-perspective Data Augmentation for Mathematical Reasoning |
link |
Yin, Shuo,..., Jinfeng |
9 |
2023-11-16 |
How Far Can We Extract Diverse Perspectives from Large Language Models? |
link |
Hayati, Shirley Anugrah,..., Dongyeop |
9 |
2024-02-22 |
Enhancing Systematic Decompositional Natural Language Inference Using Informal Logic |
link |
Weir, Nathaniel,..., Benjamin |
9 |
2024-06-27 |
Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding |
link |
Fan, Yue,..., Xin Eric |
9 |
2024-08-27 |
Into the Unknown Unknowns: Engaged Human Learning through Participation in Language Model Agent Conversations |
link |
Jiang, Yucheng,..., Monica |
9 |
2023-05-22 |
Atomic Inference for NLI with Generated Facts as Atoms |
link |
Stacey, Joe,..., Marek |
9 |
2024-01-18 |
Code Prompting Elicits Conditional Reasoning Abilities in Text+Code LLMs |
link |
Puerto, Haritz,..., Iryna |
9 |
2024-06-22 |
Beyond the Turn-Based Game: Enabling Real-Time Conversations with Duplex Models |
link |
Zhang, Xinrong,..., Zhiyuan |
9 |
2024-06-18 |
Defending Against Social Engineering Attacks in the Age of LLMs |
link |
Ai, Lin,..., Julia |
9 |
2024-05-06 |
Lifelong Knowledge Editing for LLMs with Retrieval-Augmented Continuous Prompt Learning |
link |
Chen, Qizhou,..., Hui |
9 |
2024-02-18 |
How Susceptible are Large Language Models to Ideological Manipulation? |
link |
Chen, Kai,..., Kristina |
9 |
2024-06-25 |
CoSafe: Evaluating Large Language Model Safety in Multi-Turn Dialogue Coreference |
link |
Yu, Erxin,..., Lanqing |
9 |
2024-07-09 |
Multimodal Self-Instruct: Synthetic Abstract Image and Visual Reasoning Instruction Using Language Model |
link |
Zhang, Wenqi,..., Yueting |
9 |
2024-07-09 |
Virtual Personas for Language Models via an Anthology of Backstories |
link |
Moon, Suhong,..., David |
9 |
2024-07-12 |
CompAct: Compressing Retrieved Documents Actively for Question Answering |
link |
Yoon, Chanwoong,..., Jaewoo |
9 |
2024-04-25 |
Evaluating Large Language Models on Time Series Feature Understanding: A Comprehensive Taxonomy and Benchmark |
link |
Fons, Elizabeth,..., Svitlana |
9 |
2024-05-21 |
GPT-4 Jailbreaks Itself with Near-Perfect Success Using Self-Explanation |
link |
Ramesh, Govind,..., Wei |
8 |
2024-08-22 |
Large Language Models as Foundations for Next-Gen Dense Retrieval: A Comprehensive Empirical Assessment |
link |
Luo, Kun,..., Kang |
8 |
2023-02-24 |
Retrieved Sequence Augmentation for Protein Representation Learning |
link |
Ma, Chang,..., Lingpeng |
8 |
2024-01-27 |
Do We Need Language-Specific Fact-Checking Models? The Case of Chinese |
link |
Zhang, Caiqi,..., Andreas |
8 |
2024-05-01 |
Self-Refine Instruction-Tuning for Aligning Reasoning in Language Models |
link |
Ranaldi, Leonardo,..., Andre |
8 |
2024-06-20 |
African or European Swallow? Benchmarking Large Vision-Language Models for Fine-Grained Object Classification |
link |
Geigle, Gregor,..., Goran |
8 |
2024-06-28 |
From the Least to the Most: Building a Plug-and-Play Visual Reasoner via Data Synthesis |
link |
Cheng, Chuanqi,..., Rui |
8 |
2024-06-20 |
Investigating Mysteries of CoT-Augmented Distillation |
link |
Wadhwa, Somin,..., Byron C |
8 |
2024-06-02 |
Automatic Instruction Evolving for Large Language Models |
link |
Zeng, Weihao,..., Weizhu |
8 |
2024-06-16 |
Towards Understanding Jailbreak Attacks in LLMs: A Representation Space Analysis |
link |
Lin, Yuping,..., Jiliang |
8 |
None |
Towards Injecting Medical Visual Knowledge into Multimodal LLMs at Scale |
link |
Chen, Junying,..., Benyou |
8 |
2023-12-14 |
Towards Verifiable Text Generation with Evolving Memory and Self-Reflection |
link |
Sun, Hao,..., Dawei |
8 |
2024-06-21 |
FIRST: Faster Improved Listwise Reranking with Single Token Decoding |
link |
Gangi Reddy, Revanth,..., Heng |
8 |
2024-06-27 |
RuBLiMP: Russian Benchmark of Linguistic Minimal Pairs |
link |
Taktasheva, Ekaterina,..., Vladislav |
8 |
2024-04-22 |
Text-Tuple-Table: Towards Information Integration in Text-to-Table Generation via Global Tuple Extraction |
link |
Deng, Zheye,..., Yangqiu |
8 |
2024-03-25 |
If CLIP Could Talk: Understanding Vision-Language Model Representations Through Their Preferred Concept Descriptions |
link |
Esfandiarpoor, Reza,..., Stephen |
8 |
2024-02-25 |
Efficient Temporal Extrapolation of Multimodal Large Language Models with Temporal Grounding Bridge |
link |
Wang, Yuxuan,..., Zilong |
8 |
2024-06-24 |
OmAgent: A Multi-modal Agent Framework for Complex Video Understanding with Task Divide-and-Conquer |
link |
Zhang, Lu,..., Kyusong |
8 |
2024-07-15 |
Fine-Tuning and Prompt Optimization: Two Great Steps that Work Better Together |
link |
Soylu, Dilara,..., Omar |
8 |
2024-06-24 |
Segment Any Text: A Universal Approach for Robust, Efficient and Adaptable Sentence Segmentation |
link |
Frohmann, Markus,..., Markus |
8 |
2024-07-02 |
Why Does New Knowledge Create Messy Ripple Effects in LLMs? |
link |
Qin, Jiaxin,..., Heng |
8 |
2024-09-11 |
SUPER: Evaluating Agents on Setting Up and Executing Tasks from Research Repositories |
link |
Bogin, Ben,..., Tushar |
8 |
2024-05-30 |
PATIENT-$\psi$: Using Large Language Models to Simulate Patients for Training Mental Health Professionals |
link |
Wang, Ruiyi,..., Zhiyu |
8 |
2024-02-19 |
AnaloBench: Benchmarking the Identification of Abstract and Long-context Analogies |
link |
Ye, Xiao,..., Daniel |
8 |
2024-06-18 |
Evaluating $n$-Gram Novelty of Language Models Using Rusty-DAWG |
link |
Merrill, William,..., Yanai |
8 |
2024-06-29 |
Is It Really Long Context if All You Need Is Retrieval? Towards Genuinely Difficult Long Context NLP |
link |
Goldman, Omer,..., Reut |
8 |
2024-04-24 |
Annotator-Centric Active Learning for Subjective NLP Tasks |
link |
van der Meer, Michiel,..., Enrico |
8 |
2024-04-10 |
Not All Contexts Are Equal: Teaching LLMs Credibility-aware Generation |
link |
Pan, Ruotong,..., Le |
7 |
2024-02-17 |
When LLMs Meets Acoustic Landmarks: An Efficient Approach to Integrate Speech into Large Language Models for Depression Detection |
link |
Zhang, Xiangyu,..., Julien |
7 |
2024-06-26 |
AdaZeta: Adaptive Zeroth-Order Tensor-Train Adaption for Memory-Efficient Large Language Models Fine-Tuning |
link |
Yang, Yifan,..., Zheng |
7 |
2024-06-16 |
RoseLoRA: Row and Column-wise Sparse Low-rank Adaptation of Pre-trained Language Model for Knowledge Editing and Fine-tuning |
link |
Wang, Haoyu,..., Jing |
7 |
2024-02-22 |
Triad: A Framework Leveraging a Multi-Role LLM-based Agent to Solve Knowledge Base Question Answering |
link |
Zong, Chang,..., Yueting |
7 |
2024-06-07 |
CHIQ: Contextual History Enhancement for Improving Query Rewriting in Conversational Search |
link |
Mo, Fengran,..., Jian-Yun |
7 |
2024-04-03 |
Calibrating the Confidence of Large Language Models by Eliciting Fidelity |
link |
Zhang, Mozhi,..., Xipeng |
7 |
2024-09-12 |
On the Role of Context in Reading Time Prediction |
link |
Opedal, Andreas,..., Ethan |
7 |
2024-06-18 |
Estimating Knowledge in Large Language Models Without Generating a Single Token |
link |
Gottesman, Daniela,..., Mor |
7 |
2023-11-15 |
When Is Multilinguality a Curse? Language Modeling for 250 High- and Low-Resource Languages |
link |
Chang, Tyler A.,..., Ben |
7 |
2024-06-14 |
SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages |
link |
Lovenia, Holy,..., Samuel |
7 |
2024-08-28 |
EPO: Hierarchical LLM Agents with Environment Preference Optimization |
link |
Zhao, Qi,..., George |
7 |
2024-04-17 |
Neuron Specialization: Leveraging Intrinsic Task Modularity for Multilingual Machine Translation |
link |
Tan, Shaomu,..., Christof |
7 |
2024-06-17 |
MMNeuron: Discovering Neuron-Level Domain-Specific Interpretation in Multimodal Large Language Model |
link |
Huo, Jiahao,..., Xuming |
7 |
2024-05-08 |
ADELIE: Aligning Large Language Models on Information Extraction |
link |
Qi, Yunjia,..., Juanzi |
7 |
2024-06-15 |
MIND: Multimodal Shopping Intention Distillation from Large Vision-language Models for E-commerce Purchase Understanding |
link |
Xu, Baixuan,..., Yangqiu |
7 |
2024-06-21 |
PARIKSHA: A Large-Scale Investigation of Human-LLM Evaluator Agreement on Multilingual and Multi-Cultural Data |
link |
Watts, Ishaan,..., Sunayana |
7 |
2024-09-25 |
VPTQ: Extreme Low-bit Vector Post-Training Quantization for Large Language Models |
link |
Liu, Yifei,..., Mao |
7 |
2024-09-20 |
Unlocking Memorization in Large Language Models with Dynamic Soft Prompting |
link |
Wang, Zhepeng,..., Yanfu |
7 |
2024-06-18 |
LLMs Are Prone to Fallacies in Causal Inference |
link |
Joshi, Nitish,..., He |
7 |
2024-03-05 |
Evidence-Focused Fact Summarization for Knowledge-Augmented Zero-Shot Question Answering |
link |
Ko, Sungho,..., Dongha |
7 |
2023-05-23 |
ReadMe++: Benchmarking Multilingual Language Models for Multi-Domain Readability Assessment |
link |
Naous, Tarek,..., Wei |
7 |
2024-03-08 |
Consecutive Batch Model Editing with HooK Layers |
link |
Li, Shuaiyi,..., Wai |
7 |
2024-10-19 |
Are LLMs Good Zero-Shot Fallacy Classifiers? |
link |
Pan, Fengjun,..., Anh Tuan |
7 |
2024-05-28 |
Can Automatic Metrics Assess High-Quality Translations? |
link |
Agrawal, Sweta,..., Andre |
7 |
2024-06-17 |
Small Agent Can Also Rock! Empowering Small Language Models as Hallucination Detector |
link |
Cheng, Xiaoxue,..., Ji-Rong |
7 |
2024-06-24 |
RaTEScore: A Metric for Radiology Report Generation |
link |
Zhao, Weike,..., Weidi |
7 |
2024-01-13 |
Leveraging Large Language Models for NLG Evaluation: Advances and Challenges |
link |
Li, Zhen,..., Shuai |
7 |
2024-07-21 |
Intrinsic Self-correction for Enhanced Morality: An Analysis of Internal Mechanisms and the Superficial Hypothesis |
link |
Liu, Guangliang,..., Kristen |
7 |
2024-12-18 |
MedCoT: Medical Chain of Thought via Hierarchical Expert |
link |
Liu, Jiaxiang,..., Zuozhu |
7 |
2024-04-16 |
D3CODE: Disentangling Disagreements in Data across Cultures on Offensiveness Detection and Evaluation |
link |
Mostafazadeh Davani, Aida,..., Vinodkumar |
7 |
2024-06-16 |
FoodieQA: A Multimodal Dataset for Fine-Grained Understanding of Chinese Food Culture |
link |
Li, Wenyan,..., Desmond |
7 |
2024-06-21 |
Evaluating Diversity in Automatic Poetry Generation |
link |
Chen, Yanran,..., Steffen |
7 |
2024-06-06 |
ArMeme: Propagandistic Content in Arabic Memes |
link |
Alam, Firoj,..., Maram |
6 |
None |
CryptoTrade: A Reflective LLM-based Agent to Guide Zero-shot Cryptocurrency Trading |
link |
Li, Yuan,..., Bingsheng |
6 |
2024-06-17 |
Prefixing Attention Sinks can Mitigate Activation Outliers for Large Language Model Quantization |
link |
Son, Seungwoo,..., Jaeho |
6 |
2024-06-28 |
LEMoE: Advanced Mixture of Experts Adaptor for Lifelong Model Editing of Large Language Models |
link |
Wang, Renzhi,..., Piji |
6 |
2024-02-05 |
How do Large Language Models Learn In-Context? Query and Key Matrices of In-Context Heads are Two Towers for Metric Learning |
link |
Yu, Zeping,..., Sophia |
6 |
2024-09-21 |
Interpreting Arithmetic Mechanism in Large Language Models through Comparative Neuron Analysis |
link |
Yu, Zeping,..., Sophia |
6 |
2024-06-17 |
GoldCoin: Grounding Large Language Models in Privacy Laws via Contextual Integrity Theory |
link |
Fan, Wei,..., Yangqiu |
6 |
2024-09-29 |
CoTKR: Chain-of-Thought Enhanced Knowledge Rewriting for Complex Knowledge Graph Question Answering |
link |
Wu, Yike,..., Jeff Z. |
6 |
2024-06-17 |
When Reasoning Meets Information Aggregation: A Case Study with Sports Narratives |
link |
Hu, Yebowen,..., Fei |
6 |
2024-03-25 |
Outcome-Constrained Large Language Models for Countering Hate Speech |
link |
Hong, Lingzi,..., Xiaoying |
6 |
2024-06-21 |
Retrieve-Plan-Generation: An Iterative Planning and Answering Framework for Knowledge-Intensive LLM Generation |
link |
Lyu, Yuanjie,..., Enhong |
6 |
2024-06-16 |
Concept-skill Transferability-based Data Selection for Large Vision-Language Models |
link |
Lee, Jaewoo,..., Sung Ju |
6 |
2024-06-21 |
Safely Learning with Private Data: A Federated Learning Framework for Large Language Model |
link |
Zheng, Jia-Ying,..., Zhi-Ming |
6 |
2024-02-27 |
REAR: A Relevance-Aware Retrieval-Augmented Framework for Open-Domain Question Answering |
link |
Wang, Yuhao,..., Ji-Rong |
6 |
2024-06-24 |
C-LLM: Learn to Check Chinese Spelling Errors Character by Character |
link |
Li, Kunting,..., Jie |
6 |
2024-06-18 |
CleanGen: Mitigating Backdoor Attacks for Generation Tasks in Large Language Models |
link |
Li, Yuetai,..., Radha |
6 |
None |
Enhancing Reinforcement Learning with Dense Rewards from Language Model Critic |
link |
Cao, Meng,..., Lei |
6 |
2023-05-23 |
APPLS: Evaluating Evaluation Metrics for Plain Language Summarization |
link |
Guo, Yue,..., Lucy Lu |
6 |
2022-12-20 |
Ontologically Faithful Generation of Non-Player Character Dialogues |
link |
Weir, Nathaniel,..., Harsh |
6 |
2024-04-01 |
TraveLER: A Modular Multi-LMM Agent Framework for Video Question-Answering |
link |
Shang, Chuyi,..., Roei |
6 |
2024-06-18 |
TroL: Traversal of Layers for Large Language and Vision Models |
link |
Lee, Byung-Kwan,..., Yong Man |
6 |
2024-11-06 |
Medical Adaptation of Large Language and Vision-Language Models: Are We Making Progress? |
link |
Jeong, Daniel P,..., Michael |
6 |
2024-10-09 |
DA-Code: Agent Data Science Code Generation Benchmark for Large Language Models |
link |
Huang, Yiming,..., Kang |
6 |
2024-10-10 |
AppBench: Planning of Multiple APIs from Various APPs for Complex User Instruction |
link |
Wang, Hongru,..., Kam-Fai |
6 |
2024-06-03 |
Re-ReST: Reflection-Reinforced Self-Training for Language Agents |
link |
Dou, Zi-Yi,..., Nanyun |
6 |
2024-06-24 |
Scaling Laws for Linear Complexity Language Models |
link |
Shen, Xuyang,..., Yiran |
6 |
2024-06-17 |
Community-Cross-Instruct: Unsupervised Instruction Generation for Aligning Large Language Models to Online Communities |
link |
He, Zihao,..., Kristina |
6 |
2024-09-15 |
Reasoning Paths with Reference Objects Elicit Quantitative Spatial Reasoning in Large Vision-Language Models |
link |
Liao, Yuan-Hong,..., David |
6 |
2024-06-21 |
From LLMs to MLLMs: Exploring the Landscape of Multimodal Jailbreaking |
link |
Wang, Siyuan,..., Zhongyu |
6 |
2024-04-11 |
LLoCO: Learning Long Contexts Offline |
link |
Tan, Sijun,..., Raluca Ada |
6 |
2024-02-25 |
Don`t Forget Your Reward Values: Language Model Alignment via Value-based Calibration |
link |
Mao, Xin,..., Anh Tuan |
6 |
2024-10-09 |
Is C4 Dataset Optimal for Pruning? An Investigation of Calibration Data for LLM Pruning |
link |
Bandari, Abhinav,..., Shiwei |
6 |
2023-11-28 |
Eliciting In-Context Learning in Vision-Language Models for Videos Through Curated Data Distributional Properties |
link |
Yu, Keunwoo Peter,..., Joyce |
6 |
2024-06-17 |
Safety Arithmetic: A Framework for Test-time Safety Alignment of Language Models by Steering Parameters and Activations |
link |
Hazra, Rima,..., Soujanya |
6 |
None |
Do LLMs Plan Like Human Writers? Comparing Journalist Coverage of Press Releases with LLMs |
link |
Spangher, Alexander,..., Mark |
6 |
2024-06-20 |
Holistic Evaluation for Interleaved Text-and-Image Generation |
link |
Liu, Minqian,..., Lifu |
6 |
2024-08-07 |
Is Child-Directed Speech Effective Training Data for Language Models? |
link |
Feng, Steven Y.,..., Michael |
6 |
2024-10-23 |
LongRAG: A Dual-Perspective Retrieval-Augmented Generation Paradigm for Long-Context Question Answering |
link |
Zhao, Qingfei,..., Jie |
6 |
2024-01-24 |
Instruction Fine-Tuning: Does Prompt Loss Matter? |
link |
Huerta-Enochian, Mathew,..., Seung Yong |
5 |
2024-02-22 |
A Usage-centric Take on Intent Understanding in E-Commerce |
link |
Zhou, Wendi,..., Jeff Z. |
5 |
2024-04-17 |
Consolidating Ranking and Relevance Predictions of Large Language Models through Post-Processing |
link |
Yan, Le,..., Harrie |
5 |
2024-02-15 |
EFUF: Efficient Fine-Grained Unlearning Framework for Mitigating Hallucinations in Multimodal Large Language Models |
link |
Xing, Shangyu,..., Xinyu |
5 |
2024-06-17 |
Tracking the perspectives of interacting language models |
link |
Helm, Hayden,..., Carey |
5 |
2024-09-21 |
Enhancing Advanced Visual Reasoning Ability of Large Language Models |
link |
Li, Zhiyuan,..., Weidong |
5 |
2023-08-16 |
CMD: a framework for Context-aware Model self-Detoxification |
link |
Tang, Zecheng,..., Min |
5 |
2024-07-15 |
By My Eyes: Grounding Multimodal Large Language Models with Sensor Data via Visual Prompting |
link |
Yoon, Hyungjun,..., Sung-Ju |
5 |
2024-05-15 |
Word Alignment as Preference for Machine Translation |
link |
Wu, Qiyu,..., Yoshimasa |
5 |
2024-04-22 |
A User-Centric Multi-Intent Benchmark for Evaluating Large Language Models |
link |
Wang, Jiayin,..., Jian-Yun |
5 |
2024-09-21 |
PTD-SQL: Partitioning and Targeted Drilling with LLMs in Text-to-SQL |
link |
Luo, Ruilin,..., Yujiu |
5 |
2024-01-30 |
Conditional and Modal Reasoning in Large Language Models |
link |
Holliday, Wesley H.,..., Cedegao E. |
5 |
2024-05-31 |
Unveiling the Lexical Sensitivity of LLMs: Combinatorial Optimization for Prompt Enhancement |
link |
Zhan, Pengwei,..., Ru |
5 |
2024-09-23 |
Pretraining Data Detection for Large Language Models: A Divergence-based Calibration Method |
link |
Zhang, Weichao,..., Xueqi |
5 |
2024-04-19 |
How Does the Textual Information Affect the Retrieval of Multimodal In-Context Learning? |
link |
Luo, Yang,..., Yang |
5 |
None |
Adaption-of-Thought: Learning Question Difficulty Improves Large Language Models for Reasoning |
link |
Xu, Mayi,..., Tieyun |
5 |
2024-03-03 |
Right for Right Reasons: Large Language Models for Verifiable Commonsense Knowledge Graph Question Answering |
link |
Toroghi, Armin,..., Scott |
5 |
None |
Does Large Language Model Contain Task-Specific Neurons? |
link |
Song, Ran,..., Zhengtao |
5 |
2024-04-17 |
Position Engineering: Boosting Large Language Models through Positional Information Manipulation |
link |
He, Zhiyuan,..., Lili |
5 |
2024-08-17 |
FEDKIM: Adaptive Federated Knowledge Injection into Medical Foundation Models |
link |
Wang, Xiaochen,..., Fenglong |
5 |
2024-07-12 |
Stepwise Verification and Remediation of Student Reasoning Errors with Large Language Model Tutors |
link |
Daheim, Nico,..., Mrinmaya |
5 |
2024-07-24 |
Revisiting Who`s Harry Potter: Towards Targeted Unlearning from a Causal Intervention Perspective |
link |
Liu, Yujian,..., Shiyu |
5 |
2024-06-28 |
Token Erasure as a Footprint of Implicit Vocabulary Items in LLMs |
link |
Feucht, Sheridan,..., David |
5 |
None |
Fine-grained Pluggable Gradient Ascent for Knowledge Unlearning in Language Models |
link |
Feng, XiaoHua,..., Zibin |
5 |
2024-06-30 |
Towards Robust Speech Representation Learning for Thousands of Languages |
link |
Chen, William,..., Shinji |
5 |
2024-07-23 |
PreAlign: Boosting Cross-Lingual Transfer by Early Establishment of Multilingual Alignment |
link |
Li, Jiahuan,..., Jiajun |
5 |
2024-06-21 |
Decoding Matters: Addressing Amplification Bias and Homogeneity Issue in Recommendations for Large Language Models |
link |
Bao, Keqin,..., Fuli |
5 |
2024-05-28 |
ATM: Adversarial Tuning Multi-agent System Makes a Robust Retrieval-Augmented Generator |
link |
Zhu, Junda,..., Lei |
5 |
2024-01-10 |
Towards Online Continuous Sign Language Recognition and Translation |
link |
Zuo, Ronglai,..., Brian |
5 |
2024-06-26 |
PrExMe! Large Scale Prompt Exploration of Open Source LLMs for Machine Translation and Summarization Evaluation |
link |
Leiter, Christoph,..., Steffen |
5 |
2024-06-14 |
SciEx: Benchmarking Large Language Models on Scientific Exams with Human Expert Grading and Automatic Grading |
link |
Dinh, Tu Anh,..., Jan |
5 |
2024-05-23 |
Large Language Models Can Self-Correct with Key Condition Verification |
link |
Wu, Zhenyu,..., Meng |
5 |
2024-10-14 |
How to Leverage Demonstration Data in Alignment for Large Language Model? A Self-Imitation Learning Perspective |
link |
Xiao, Teng,..., Vasant G |
5 |
2024-10-01 |
Style-Specific Neurons for Steering LLMs in Text Style Transfer |
link |
Lai, Wen,..., Alexander |
5 |
2024-02-17 |
Grasping the Essentials: Tailoring Large Language Models for Zero-Shot Relation Extraction |
link |
Zhou, Sizhe,..., Jiawei |
5 |
None |
Large Language Models Are Poor Clinical Decision-Makers: A Comprehensive Benchmark |
link |
Liu, Fenglin,..., David A. |
5 |
2024-04-17 |
Related Work and Citation Text Generation: A Survey |
link |
Li, Xiangci,..., Jessica |
5 |
2024-02-17 |
KnowTuning: Knowledge-aware Fine-tuning for Large Language Models |
link |
Lyu, Yougang,..., Zhaochun |
5 |
2024-06-18 |
Bridging Local Details and Global Context in Text-Attributed Graphs |
link |
Wang, Yaoke,..., Siliang |
5 |
2024-10-31 |
Commonsense Knowledge Editing Based on Free-Text in LLMs |
link |
Huang, Xiusheng,..., Kang |
5 |
2024-06-25 |
GRASS: Compute Efficient Low-Memory LLM Training with Structured Sparse Gradients |
link |
Muhamed, Aashiq,..., Virginia |
5 |
2024-07-10 |
LitSearch: A Retrieval Benchmark for Scientific Literature Search |
link |
Ajith, Anirudh,..., Tianyu |
5 |
2024-07-19 |
ECCO: Can We Improve Model-Generated Code Efficiency Without Sacrificing Functional Correctness? |
link |
Waghjale, Siddhant,..., Daniel |
5 |
2024-10-02 |
InfiniPot: Infinite Context Processing on Memory-Constrained LLMs |
link |
Kim, Minsoo,..., Simyung |
5 |
2024-02-08 |
On the Robustness of Editing Large Language Models |
link |
Ma, Xinbei,..., Yulong |
5 |
2024-10-17 |
Towards Cross-Cultural Machine Translation with Retrieval-Augmented Generation from Multilingual Knowledge Graphs |
link |
Conia, Simone,..., Yunyao |
5 |
2024-08-24 |
Symbolic Working Memory Enhances Language Models for Complex Rule Application |
link |
Wang, Siyuan,..., Xiang |
5 |
2024-06-19 |
Data Contamination Can Cross Language Barriers |
link |
Yao, Feng,..., Jingbo |
5 |
2024-09-29 |
Calibrating Language Models with Adaptive Temperature Scaling |
link |
Xie, Johnathan,..., Chelsea |
5 |
2024-07-11 |
On the Universal Truthfulness Hyperplane Inside LLMs |
link |
Liu, Junteng,..., Junxian |
5 |
2024-10-03 |
On the Proper Treatment of Tokenization in Psycholinguistics |
link |
Giulianelli, Mario,..., Ryan |
5 |
2024-05-16 |
SynthesizRR: Generating Diverse Datasets with Retrieval Augmentation |
link |
Divekar, Abhishek,..., Greg |
5 |
2024-08-09 |
DataNarrative: Automated Data-Driven Storytelling with Visualizations and Texts |
link |
Islam, Mohammed Saidul,..., Shafiq |
5 |
2024-07-20 |
Step-by-Step Reasoning to Solve Grid Puzzles: Where do LLMs Falter? |
link |
Tyagi, Nemika,..., Chitta |
5 |
2024-11-14 |
Unveiling Multi-level and Multi-modal Semantic Representations in the Human Brain using Large Language Models |
link |
Nakagi, Yuko,..., Yu |
5 |
2024-04-17 |
Advancing Social Intelligence in AI Agents: Technical Challenges and Open Questions |
link |
Mathur, Leena,..., Louis-Philippe |
5 |
2024-07-09 |
Mixture-of-Modules: Reinventing Transformers as Dynamic Assemblies of Modules |
link |
Gong, Zhuocheng,..., Rui |
5 |
2024-11-06 |
No Culture Left Behind: ArtELingo-28, a Benchmark of WikiArt with Captions in 28 Languages |
link |
Mohamed, Youssef,..., Mohamed |
5 |
2024-11-09 |
An Empirical Analysis on Spatial Reasoning Capabilities of Large Multimodal Models |
link |
Shiri, Fatemeh,..., Yuan-Fang |
5 |
2024-06-17 |
STAR: SocioTechnical Approach to Red Teaming Language Models |
link |
Weidinger, Laura,..., William |
5 |
2024-05-03 |
Assessing and Verifying Task Utility in LLM-Powered Applications |
link |
Arabzadeh, Negar,..., Julia |
5 |
2024-06-09 |
RE-RAG: Improving Open-Domain QA Performance and Interpretability with Relevance Estimator in Retrieval-Augmented Generation |
link |
Kim, Kiseung,..., Jay-Yoon |
5 |
2024-07-22 |
Improving Minimum Bayes Risk Decoding with Multi-Prompt |
link |
Heineman, David,..., Wei |
5 |
2023-12-06 |
Mitigating Open-Vocabulary Caption Hallucinations |
link |
Ben-Kish, Assaf,..., Hadar |
4 |
2024-03-11 |
Strength Lies in Differences! Improving Strategy Planning for Non-collaborative Dialogues via Diversified User Simulation |
link |
Zhang, Tong,..., Tat-Seng |
4 |
2024-04-10 |
Personality-aware Student Simulation for Conversational Intelligent Tutoring Systems |
link |
Liu, Zhengyuan,..., Nancy F. |
4 |
2024-06-18 |
Can Large Language Models Always Solve Easy Problems if They Can Solve Harder Ones? |
link |
Yang, Zhe,..., Zhifang |
4 |
2024-10-16 |
Rethinking Token Reduction for State Space Models |
link |
Zhan, Zheng,..., Yanzhi |
4 |
2024-09-30 |
HELPD: Mitigating Hallucination of LVLMs by Hierarchical Feedback Learning with Vision-enhanced Penalty Decoding |
link |
Yuan, Fan,..., Piji |
4 |
2024-04-19 |
AutoScraper: A Progressive Understanding Web Agent for Web Scraper Generation |
link |
Huang, Wenhao,..., Zulong |
4 |
2024-06-18 |
From Insights to Actions: The Impact of Interpretability and Analysis Research on NLP |
link |
Mosbach, Marius,..., Mor |
4 |
2023-11-14 |
Toxicity Detection is NOT all you Need: Measuring the Gaps to Supporting Volunteer Content Moderators through a User-Centric Method |
link |
Cao, Yang Trista,..., Hal |
4 |
2024-06-22 |
Teaching LLMs to Abstain across Languages via Multilingual Feedback |
link |
Feng, Shangbin,..., Yulia |
4 |
2024-09-30 |
World to Code: Multi-modal Data Generation via Self-Instructed Compositional Captioning and Filtering |
link |
Wang, Jiacong,..., Jun |
4 |
2024-02-24 |
How Do Humans Write Code? Large Models Do It the Same Way Too |
link |
Li, Long,..., Liang |
4 |
2024-11-07 |
Bayesian Calibration of Win Rate Estimation with LLM Evaluators |
link |
Gao, Yicheng,..., Arman |
4 |
2023-12-03 |
NLEBench+NorGLM: A Comprehensive Empirical Analysis and Benchmark Dataset for Generative Language Models in Norwegian |
link |
Liu, Peng,..., Zhirong |
4 |
2024-06-19 |
Model Internals-based Answer Attribution for Trustworthy Retrieval-Augmented Generation |
link |
Qi, Jirui,..., Arianna |
4 |
2024-04-11 |
An Audit on the Perspectives and Challenges of Hallucinations in NLP |
link |
Narayanan Venkit, Pranav,..., Shomir |
4 |
2024-06-18 |
InterIntent: Investigating Social Intelligence of LLMs via Intention Understanding in an Interactive Game Context |
link |
Liu, Ziyi,..., Jieyu |
4 |
2024-05-09 |
Efficient LLM Comparative Assessment: A Product of Experts Framework for Pairwise Comparisons |
link |
Liusie, Adian,..., Mark |
4 |
2024-02-16 |
Python is Not Always the Best Choice: Embracing Multilingual Program of Thoughts |
link |
Luo, Xianzhen,..., Wanxiang |
4 |
2024-08-06 |
Unveiling Factual Recall Behaviors of Large Language Models through Knowledge Neurons |
link |
Wang, Yifei,..., Daniel Dajun |
4 |
2024-05-28 |
More Than Catastrophic Forgetting: Integrating General Capabilities For Domain-Specific LLMs |
link |
Liu, Chengyuan,..., Fei |
4 |
2024-06-23 |
Unlocking the Future: Exploring Look-Ahead Planning Mechanistic Interpretability in Large Language Models |
link |
Men, Tianyi,..., Jun |
4 |
2024-07-02 |
Breaking Language Barriers: Cross-Lingual Continual Pre-Training at Scale |
link |
Zheng, Wenzhen,..., Ming |
4 |
2024-10-06 |
Empowering Backbone Models for Visual Text Generation with Input Granularity Control and Glyph-Aware Training |
link |
Li, Wenbo,..., Jinsong |
4 |
2024-10-09 |
CoBa: Convergence Balancer for Multitask Finetuning of Large Language Models |
link |
Gong, Zi,..., Jianguo |
4 |
2024-07-10 |
Attribute or Abstain: Large Language Models as Long Document Assistants |
link |
Buchmann, Jan,..., Iryna |
4 |
2024-07-04 |
Resampled Datasets Are Not Enough: Mitigating Societal Bias Beyond Single Attributes |
link |
Hirota, Yusuke,..., Alice |
4 |
2024-05-13 |
MetaReflection: Learning Instructions for Language Agents using Past Reflections |
link |
Gupta, Priyanshu,..., Sherry |
4 |
None |
Jellyfish: Instruction-Tuning Local Large Language Models for Data Preprocessing |
link |
Zhang, Haochen,..., Masafumi |
4 |
2024-06-29 |
From RAG to Riches: Retrieval Interlaced with Sequence Generation |
link |
Jain, Palak,..., Tom |
4 |
2024-06-20 |
PostMark: A Robust Blackbox Watermark for Large Language Models |
link |
Chang, Yapei,..., Mohit |
4 |
2024-06-27 |
DiVERT: Distractor Generation with Variational Errors Represented as Text for Math Multiple-choice Questions |
link |
Fernandez, Nigel,..., Andrew |
4 |
2024-02-20 |
Structure Guided Prompt: Instructing Large Language Model in Multi-Step Reasoning by Exploring Graph Structure of the Text |
link |
Cheng, Kewei,..., Yizhou |
4 |
2024-10-21 |
Susu Box or Piggy Bank: Assessing Cultural Commonsense Knowledge between Ghana and the US |
link |
Acquaye, Christabel,..., Rachel |
4 |
2024-06-05 |
Ranking Manipulation for Conversational Search Engines |
link |
Pfrommer, Samuel,..., Somayeh |
4 |
2024-07-09 |
STORYSUMM: Evaluating Faithfulness in Story Summarization |
link |
Subbiah, Melanie,..., Kathleen |
4 |
2024-06-28 |
Detecting Subtle Differences between Human and Model Languages Using Spectrum of Relative Likelihood |
link |
Xu, Yang,..., Yongyuan |
4 |
2024-02-17 |
I Learn Better If You Speak My Language: Understanding the Superior Performance of Fine-Tuning Large Language Models with LLM-Generated Responses |
link |
Ren, Xuan,..., Lingqiao |
4 |
2024-06-25 |
Encourage or Inhibit Monosemanticity? Revisit Monosemanticity from a Feature Decorrelation Perspective |
link |
Yan, Hanqi,..., Yulan |
4 |
2024-02-27 |
AmbigNLG: Addressing Task Ambiguity in Instruction for NLG |
link |
Niwa, Ayana,..., Hayate |
4 |
2024-06-28 |
Paraphrase Types Elicit Prompt Engineering Capabilities |
link |
Wahle, Jan Philip,..., Bela |
4 |
2024-10-21 |
Improve Dense Passage Retrieval with Entailment Tuning |
link |
Dai, Lu,..., Hui |
4 |
2024-06-18 |
What Are the Odds? Language Models Are Capable of Probabilistic Reasoning |
link |
Paruchuri, Akshay,..., Daniel |
4 |
None |
Empowering Multi-step Reasoning across Languages via Program-Aided Language Models |
link |
Ranaldi, Leonardo,..., Alexandra |
4 |
2024-03-11 |
GlossLM: A Massively Multilingual Corpus and Pretrained Model for Interlinear Glossed Text |
link |
Ginn, Michael,..., Lori |
4 |
2024-07-24 |
Unveiling In-Context Learning: A Coordinate System to Understand Its Working Mechanism |
link |
Zhao, Anhao,..., Xiaoyu |
4 |
2024-01-23 |
SLANG: New Concept Comprehension of Large Language Models |
link |
Mei, Lingrui,..., Xueqi |
4 |
2024-05-21 |
Atomic Self-Consistency for Better Long Form Generations |
link |
Thirukovalluru, Raghuveer,..., Bhuwan |
4 |
2024-02-17 |
Turn Waste into Worth: Rectifying Top-$k$ Router of MoE |
link |
Zeng, Zhiyuan,..., Xipeng |
4 |
2024-10-01 |
Preserving Generalization of Language models in Few-shot Continual Relation Extraction |
link |
Tran, Quyen,..., Thien Huu |
4 |
2024-10-14 |
MAIR: A Massive Benchmark for Evaluating Instructed Retrieval |
link |
Sun, Weiwei,..., Zhaochun |
4 |
2024-06-27 |
Tools Fail: Detecting Silent Errors in Faulty Tools |
link |
Sun, Jimin,..., Yonatan |
4 |
None |
More DWUGs: Extending and Evaluating Word Usage Graph Datasets in Multiple Languages |
link |
Schlechtweg, Dominik,..., Nina |
4 |
2024-06-22 |
Ladder: A Model-Agnostic Framework Boosting LLM-based Machine Translation to the Next Level |
link |
Feng, Zhaopeng,..., Zuozhu |
4 |
2024-07-01 |
DogeRM: Equipping Reward Models with Domain Knowledge through Model Merging |
link |
Lin, Tzu-Han,..., Yun-Nung |
4 |
2024-10-09 |
Mitigating the Language Mismatch and Repetition Issues in LLM-based Machine Translation via Model Editing |
link |
Wang, Weichuan,..., Ying |
4 |
2024-06-17 |
Cultural Conditioning or Placebo? On the Effectiveness of Socio-Demographic Prompting |
link |
Mukherjee, Sagnik,..., Monojit |
4 |
2024-10-03 |
Hate Personified: Investigating the role of LLMs in content moderation |
link |
Masud, Sarah,..., Tanmoy |
4 |
2024-06-26 |
Themis: A Reference-free NLG Evaluation Language Model with Flexibility and Interpretability |
link |
Hu, Xinyu,..., Xiaojun |
4 |
2024-11-06 |
WorryWords: Norms of Anxiety Association for over 44k English Words |
link |
Mohammad, Saif M. |
4 |
2024-02-23 |
Interpreting Context Look-ups in Transformers: Investigating Attention-MLP Interactions |
link |
Neo, Clement,..., Fazl |
4 |
2024-06-27 |
The Odyssey of Commonsense Causality: From Foundational Benchmarks to Cutting-Edge Reasoning |
link |
Cui, Shaobo,..., Boi |
4 |
2024-10-05 |
Adaptive Question Answering: Enhancing Language Model Proficiency for Addressing Knowledge Conflicts with Source Citations |
link |
Shaier, Sagi,..., Philip V. |
4 |
2024-05-03 |
MedReadMe: A Systematic Study for Fine-grained Sentence Readability in Medical Domain |
link |
Jiang, Chao,..., Wei |
4 |
2024-09-23 |
MemeCLIP: Leveraging CLIP Representations for Multimodal Meme Classification |
link |
Shah, Siddhant Bikram,..., Haohan |
4 |
2023-11-16 |
StorySparkQA: Expert-Annotated QA Pairs with Real-World Knowledge for Children`s Story-Based Learning |
link |
Chen, Jiaju,..., Yuling |
4 |
2024-01-26 |
HiFT: A Hierarchical Full Parameter Fine-Tuning Strategy |
link |
Liu, YongKang,..., Hinrich |
4 |
2024-05-16 |
Simultaneous Masking, Not Prompting Optimization: A Paradigm Shift in Fine-tuning LLMs for Simultaneous Translation |
link |
Raffel, Matthew,..., Lizhong |
4 |
2024-02-20 |
Enhanced Hallucination Detection in Neural Machine Translation through Simple Detector Aggregation |
link |
Himmi, Anas,..., Nuno M |
4 |
2024-06-06 |
BLSP-Emo: Towards Empathetic Large Speech-Language Models |
link |
Wang, Chen,..., Jiajun |
4 |
2024-10-22 |
Altogether: Image Captioning via Re-aligning Alt-text |
link |
Xu, Hu,..., Christoph |
4 |
2024-07-04 |
Investigating the Role of Instruction Variety and Task Difficulty in Robotic Manipulation Tasks |
link |
Parekh, Amit,..., Ioannis |
4 |
2024-04-12 |
The Generation Gap: Exploring Age Bias in the Value Systems of Large Language Models |
link |
Liu, Siyang,..., Rada |
4 |
2024-07-08 |
Perceptions to Beliefs: Exploring Precursory Inferences for Theory of Mind in Large Language Models |
link |
Jung, Chani,..., Hyunwoo |
4 |
2024-02-21 |
SYNFAC-EDIT: Synthetic Imitation Edit Feedback for Factual Alignment in Clinical Summarization |
link |
Mishra, Prakamya,..., Hong |
4 |
2024-02-23 |
DeMPT: Decoding-enhanced Multi-phase Prompt Tuning for Making LLMs Be Better Context-aware Translators |
link |
Lyu, Xinglin,..., Min |
4 |
2024-11-08 |
SpecHub: Provable Acceleration to Multi-Draft Speculative Decoding |
link |
Sun, Ryan,..., Lichao |
4 |
2024-04-23 |
Bayesian Example Selection Improves In-Context Learning for Speech, Text and Visual Modalities |
link |
Wang, Siyin,..., Chao |
4 |
2024-02-21 |
Investigating Multilingual Instruction-Tuning: Do Polyglot Models Demand for Multilingual Instructions? |
link |
Weber, Alexander Arno,..., Mehdi |
4 |
2024-06-27 |
Contrastive Policy Gradient: Aligning LLMs on sequence-level scores in a supervised-friendly fashion |
link |
Flet-Berliac, Yannis,..., Matthieu |
4 |
None |
Can LLMs Learn Uncertainty on Their Own? Expressing Uncertainty Effectively in A Self-Training Manner |
link |
Liu, Shudong,..., Min |
4 |
2024-06-27 |
T-FREE: Subword Tokenizer-Free Generative LLMs via Sparse Representations for Memory-Efficient Embeddings |
link |
Deiseroth, Bj{\"o}rn,..., Samuel |
4 |
2024-04-18 |
Simultaneous Interpretation Corpus Construction by Large Language Models in Distant Language Pair |
link |
Sakai, Yusuke,..., Taro |
3 |
2023-11-13 |
Prompts have evil twins |
link |
Melamed, Rimon,..., Enric |
3 |
2023-12-21 |
EmphAssess : a Prosodic Benchmark on Assessing Emphasis Transfer in Speech-to-Speech Models |
link |
de Seyssel, Maureen,..., Emmanuel |
3 |
None |
On Fake News Detection with LLM Enhanced Semantics Mining |
link |
Ma, Xiaoxiao,..., Hao |
3 |
2024-02-20 |
On Sensitivity of Learning with Limited Labelled Data to the Effects of Randomness: Impact of Interactions and Systematic Choices |
link |
Pecher, Branislav,..., Maria |
3 |
2024-06-28 |
Learning Interpretable Legal Case Retrieval via Knowledge-Guided Case Reformulation |
link |
Deng, Chenlong,..., Zhicheng |
3 |
2023-11-27 |
Pre-trained Language Models Do Not Help Auto-regressive Text-to-Image Generation |
link |
Zhang, Yuhui,..., Alexander T |
3 |
2024-08-02 |
QUDSELECT: Selective Decoding for Questions Under Discussion Parsing |
link |
Suvarna, Ashima,..., Nanyun |
3 |
2024-08-21 |
UniFashion: A Unified Vision-Language Model for Multimodal Fashion Retrieval and Generation |
link |
Zhao, Xiangyu,..., Xiao-Ming |
3 |
2024-02-19 |
Standardize: Aligning Language Models with Expert-Defined Standards for Content Generation |
link |
Imperial, Joseph Marvin,..., Harish |
3 |
2024-06-26 |
MatchTime: Towards Automatic Soccer Game Commentary Generation |
link |
Rao, Jiayuan,..., Weidi |
3 |
2024-06-29 |
Advancing Process Verification for Large Language Models via Tree-Based Preference Learning |
link |
He, Mingqian,..., Weiming |
3 |
2024-07-03 |
VIVA: A Benchmark for Vision-Grounded Decision-Making with Human Values |
link |
Hu, Zhe,..., Yu |
3 |
2024-07-07 |
Large Language Model as an Assignment Evaluator: Insights, Feedback, and Challenges in a 1000+ Student Course |
link |
Chiang, Cheng-Han,..., Hung-yi |
3 |
2024-09-08 |
Seemingly Plausible Distractors in Multi-Hop Reasoning: Are Large Language Models Attentive Readers? |
link |
Bhuiya, Neeladri,..., Stefan |
3 |
2024-01-19 |
Knowledge Verification to Nip Hallucination in the Bud |
link |
Wan, Fanqi,..., Shuming |
3 |
2024-02-29 |
Whispers that Shake Foundations: Analyzing and Mitigating False Premise Hallucinations in Large Language Models |
link |
Yuan, Hongbang,..., Jun |
3 |
2024-02-02 |
KB-Plugin: A Plug-and-play Framework for Large Language Models to Induce Programs over Low-resourced Knowledge Bases |
link |
Zhang, Jiajie,..., Juanzi |
3 |
2024-09-23 |
CUTE: Measuring LLMs' Understanding of Their Tokens |
link |
Edman, Lukas,..., Alexander |
3 |
2024-04-11 |
On Training Data Influence of GPT Models |
link |
Chai, Yekun,..., Hua |
3 |
2023-12-19 |
Neuron-Level Knowledge Attribution in Large Language Models |
link |
Yu, Zeping,..., Sophia |
3 |
2024-10-04 |
Is Safer Better? The Impact of Guardrails on the Argumentative Strength of LLMs in Hate Speech Countering |
link |
Bonaldi, Helena,..., Marco |
3 |
2024-07-10 |
Decompose and Compare Consistency: Measuring VLMs' Answer Reliability via Task-Decomposition Consistency Comparison |
link |
Yang, Qian,..., Aishwarya |
3 |
2024-04-16 |
Incubating Text Classifiers Following User Instruction with Nothing but LLM |
link |
Peng, Letian,..., Jingbo |
3 |
2024-10-17 |
Advancing Large Language Model Attribution through Self-Improving |
link |
Huang, Lei,..., Bing |
3 |
2024-08-16 |
Where is the signal in tokenization space? |
link |
Geh, Renato,..., Guy |
3 |
2024-09-28 |
Crafting Personalized Agents through Retrieval-Augmented Generation on Editable Memory Graphs |
link |
Wang, Zheng,..., Wei |
3 |
None |
Modeling Nonnative Sentence Processing with L2 Language Models |
link |
Aoyama, Tatsuya,..., Nathan |
3 |
2024-04-07 |
Cross-Domain Audio Deepfake Detection: Dataset and Analysis |
link |
Li, Yuang,..., Hao |
3 |
2024-10-01 |
Concept Space Alignment in Multilingual LLMs |
link |
Peng, Qiwei,..., Anders |
3 |
2024-10-02 |
Quantifying the Gaps Between Translation and Native Perception in Training for Multimodal, Multilingual Retrieval |
link |
Buettner, Kyle,..., Adriana |
3 |
2022-10-09 |
Fine-Grained Detection of Solidarity for Women and Migrants in 155 Years of German Parliamentary Debates |
link |
Kostikova, Aida,..., Steffen |
3 |
2024-06-17 |
CItruS: Chunked Instruction-aware State Eviction for Long Sequence Modeling |
link |
Bai, Yu,..., Jackie CK |
3 |
2024-08-26 |
Focused Large Language Models are Stable Many-Shot Learners |
link |
Yuan, Peiwen,..., Kan |
3 |
2024-07-09 |
ChatGPT Doesn`t Trust Chargers Fans: Guardrail Sensitivity in Context |
link |
Li, Victoria R,..., Naomi |
3 |
2024-06-16 |
Optimized Speculative Sampling for GPU Hardware Accelerators |
link |
Wagner, Dominik,..., Tobias |
3 |
2024-02-05 |
Reconstruct Your Previous Conversations! Comprehensively Investigating Privacy Leakage Risks in Conversations with GPT Models |
link |
Chu, Junjie,..., Yang |
3 |
2024-02-04 |
Can Large Language Models Learn Independent Causal Mechanisms? |
link |
Gendron, Gael,..., Gillian |
3 |
2024-06-18 |
Liar, Liar, Logical Mire: A Benchmark for Suppositional Reasoning in Large Language Models |
link |
Mondorf, Philipp,..., Barbara |
3 |
2024-05-09 |
Muting Whisper: A Universal Acoustic Adversarial Attack on Speech Foundation Models |
link |
Raina, Vyas,..., Mark |
3 |
2023-11-15 |
XplainLLM: A Knowledge-Augmented Dataset for Reliable Grounded Explanations in LLMs |
link |
Chen, Zichen,..., Misha |
3 |
2024-10-08 |
KnowledgeSG: Privacy-Preserving Synthetic Text Generation with Knowledge Distillation from Server |
link |
Wang, WenHao,..., Yanfeng |
3 |
2024-07-03 |
Improving Retrieval-augmented Text-to-SQL with AST-based Ranking and Schema Pruning |
link |
Shen, Zhili,..., Jeff Z. |
3 |
2024-05-22 |
Getting More from Less: Large Language Models are Good Spontaneous Multilingual Learners |
link |
Zhang, Shimao,..., Shujian |
3 |
2024-07-22 |
Walking in Others' Shoes: How Perspective-Taking Guides Large Language Models in Reducing Toxicity and Bias |
link |
Xu, Rongwu,..., Han |
3 |
2024-06-10 |
Annotation alignment: Comparing LLM and human annotations of conversational safety |
link |
Movva, Rajiv,..., Emma |
3 |
2024-10-04 |
CommonIT: Commonality-Aware Instruction Tuning for Large Language Models via Data Partitions |
link |
Rao, Jun,..., Min |
3 |
2024-04-30 |
ESC: Efficient Speech Coding with Cross-Scale Residual Vector Quantized Transformers |
link |
Gu, Yuzhe,..., Enmao |
3 |
None |
Optimizing Language Models with Fair and Stable Reward Composition in Reinforcement Learning |
link |
Li, Jiahui,..., Jun |
3 |
2024-06-19 |
When Parts Are Greater Than Sums: Individual LLM Components Can Outperform Full Models |
link |
Chang, Ting-Yun,..., Robin |
3 |
2024-06-19 |
Enhancing Language Model Factuality via Activation-Based Confidence Calibration and Guided Decoding |
link |
Liu, Xin,..., Lu |
3 |
2024-07-08 |
Data, Data Everywhere: A Guide for Pretraining Dataset Construction |
link |
Parmar, Jupinder,..., Bryan |
3 |
2024-06-24 |
Towards Fast Multilingual LLM Inference: Speculative Decoding and Specialized Drafters |
link |
Yi, Euiin,..., Se-Young |
3 |
2024-09-04 |
What is lost in Normalization? Exploring Pitfalls in Multilingual ASR Model Evaluations |
link |
Manohar, Kavya,..., Leena G |
3 |
2024-02-03 |
CodeAgent: Autonomous Communicative Agents for Code Review |
link |
Tang, Xunzhu,..., Tegawend{\'e} F. |
3 |
2024-01-12 |
Experimental Contexts Can Facilitate Robust Semantic Property Inference in Language Models, but Inconsistently |
link |
Misra, Kanishka,..., Kyle |
3 |
None |
ABSEval: An Agent-based Framework for Script Evaluation |
link |
Liang, Sirui,..., Kang |
3 |
2024-08-22 |
FIRST: Teach A Reliable Large Language Model Through Efficient Trustworthy Distillation |
link |
Shum, KaShun,..., Muhammad Omer |
3 |
2024-10-02 |
ACE: A LLM-based Negotiation Coaching System |
link |
Shea, Ryan,..., Zhou |
3 |
2024-08-28 |
CoGen: Learning from Feedback with Coupled Comprehension and Generation |
link |
Gul, Mustafa Omer,..., Yoav |
3 |
None |
Null-Shot Prompting: Rethinking Prompting Large Language Models With Hallucination |
link |
Taveekitworachai, Pittawat,..., Ruck |
3 |
2024-02-26 |
Pre-training Cross-lingual Open Domain Question Answering with Large-scale Synthetic Supervision |
link |
Jiang, Fan,..., Trevor |
3 |
2024-02-19 |
KARL: Knowledge-Aware Retrieval and Representations aid Retention and Learning in Students |
link |
Shu, Matthew,..., Jordan Lee |
3 |
2024-02-02 |
Distractor Generation in Multiple-Choice Tasks: A Survey of Methods, Datasets, and Evaluation |
link |
Alhazmi, Elaf,..., Ahoud |
3 |
2024-10-10 |
Modeling User Preferences with Automatic Metrics: Creating a High-Quality Preference Dataset for Machine Translation |
link |
Agrawal, Sweta,..., Andre |
3 |
2024-02-28 |
LLM Task Interference: An Initial Study on the Impact of Task-Switch in Conversational History |
link |
Gupta, Akash,..., Mario |
3 |
None |
Revisiting Automated Evaluation for Long-form Table Question Answering |
link |
Wang, Yuqi,..., Yilun |
3 |
None |
HalluMeasure: Fine-grained Hallucination Measurement Using Chain-of-Thought Reasoning |
link |
Akbar, Shayan Ali,..., Erwin |
3 |
2024-09-23 |
FineCops-Ref: A new Dataset and Task for Fine-Grained Compositional Referring Expression Comprehension |
link |
Liu, Junzhuo,..., Peng |
3 |
2024-10-01 |
VideoCLIP-XL: Advancing Long Description Understanding for Video CLIP Models |
link |
Wang, Jiapeng,..., Lianwen |
3 |
2024-07-24 |
CMR Scaling Law: Predicting Critical Mixture Ratios for Continual Pre-training of Language Models |
link |
Gu, Jiawei,..., Fei |
3 |
2024-05-05 |
Exploring the Compositional Deficiency of Large Language Models in Mathematical Reasoning Through Trap Problems |
link |
Zhao, Jun,..., Xuanjing |
3 |
None |
Working Memory Identifies Reasoning Limits in Language Models |
link |
Zhang, Chunhui,..., Soroush |
3 |
2024-06-18 |
Measuring Psychological Depth in Language Models |
link |
Harel-Canada, Fabrice Y,..., Nanyun |
3 |
2024-09-23 |
Knowledge Planning in Large Language Models for Domain-Aligned Counseling Summarization |
link |
Srivastava, Aseem,..., Md Shad |
3 |
2024-06-20 |
From Descriptive Richness to Bias: Unveiling the Dark Side of Generative Image Caption Enrichment |
link |
Hirota, Yusuke,..., Yuta |
3 |
None |
Game on Tree: Visual Hallucination Mitigation via Coarse-to-Fine View Tree and Game Theory |
link |
Zhuang, Xianwei,..., Yuexian |
3 |
2024-10-01 |
What the Harm? Quantifying the Tangible Impact of Gender Bias in Machine Translation with a Human-centered Study |
link |
Savoldi, Beatrice,..., Luisa |
3 |
2024-06-25 |
Dual-Space Knowledge Distillation for Large Language Models |
link |
Zhang, Songming,..., Jinan |
3 |
2024-09-23 |
ToolPlanner: A Tool Augmented LLM for Multi Granularity Instructions with Path Planning and Feedback |
link |
Wu, Qinzhuo,..., Bin |
3 |
2024-02-14 |
Recurrent Alignment with Hard Attention for Hierarchical Text Rating |
link |
Lin, Chenxi,..., Xiaomin |
3 |
2024-09-02 |
CHESS: Optimizing LLM Inference via Channel-Wise Thresholding and Selective Sparsification |
link |
He, Junhui,..., Qingan |
3 |
2024-10-21 |
Surprise! Uniform Information Density Isn`t the Whole Story: Predicting Surprisal Contours in Long-form Discourse |
link |
Tsipidi, Eleftheria,..., Alex |
3 |
2024-10-07 |
Preserving Multi-Modal Capabilities of Pre-trained VLMs for Improving Vision-Linguistic Compositionality |
link |
Oh, Youngtaek,..., Junmo |
3 |
2024-02-25 |
DetoxLLM: A Framework for Detoxification with Explanations |
link |
Khondaker, Md Tawkat Islam,..., Laks V. S. |
3 |
2024-06-22 |
CaT-Bench: Benchmarking Language Model Understanding of Causal and Temporal Dependencies in Plans |
link |
Lal, Yash Kumar,..., Ray |
3 |
2024-10-03 |
CodeJudge: Evaluating Code Generation with Large Language Models |
link |
Tong, Weixi,..., Tianyi |
3 |
2024-06-28 |
Self-Training Large Language and Vision Assistant for Medical Question Answering |
link |
Sun, Guohao,..., Zhiqiang |
3 |
2024-06-12 |
Updating CLIP to Prefer Descriptions Over Captions |
link |
Zur, Amir,..., Atticus |
3 |
2024-10-14 |
AlphaLoRA: Assigning LoRA Experts Based on Layer Training Quality |
link |
Qing, Peijun,..., Soroush |
3 |
2024-06-24 |
Multi-LogiEval: Towards Evaluating Multi-Step Logical Reasoning Ability of Large Language Models |
link |
Patel, Nisarg,..., Chitta |
3 |
None |
Memorize Step by Step: Efficient Long-Context Prefilling with Incremental Memory and Decremental Chunk |
link |
Zeng, Zhiyuan,..., Xipeng |
3 |
2024-02-28 |
Twists, Humps, and Pebbles: Multilingual Speech Recognition Models Exhibit Gender Performance Gaps |
link |
Attanasio, Giuseppe,..., Dirk |
3 |
2024-08-22 |
Preference-Guided Reflective Sampling for Aligning Language Models |
link |
Ye, Hai,..., Hwee Tou |
3 |
2024-06-20 |
xCOMET-lite: Bridging the Gap Between Efficiency and Quality in Learned MT Evaluation Metrics |
link |
Larionov, Daniil,..., Steffen |
3 |
2024-10-07 |
The LLM Effect: Are Humans Truly Using LLMs, or Are They Being Influenced By Them Instead? |
link |
Choi, Alexander,..., Antonios |
3 |
2024-09-29 |
Coffee-Gym: An Environment for Evaluating and Improving Natural Language Feedback on Erroneous Code |
link |
Chae, Hyungjoo,..., Jinyoung |
3 |
2024-10-31 |
Nearest Neighbor Normalization Improves Multimodal Retrieval |
link |
Chowdhury, Neil,..., Tristan |
2 |
2024-04-15 |
Multi-News+: Cost-efficient Dataset Cleansing via LLM-based Data Annotation |
link |
Choi, Juhwan,..., YoungBin |
2 |
2024-04-17 |
FIZZ: Factual Inconsistency Detection by Zoom-in Summary and Zoom-out Document |
link |
Yang, Joonho,..., Hwanhee |
2 |
2024-02-16 |
Speaking in Wavelet Domain: A Simple and Efficient Approach to Speed up Speech Diffusion Model |
link |
Zhang, Xiangyu,..., Lina |
2 |
2024-05-27 |
HEART-felt Narratives: Tracing Empathy and Narrative Style in Personal Stories with LLMs |
link |
Shen, Jocelyn,..., Maarten |
2 |
None |
MAR: Matching-Augmented Reasoning for Enhancing Visual-based Entity Question Answering |
link |
Zhang, Zhengxuan,..., Nan |
2 |
2023-11-14 |
DA$^3$: A Distribution-Aware Adversarial Attack against Language Models |
link |
Wang, Yibo,..., Philip S. |
2 |
2024-09-24 |
TCSinger: Zero-Shot Singing Voice Synthesis with Style Transfer and Multi-Level Style Control |
link |
Zhang, Yu,..., Zhou |
2 |
2024-06-18 |
FuseGen: PLM Fusion for Data-generation based Zero-shot Learning |
link |
Zou, Tianyuan,..., Ya-Qin |
2 |
2024-07-20 |
I Need Help! Evaluating LLM`s Ability to Ask for Users' Support: A Case Study on Text-to-SQL Generation |
link |
Wu, Cheng-Kuang,..., Yun-Nung |
2 |
2024-03-01 |
Surveying the Dead Minds: Historical-Psychological Text Analysis with Contextualized Construct Representation (CCR) for Classical Chinese |
link |
Chen, Yuqi,..., Mohammad |
2 |
None |
Alignment-Enhanced Decoding: Defending Jailbreaks via Token-Level Adaptive Refining of Probability Distributions |
link |
Liu, Quan,..., Sen |
2 |
None |
DGLF: A Dual Graph-based Learning Framework for Multi-modal Sarcasm Detection |
link |
Zhu, Zhihong,..., Yefeng |
2 |
None |
BC-Prover: Backward Chaining Prover for Formal Theorem Proving |
link |
He, Yuhang,..., Wotao |
2 |
2024-04-16 |
Autoregressive Pre-Training on Pixels and Texts |
link |
Chai, Yekun,..., Hua |
2 |
2024-10-06 |
Fine-Grained Prediction of Reading Comprehension from Eye Movements |
link |
Shubi, Omer,..., Yevgeni |
2 |
None |
D2R: Dual-Branch Dynamic Routing Network for Multimodal Sentiment Detection |
link |
Chen, Yifan,..., Fenghuan |
2 |
2024-06-18 |
A Generic Method for Fine-grained Category Discovery in Natural Language Texts |
link |
Tian, Chang,..., Marie-Francine |
2 |
None |
VGBench: Evaluating Large Language Models on Vector Graphics Understanding and Generation |
link |
Zou, Bocheng,..., Yong Jae |
2 |
2024-11-07 |
Performance-Guided LLM Knowledge Distillation for Efficient Text Classification at Scale |
link |
Palo, Flavio Di,..., Bilal H |
2 |
2024-10-09 |
Dissecting Fine-Tuning Unlearning in Large Language Models |
link |
Hong, Yihuai,..., Haiqin |
2 |
2024-10-05 |
Consistent Autoformalization for Constructing Mathematical Libraries |
link |
Zhang, Lan,..., Andre |
2 |
2024-01-13 |
MiTTenS: A Dataset for Evaluating Gender Mistranslation |
link |
Robinson, Kevin,..., Jasmijn |
2 |
2024-08-28 |
StyleRemix: Interpretable Authorship Obfuscation via Distillation and Perturbation of Style Elements |
link |
Fisher, Jillian,..., Yejin |
2 |
2024-07-08 |
VIMI: Grounding Video Generation through Multi-modal Instruction |
link |
Fang, Yuwei,..., Sergey |
2 |
None |
DVD: Dynamic Contrastive Decoding for Knowledge Amplification in Multi-Document Question Answering |
link |
Jin, Jing,..., Zhijiang |
2 |
2024-06-13 |
An Unsupervised Approach to Achieve Supervised-Level Explainability in Healthcare Records |
link |
Edin, Joakim,..., Tuukka |
2 |
2024-09-20 |
MaPPER: Multimodal Prior-guided Parameter Efficient Tuning for Referring Expression Comprehension |
link |
Liu, Ting,..., Quanjun |
2 |
2024-06-27 |
Hierarchical Deconstruction of LLM Reasoning: A Graph-Based Framework for Analyzing Knowledge Utilization |
link |
Ko, Miyoung,..., Minjoon |
2 |
2024-06-20 |
An LLM Feature-based Framework for Dialogue Constructiveness Assessment |
link |
Zhou, Lexin,..., Andreas |
2 |
2024-07-11 |
Investigating LLMs as Voting Assistants via Contextual Augmentation: A Case Study on the European Parliament Elections 2024 |
link |
Chalkidis, Ilias |
2 |
2024-10-08 |
Scaling Laws Across Model Architectures: A Comparative Analysis of Dense and MoE Models in Large Language Models |
link |
Wang, Siqi,..., Jingang |
2 |
None |
Teaching Small Language Models Reasoning through Counterfactual Distillation |
link |
Feng, Tao,..., Yin |
2 |
None |
Pretraining Language Models Using Translationese |
link |
Doshi, Meet,..., Pushpak |
2 |
2024-06-18 |
ToxiCloakCN: Evaluating Robustness of Offensive Language Detection in Chinese with Cloaking Perturbations |
link |
Xiao, Yunze,..., Roy Ka-Wei |
2 |
2024-06-17 |
Boosting Scientific Concepts Understanding: Can Analogy from Teacher Models Empower Student Models? |
link |
Yuan, Siyu,..., Deqing |
2 |
2023-07-18 |
Distilling Knowledge from Text-to-Image Generative Models Improves Visio-Linguistic Reasoning in CLIP |
link |
Basu, Samyadeep,..., Soheil |
2 |
2024-06-16 |
Reconsidering Sentence-Level Sign Language Translation |
link |
Tanzer, Garrett,..., David |
2 |
2024-06-28 |
From Local Concepts to Universals: Evaluating the Multicultural Understanding of Vision-Language Models |
link |
Bhatia, Mehar,..., Vered |
2 |
2024-02-21 |
Dynamic Multi-Reward Weighting for Multi-Style Controllable Generation |
link |
De Langis, Karin,..., Dongyeop |
2 |
2024-03-14 |
Revealing the Parallel Multilingual Learning within Large Language Models |
link |
Mu, Yongyu,..., JingBo |
2 |
2024-05-30 |
Encoding and Controlling Global Semantics for Long-form Video Question Answering |
link |
Nguyen, Thong Thanh,..., Anh Tuan |
2 |
2024-10-25 |
Taxonomy-guided Semantic Indexing for Academic Paper Search |
link |
Kang, SeongKu,..., Hwanjo |
2 |
2024-08-27 |
Advancing Adversarial Suffix Transfer Learning on Aligned Large Language Models |
link |
Liu, Hongfu,..., Michael |
2 |
2024-06-20 |
Aligning Large Language Models with Diverse Political Viewpoints |
link |
Stammbach, Dominik,..., Elliott |
2 |
None |
Rethinking the Reversal Curse of LLMs: a Prescription from Human Knowledge Reversal |
link |
Lu, Zhicong,..., Xunliang |
2 |
None |
GENRA: Enhancing Zero-shot Retrieval with Rank Aggregation |
link |
Katsimpras, Georgios,..., Georgios |
2 |
2024-07-02 |
MORPHEUS: Modeling Role from Personalized Dialogue History by Exploring and Utilizing Latent Space |
link |
Tang, Yihong,..., Yuexian |
2 |
2024-10-08 |
Bridging Modalities: Enhancing Cross-Modality Hate Speech Detection with Few-Shot In-Context Learning |
link |
Hee, Ming Shan,..., Roy Ka-Wei |
2 |
2024-09-19 |
Efficient Performance Tracking: Leveraging Large Language Models for Automated Construction of Scientific Leaderboards |
link |
{\c{S}}ahinu{\c{c}}, Furkan,..., Iryna |
2 |
2024-10-17 |
AdaSwitch: Adaptive Switching between Small and Large Agents for Effective Cloud-Local Collaborative Learning |
link |
Sun, Hao,..., Dawei |
2 |
2024-10-01 |
EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion Control |
link |
Chen, Haozhe,..., Julia |
2 |
2023-05-06 |
The Best Defense is Attack: Repairing Semantics in Textual Adversarial Examples |
link |
Yang, Heng,..., Ke |
2 |
2024-07-22 |
Perceptions of Linguistic Uncertainty by Language Models and Humans |
link |
Bel{\'e}m, Catarina G,..., Padhraic |
2 |
2024-07-09 |
LIONs: An Empirically Optimized Approach to Align Language Models |
link |
Yu, Xiao,..., Zhou |
2 |
2023-10-27 |
MOSEL: Inference Serving Using Dynamic Modality Selection |
link |
Hu, Bodun,..., Aditya |
2 |
2024-06-05 |
Task Arithmetic can Mitigate Synthetic-to-Real Gap in Automatic Speech Recognition |
link |
Su, Hsuan,..., Hung-yi |