801 |
2023-03-29 |
link |
G-Eval: NLG Evaluation using GPT-4 with Better Human Alignment |
Liu, Yang,..., Chenguang |
586 |
2023-02-08 |
link |
Is ChatGPT a General-Purpose Natural Language Processing Task Solver? |
Qin, Chengwei,..., Diyi |
455 |
2023-05-17 |
link |
Evaluating Object Hallucination in Large Vision-Language Models |
Li, Yifan,..., Ji-Rong |
441 |
2022-10-20 |
link |
Large Language Models Can Self-Improve |
Huang, Jiaxin,..., Jiawei |
435 |
2023-05-23 |
link |
FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text Generation |
Min, Sewon,..., Hannaneh |
371 |
2023-05-22 |
link |
GQA: Training Generalized Multi-Query Transformer Models from Multi-Head Checkpoints |
Ainslie, Joshua,..., Sumit |
348 |
2023-05-23 |
link |
Enhancing Chat Language Models by Scaling High-quality Instructional Conversations |
Ding, Ning,..., Bowen |
344 |
2023-05-13 |
link |
CodeT5+: Open Code Large Language Models for Code Understanding and Generation |
Wang, Yue,..., Steven |
340 |
2023-05-24 |
link |
Reasoning with Language Model is Planning with World Model |
Hao, Shibo,..., Zhiting |
299 |
2023-03-16 |
link |
SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models |
Manakul, Potsawee,..., Mark |
233 |
2023-05-24 |
link |
Enabling Large Language Models to Generate Text with Citations |
Gao, Tianyu,..., Danqi |
214 |
2023-05-04 |
link |
Automatic Prompt Optimization with "Gradient Descent" and Beam Search |
Pryzant, Reid,..., Michael |
213 |
2023-04-19 |
link |
Is ChatGPT Good at Search? Investigating Large Language Models as Re-Ranking Agent |
Sun, Weiwei,..., Zhaochun |
212 |
2023-03-22 |
link |
MEGA: Multilingual Evaluation of Generative AI |
Ahuja, Kabir,..., Sunayana |
211 |
2023-05-22 |
link |
Editing Large Language Models: Problems, Methods, and Opportunities |
Yao, Yunzhi,..., Ningyu |
211 |
2023-05-24 |
link |
Just Ask for Calibration: Strategies for Eliciting Calibrated Confidence Scores from Language Models Fine-Tuned with Human Feedback |
Tian, Katherine,..., Christopher |
197 |
2023-04-28 |
link |
Dissecting Recall of Factual Associations in Auto-Regressive Language Models |
Geva, Mor,..., Amir |
188 |
2023-05-16 |
link |
StructGPT: A General Framework for Large Language Model to Reason over Structured Data |
Jiang, Jinhao,..., Ji-Rong |
165 |
2023-05-19 |
link |
HaluEval: A Large-Scale Hallucination Evaluation Benchmark for Large Language Models |
Li, Junyi,..., Ji-Rong |
161 |
2023-05-11 |
link |
Active Retrieval Augmented Generation |
Jiang, Zhengbao,..., Graham |
158 |
2023-04-04 |
link |
LLM-Adapters: An Adapter Family for Parameter-Efficient Fine-Tuning of Large Language Models |
Hu, Zhiqiang,..., Roy |
151 |
2023-03-22 |
link |
RepoCoder: Repository-Level Code Completion Through Iterative Retrieval and Generation |
Zhang, Fengji,..., Weizhu |
150 |
2023-05-24 |
link |
MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions |
Zhong, Zexuan,..., Danqi |
138 |
2023-05-22 |
link |
Can We Edit Factual Knowledge by In-Context Learning? |
Zheng, Ce,..., Baobao |
134 |
2023-10-16 |
link |
Character-LLM: A Trainable Agent for Role-Playing |
Shao, Yunfan,..., Xipeng |
134 |
2022-12-20 |
link |
SODA: Million-scale Dialogue Distillation with Social Commonsense Contextualization |
Kim, Hyunwoo,..., Yejin |
131 |
2023-05-23 |
link |
Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning |
Wang, Lean,..., Xu |
125 |
2023-05-24 |
link |
Adapting Language Models to Compress Contexts |
Chevalier, Alexis,..., Danqi |
100 |
2023-10-31 |
link |
Unlearn What You Want to Forget: Efficient Unlearning for LLMs |
Chen, Jiaao,..., Diyi |
96 |
2023-04-14 |
link |
API-Bank: A Comprehensive Benchmark for Tool-Augmented LLMs |
Li, Minghao,..., Yongbin |
90 |
2023-05-22 |
link |
LM vs LM: Detecting Factual Errors via Cross Examination |
Cohen, Roi,..., Amir |
87 |
2023-04-28 |
link |
Speak, Memory: An Archaeology of Books Known to ChatGPT/GPT-4 |
Chang, Kent,..., David |
86 |
2023-10-08 |
link |
The Troubling Emergence of Hallucination in Large Language Models - An Extensive Definition, Quantification, and Prescriptive Remediations |
Rawte, Vipula,..., Amitava |
84 |
2023-04-05 |
link |
Document-Level Machine Translation with Large Language Models |
Wang, Longyue,..., Zhaopeng |
80 |
2023-05-23 |
link |
DetGPT: Detect What You Need via Reasoning |
Pi, Renjie,..., Tong |
80 |
2023-05-17 |
link |
Stop Uploading Test Data in Plain Text: Practical Strategies for Mitigating Data Contamination by Evaluation Benchmarks |
Jacovi, Alon,..., Yoav |
77 |
2023-05-02 |
link |
Prompt as Triggers for Backdoor Attack: Examining the Vulnerability in Language Models |
Zhao, Shuai,..., Jie |
75 |
2023-04-25 |
link |
Answering Questions by Meta-Reasoning over Multiple Chains of Thought |
Yoran, Ori,..., Jonathan |
74 |
None |
link |
INSTRUCTSCORE: Towards Explainable Text Generation Evaluation with Automatic Feedback |
Xu, Wenda,..., Lei |
73 |
2023-02-10 |
link |
CodeBERTScore: Evaluating Code Generation with Pretrained Models of Code |
Zhou, Shuyan,..., Graham |
73 |
2023-05-21 |
link |
TheoremQA: A Theorem-driven Question Answering dataset |
Chen, Wenhu,..., Tony |
71 |
2023-03-14 |
link |
Query2doc: Query Expansion with Large Language Models |
Wang, Liang,..., Furu |
68 |
2023-03-02 |
link |
WiCE: Real-World Entailment for Claims in Wikipedia |
Kamoi, Ryo,..., Greg |
68 |
2023-10-09 |
link |
LLMLingua: Compressing Prompts for Accelerated Inference of Large Language Models |
Jiang, Huiqiang,..., Lili |
67 |
2023-04-27 |
link |
We're Afraid Language Models Aren't Modeling Ambiguity |
Liu, Alisa,..., Yejin |
66 |
2023-05-24 |
link |
UniChart: A Universal Vision-language Pretrained Model for Chart Comprehension and Reasoning |
Masry, Ahmed,..., Shafiq |
65 |
2023-02-17 |
link |
AfriSenti: A Twitter Sentiment Analysis Benchmark for African Languages |
Muhammad, Shamsuddeen,..., Stephen |
63 |
2023-05-18 |
link |
TrueTeacher: Learning Factual Consistency Evaluation with Large Language Models |
Gekhman, Zorik,..., Idan |
63 |
2023-05-23 |
link |
The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning |
Kim, Seungone,..., Minjoon |
63 |
2022-10-20 |
link |
Transcending Scaling Laws with 0.1% Extra Compute |
Tay, Yi,..., Mostafa |
62 |
2023-10-23 |
link |
LINC: A Neurosymbolic Approach for Logical Reasoning by Combining Language Models with First-Order Logic Provers |
Olausson, Theo,..., Roger |
62 |
2023-05-15 |
link |
Symbol tuning improves in-context learning in language models |
Wei, Jerry,..., Quoc |
61 |
2023-03-07 |
link |
Exploiting Asymmetry for Synthetic Training Data Generation: SynthIE and the Case of Information Extraction |
Josifoski, Martin,..., Robert |
60 |
2023-05-03 |
link |
GPT-RE: In-context Learning for Relation Extraction using Large Language Models |
Wan, Zhen,..., Sadao |
59 |
2023-01-25 |
link |
XLM-V: Overcoming the Vocabulary Bottleneck in Multilingual Masked Language Models |
Liang, Davis,..., Madian |
58 |
2023-10-10 |
link |
Text Embeddings Reveal (Almost) As Much As Text |
Morris, John,..., Alexander |
58 |
2023-05-23 |
link |
Do All Languages Cost the Same? Tokenization in the Era of Commercial Language Models |
Ahia, Orevaoghene,..., Yulia |
57 |
2022-11-23 |
link |
SciRepEval: A Multi-Format Benchmark for Scientific Document Representations |
Singh, Amanpreet,..., Sergey |
57 |
2023-10-30 |
link |
What's "up" with vision-language models? Investigating their struggle with spatial reasoning |
Kamath, Amita,..., Kai-Wei |
56 |
2023-02-23 |
link |
Can Pre-trained Vision and Language Models Answer Visual Information-Seeking Questions? |
Chen, Yang,..., Ming-Wei |
56 |
2022-08-08 |
link |
Investigating Efficiently Extending Transformers for Long Input Summarization |
Phang, Jason,..., Peter |
56 |
2023-10-19 |
link |
MolCA: Molecular Graph-Language Modeling with Cross-Modal Projector and Uni-Modal Adapter |
Liu, Zhiyuan,..., Tat-Seng |
55 |
2023-10-24 |
link |
FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions |
Kim, Hyunwoo,..., Maarten |
55 |
2023-05-23 |
link |
Aligning Large Language Models through Synthetic Feedback |
Kim, Sungdong,..., Minjoon |
52 |
2023-05-24 |
link |
Do LLMs Understand Social Knowledge? Evaluating the Sociability of Large Language Models with SocKET Benchmark |
Choi, Minje,..., David |
52 |
2023-05-19 |
link |
How Does Generative Retrieval Scale to Millions of Passages? |
Pradeep, Ronak,..., Vinh |
52 |
2023-03-15 |
link |
UPRISE: Universal Prompt Retrieval for Improving Zero-Shot Evaluation |
Cheng, Daixuan,..., Qi |
52 |
2023-03-17 |
link |
CoLT5: Faster Long-Range Transformers with Conditional Computation |
Ainslie, Joshua,..., Sumit |
50 |
2023-05-24 |
link |
GPTAraEval: A Comprehensive Evaluation of ChatGPT on Arabic NLP |
Khondaker, Md Tawkat Islam,..., Muhammad |
50 |
2023-04-13 |
link |
Shall We Pretrain Autoregressive Language Models with Retrieval? A Comprehensive Study |
Wang, Boxin,..., Bryan |
50 |
2022-11-03 |
link |
Inverse scaling can become U-shaped |
Wei, Jason,..., Quoc |
49 |
2023-05-24 |
link |
Don't Trust ChatGPT when your Question is not in English: A Study of Multilingual Abilities and Types of LLMs |
Zhang, Xiang,..., Grzegorz |
47 |
2023-10-23 |
link |
Evaluating Large Language Models on Controlled Generation Tasks |
Sun, Jiao,..., Xuezhe |
46 |
2023-10-24 |
link |
CoAnnotating: Uncertainty-Guided Work Allocation between Human and Large Language Models for Data Annotation |
Li, Minzhi,..., Diyi |
44 |
2023-10-23 |
link |
Cross-lingual Prompting: Improving Zero-shot Chain-of-Thought Reasoning across Languages |
Qin, Libo,..., Wanxiang |
44 |
2023-10-16 |
link |
Empirical Study of Zero-Shot NER with ChatGPT |
Xie, Tingyu,..., Hongwei |
44 |
2023-05-14 |
link |
FactKB: Generalizable Factuality Evaluation using Language Models Enhanced with Factual Knowledge |
Feng, Shangbin,..., Yulia |
43 |
2023-10-17 |
link |
CoMPosT: Characterizing and Evaluating Caricature in LLM Simulations |
Cheng, Myra,..., Diyi |
42 |
2023-04-24 |
link |
On the Challenges of Using Black-Box APIs for Toxicity Evaluation in Research |
Pozzobon, Luiza,..., Sara |
41 |
2023-10-11 |
link |
BioT5: Enriching Cross-modal Integration in Biology with Chemical Knowledge and Natural Language Associations |
Pei, Qizhi,..., Rui |
41 |
2023-05-23 |
link |
Multilingual Large Language Models Are Not (Yet) Code-Switchers |
Zhang, Ruochen,..., Alham |
40 |
2023-04-06 |
link |
Towards Interpretable Mental Health Analysis with Large Language Models |
Yang, Kailai,..., Sophia |
40 |
2023-10-23 |
link |
Federated Learning of Large Language Models with Parameter-Efficient Prompt Tuning and Adaptive Optimization |
Che, Tianshi,..., Dejing |
39 |
2023-03-11 |
link |
Consistency Analysis of ChatGPT |
Jang, Myeongjun,..., Thomas |
39 |
2023-10-25 |
link |
LLM-FP4: 4-Bit Floating-Point Quantized Transformers |
Liu, Shih-yang,..., Kwang-Ting |
39 |
2023-10-20 |
link |
Copyright Violations and Large Language Models |
Karamolegkou, Antonia,..., Anders |
38 |
2023-05-05 |
link |
Vera: A General-Purpose Plausibility Estimation Model for Commonsense Statements |
Liu, Jiacheng,..., Hannaneh |
38 |
2023-10-09 |
link |
Fast and Robust Early-Exiting Framework for Autoregressive Language Models with Synchronized Parallel Decoding |
Bae, Sangmin,..., Se-Young |
38 |
None |
link |
Ignore This Title and HackAPrompt: Exposing Systemic Vulnerabilities of LLMs Through a Global Prompt Hacking Competition |
Schulhoff, Sander,..., Jordan |
38 |
2023-11-29 |
link |
ROBBIE: Robust Bias Evaluation of Large Generative Language Models |
Esiobu, David,..., Eric |
38 |
2023-10-16 |
link |
Cross-Lingual Consistency of Factual Knowledge in Multilingual Language Models |
Qi, Jirui,..., Arianna |
37 |
2023-10-11 |
link |
Synthetic Data Generation with Large Language Models for Text Classification: Potential and Limitations |
Li, Zhuoyan,..., Ming |
37 |
2023-10-20 |
link |
Evaluation Metrics in the Era of GPT-4: Reliably Evaluating Large Language Models on Sequence to Sequence Tasks |
Sottana, Andrea,..., Zheng |
37 |
2023-10-10 |
link |
Whispering LLaMA: A Cross-Modal Generative Error Correction Framework for Speech Recognition |
Radhakrishnan, Srijith,..., Jesper |
37 |
2023-10-09 |
link |
Compressing Context to Enhance Inference Efficiency of Large Language Models |
Li, Yucheng,..., Chenghua |
36 |
2023-05-23 |
link |
Grammar-Constrained Decoding for Structured NLP Tasks without Finetuning |
Geng, Saibo,..., Robert |
36 |
2023-10-29 |
link |
Poisoning Retrieval Corpora by Injecting Adversarial Passages |
Zhong, Zexuan,..., Danqi |
35 |
2023-04-21 |
link |
ReCEval: Evaluating Reasoning Chains via Correctness and Informativeness |
Prasad, Archiki,..., Mohit |
35 |
2023-10-16 |
link |
Theory of Mind for Multi-Agent Collaboration via Large Language Models |
Li, Huao,..., Katia |
35 |
2023-05-24 |
link |
Have LLMs Advanced Enough? A Challenging Problem Solving Benchmark For Large Language Models |
Arora, Daman,..., {Mausam} |
35 |
2022-10-16 |
link |
NormSAGE: Multi-Lingual Multi-Cultural Norm Discovery from Conversations On-the-Fly |
Fung, Yi,..., Heng |
34 |
2022-12-19 |
link |
DSI++: Updating Transformer Memory with New Documents |
Mehta, Sanket,..., Donald |
34 |
2023-10-13 |
link |
SeqXGPT: Sentence-Level AI-Generated Text Detection |
Wang, Pengyu,..., Xipeng |
34 |
2023-05-23 |
link |
Dynosaur: A Dynamic Growth Paradigm for Instruction-Tuning Data Curation |
Yin, Da,..., Kai-Wei |
34 |
2023-05-22 |
link |
Rethinking the Evaluation for Conversational Recommendation in the Era of Large Language Models |
Wang, Xiaolei,..., Ji-Rong |
34 |
None |
link |
SummEdits: Measuring LLM Ability at Factual Reasoning Through The Lens of Summarization |
Laban, Philippe,..., Chien-Sheng |
33 |
2023-05-24 |
link |
ClusterLLM: Large Language Models as a Guide for Text Clustering |
Zhang, Yuwei,..., Jingbo |
32 |
2023-05-11 |
link |
When the Majority is Wrong: Modeling Annotator Disagreement for Subjective Tasks |
Fleisig, Eve,..., Dan |
32 |
2023-05-24 |
link |
A Mechanistic Interpretation of Arithmetic Reasoning in Language Models using Causal Mediation Analysis |
Stolfo, Alessandro,..., Mrinmaya |
31 |
2023-05-12 |
link |
NL2TL: Transforming Natural Languages to Temporal Logics using Large Language Models |
Chen, Yongchao,..., Chuchu |
31 |
2023-05-24 |
link |
Meta-Learning Online Adaptation of Language Models |
Hu, Nathan,..., Chelsea |
31 |
2023-10-11 |
link |
Beyond Factuality: A Comprehensive Evaluation of Large Language Models as Knowledge Generators |
Chen, Liang,..., Kam-Fai |
31 |
2023-11-03 |
link |
FinGPT: Large Generative Models for a Small Language |
Luukkonen, Risto,..., Sampo |
31 |
2023-02-26 |
link |
Navigating the Grey Area: How Expressions of Uncertainty and Overconfidence Affect Language Models |
Zhou, Kaitlyn,..., Tatsunori |
30 |
2023-11-27 |
link |
Cognitive Dissonance: Why Do Language Model Outputs Disagree with Internal Representations of Truthfulness? |
Liu, Kevin,..., Jacob |
30 |
2023-05-22 |
link |
ChatGPT to Replace Crowdsourcing of Paraphrases for Intent Classification: Higher Diversity and Comparable Model Robustness |
Cegin, Jan,..., Peter |
30 |
2023-10-23 |
link |
Exploring the Boundaries of GPT-4 in Radiology |
Liu, Qianchu,..., Javier |
30 |
2023-05-23 |
link |
Detecting and Mitigating Hallucinations in Multilingual Summarisation |
Qiu, Yifu,..., Shay |
29 |
2023-11-01 |
link |
Active Instruction Tuning: Improving Cross-Task Generalization by Training on Prompt Sensitive Tasks |
Kung, Po-Nien,..., Nanyun |
29 |
2023-12-04 |
link |
Exchange-of-Thought: Enhancing Large Language Model Capabilities through Cross-Model Communication |
Yin, Zhangyue,..., Xipeng |
29 |
2023-10-31 |
link |
DEPN: Detecting and Editing Privacy Neurons in Pretrained Language Models |
Wu, Xinwei,..., Deyi |
29 |
2023-10-13 |
link |
Dialogue Chain-of-Thought Distillation for Commonsense-aware Conversational Agents |
Chae, Hyungjoo,..., Jinyoung |
29 |
2023-05-23 |
link |
Revisiting Machine Translation for Cross-lingual Classification |
Artetxe, Mikel,..., Luke |
29 |
2023-05-23 |
link |
Ties Matter: Meta-Evaluating Modern Metrics with Pairwise Accuracy and Tie Calibration |
Deutsch, Daniel,..., Markus |
28 |
2023-10-22 |
link |
Merging Generated and Retrieved Knowledge for Open-Domain QA |
Zhang, Yunxiang,..., Lu |
28 |
2023-03-01 |
link |
UDAPDR: Unsupervised Domain Adaptation via LLM Prompting and Distillation of Rerankers |
Saad-Falcon, Jon,..., Christopher |
28 |
2023-10-11 |
link |
The Past, Present and Better Future of Feedback Learning in Large Language Models for Subjective Human Preferences and Values |
Kirk, Hannah,..., Scott |
27 |
2023-05-16 |
link |
Mirages: On Anthropomorphism in Dialogue Systems |
Abercrombie, Gavin,..., Zeerak |
27 |
2023-05-22 |
link |
SEAHORSE: A Multilingual, Multifaceted Dataset for Summarization Evaluation |
Clark, Elizabeth,..., Ankur |
27 |
2023-11-22 |
link |
Enhancing Uncertainty-Based Hallucination Detection with Stronger Focus |
Zhang, Tianhang,..., Luoyi |
27 |
2022-08-01 |
link |
Composable Text Controls in Latent Space with ODEs |
Liu, Guangyi,..., Zhiting |
26 |
2023-03-14 |
link |
Do Transformers Parse while Predicting the Masked Word? |
Zhao, Haoyu,..., Sanjeev |
26 |
2023-10-23 |
link |
Towards a Mechanistic Interpretation of Multi-Step Reasoning Capabilities of Language Models |
Hou, Yifan,..., Mrinmaya |
26 |
2022-10-31 |
link |
Where to start? Analyzing the potential value of intermediate models |
Choshen, Leshem,..., Yoav |
26 |
2023-05-22 |
link |
Let GPT be a Math Tutor: Teaching Math Word Problem Solvers with Customized Exercise Generation |
Liang, Zhenwen,..., Ashwin |
26 |
2023-05-05 |
link |
NLI4CT: Multi-Evidence Natural Language Inference for Clinical Trial Reports |
Jullien, Mael,..., Andre |
26 |
2023-10-23 |
link |
Plan, Verify and Switch: Integrated Reasoning with Diverse X-of-Thoughts |
Liu, Tengxiao,..., Zheng |
26 |
2023-05-23 |
link |
Skill-Based Few-Shot Selection for In-Context Learning |
An, Shengnan,..., Jian-Guang |
26 |
2023-11-20 |
link |
Sparse Low-rank Adaptation of Pre-trained Language Models |
Ding, Ning,..., Maosong |
26 |
2023-10-07 |
link |
Large Language Models Only Pass Primary School Exams in Indonesia: A Comprehensive Test on IndoMMLU |
Koto, Fajri,..., Timothy |
25 |
2023-10-14 |
link |
Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model |
Deng, Haikang,..., Colin |
25 |
None |
link |
trlX: A Framework for Large Scale Reinforcement Learning from Human Feedback |
Havrilla, Alexander,..., Louis |
25 |
2023-10-24 |
link |
Fighting Fire with Fire: The Dual Role of LLMs in Crafting and Detecting Elusive Disinformation |
Lucas, Jason,..., Dongwon |
25 |
2023-05-19 |
link |
What Comes Next? Evaluating Uncertainty in Neural Text Generators Against Human Production Variability |
Giulianelli, Mario,..., Barbara |
25 |
2023-05-16 |
link |
Generative Table Pre-training Empowers Models for Tabular Prediction |
Zhang, Tianping,..., Qian |
24 |
2023-05-23 |
link |
MemeCap: A Dataset for Captioning and Interpreting Memes |
Hwang, EunJeong,..., Vered |
24 |
2023-05-10 |
link |
Incorporating Structured Representations into Pretrained Vision & Language Models Using Scene Graphs |
Herzig, Roei,..., Amir |
24 |
2023-05-09 |
link |
MoT: Memory-of-Thought Enables ChatGPT to Self-Improve |
Li, Xiaonan,..., Xipeng |
24 |
2023-05-22 |
link |
clembench: Using Game Play to Evaluate Chat-Optimized Language Models as Conversational Agents |
Chalamalasetti, Kranti,..., David |
24 |
2023-05-24 |
link |
Towards Reliable Misinformation Mitigation: Generalization, Uncertainty, and GPT-4 |
Pelrine, Kellin,..., Reihaneh |
24 |
2023-05-23 |
link |
Prompt-Based Monte-Carlo Tree Search for Goal-Oriented Dialogue Policy Planning |
Yu, Xiao,..., Zhou |
23 |
2023-10-12 |
link |
Prompting Large Language Models with Chain-of-Thought for Few-Shot Knowledge Base Question Generation |
Liang, Yuanyuan,..., Yunshi |
23 |
None |
link |
FOCUS: Effective Embedding Initialization for Monolingual Specialization of Multilingual Models |
Dobler, Konstantin,..., Gerard |
23 |
2023-05-23 |
link |
Goal-Driven Explainable Clustering via Language Descriptions |
Wang, Zihan,..., Ruiqi |
23 |
2023-10-19 |
link |
StoryAnalogy: Deriving Story-level Analogies from Large Language Models to Unlock Analogical Understanding |
Jiayang, Cheng,..., Zheng |
23 |
None |
link |
Preserving Privacy Through Dememorization: An Unlearning Technique For Mitigating Memorization Risks In Language Models |
Kassem, Aly,..., Sherif |
22 |
2023-05-23 |
link |
Can Large Language Models Capture Dissenting Human Voices? |
Lee, Noah,..., James |
22 |
2023-10-19 |
link |
From Multilingual Complexity to Emotional Clarity: Leveraging Commonsense to Unveil Emotions in Code-Mixed Dialogues |
Kumar, Shivani,..., Tanmoy |
22 |
2023-10-24 |
link |
Large Language Models are Temporal and Causal Reasoners for Video Question Answering |
Ko, Dohwan,..., Hyunwoo |
22 |
2023-10-13 |
link |
Precedent-Enhanced Legal Judgment Prediction with LLM and Domain-Model Collaboration |
Wu, Yiquan,..., Kun |
22 |
2023-10-25 |
link |
Improving Diversity of Demographic Representation in Large Language Models via Collective-Critiques and Self-Voting |
Lahoti, Preethi,..., Jilin |
22 |
2023-10-14 |
link |
Self-Detoxifying Language Models via Toxification Reversal |
Leong, Chak,..., Wenjie |
22 |
2023-10-23 |
link |
PRCA: Fitting Black-Box Large Language Models for Retrieval Question Answering via Pluggable Reward-Driven Contextual Adapter |
Yang, Haoyan,..., Jing |
22 |
2023-10-13 |
link |
KCTS: Knowledge-Constrained Tree Search Decoding with Token-Level Hallucination Detection |
Choi, Sehyun,..., Yangqiu |
21 |
2023-05-24 |
link |
Privacy Implications of Retrieval-Based Language Models |
Huang, Yangsibo,..., Danqi |
21 |
2023-10-12 |
link |
Can We Edit Multimodal Large Language Models? |
Cheng, Siyuan,..., Ningyu |
21 |
2023-05-24 |
link |
Inference-Time Policy Adapters (IPA): Tailoring Extreme-Scale LMs without Fine-tuning |
Lu, Ximing,..., Yejin |
21 |
2023-05-18 |
link |
Comparing Biases and the Impact of Multilingual Training across Multiple Languages |
Levy, Sharon,..., Dan |
21 |
2023-05-19 |
link |
HalOmi: A Manually Annotated Benchmark for Multilingual Hallucination and Omission Detection in Machine Translation |
Dale, David,..., Marta |
20 |
2023-05-24 |
link |
Self-ICL: Zero-Shot In-Context Learning with Self-Generated Demonstrations |
Chen, Wei-Lin,..., Hsin-Hsi |
20 |
2023-10-20 |
link |
Conversation Chronicles: Towards Diverse Temporal and Relational Dynamics in Multi-Session Conversations |
Jang, Jihyoung,..., Hyounghun |
20 |
2023-05-24 |
link |
Contrastive Learning of Sentence Embeddings from Scratch |
Zhang, Junlei,..., Junxian |
20 |
2023-10-24 |
link |
Instruct and Extract: Instruction Tuning for On-Demand Information Extraction |
Jiao, Yizhu,..., Jiawei |
20 |
2023-12-13 |
link |
Large Language Models are Complex Table Parsers |
Zhao, Bowen,..., Xiaobo |
19 |
2023-05-24 |
link |
Selectively Answering Ambiguous Questions |
Cole, Jeremy,..., Jacob |
19 |
2023-03-07 |
link |
Towards Interpretable and Efficient Automatic Reference-Based Summarization Evaluation |
Liu, Yixin,..., Dragomir |
19 |
2023-05-23 |
link |
Coarse-to-Fine Contrastive Learning in Image-Text-Graph Space for Improved Vision-Language Compositionality |
Singh, Harman,..., Yu |
18 |
None |
link |
Stance Detection on Social Media with Background Knowledge |
Li, Ang,..., Ruifeng |
18 |
2023-10-24 |
link |
Characterizing Mechanisms for Factual Recall in Language Models |
Yu, Qinan,..., Ellie |
18 |
2023-05-17 |
link |
Temporal Knowledge Graph Forecasting Without Knowledge Using In-Context Learning |
Lee, Dong-Ho,..., Jay |
18 |
2023-11-25 |
link |
Faster Minimum Bayes Risk Decoding with Confidence-based Pruning |
Cheng, Julius,..., Andreas |
18 |
2023-05-22 |
link |
Prompting is not a substitute for probability measurements in large language models |
Hu, Jennifer,..., Roger |
18 |
2023-10-22 |
link |
PromptMix: A Class Boundary Augmentation Method for Large Language Model Distillation |
Sahu, Gaurav,..., Issam |
18 |
2023-10-23 |
link |
Tree of Clarifications: Answering Ambiguous Questions with Retrieval-Augmented Large Language Models |
Kim, Gangwoo,..., Jaewoo |
18 |
2023-10-07 |
link |
Crystal: Introspective Reasoners Reinforced with Self-Feedback |
Liu, Jiacheng,..., Asli |
17 |
2023-10-26 |
link |
CodeFusion: A Pre-trained Diffusion Model for Code Generation |
Singh, Mukul,..., Gust |
17 |
2023-04-18 |
link |
Outlier Suppression+: Accurate quantization of large language models by equivalent and effective shifting and scaling |
Wei, Xiuying,..., Xianglong |
17 |
2023-10-31 |
link |
Making Large Language Models Better Data Creators |
Lee, Dong-Ho,..., Sujay |
17 |
2023-10-18 |
link |
The Curious Case of Hallucinatory Unanswerablity: Finding Truths in the Hidden States of Over-Confident Large Language Models |
Slobodkin, Aviv,..., Shauli |
17 |
2023-05-23 |
link |
Evaluation of African American Language Bias in Natural Language Generation |
Deas, Nicholas,..., Kathleen |
17 |
2023-10-24 |
link |
BLESS: Benchmarking Large Language Models on Sentence Simplification |
Kew, Tannon,..., Matthew |
17 |
2023-05-24 |
link |
Evaluating Evaluation Metrics: A Framework for Analyzing NLG Evaluation Metrics using Measurement Theory |
Xiao, Ziang,..., Q. Vera |
17 |
2023-05-23 |
link |
IfQA: A Dataset for Open-domain Question Answering under Counterfactual Presuppositions |
Yu, Wenhao,..., Ashish |
17 |
2023-11-29 |
link |
Unveiling the Implicit Toxicity in Large Language Models |
Wen, Jiaxin,..., Minlie |
17 |
2023-10-19 |
link |
The Shifted and The Overlooked: A Task-oriented Investigation of User-GPT Interactions |
Ouyang, Siru,..., Jiawei |
17 |
2023-10-23 |
link |
ALCUNA: Large Language Models Meet New Knowledge |
Yin, Xunjian,..., Xiaojun |
16 |
2023-10-16 |
link |
Generating Summaries with Controllable Readability Levels |
Ribeiro, Leonardo F. R.,..., Markus |
16 |
None |
link |
Dr ChatGPT tell me what I want to hear: How different prompts impact health answer correctness |
Koopman, Bevan,..., Guido |
16 |
2023-11-02 |
link |
People Make Better Edits: Measuring the Efficacy of LLM-Generated Counterfactually Augmented Data for Harmful Language Detection |
Sen, Indira,..., Claudia |
16 |
2023-10-08 |
link |
Guideline Learning for In-context Information Extraction |
Pang, Chaoxu,..., Ping |
16 |
None |
link |
Enhancing Code-Switching for Cross-lingual SLU: A Unified View of Semantic and Grammatical Coherence |
Zhu, Zhihong,..., Yuexian |
16 |
2023-05-23 |
link |
CombLM: Adapting Black-Box Language Models through Small Fine-Tuned Models |
Ormazabal, Aitor,..., Eneko |
16 |
2023-10-23 |
link |
Specialist or Generalist? Instruction Tuning for Specific NLP Tasks |
Shi, Chufan,..., Deng |
15 |
2023-10-24 |
link |
Failures Pave the Way: Enhancing Large Language Models through Tuning-free Rule Accumulation |
Yang, Zeyuan,..., Yang |
15 |
2023-10-27 |
link |
Modeling Legal Reasoning: LM Annotation at the Edge of Human Agreement |
Thalken, Rosamond,..., Matthew |
15 |
2023-05-19 |
link |
Let's Sample Step by Step: Adaptive-Consistency for Efficient Reasoning and Coding with LLMs |
Aggarwal, Pranjal,..., {Mausam} |
15 |
2023-10-13 |
link |
Explore-Instruct: Enhancing Domain-Specific Instruction Coverage through Active Exploration |
Wan, Fanqi,..., Shuming |
15 |
None |
link |
CoCo: Coherence-Enhanced Machine-Generated Text Detection Under Low Resource With Contrastive Learning |
Liu, Xiaoming,..., Chao |
15 |
2023-05-19 |
link |
Mitigating Backdoor Poisoning Attacks through the Lens of Spurious Correlation |
He, Xuanli,..., Trevor |
15 |
2023-04-10 |
link |
A Cheaper and Better Diffusion Language Model with Soft-Masked Noise |
Chen, Jiaao,..., Diyi |
15 |
2023-10-18 |
link |
A Tale of Pronouns: Interpretability Informs Gender Bias Mitigation for Fairer Instruction-Tuned Machine Translation |
Attanasio, Giuseppe,..., Anne |
15 |
None |
link |
Hallucination Detection for Generative Large Language Models by Bayesian Sequential Estimation |
Wang, Xiaohua,..., Xuanjing |
15 |
2023-05-23 |
link |
Sociocultural Norm Similarities and Differences via Situational Alignment and Explainable Textual Entailment |
CH-Wang, Sky,..., Smaranda |
15 |
2023-05-23 |
link |
Can Language Models Understand Physical Concepts? |
Li, Lei,..., Qi |
15 |
2023-10-08 |
link |
Counter Turing Test CT^2: AI-Generated Text Detection is Not as Easy as You May Think - Introducing AI Detectability Index |
Chakraborty, Megha,..., Amitava |
15 |
None |
link |
Enhancing Generative Retrieval with Reinforcement Learning from Relevance Feedback |
Zhou, Yujia,..., Ji-Rong |
14 |
2023-10-23 |
link |
GeoLM: Empowering Language Models for Geospatially Grounded Language Understanding |
Li, Zekun,..., Muhao |
14 |
2023-10-12 |
link |
ClimateBERT-NetZero: Detecting and Assessing Net Zero and Reduction Targets |
Schimanski, Tobias,..., Markus |
14 |
2023-10-30 |
link |
Large Language Models: The Need for Nuance in Current Debates and a Pragmatic Perspective on Understanding |
van Dijk, Bram,..., Max Johannes |
14 |
2023-05-24 |
link |
Editing Common Sense in Transformers |
Gupta, Anshita,..., Niket |
14 |
2023-05-24 |
link |
Advancements in Arabic Grammatical Error Detection and Correction: An Empirical Investigation |
Alhafni, Bashar,..., Nizar |
14 |
2023-01-27 |
link |
Byte Pair Encoding for Symbolic Music |
Fradet, Nathan,..., Jean-Pierre |
14 |
2023-03-13 |
link |
Model-tuning Via Prompts Makes NLP Models Adversarially Robust |
Raman, Mrigank,..., Danish |
14 |
2023-10-29 |
link |
EtiCor: Corpus for Analyzing LLMs for Etiquettes |
Dwivedi, Ashutosh,..., Ashutosh |
14 |
2023-05-23 |
link |
Evaluating and Modeling Attribution for Cross-Lingual Question Answering |
Muller, Benjamin,..., Xinyi |
14 |
2023-10-26 |
link |
Global Voices, Local Biases: Socio-Cultural Prejudices across Languages |
Mukherjee, Anjishnu,..., Antonios |
14 |
2023-05-22 |
link |
How do languages influence each other? Studying cross-lingual data sharing during LLM fine-tuning |
Choenni, Rochelle,..., Ekaterina |
14 |
2023-10-20 |
link |
Decoding the Silent Majority: Inducing Belief Augmented Social Graph with Large Language Model for Response Forecasting |
Sun, Chenkai,..., Heng |
14 |
2023-05-22 |
link |
ViT-TTS: Visual Text-to-Speech with Scalable Diffusion Transformer |
Liu, Huadai,..., Zhou |
13 |
2023-10-09 |
link |
Learning Language-guided Adaptive Hyper-modality Representation for Multimodal Sentiment Analysis |
Zhang, Haoyu,..., Tianshu |
13 |
None |
link |
UniMath: A Foundational and Multimodal Mathematical Reasoner |
Liang, Zhenwen,..., Xiangliang |
13 |
2023-10-23 |
link |
Towards LLM-driven Dialogue State Tracking |
Feng, Yujie,..., Xiao-Ming |
13 |
2023-05-22 |
link |
SCITAB: A Challenging Benchmark for Compositional Reasoning and Claim Verification on Scientific Tables |
Lu, Xinyuan,..., Min-Yen |
13 |
None |
link |
Language Model Quality Correlates with Psychometric Predictive Power in Multiple Languages |
Wilcox, Ethan,..., Tiago |
13 |
2023-02-02 |
link |
IC3: Image Captioning by Committee Consensus |
Chan, David,..., John |
13 |
2023-03-28 |
link |
Explicit Planning Helps Language Models in Logical Reasoning |
Zhao, Hongyu,..., Hongyuan |
13 |
2023-05-22 |
link |
Lion: Adversarial Distillation of Proprietary Large Language Models |
Jiang, Yuxin,..., Wei |
13 |
2023-11-06 |
link |
Incorporating Worker Perspectives into MTurk Annotation Practices for NLP |
Huang, Olivia,..., Dan |
13 |
2023-10-20 |
link |
Multi-level Contrastive Learning for Script-based Character Understanding |
Li, Dawei,..., Shiping |
13 |
2023-11-02 |
link |
Self-Influence Guided Data Reweighting for Language Model Pre-training |
Thakkar, Megh,..., Partha |
13 |
2023-03-15 |
link |
PRESTO: A Multilingual Dataset for Parsing Realistic Task-Oriented Dialogs |
Goel, Rahul,..., Zhou |
13 |
2023-05-23 |
link |
Multilingual Pixel Representations for Translation and Effective Cross-lingual Transfer |
Salesky, Elizabeth,..., Matt |
13 |
2023-10-23 |
link |
We are Who We Cite: Bridges of Influence Between Natural Language Processing and Other Academic Fields |
Wahle, Jan Philip,..., Saif |
13 |
2023-05-24 |
link |
Mitigating Temporal Misalignment by Discarding Outdated Facts |
Zhang, Michael,..., Eunsol |
13 |
2023-05-19 |
link |
Prompting with Pseudo-Code Instructions |
Mishra, Mayank,..., Srikanth |
13 |
2023-10-11 |
link |
How Do Large Language Models Capture the Ever-changing World Knowledge? A Review of Recent Advances |
Zhang, Zihan,..., Jun |
13 |
None |
link |
Syllogistic Reasoning for Legal Judgment Analysis |
Deng, Wentao,..., Pengjie |
13 |
2023-03-16 |
link |
GLEN: General-Purpose Event Detection for Thousands of Types |
Li, Sha,..., Jiawei |
12 |
2023-10-23 |
link |
Non-autoregressive Streaming Transformer for Simultaneous Translation |
Ma, Zhengrui,..., Yang |
12 |
2023-11-09 |
link |
Enhancing Computation Efficiency in Large Language Models through Weight and Activation Quantization |
Lee, Janghwan,..., Jungwook |
12 |
None |
link |
On the Benefits of Learning to Route in Mixture-of-Experts Models |
Dikkala, Nishanth,..., Xin |
12 |
2023-10-20 |
link |
Optimizing Retrieval-augmented Reader Models via Token Elimination |
Berchansky, Moshe,..., Moshe |
12 |
None |
link |
Query Rewriting in Retrieval-Augmented Large Language Models |
Ma, Xinbei,..., Nan |
12 |
2023-10-16 |
link |
TRIGO: Benchmarking Formal Mathematical Proof Reduction for Generative Language Models |
Xiong, Jing,..., Qun |
12 |
2023-11-29 |
link |
Hyperpolyglot LLMs: Cross-Lingual Interpretability in Token Embeddings |
Wen-Yi, Andrea W,..., David |
12 |
2023-11-08 |
link |
Conversation Understanding using Relational Temporal Graph Neural Networks with Auxiliary Cross-Modality Interaction |
Nguyen, Cam Van Thi,..., Duc-Trong |
12 |
2023-10-17 |
link |
Reading Order Matters: Information Extraction from Visually-rich Documents by Token Path Prediction |
Zhang, Chong,..., Tao |
12 |
2023-11-14 |
link |
TempTabQA: Temporal Question Answering for Semi-Structured Tables |
Gupta, Vivek,..., Vivek |
12 |
2023-11-30 |
link |
Unnatural Error Correction: GPT-4 Can Almost Perfectly Handle Unnatural Scrambled Text |
Cao, Qi,..., Yusuke |
12 |
2023-11-09 |
link |
Mirror: A Universal Framework for Various Information Extraction Tasks |
Zhu, Tong,..., Min |
12 |
2023-10-21 |
link |
MedEval: A Multi-Level, Multi-Task, and Multi-Domain Medical Benchmark for Language Model Evaluation |
He, Zexue,..., Chun-Nan |
12 |
2023-05-24 |
link |
A Question Answering Framework for Decontextualizing User-facing Snippets from Scientific Documents |
Newman, Benjamin,..., Kyle |
12 |
2023-10-29 |
link |
Pushdown Layers: Encoding Recursive Structure in Transformer Language Models |
Murty, Shikhar,..., Christopher |
12 |
2023-11-27 |
link |
DUnE: Dataset for Unified Editing |
Aky{\"u}rek, Afra,..., Derry |
12 |
None |
link |
A Self-training Framework for Automated Medical Report Generation |
Wang, Siyuan,..., Bo |
12 |
None |
link |
SPT: Learning to Selectively Insert Prompts for Better Prompt Tuning |
Zhu, Wei,..., Ming |
12 |
2023-11-15 |
link |
Token Prediction as Implicit Classification to Identify LLM-Generated Text |
Chen, Yutian,..., Bhiksha |
12 |
2023-10-09 |
link |
Task-Adaptive Tokenization: Enhancing Long-Form Text Generation Efficacy in Mental Health and Beyond |
Liu, Siyang,..., Rada |
12 |
2023-05-09 |
link |
CaseEncoder: A Knowledge-enhanced Pre-trained Model for Legal Case Encoding |
Ma, Yixiao,..., Yiqun |
12 |
2023-01-02 |
link |
MAUD: An Expert-Annotated Legal NLP Dataset for Merger Agreement Understanding |
Wang, Steven,..., Dan |
12 |
2023-10-19 |
link |
Fast and Accurate Factual Inconsistency Detection Over Long Documents |
Lattimer, Barrett,..., Yi |
11 |
2022-12-19 |
link |
CiteBench: A benchmark for Scientific Citation Text Generation |
Funkquist, Martin,..., Iryna |
11 |
2023-05-24 |
link |
Calc-X and Calcformers: Empowering Arithmetical Chain-of-Thought through Interaction with Symbolic Systems |
Kadl{\v{c}}{\'\i}k, Marek,..., Vlastimil |
11 |
2023-05-24 |
link |
Text encoders bottleneck compositionality in contrastive vision-language models |
Kamath, Amita,..., Kai-Wei |
11 |
2023-10-21 |
link |
Tree Prompting: Efficient Task Adaptation without Fine-Tuning |
Singh, Chandan,..., Yuntian |
11 |
2023-05-22 |
link |
TaskWeb: Selecting Better Source Tasks for Multi-task NLP |
Kim, Joongwon,..., Hannaneh |
11 |
2023-10-08 |
link |
Generative Spoken Language Model based on continuous word-sized audio tokens |
Algayres, Robin,..., Emmanuel |
11 |
2023-10-23 |
link |
Why LLMs Hallucinate, and How to Get (Evidential) Closure: Perceptual, Intensional, and Extensional Learning for Faithful Natural Language Generation |
Bouyamourn, Adam |
11 |
2023-03-16 |
link |
Exploring Distributional Shifts in Large Language Models for Code Analysis |
Arakelyan, Shushan,..., Xiang |
11 |
None |
link |
Representative Demonstration Selection for In-Context Learning with Two-Stage Determinantal Point Process |
Yang, Zhao,..., Kang |
11 |
2023-11-02 |
link |
CRUSH4SQL: Collective Retrieval Using Schema Hallucination For Text2SQL |
Kothyari, Mayank,..., Soumen |
11 |
2023-10-08 |
link |
Hi Guys or Hi Folks? Benchmarking Gender-Neutral Machine Translation with the GeNTE Corpus |
Piergentili, Andrea,..., Luisa |
11 |
2023-10-15 |
link |
Homophone Disambiguation Reveals Patterns of Context Mixing in Speech Transformers |
Mohebbi, Hosein,..., Afra |
11 |
2023-05-19 |
link |
AutoTrial: Prompting Language Models for Clinical Trial Design |
Wang, Zifeng,..., Jimeng |
11 |
2023-10-24 |
link |
Improving Biomedical Abstractive Summarisation with Knowledge Aggregation from Citation Papers |
Tang, Chen,..., Chenghua |
11 |
2023-12-04 |
link |
APoLLo : Unified Adapter and Prompt Learning for Vision Language Models |
Chowdhury, Sanjoy,..., Dinesh |
11 |
2023-05-22 |
link |
Multilingual Holistic Bias: Extending Descriptors and Patterns to Unveil Demographic Biases in Languages at Scale |
Costa-juss{`a}, Marta,..., Carleigh |
11 |
2023-01-30 |
link |
STAIR: Learning Sparse Text and Image Representation in Grounded Tokens |
Chen, Chen,..., Yinfei |
11 |
2023-05-22 |
link |
Discovering Universal Geometry in Embeddings with ICA |
Yamagiwa, Hiroaki,..., Hidetoshi |
11 |
2023-10-24 |
link |
Do Differences in Values Influence Disagreements in Online Discussions? |
van der Meer, Michiel,..., Pradeep |
11 |
2023-11-07 |
link |
CRAB: Assessing the Strength of Causal Relationships Between Real-world Events |
Romanou, Angelika,..., Antoine |
11 |
2023-10-21 |
link |
Small Language Models Fine-tuned to Coordinate Larger Language Models improve Complex Reasoning |
Juneja, Gurusha,..., Tanmoy |
11 |
2023-10-15 |
link |
Lifelong Sequence Generation with Dynamic Module Expansion and Adaptation |
Qin, Chengwei,..., Shafiq |
11 |
2023-10-12 |
link |
Context Compression for Auto-regressive Transformers with Sentinel Tokens |
Ren, Siyu,..., Kenny |
11 |
2023-10-18 |
link |
From Dissonance to Insights: Dissecting Disagreements in Rationale Construction for Case Outcome Classification |
Xu, Shanshan,..., Matthias |
11 |
2023-07-25 |
link |
3DRP-Net: 3D Relative Position-aware Network for 3D Visual Grounding |
Wang, Zehan,..., Zhou |
11 |
2023-10-20 |
link |
A Diachronic Perspective on User Trust in AI under Uncertainty |
Dhuliawala, Shehzaad,..., Mrinmaya |
10 |
2023-05-13 |
link |
Multilingual Previously Fact-Checked Claim Retrieval |
Pikuliak, Mat{\'u}{\v{s}},..., Maria |
10 |
2023-10-20 |
link |
Information Value: Measuring Utterance Predictability as Distance from Plausible Alternatives |
Giulianelli, Mario,..., Raquel |
10 |
2023-05-22 |
link |
DADA: Dialect Adaptation via Dynamic Aggregation of Linguistic Rules |
Liu, Yanchen,..., Diyi |
10 |
2023-10-20 |
link |
Benchmarking and Improving Text-to-SQL Generation under Ambiguity |
Bhaskar, Adithya,..., Sunita |
10 |
2023-10-31 |
link |
Grounding Visual Illusions in Language: Do Vision-Language Models Perceive Illusions Like Humans? |
Zhang, Yichi,..., Joyce |
10 |
2023-11-07 |
link |
Unified Low-Resource Sequence Labeling by Sample-Aware Dynamic Sparse Finetuning |
Das, Sarkar Snigdha Sarathi,..., Rui |
10 |
2023-12-12 |
link |
HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts |
Do, Truong Giang,..., Steven |
10 |
2023-05-17 |
link |
Elaborative Simplification as Implicit Questions Under Discussion |
Wu, Yating,..., Junyi Jessy |
10 |
2023-10-18 |
link |
Zero-shot Faithfulness Evaluation for Text Summarization with Foundation Language Model |
Jia, Qi,..., Kenny |
10 |
2023-10-16 |
link |
Large Language Models Meet Open-World Intent Discovery and Recognition: An Evaluation of ChatGPT |
Song, Xiaoshuai,..., Weiran |
10 |
2023-10-15 |
link |
Merging Experts into One: Improving Computational Efficiency of Mixture of Experts |
He, Shwai,..., Dacheng |
10 |
2023-12-30 |
link |
ReasoningLM: Enabling Structural Subgraph Reasoning in Pre-trained Language Models for Question Answering over Knowledge Graph |
Jiang, Jinhao,..., Ji-Rong |
10 |
None |
link |
Cultural Concept Adaptation on Multimodal Reasoning |
Li, Zhi,..., Yin |
10 |
2023-05-23 |
link |
Dancing Between Success and Failure: Edit-level Simplification Evaluation using SALSA |
Heineman, David,..., Wei |
10 |
2023-02-09 |
link |
Explanation Selection Using Unlabeled Data for Chain-of-Thought Prompting |
Ye, Xi,..., Greg |
10 |
2023-10-08 |
link |
DialCoT Meets PPO: Decomposing and Exploring Reasoning Paths in Smaller Language Models |
Han, Chengcheng,..., Baoyuan |
10 |
2023-10-16 |
link |
VIBE: Topic-Driven Temporal Adaptation for Twitter Classification |
Zhang, Yuji,..., Wenjie |
10 |
2023-05-21 |
link |
Continually Improving Extractive QA via Human Feedback |
Gao, Ge,..., Eunsol |
10 |
2023-10-23 |
link |
SLOG: A Structural Generalization Benchmark for Semantic Parsing |
Li, Bingzhi,..., Najoung |
10 |
2023-10-26 |
link |
"Fifty Shades of Bias": Normative Ratings of Gender Bias in GPT Generated English Text |
Hada, Rishav,..., Kalika |
10 |
2023-10-20 |
link |
Bridging Information-Theoretic and Geometric Compression in Language Models |
Cheng, Emily,..., Marco |
10 |
2023-05-23 |
link |
Question Answering as Programming for Solving Time-Sensitive Questions |
Zhu, Xinyu,..., Yujiu |
10 |
2023-11-07 |
link |
Language Representation Projection: Can We Transfer Factual Knowledge across Languages in Multilingual Language Models? |
Xu, Shaoyang,..., Deyi |
10 |
2023-10-23 |
link |
NormDial: A Comparable Bilingual Synthetic Dialog Dataset for Modeling Social Norm Adherence and Violation |
Li, Oliver,..., Smaranda |
10 |
2023-10-20 |
link |
Analyzing Cognitive Plausibility of Subword Tokenization |
Beinborn, Lisa,..., Yuval |
10 |
2023-05-19 |
link |
Appraising the Potential Uses and Harms of LLMs for Medical Systematic Reviews |
Yun, Hye,..., Byron |
10 |
None |
link |
Beat LLMs at Their Own Game: Zero-Shot LLM-Generated Text Detection via Querying ChatGPT |
Zhu, Biru,..., Ming |
10 |
2023-10-23 |
link |
When Language Models Fall in Love: Animacy Processing in Transformer Language Models |
Hanna, Michael,..., Sandro |
10 |
None |
link |
APrompt: Attention Prompt Tuning for Efficient Adaptation of Pre-trained Language Models |
Wang, Qifan,..., Dongfang |
10 |
2023-10-20 |
link |
Seq2seq is All You Need for Coreference Resolution |
Zhang, Wenzheng,..., Karl |
9 |
2023-10-12 |
link |
Simplicity Level Estimate (SLE): A Learned Reference-Less Metric for Sentence Simplification |
Cripwell, Liam,..., Claire |
9 |
2023-10-19 |
link |
CLAIR: Evaluating Image Captions with Large Language Models |
Chan, David,..., John |
9 |
2022-12-18 |
link |
Rainproof: An Umbrella To Shield Text Generators From Out-Of-Distribution Data |
Darrin, Maxime,..., Pierre |
9 |
2023-05-24 |
link |
DecipherPref: Analyzing Influential Factors in Human Preference Judgments via GPT-4 |
Hu, Yebowen,..., Fei |
9 |
None |
link |
Towards Noise-Tolerant Speech-Referring Video Object Segmentation: Bridging Speech and Text |
Li, Xiang,..., Bhiksha |
9 |
2023-10-19 |
link |
MAF: Multi-Aspect Feedback for Improving Reasoning in Large Language Models |
Nathani, Deepak,..., William |
9 |
2023-11-20 |
link |
Adapt in Contexts: Retrieval-Augmented Domain Adaptation via In-Context Learning |
Long, Quanyu,..., Sinno |
9 |
2023-05-24 |
link |
ByteSized32: A Corpus and Challenge Task for Generating Task-Specific World Models Expressed as Text Games |
Wang, Ruoyao,..., Peter |
9 |
2023-10-24 |
link |
DALE: Generative Data Augmentation for Low-Resource Legal NLP |
Ghosh, Sreyan,..., Dinesh |
9 |
2022-12-19 |
link |
Norm of word embedding encodes information gain |
Oyama, Momose,..., Hidetoshi |
9 |
2023-10-20 |
link |
Primacy Effect of ChatGPT |
Wang, Yiwei,..., Bryan |
9 |
2023-10-11 |
link |
Sparse Universal Transformer |
Tan, Shawn,..., Chuang |
9 |
2023-05-03 |
link |
The Benefits of Label-Description Training for Zero-Shot Text Classification |
Gao, Lingyu,..., Kevin |
9 |
2023-05-03 |
link |
PTP: Boosting Stability and Performance of Prompt Tuning with Perturbation-Based Regularizer |
Chen, Lichang,..., Minhao |
9 |
2023-10-21 |
link |
Revisiting Instruction Fine-tuned Model Evaluation to Guide Industrial Applications |
Faysse, Manuel,..., Pierre |
9 |
2023-05-23 |
link |
Language Models with Rationality |
Kassner, Nora,..., Peter |
9 |
None |
link |
Cross-Cultural Analysis of Human Values, Morals, and Biases in Folk Tales |
Wu, Winston,..., Rada |
9 |
2023-11-15 |
link |
Structural Priming Demonstrates Abstract Grammatical Representations in Multilingual Language Models |
Michaelov, James,..., Ben |
9 |
None |
link |
Argument-based Detection and Classification of Fallacies in Political Debates |
Goffredo, Pierpaolo,..., Elena |
9 |
None |
link |
Expand, Highlight, Generate: RL-driven Document Generation for Passage Reranking |
Askari, Arian,..., Suzan |
9 |
2023-10-18 |
link |
LACMA: Language-Aligning Contrastive Learning with Meta-Actions for Embodied Instruction Following |
Yang, Cheng-Fu,..., Kai-Wei |
9 |
2023-05-23 |
link |
PIEClass: Weakly-Supervised Text Classification with Prompting and Noise-Robust Iterative Ensemble Training |
Zhang, Yunyi,..., Jiawei |
9 |
2023-10-23 |
link |
JointMatch: A Unified Approach for Diverse and Collaborative Pseudo-Labeling to Semi-Supervised Text Classification |
Zou, Henry,..., Cornelia |
8 |
2023-10-08 |
link |
An Investigation of LLMs' Inefficacy in Understanding Converse Relations |
Qi, Chengwen,..., Yuanjun |
8 |
2023-10-16 |
link |
BioPlanner: Automatic Evaluation of LLMs on Protocol Planning in Biology |
O{'}Donoghue, Odhran,..., Samuel |
8 |
2023-12-07 |
link |
OpenAsp: A Benchmark for Multi-document Open Aspect-based Summarization |
Amar, Shmuel,..., Ido |
8 |
2022-12-21 |
link |
ZEROTOP: Zero-Shot Task-Oriented Semantic Parsing using Large Language Models |
Mekala, Dheeraj,..., Subhro |
8 |
2023-10-20 |
link |
ALDi: Quantifying the Arabic Level of Dialectness of Text |
Keleg, Amr,..., Walid |
8 |
2023-11-02 |
link |
The Effect of Scaling, Retrieval Augmentation and Form on the Factual Consistency of Language Models |
Hagstr{\"o}m, Lovisa,..., Richard |
8 |
2023-11-22 |
link |
ViStruct: Visual Structural Knowledge Extraction via Curriculum Guided Code-Vision Representation |
Chen, Yangyi,..., Heng |
8 |
2023-10-27 |
link |
Evaluating Cross-Domain Text-to-SQL Models and Benchmarks |
Pourreza, Mohammadreza,..., Davood |
8 |
None |
link |
Target-to-Source Augmentation for Aspect Sentiment Triplet Extraction |
Zhang, Yice,..., Ruifeng |
8 |
2023-11-18 |
link |
Joyful: Joint Modality Fusion and Graph Contrastive Learning for Multimodal Emotion Recognition |
Li, Dongyuan,..., Manabu |
8 |
2023-05-24 |
link |
Gender Biases in Automatic Evaluation Metrics for Image Captioning |
Qiu, Haoyi,..., Nanyun |
8 |
2023-10-15 |
link |
Prompting Scientific Names for Zero-Shot Species Recognition |
Parashar, Shubham,..., Shu |
8 |
2023-10-18 |
link |
The Sentiment Problem: A Critical Survey towards Deconstructing Sentiment Analysis |
Venkit, Pranav,..., Shomir |
8 |
2023-11-09 |
link |
Bridging the Digital Divide: Performance Variation across Socio-Economic Factors in Vision-Language Models |
Nwatu, Joan,..., Rada |
8 |
2023-05-12 |
link |
Interactive Text-to-SQL Generation via Editable Step-by-Step Explanations |
Tian, Yuan,..., Tianyi |
8 |
2023-10-23 |
link |
Adaptive Policy with Wait-$k$ Model for Simultaneous Translation |
Zhao, Libo,..., Zhongqiang |
8 |
2023-05-23 |
link |
Target-Agnostic Gender-Aware Contrastive Learning for Mitigating Bias in Multilingual Machine Translation |
Lee, Minwoo,..., Kyomin |
8 |
2023-10-16 |
link |
Towards a Better Understanding of Variations in Zero-Shot Neural Machine Translation Performance |
Tan, Shaomu,..., Christof |
8 |
2023-05-23 |
link |
Robust Prompt Optimization for Large Language Models Against Distribution Shifts |
Li, Moxin,..., Tat-Seng |
8 |
2023-10-20 |
link |
Democratizing Reasoning Ability: Tailored Learning from Large Language Model |
Wang, Zhaoyang,..., Qi |
8 |
2022-11-30 |
link |
KRLS: Improving End-to-End Response Generation in Task Oriented Dialog with Reinforced Keywords Learning |
Yu, Xiao,..., Zhou |
8 |
2023-03-02 |
link |
CoSyn: Detecting Implicit Hate Speech in Online Conversations Using a Context Synergized Hyperbolic Network |
Ghosh, Sreyan,..., Dinesh |
8 |
2023-05-16 |
link |
A Video Is Worth 4096 Tokens: Verbalize Videos To Understand Them In Zero Shot |
Bhattacharyya, Aanisha,..., Changyou |
8 |
2023-10-23 |
link |
Air-Decoding: Attribute Distribution Reconstruction for Decoding-Time Controllable Text Generation |
Zhong, Tianqi,..., Zhendong |
8 |
None |
link |
SMoP: Towards Efficient and Effective Prompt Tuning with Sparse Mixture-of-Prompts |
Choi, Joon-Young,..., SangKeun |
8 |
2023-10-10 |
link |
Rationale-Enhanced Language Models are Better Continual Relation Learners |
Xiong, Weimin,..., Sujian |
8 |
2023-05-22 |
link |
Look-back Decoding for Open-Ended Text Generation |
Xu, Nan,..., Xuezhe |
8 |
2023-10-23 |
link |
Diversify Question Generation with Retrieval-Augmented Style Transfer |
Gou, Qi,..., Nguyen |
8 |
2023-11-06 |
link |
Instructed Language Models with Retrievers Are Powerful Entity Linkers |
Xiao, Zilin,..., Daxin |
8 |
2023-05-24 |
link |
The Art of SOCRATIC QUESTIONING: Recursive Thinking with Large Language Models |
Qi, Jingyuan,..., Lifu |
8 |
2023-10-23 |
link |
Localizing Active Objects from Egocentric Vision with Symbolic World Knowledge |
Wu, Te-Lin,..., Nanyun |
8 |
2022-02-24 |
link |
Oolong: Investigating What Makes Transfer Learning Hard with Controlled Studies |
Wu, Zhengxuan,..., Isabel |
8 |
2023-05-23 |
link |
Pre-training Language Models for Comparative Reasoning |
Yu, Mengxia,..., Meng |
8 |
2023-05-22 |
link |
Can LLMs facilitate interpretation of pre-trained language models? |
Mousi, Basel,..., Fahim |
8 |
2023-10-23 |
link |
SpEL: Structured Prediction for Entity Linking |
Shavarani, Hassan,..., Anoop |
8 |
2023-05-24 |
link |
The ACL OCL Corpus: advancing Open science in Computational Linguistics |
Rohatgi, Shaurya,..., Min-Yen |
8 |
2023-05-24 |
link |
Focus Your Attention (with Adaptive IIR Filters) |
Lutati, Shahar,..., Lior |
7 |
None |
link |
Personalized Distillation: Empowering Open-Sourced LLMs with Adaptive Learning for Code Generation |
Chen, Hailin,..., Shafiq |
7 |
2023-10-26 |
link |
Joint Entity and Relation Extraction with Span Pruning and Hypergraph Neural Networks |
Yan, Zhaohui,..., Kewei |
7 |
2023-05-24 |
link |
Universal Self-adaptive Prompting |
Wan, Xingchen,..., Tomas |
7 |
None |
link |
Better Quality Pre-training Data and T5 Models for African Languages |
Oladipo, Akintunde,..., Jimmy |
7 |
2023-05-23 |
link |
Beyond Shared Vocabulary: Increasing Representational Word Similarities across Languages for Multilingual Machine Translation |
Wu, Di,..., Christof |
7 |
2023-03-06 |
link |
Models See Hallucinations: Evaluating the Factuality in Video Captioning |
Liu, Hui,..., Xiaojun |
7 |
2023-10-23 |
link |
CoF-CoT: Enhancing Large Language Models with Coarse-to-Fine Chain-of-Thought Prompting for Multi-domain NLU Tasks |
Nguyen, Hoang,..., Philip |
7 |
None |
link |
A Fine-Grained Taxonomy of Replies to Hate Speech |
Yu, Xinchen,..., Lingzi |
7 |
2023-10-23 |
link |
M2DF: Multi-grained Multi-curriculum Denoising Framework for Multimodal Aspect-based Sentiment Analysis |
Zhao, Fei,..., Xinyu |
7 |
2023-05-21 |
link |
Multilingual Simplification of Medical Texts |
Joseph, Sebastian,..., Junyi Jessy |
7 |
None |
link |
Dual-Channel Span for Aspect Sentiment Triplet Extraction |
Li, Pan,..., Kai |
7 |
2023-10-24 |
link |
Enhancing Biomedical Lay Summarisation with External Knowledge Graphs |
Goldsack, Tomas,..., Chenghua |
7 |
2023-10-24 |
link |
A Diffusion Weighted Graph Framework for New Intent Discovery |
Shi, Wenkai,..., Ping |
7 |
2023-05-23 |
link |
Generating Data for Symbolic Language with Large Language Models |
Ye, Jiacheng,..., Tao |
7 |
None |
link |
Reduce Human Labor On Evaluating Conversational Information Retrieval System: A Human-Machine Collaboration Approach |
Huang, Chen,..., Jiancheng |
7 |
2023-10-11 |
link |
Target-oriented Proactive Dialogue Systems with Personalization: Problem Formulation and Dataset Curation |
Wang, Jian,..., Wenjie |
7 |
2023-10-10 |
link |
Rethinking Model Selection and Decoding for Keyphrase Generation with Pre-trained Sequence-to-Sequence Models |
Wu, Di,..., Kai-Wei |
7 |
2023-10-21 |
link |
Transductive Learning for Textual Few-Shot Classification in API-based Embedding Models |
Colombo, Pierre,..., Pablo |
7 |
None |
link |
Rumor Detection on Social Media with Crowd Intelligence and ChatGPT-Assisted Networks |
Yang, Chang,..., Jiaming |
7 |
None |
link |
Generating Commonsense Counterfactuals for Stable Relation Extraction |
Miao, Xin,..., Tieyun |
7 |
2023-10-23 |
link |
API-Assisted Code Generation for Question Answering on Varied Table Structures |
Cao, Yihan,..., Daniel |
7 |
2023-11-13 |
link |
Semi-automatic Data Enhancement for Document-Level Relation Extraction with Distant Supervision from Large Language Models |
Li, Junpeng,..., Zilong |
7 |
2023-10-23 |
link |
QUDEVAL: The Evaluation of Questions Under Discussion Discourse Parsing |
Wu, Yating,..., Junyi Jessy |
7 |
2023-11-07 |
link |
Multitask Multimodal Prompted Training for Interactive Embodied Task Completion |
Pantazopoulos, Georgios,..., Alessandro |
7 |
2022-05-25 |
link |
Non-Programmers Can Label Programs Indirectly via Active Examples: A Case Study with Text-to-SQL |
Zhong, Ruiqi,..., Jason |
7 |
2022-12-19 |
link |
Query-as-context Pre-training for Dense Passage Retrieval |
W, Xing,..., Songlin |
7 |
2023-11-03 |
link |
Support or Refute: Analyzing the Stance of Evidence to Detect Out-of-Context Mis- and Disinformation |
Yuan, Xin,..., Shujun |
7 |
2023-11-01 |
link |
Text Rendering Strategies for Pixel Language Models |
Lotz, Jonas,..., Desmond |
7 |
2023-10-23 |
link |
Assessing Step-by-Step Reasoning against Lexical Negation: A Case Study on Syllogism |
Ye, Mengyu,..., Hiroaki |
7 |
2023-10-24 |
link |
CP-BCS: Binary Code Summarization Guided by Control Flow Graph and Pseudo Code |
Ye, Tong,..., Wenhai |
7 |
2023-12-01 |
link |
A Comprehensive Evaluation of Biomedical Entity Linking Models |
Kartchner, David,..., Cassie |
7 |
None |
link |
Cross-Document Event Coreference Resolution on Discourse Structure |
Chen, Xinyu,..., Qiaoming |
7 |
None |
link |
Interventional Rationalization |
Yue, Linan,..., Zhenya |
7 |
2022-12-22 |
link |
When are Lemons Purple? The Concept Association Bias of Vision-Language Models |
Tang, Yingtian,..., Ilker |
7 |
2023-05-05 |
link |
A Suite of Generative Tasks for Multi-Level Multimodal Webpage Understanding |
Burns, Andrea,..., Mandy |
7 |
2023-10-25 |
link |
Critic-Driven Decoding for Mitigating Hallucinations in Data-to-text Generation |
Lango, Mateusz,..., Ondrej |
7 |
None |
link |
CRT-QA: A Dataset of Complex Reasoning Question Answering over Tabular Data |
Zhang, Zhehao,..., Jian-Guang |
7 |
None |
link |
Tagging-Assisted Generation Model with Encoder and Decoder Supervision for Aspect Sentiment Triplet Extraction |
Xianlong, Luo,..., Yihao |
7 |
2023-12-09 |
link |
Understanding the Effect of Model Compression on Social Bias in Large Language Models |
Gon{\c{c}}alves, Gustavo,..., Emma |
7 |
2022-12-20 |
link |
AnyTOD: A Programmable Task-Oriented Dialog System |
Zhao, Jeffrey,..., Yonghui |
7 |
2023-05-23 |
link |
Modeling Empathic Similarity in Personal Narratives |
Shen, Jocelyn,..., Cynthia |
7 |
2023-12-06 |
link |
AMR Parsing is Far from Solved: GrAPES, the Granular AMR Parsing Evaluation Suite |
Groschwitz, Jonas,..., Meaghan |
7 |
2023-02-13 |
link |
The Framework Tax: Disparities Between Inference Efficiency in Research and Deployment |
Fernandez, Jared,..., Emma |
7 |
2023-10-16 |
link |
Learning to Rank Context for Named Entity Recognition Using a Synthetic Dataset |
Amalvy, Arthur,..., Richard |
7 |
2023-05-23 |
link |
Preserving Knowledge Invariance: Rethinking Robustness Evaluation of Open Information Extraction |
Qi, Ji,..., Xu |
6 |
2024-02-01 |
link |
Hidding the Ghostwriters: An Adversarial Evaluation of AI-Generated Student Essay Detection |
Peng, Xinlin,..., Yingfei |
6 |
2023-11-15 |
link |
Accelerating Toeplitz Neural Network with Constant-time Inference Complexity |
Qin, Zhen,..., Yiran |
6 |
2023-05-23 |
link |
What Else Do I Need to Know? The Effect of Background Information on Users' Reliance on QA Systems |
Goyal, Navita,..., Hal |
6 |
2023-03-20 |
link |
ChatEdit: Towards Multi-turn Interactive Facial Image Editing via Dialogue |
Cui, Xing,..., Zhaofeng |
6 |
2023-10-20 |
link |
A Unified View of Evaluation Metrics for Structured Prediction |
Chen, Yunmo,..., Benjamin |
6 |
2023-11-06 |
link |
GLEN: Generative Retrieval via Lexical Index Learning |
Lee, Sunkyung,..., Jongwuk |
6 |
2024-03-05 |
link |
EasyQuant: An Efficient Data-free Quantization Algorithm for LLMs |
Tang, Hanlin,..., Zhanhui |
6 |
2023-05-13 |
link |
SCENE: Self-Labeled Counterfactuals for Extrapolating to Negative Examples |
Fu, Deqing,..., Robin |
6 |
2023-10-11 |
link |
RobustGEC: Robust Grammatical Error Correction Against Subtle Context Perturbation |
Zhang, Yue,..., Shuming |
6 |
2023-10-23 |
link |
CorefPrompt: Prompt-based Event Coreference Resolution by Measuring Event Type and Argument Compatibilities |
Xu, Sheng,..., Qiaoming |
6 |
2023-02-07 |
link |
Augmenting Zero-Shot Dense Retrievers with Plug-in Mixture-of-Memories |
Ge, Suyu,..., Paul |
6 |
2023-11-14 |
link |
Improving Image Captioning via Predicting Structured Concepts |
Wang, Ting,..., Zhendong |
6 |
2023-11-25 |
link |
E-CORE: Emotion Correlation Enhanced Empathetic Dialogue Generation |
Fu, Fengyi,..., Zhendong |
6 |
2023-11-17 |
link |
Countering Misinformation via Emotional Response Generation |
Russo, Daniel,..., Marco |
6 |
None |
link |
It Ain't Over: A Multi-aspect Diverse Math Word Problem Dataset |
Kim, Jiwoo,..., Jongwuk |
6 |
None |
link |
Unifying Cross-Lingual Transfer across Scenarios of Resource Scarcity |
Ansell, Alan,..., Edoardo |
6 |
2023-10-24 |
link |
This is not a Dataset: A Large Negation Benchmark to Challenge Large Language Models |
Garc{\'\i}a-Ferrero, Iker,..., German |
6 |
2023-10-17 |
link |
Balance Act: Mitigating Hubness in Cross-Modal Retrieval with Query and Gallery Banks |
Wang, Yimu,..., Bo |
6 |
None |
link |
Identification of Multimodal Stance Towards Frames of Communication |
Weinzierl, Maxwell,..., Sanda |
6 |
None |
link |
Superlim: A Swedish Language Understanding Evaluation Benchmark |
Berdicevskis, Aleksandrs,..., Nina |
6 |
2023-10-23 |
link |
Continual Named Entity Recognition without Catastrophic Forgetting |
Zhang, Duzhen,..., Zhen |
6 |
2023-05-03 |
link |
Can LMs Generalize to Future Data? An Empirical Analysis on Text Summarization |
Cheang, Chi,..., Lidia |
6 |
2023-10-10 |
link |
Crossing the Threshold: Idiomatic Machine Translation through Retrieval Augmentation and Loss Weighting |
Liu, Emmy,..., Graham |
6 |
2023-10-13 |
link |
Retrieval-Generation Alignment for End-to-End Task-Oriented Dialogue System |
Shen, Weizhou,..., Wei |
6 |
2023-05-24 |
link |
KNN-LM Does Not Improve Open-ended Text Generation |
Wang, Shufan,..., Mohit |
6 |
None |
link |
Promoting Topic Coherence and Inter-Document Consorts in Multi-Document Summarization via Simplicial Complex and Sheaf Graph |
Atri, Yash,..., Vikram |
6 |
None |
link |
Multimodal Embodied Plan Prediction Augmented with Synthetic Embodied Dialogue |
Padmakumar, Aishwarya,..., Dilek |
6 |
None |
link |
PALS: Personalized Active Learning for Subjective Tasks in NLP |
Kanclerz, Kamil,..., Przemyslaw |
6 |
2023-05-23 |
link |
Natural Language Decompositions of Implicit Content Enable Better Text Representations |
Hoyle, Alexander,..., Philip |
6 |
2023-10-18 |
link |
Rather a Nurse than a Physician - Contrastive Explanations under Investigation |
Eberle, Oliver,..., Stephanie |
6 |
2022-03-28 |
link |
Automatic Debate Evaluation with Argumentation Semantics and Natural Language Argument Graph Networks |
Ruiz-Dolz, Ramon,..., Ana |
6 |
2023-10-24 |
link |
RAPL: A Relation-Aware Prototype Learning Approach for Few-Shot Document-Level Relation Extraction |
Meng, Shiao,..., Lijie |
6 |
2023-05-24 |
link |
GlobalBench: A Benchmark for Global Progress in Natural Language Processing |
Song, Yueqi,..., Graham |
6 |
2023-10-09 |
link |
Empower Nested Boolean Logic via Self-Supervised Curriculum Learning |
Wu, Hongqiu,..., Min |
6 |
2023-10-23 |
link |
Fidelity-Enriched Contrastive Search: Reconciling the Faithfulness-Diversity Trade-Off in Text Generation |
Chen, Wei-Lin,..., Chung-Chi |
6 |
2023-10-24 |
link |
Length is a Curse and a Blessing for Document-level Semantics |
Xiao, Chenghao,..., Noura |
6 |
2023-10-25 |
link |
CoheSentia: A Novel Benchmark of Incremental versus Holistic Assessment of Coherence in Generated Texts |
Maimon, Aviya,..., Reut |
6 |
2023-04-07 |
link |
GEMINI: Controlling The Sentence-Level Summary Style in Abstractive Text Summarization |
Bao, Guangsheng,..., Yue |
6 |
2023-10-22 |
link |
Be Selfish, But Wisely: Investigating the Impact of Agent Personality in Mixed-Motive Human-Agent Interactions |
Chawla, Kushal,..., Jonathan |
6 |
None |
link |
Do Language Models Have a Common Sense regarding Time? Revisiting Temporal Commonsense Reasoning in the Era of Large Language Models |
Jain, Raghav,..., Sandipan |
6 |
None |
link |
You Told Me That Joke Twice: A Systematic Investigation of Transferability and Robustness of Humor Detection Models |
Baranov, Alexander,..., Pavel |
5 |
2023-10-16 |
link |
DNA: Denoised Neighborhood Aggregation for Fine-grained Category Discovery |
An, Wenbin,..., Ping |
5 |
2023-11-20 |
link |
Efficient Grammatical Error Correction Via Multi-Task Training and Optimized Training Schedule |
Bout, Andrey,..., Irina |
5 |
2023-05-08 |
link |
Non-Autoregressive Math Word Problem Solver with Unified Tree Structure |
Bin, Yi,..., Heng |
5 |
2023-10-23 |
link |
The BLA Benchmark: Investigating Basic Language Abilities of Pre-Trained Multimodal Models |
Chen, Xinyi,..., Sandro |
5 |
2023-05-24 |
link |
Chain-of-Questions Training with Latent Answers for Robust Multistep Question Answering |
Zhu, Wang,..., Robin |
5 |
2022-10-14 |
link |
InterFair: Debiasing with Natural Language Feedback for Fair Interpretable Predictions |
Majumder, Bodhisattwa,..., Julian |
5 |
2023-11-09 |
link |
Dialogizer: Context-aware Conversational-QA Dataset Generation from Textual Sources |
Hwang, Yerin,..., Kyomin |
5 |
2023-10-24 |
link |
ScanDL: A Diffusion Model for Generating Synthetic Scanpaths on Texts |
Bolliger, Lena,..., Lena |
5 |
2023-05-08 |
link |
HistAlign: Improving Context Dependency in Language Generation by Aligning with History |
Wan, David,..., Mohit |
5 |
2023-04-05 |
link |
Conceptual structure coheres in human cognition but not in large language models |
Suresh, Siddharth,..., Timothy |
5 |
2023-10-24 |
link |
CleanCoNLL: A Nearly Noise-Free Named Entity Recognition Dataset |
R{\"u}cker, Susanna,..., Alan |
5 |
2022-10-11 |
link |
Hybrid Inverted Index Is a Robust Accelerator for Dense Retrieval |
Zhang, Peitian,..., Jing |
5 |
None |
link |
Not all quantifiers are equal: Probing Transformer-based language models' understanding of generalised quantifiers |
Madusanka, Tharindu,..., Riza |
5 |
2023-10-23 |
link |
EXPLAIN, EDIT, GENERATE: Rationale-Sensitive Counterfactual Data Augmentation for Multi-hop Fact Verification |
Zhu, Yingjie,..., Yulan |
5 |
2023-05-05 |
link |
Expository Text Generation: Imitate, Retrieve, Paraphrase |
Balepur, Nishant,..., Kevin |
5 |
2023-05-23 |
link |
Weakly-Supervised Learning of Visual Relations in Multimodal Pretraining |
Bugliarello, Emanuele,..., Lisa |
5 |
2023-10-25 |
link |
ZGUL: Zero-shot Generalization to Unseen Languages using Multi-source Ensembling of Language Adapters |
Rathore, Vipul,..., {Mausam} |
5 |
2023-10-24 |
link |
Practical Computational Power of Linear Transformers and Their Recurrent and Self-Referential Extensions |
Irie, Kazuki,..., J{\"u}rgen |
5 |
None |
link |
An Iteratively Parallel Generation Method with the Pre-Filling Strategy for Document-level Event Extraction |
Huang, Guanhua,..., Weinan |
5 |
2023-11-15 |
link |
End-to-end Task-oriented Dialogue: A Survey of Tasks, Methods, and Future Directions |
Qin, Libo,..., Min |
5 |
2023-10-20 |
link |
Bridging the Gap between Synthetic and Authentic Images for Multimodal Machine Translation |
Guo, Wenyu,..., Yang |
5 |
2023-10-17 |
link |
An Empirical Study of Translation Hypothesis Ensembling with Large Language Models |
Farinhas, Ant{\'o}nio,..., Andre |
5 |
2023-10-10 |
link |
Generating and Evaluating Tests for K-12 Students with Language Model Simulations: A Case Study on Sentence Reading Efficiency |
Zelikman, Eric,..., Nick |
5 |
None |
link |
Continual Learning for Multilingual Neural Machine Translation via Dual Importance-based Model Division |
Liu, Junpeng,..., Degen |
5 |
2023-12-06 |
link |
Revisiting the Optimality of Word Lengths |
Pimentel, Tiago,..., Ryan |
5 |
2023-05-22 |
link |
A Diachronic Analysis of Paradigm Shifts in NLP Research: When, How, and Why? |
Pramanick, Aniket,..., Iryna |
5 |
2023-10-25 |
link |
Physician Detection of Clinical Harm in Machine Translation: Quality Estimation Aids in Reliance and Backtranslation Identifies Critical Errors |
Mehandru, Nikita,..., Niloufar |
5 |
None |
link |
Fine-grained Medical Vision-Language Representation Learning for Radiology Report Generation |
Wang, Siyuan,..., Qi |
5 |
None |
link |
Deciphering Stereotypes in Pre-Trained Language Models |
Ma, Weicheng,..., Soroush |
5 |
2023-10-27 |
link |
Elevating Code-mixed Text Handling through Auditory Information of Words |
Mamta, Mamta,..., Asif |
5 |
None |
link |
Length Does Matter: Summary Length can Bias Summarization Metrics |
Guo, Xiaobo,..., Soroush |
5 |
2023-05-24 |
link |
Pre-training Intent-Aware Encoders for Zero- and Few-Shot Intent Classification |
Sung, Mujeen,..., Vittorio |
5 |
2023-10-23 |
link |
The Skipped Beat: A Study of Sociopragmatic Understanding in LLMs for 64 Languages |
Zhang, Chiyu,..., Muhammad |
5 |
None |
link |
Detection of Multiple Mental Disorders from Social Media with Two-Stream Psychiatric Experts |
Chen, Siyuan,..., Kenny |
5 |
None |
link |
TacoPrompt: A Collaborative Multi-Task Prompt Learning Method for Self-Supervised Taxonomy Completion |
Xu, Hongyuan,..., Xiaojie |
5 |
2023-10-26 |
link |
PAC-tuning: Fine-tuning Pretrained Language Models with PAC-driven Perturbed Gradient Descent |
Liu, Guangliang,..., Rongrong |
5 |
2023-11-30 |
link |
Evaluating the Rationale Understanding of Critical Reasoning in Logical Reading Comprehension |
Kawabata, Akira,..., Saku |
5 |
2023-11-03 |
link |
Emergence of Abstract State Representations in Embodied Sequence Modeling |
Yun, Tian,..., Chen |
5 |
2023-10-08 |
link |
FLatS: Principled Out-of-Distribution Detection with Feature-Based Likelihood Ratio Score |
Lin, Haowei,..., Yuntian |
5 |
2023-10-19 |
link |
FinEntity: Entity-level Sentiment Classification for Financial Texts |
Tang, Yixuan,..., Justin |
5 |
2023-10-08 |
link |
Revisiting Block-based Quantisation: What is Important for Sub-8-bit LLM Inference? |
Zhang, Cheng,..., Yiren |
5 |
2023-10-09 |
link |
Can language models learn analogical reasoning? Investigating training objectives and comparisons to human performance |
Petersen, Molly,..., Lonneke |
5 |
2023-10-25 |
link |
Back Transcription as a Method for Evaluating Robustness of Natural Language Understanding Models to Speech Recognition Errors |
Kubis, Marek,..., Tomasz |
5 |
None |
link |
A Frustratingly Easy Post-Training Quantization Scheme for LLMs |
Jeon, Yongkweon,..., Ho-young |
5 |
None |
link |
HutCRS: Hierarchical User-Interest Tracking for Conversational Recommender System |
Qian, Mingjie,..., Liang |
5 |
2023-10-23 |
link |
GD-COMET: A Geo-Diverse Commonsense Inference Model |
Bhatia, Mehar,..., Vered |
5 |
None |
link |
GATITOS: Using a New Multilingual Lexicon for Low-resource Machine Translation |
Jones, Alexander,..., Ishank |
5 |
2023-11-20 |
link |
Human Learning by Model Feedback: The Dynamics of Iterative Prompting with Midjourney |
Don-Yehiya, Shachar,..., Omri |
5 |
2023-10-23 |
link |
System Combination via Quality Estimation for Grammatical Error Correction |
Qorib, Muhammad Reza,..., Hwee Tou |
5 |
None |
link |
Unveiling the Essence of Poetry: Introducing a Comprehensive Dataset and Benchmark for Poem Summarization |
Mahbub, Ridwan,..., Sabbir |
5 |
2023-10-12 |
link |
Rethinking Negative Pairs in Code Search |
Li, Haochen,..., Chunyan |
5 |
2023-10-14 |
link |
An Expression Tree Decoding Strategy for Mathematical Equation Generation |
Zhang, Wenqi,..., Weiming |
5 |
2022-12-06 |
link |
DiSTRICT: Dialogue State Tracking with Retriever Driven In-Context Tuning |
Venkateswaran, Praveen,..., Vatche |
5 |
None |
link |
TrojanSQL: SQL Injection against Natural Language Interface to Database |
Zhang, Jinchuan,..., Songlin |
5 |
2023-10-22 |
link |
Large Language Models are biased to overestimate profoundness |
Herrera-Berg, Eugenio,..., Cristian |
5 |
2023-10-26 |
link |
TIMELINE: Exhaustive Annotation of Temporal Relations Supporting the Automatic Ordering of Events in News Articles |
Alsayyahi, Sarah,..., Riza |
5 |
None |
link |
Referring Image Segmentation via Joint Mask Contextual Embedding Learning and Progressive Alignment Network |
Huang, Ziling,..., Shin{'}ichi |
5 |
None |
link |
PromptST: Abstract Prompt Learning for End-to-End Speech Translation |
Yu, Tengfei,..., Min |
5 |
2023-05-21 |
link |
"Are Your Explanations Reliable?" Investigating the Stability of LIME in Explaining Text Classifiers by Marrying XAI and Adversarial Attack |
Burger, Christopher,..., Thai |
5 |
2023-11-01 |
link |
Noisy Exemplars Make Large Language Models More Robust: A Domain-Agnostic Behavioral Analysis |
Zheng, Hongyi,..., Abulhair |
5 |
None |
link |
MMNMT: Modularizing Multilingual Neural Machine Translation with Flexibly Assembled MoE and Dense Blocks |
Li, Shangjie,..., Deyi |
5 |
2023-05-23 |
link |
BiasX: "Thinking Slow" in Toxic Content Moderation with Explanations of Implied Social Biases |
Zhang, Yiming,..., Maarten |
5 |
2023-10-18 |
link |
Prototype-based HyperAdapter for Sample-Efficient Multi-task Tuning |
Zhao, Hao,..., Zhaofeng |
5 |
2023-05-04 |
link |
How to Enhance Causal Discrimination of Utterances: A Case on Affective Reasoning |
Chen, Hang,..., Wenjing |
5 |
2023-05-23 |
link |
EDIS: Entity-Driven Image Search over Multimodal Web Content |
Liu, Siqi,..., William |
5 |
2023-11-27 |
link |
FreeAL: Towards Human-Free Active Learning in the Era of Large Language Models |
Xiao, Ruixuan,..., Haobo |
5 |
None |
link |
mRedditSum: A Multimodal Abstractive Summarization Dataset of Reddit Threads with Images |
Overbay, Keighley,..., Gunhee |
5 |
2023-03-14 |
link |
Adapting Offline Speech Translation Models for Streaming with Future-Aware Distillation and Inference |
Fu, Biao,..., Xiaodong |
5 |
2023-05-23 |
link |
CompoundPiece: Evaluating and Improving Decompounding Performance of Language Models |
Minixhofer, Benjamin,..., Ivan |
5 |
2023-10-24 |
link |
MarkQA: A large scale KBQA dataset with numerical reasoning |
Huang, Xiang,..., Yuzhong |
5 |
2023-12-13 |
link |
ToViLaG: Your Visual-Language Generative Model is Also An Evildoer |
Wang, Xinpeng,..., Xing |
5 |
2023-10-24 |
link |
Learning From Free-Text Human Feedback - Collect New Datasets Or Extend Existing Ones? |
Petrak, Dominic,..., Iryna |
5 |
None |
link |
Hallucination Mitigation in Natural Language Generation from Large-Scale Open-Domain Knowledge Graphs |
Shi, Xiao,..., Chengkai |
5 |
2023-05-22 |
link |
FACTIFY3M: A Benchmark for Multimodal Fact Verification with Explainability through 5W Question-Answering |
Chakraborty, Megha,..., Amitava |
5 |
2023-05-23 |
link |
Challenges in Context-Aware Neural Machine Translation |
Jin, Linghao,..., Xuezhe |
5 |
None |
link |
Multi-Source Multi-Type Knowledge Exploration and Exploitation for Dialogue Generation |
Ni, Xuanfan,..., Piji |
5 |
2023-10-28 |
link |
Translating away Translationese without Parallel Data |
Jalota, Rricha,..., Josef |
5 |
2023-10-18 |
link |
Improving Long Document Topic Segmentation Models With Enhanced Coherence Modeling |
Yu, Hai,..., Wen |
5 |
2023-10-23 |
link |
Towards Conceptualization of "Fair Explanation": Disparate Impacts of anti-Asian Hate Speech Explanations on Content Moderators |
Nguyen, Tin,..., Marine |
5 |
2023-05-18 |
link |
Causal Document-Grounded Dialogue Pre-training |
Zhao, Yingxiu,..., Nevin |
5 |
2023-10-19 |
link |
Knowledge-Augmented Language Model Verification |
Baek, Jinheon,..., Sung |
5 |
2023-10-19 |
link |
A Predictive Factor Analysis of Social Biases and Task-Performance in Pretrained Masked Language Models |
Zhou, Yi,..., Danushka |
5 |
2023-11-15 |
link |
Data Similarity is Not Enough to Explain Language Model Performance |
Yauney, Gregory,..., David |
5 |
2023-05-22 |
link |
DUMB: A Benchmark for Smart Evaluation of Dutch Models |
de Vries, Wietse,..., Malvina |
5 |
2022-10-14 |
link |
COFFEE: Counterfactual Fairness for Personalized Text Generation in Explainable Recommendation |
Wang, Nan,..., Shaoliang |
5 |
2023-12-08 |
link |
Predictive Chemistry Augmented with Text Retrieval |
Qian, Yujie,..., Regina |
4 |
2023-05-19 |
link |
Reducing Sequence Length by Predicting Edit Spans with Large Language Models |
Kaneko, Masahiro,..., Naoaki |
4 |
2023-05-24 |
link |
Don't Take This Out of Context! On the Need for Contextual Models and Evaluations for Stylistic Rewriting |
Yerukola, Akhila,..., Maarten |
4 |
2023-10-22 |
link |
TATA: Stance Detection via Topic-Agnostic and Topic-Aware Embeddings |
Hanley, Hans,..., Zakir |
4 |
2023-10-25 |
link |
An Integrative Survey on Mental Health Conversational Agents to Bridge Computer Science and Medical Perspectives |
Cho, Young Min,..., Sharath |
4 |
2023-11-02 |
link |
Revisiting the Knowledge Injection Frameworks |
Fu, Peng,..., Junbo |
4 |
None |
link |
GazeVQA: A Video Question Answering Dataset for Multiview Eye-Gaze Task-Oriented Collaborations |
Ilaslan, Muhammet,..., Mike |
4 |
None |
link |
Contextual Interaction for Argument Post Quality Assessment |
Wang, Yiran,..., Le |
4 |
2023-05-18 |
link |
Collaborative Generative AI: Integrating GPT-k for Efficient Editing in Text-to-Image Generation |
Zhu, Wanrong,..., William |
4 |
2023-05-20 |
link |
Pointwise Mutual Information Based Metric and Decoding Strategy for Faithful Generation in Document Grounded Dialogs |
Nandwani, Yatin,..., Luis |
4 |
2023-05-24 |
link |
Increasing Probability Mass on Answer Choices Does Not Always Improve Accuracy |
Wiegreffe, Sarah,..., Ashish |
4 |
2023-06-04 |
link |
Exploring the Impact of Model Scaling on Parameter-Efficient Tuning |
Su, Yusheng,..., Maosong |
4 |
2023-10-28 |
link |
All Things Considered: Detecting Partisan Events from News Media with Cross-Article Comparison |
Liu, Yujian,..., Lu |
4 |
2023-05-23 |
link |
Condensing Multilingual Knowledge with Lightweight Language-Specific Modules |
Xu, Haoran,..., Kenton |
4 |
2023-05-18 |
link |
Analyzing Norm Violations in Live-Stream Chat |
Moon, Jihyung,..., Sungjoon |
4 |
2023-05-23 |
link |
LIMIT: Language Identification, Misidentification, and Translation using Hierarchical Models in 350+ Languages |
Agarwal, Milind,..., Antonios |
4 |
2023-03-16 |
link |
A Picture is Worth a Thousand Words: Language Models Plan from Pixels |
Liu, Anthony,..., Honglak |
4 |
None |
link |
Out-of-Distribution Generalization in Natural Language Processing: Past, Present, and Future |
Yang, Linyi,..., Yue |
4 |
2023-10-19 |
link |
Predict the Future from the Past? On the Temporal Data Distribution Shift in Financial Sentiment Classifications |
Guo, Yue,..., Yi |
4 |
2023-10-20 |
link |
A Simple Baseline for Knowledge-Based Visual Question Answering |
Xenos, Alexandros,..., Georgios |
4 |
2023-05-15 |
link |
SuperDialseg: A Large-scale Dataset for Supervised Dialogue Segmentation |
Jiang, Junfeng,..., Akiko |
4 |
2023-05-23 |
link |
Towards A Unified View of Sparse Feed-Forward Network in Pretraining Large Language Model |
Liu, Zeyu,..., Xian |
4 |
2023-05-20 |
link |
Re-visiting Automated Topic Model Evaluation with Large Language Models |
Stammbach, Dominik,..., Elliott |
4 |
2023-10-27 |
link |
Lost in Translation, Found in Spans: Identifying Claims in Multilingual Social Media |
Mittal, Shubham,..., Preslav |
4 |
None |
link |
Confidence-based Ensembling of Perspective-aware Models |
Casola, Silvia,..., Cristina |
4 |
2023-05-05 |
link |
Open Information Extraction via Chunks |
Dong, Kuicai,..., Xiaoli |
4 |
2023-10-28 |
link |
Anaphor Assisted Document-Level Relation Extraction |
Lu, Chonggang,..., Yongyi |
4 |
2023-10-26 |
link |
GROOViST: A Metric for Grounding Objects in Visual Storytelling |
Surikuchi, Aditya,..., Raquel |
4 |
2023-05-22 |
link |
Cross-lingual Transfer Can Worsen Bias in Sentiment Analysis |
Goldfarb-Tarrant, Seraphina,..., Adam |
4 |
2023-10-18 |
link |
BanglaAbuseMeme: A Dataset for Bengali Abusive Meme Classification |
Das, Mithun,..., Animesh |
4 |
2023-11-07 |
link |
Analyzing Film Adaptation through Narrative Alignment |
Pial, Tanzir,..., Steven |
4 |
2023-10-26 |
link |
Language and Mental Health: Measures of Emotion Dynamics from Text as Linguistic Biosocial Markers |
Teodorescu, Daniela,..., Saif |
4 |
None |
link |
A Training-Free Debiasing Framework with Counterfactual Reasoning for Conversational Emotion Detection |
Tu, Geng,..., Ruifeng |
4 |
2023-10-22 |
link |
Can Language Models Laugh at YouTube Short-form Videos? |
Ko, Dayoon,..., Gunhee |
4 |
2023-10-16 |
link |
Investigating Bias in Multilingual Language Models: Cross-Lingual Transfer of Debiasing Techniques |
Reusens, Manon,..., Bart |
4 |
2023-10-11 |
link |
Hierarchical Pretraining on Multimodal Electronic Health Records |
Wang, Xiaochen,..., Fenglong |
4 |
2023-10-09 |
link |
An Attribution Method for Siamese Encoders |
Moeller, Lucas,..., Sebastian |
4 |
None |
link |
HyperRank: Hyperbolic Ranking Model for Unsupervised Keyphrase Extraction |
Song, Mingyang,..., Liping |
4 |
2022-11-14 |
link |
Semantic Similarity Models for Depression Severity Estimation |
P{\'e}rez, Anxo,..., Iryna |