Abstract
Generative Large Language Models (LLMs) have recently been widely utilized for their unprecedented capabil-ities across many tasks. Considering their use in high-stakes environments and for mission-critical applications, the fact that LLMs often can generate inaccurate or misleading results can be potentially harmful, which motivates us to study the correctness of generative LLM outputs. Uncertainty Estimation (UE) in generative LLMs is a developing area, with state-of-the-art probability-based techniques frequently using length-normalized scoring. As an alternative to length-normalized scoring in UE, in this work, we propose Meaning-Aware Response Scoring (MARS). The key idea of MARS is to consider the semantic contribution of each token of the generated sequence to the context of the question during UE. Through extensive experiments on three question-answering datasets across five pretrained LLMs, we show that utilizing MARS during UE results in a universal and significant improvement in UE performance.
Original language | English |
---|---|
Title of host publication | 2024 IEEE International Symposium on Information Theory (ISIT) |
Publisher | IEEE |
Pages | 2033-2037 |
Number of pages | 5 |
ISBN (Electronic) | 9798350382846 |
ISBN (Print) | 9798350382853 |
DOIs | |
Publication status | Published - 19 Aug 2024 |
Externally published | Yes |
Event | 2024 IEEE International Symposium on Information Theory (ISIT) - Athens, Greece Duration: 7 Jul 2024 → 12 Jul 2024 https://2024.ieee-isit.org/home |
Publication series
Name | IEEE International Symposium on Information Theory |
---|---|
Publisher | IEEE |
ISSN (Print) | 2157-8095 |
ISSN (Electronic) | 2157-8117 |
Conference
Conference | 2024 IEEE International Symposium on Information Theory (ISIT) |
---|---|
Abbreviated title | IEEE ISIT2024 |
Country/Territory | Greece |
City | Athens |
Period | 7/07/24 → 12/07/24 |
Internet address |
Keywords
- Uncertainty
- Large language models
- Semantics
- Mission critical systems
- Estimation
- Task analysis
- Information theory