Abstract
Manually labelled datasets inherently contain errors or uncertain/imprecise labelling as sometimes experts cannot agree or are not sure. This issue is even more prominent in multi-label datasets as some labels may be missing. However, discarding samples with high uncertainty may lead to the loss of valuable data. In this paper, we study two datasets where the uncertainty is explicit in the expert annotations. We give an overview of the different approaches available to deal with uncertainty and evaluate them on two dysfluency datasets. Our results show that adopting methods that embrace uncertainty leads to better results than using only labels with high certainty and performs better than current state of the art results.
Original language | English |
---|---|
Title of host publication | Speech and Computer |
Subtitle of host publication | 24th International Conference, SPECOM 2022, Gurugram, India, November 14–16, 2022, Proceedings |
Editors | S.R. Mahadeva Prasanna, Alexey Karpov, K. Samudravijaya, Shyam S. Agrawal |
Publisher | Springer |
Pages | 302-311 |
Number of pages | 10 |
ISBN (Electronic) | 9783031209802 |
ISBN (Print) | 9783031209796 |
DOIs | |
Publication status | Published - 10 Nov 2022 |
Event | 24th International Conference on Speech and Computer, SPECOM 2022 - Gurugram, India Duration: 14 Nov 2022 → 16 Nov 2022 |
Publication series
Name | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
---|---|
Volume | 13721 LNAI |
ISSN (Print) | 0302-9743 |
ISSN (Electronic) | 1611-3349 |
Conference
Conference | 24th International Conference on Speech and Computer, SPECOM 2022 |
---|---|
Country/Territory | India |
City | Gurugram |
Period | 14/11/22 → 16/11/22 |
Bibliographical note
Funding Information:Acknowledgments. This research was undertaken, in part, thanks to funding from the Canada 150 Research Chairs Program.
Publisher Copyright:
© 2022, Springer Nature Switzerland AG.
Keywords
- Dysfluency classification
- Transfer learning
- Uncertainty
ASJC Scopus subject areas
- Theoretical Computer Science
- General Computer Science