Abstract
Stuttering is a neuro-developmental disorder represented in 1% of the population. Dysfluency classification is still an open research question, with concerns of which feature representation or which classifier to use. Another issue, which has been neglected so far, is how to deal with audio samples that contain more than one type of dysfluency. Research has mostly preferred considering only single-labels problems, in part due to the lack of substantial multi-labels datasets. However, the FluencyBank and SEP-28K datasets are now available and contain multi-label data, which should pave the way for more research taking this aspect into account. In this paper, we give an overview of different ways to handle multi-label classification and compare them, while fine-tuning the ResNet50 network to perform multi-label dysfluency classification. We show that, fine-tuning the ResNet50, independently of the label representation, performs better than current state of the art results.
Original language | English |
---|---|
Title of host publication | Speech and Computer |
Subtitle of host publication | 24th International Conference, SPECOM 2022, Gurugram, India, November 14–16, 2022, Proceedings |
Editors | S.R. Mahadeva Prasanna, Alexey Karpov, K. Samudravijaya, Shyam S. Agrawal |
Publisher | Springer |
Pages | 290-301 |
Number of pages | 12 |
ISBN (Electronic) | 9783031209802 |
ISBN (Print) | 9783031209796 |
DOIs | |
Publication status | Published - 10 Nov 2022 |
Event | 24th International Conference on Speech and Computer, SPECOM 2022 - Gurugram, India Duration: 14 Nov 2022 → 16 Nov 2022 |
Publication series
Name | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
---|---|
Volume | 13721 LNAI |
ISSN (Print) | 0302-9743 |
ISSN (Electronic) | 1611-3349 |
Conference
Conference | 24th International Conference on Speech and Computer, SPECOM 2022 |
---|---|
Country/Territory | India |
City | Gurugram |
Period | 14/11/22 → 16/11/22 |
Bibliographical note
Funding Information:Acknowledgments. This research was undertaken, in part, thanks to funding from the Canada 150 Research Chairs Program.
Publisher Copyright:
© 2022, Springer Nature Switzerland AG.
Keywords
- Dysfluency classification
- Multi-label classification
- Transfer learning
ASJC Scopus subject areas
- Theoretical Computer Science
- General Computer Science