Multi-label Dysfluency Classification

Melanie Jouaiti*, Kerstin Dautenhahn

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Stuttering is a neuro-developmental disorder represented in 1% of the population. Dysfluency classification is still an open research question, with concerns of which feature representation or which classifier to use. Another issue, which has been neglected so far, is how to deal with audio samples that contain more than one type of dysfluency. Research has mostly preferred considering only single-labels problems, in part due to the lack of substantial multi-labels datasets. However, the FluencyBank and SEP-28K datasets are now available and contain multi-label data, which should pave the way for more research taking this aspect into account. In this paper, we give an overview of different ways to handle multi-label classification and compare them, while fine-tuning the ResNet50 network to perform multi-label dysfluency classification. We show that, fine-tuning the ResNet50, independently of the label representation, performs better than current state of the art results.

Original languageEnglish
Title of host publicationSpeech and Computer
Subtitle of host publication24th International Conference, SPECOM 2022, Gurugram, India, November 14–16, 2022, Proceedings
EditorsS.R. Mahadeva Prasanna, Alexey Karpov, K. Samudravijaya, Shyam S. Agrawal
PublisherSpringer
Pages290-301
Number of pages12
ISBN (Electronic)9783031209802
ISBN (Print)9783031209796
DOIs
Publication statusPublished - 10 Nov 2022
Event24th International Conference on Speech and Computer, SPECOM 2022 - Gurugram, India
Duration: 14 Nov 202216 Nov 2022

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume13721 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference24th International Conference on Speech and Computer, SPECOM 2022
Country/TerritoryIndia
CityGurugram
Period14/11/2216/11/22

Bibliographical note

Funding Information:
Acknowledgments. This research was undertaken, in part, thanks to funding from the Canada 150 Research Chairs Program.

Publisher Copyright:
© 2022, Springer Nature Switzerland AG.

Keywords

  • Dysfluency classification
  • Multi-label classification
  • Transfer learning

ASJC Scopus subject areas

  • Theoretical Computer Science
  • General Computer Science

Fingerprint

Dive into the research topics of 'Multi-label Dysfluency Classification'. Together they form a unique fingerprint.

Cite this