Overview of the 2018 spoken CALL shared task

Claudia Baur; Andrew Caines; Cathy Chua; Johanna Gerlach; Mengjie Qian; Manny Rayner; Martin Russell; Helmer Strik; Xizi Wei

doi:10.21437/Interspeech.2018-97

Overview of the 2018 spoken CALL shared task

Claudia Baur, Andrew Caines, Cathy Chua, Johanna Gerlach, Mengjie Qian, Manny Rayner, Martin Russell, Helmer Strik, Xizi Wei

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

7 Citations (Scopus)

214 Downloads (Pure)

Abstract

We present an overview of the second edition of the Spoken CALL Shared Task. Groups competed on a prompt-response task using English-language data collected, through an online CALL game, from Swiss German teens in their second and third years of learning English. Each item consists of a written German prompt and an audio file containing a spoken response. The task is to accept linguistically correct responses and reject linguistically incorrect ones, with “linguistically correct” defined by a gold standard derived from human annotations. Scoring was performed using a metric defined as the ratio of the relative rejection rates on incorrect and correct responses. The second edition received eighteen entries and showed very substantial improvement on the first edition; all entries were better than the best entry from the first edition and the best score was about four times higher. We present the task, the resources, the results, a discussion of the metrics used and an analysis of what makes items challenging. In particular, we present quantitative evidence suggesting that incorrect responses are much more difficult to process than correct responses and that the most significant factor in making a response challenging is its distance from the closest training example.

Original language	English
Title of host publication	Proceedings of Interspeech 2018
Place of Publication	Hyderabad, India
Publisher	ISCA
Pages	2354-2358
Number of pages	5
DOIs	https://doi.org/10.21437/Interspeech.2018-97
Publication status	Published - 3 Sept 2018
Event	Interspeech 2018 - Hyderabad International Convention Centre, Hyderabad , India Duration: 2 Sept 2018 → 6 Sept 2018

Publication series

Name	Interspeech
Volume	2018
ISSN (Electronic)	1990-9772

Conference

Conference	Interspeech 2018
Country/Territory	India
City	Hyderabad
Period	2/09/18 → 6/09/18

Keywords

CALL
shared tasks
speech recognition
metrics

Access to Document

10.21437/Interspeech.2018-97Licence: None: All rights reserved

Claudia_Baur_et_al_Overview_of_the_2018_Spoken_CALL_Shared_Task_Proc_Interspeech_2018
Checked for eligibility: 13/09/2018 Baur, C., Caines, A., Chua, C., Gerlach, J., Qian, M., Rayner, M., Russell, M., Strik, H., Wei, X. (2018) Overview of the 2018 Spoken CALL Shared Task. Proc. Interspeech 2018, 2354-2358, DOI: 10.21437/Interspeech.2018-97.
Final published version, 223 KBLicence: Other (please specify with Rights Statement)

Cite this

@inproceedings{1b696e81058a466ba04d15cbe3d18690,

title = "Overview of the 2018 spoken CALL shared task",

abstract = "We present an overview of the second edition of the Spoken CALL Shared Task. Groups competed on a prompt-response task using English-language data collected, through an online CALL game, from Swiss German teens in their second and third years of learning English. Each item consists of a written German prompt and an audio file containing a spoken response. The task is to accept linguistically correct responses and reject linguistically incorrect ones, with “linguistically correct” defined by a gold standard derived from human annotations. Scoring was performed using a metric defined as the ratio of the relative rejection rates on incorrect and correct responses. The second edition received eighteen entries and showed very substantial improvement on the first edition; all entries were better than the best entry from the first edition and the best score was about four times higher. We present the task, the resources, the results, a discussion of the metrics used and an analysis of what makes items challenging. In particular, we present quantitative evidence suggesting that incorrect responses are much more difficult to process than correct responses and that the most significant factor in making a response challenging is its distance from the closest training example.",

keywords = "CALL, shared tasks, speech recognition, metrics",

author = "Claudia Baur and Andrew Caines and Cathy Chua and Johanna Gerlach and Mengjie Qian and Manny Rayner and Martin Russell and Helmer Strik and Xizi Wei",

year = "2018",

month = sep,

day = "3",

doi = "10.21437/Interspeech.2018-97",

language = "English",

series = "Interspeech",

publisher = "ISCA",

pages = "2354--2358",

booktitle = "Proceedings of Interspeech 2018",

note = "Interspeech 2018 ; Conference date: 02-09-2018 Through 06-09-2018",

}

TY - GEN

T1 - Overview of the 2018 spoken CALL shared task

AU - Baur, Claudia

AU - Caines, Andrew

AU - Chua, Cathy

AU - Gerlach, Johanna

AU - Qian, Mengjie

AU - Rayner, Manny

AU - Russell, Martin

AU - Strik, Helmer

AU - Wei, Xizi

PY - 2018/9/3

Y1 - 2018/9/3

N2 - We present an overview of the second edition of the Spoken CALL Shared Task. Groups competed on a prompt-response task using English-language data collected, through an online CALL game, from Swiss German teens in their second and third years of learning English. Each item consists of a written German prompt and an audio file containing a spoken response. The task is to accept linguistically correct responses and reject linguistically incorrect ones, with “linguistically correct” defined by a gold standard derived from human annotations. Scoring was performed using a metric defined as the ratio of the relative rejection rates on incorrect and correct responses. The second edition received eighteen entries and showed very substantial improvement on the first edition; all entries were better than the best entry from the first edition and the best score was about four times higher. We present the task, the resources, the results, a discussion of the metrics used and an analysis of what makes items challenging. In particular, we present quantitative evidence suggesting that incorrect responses are much more difficult to process than correct responses and that the most significant factor in making a response challenging is its distance from the closest training example.

AB - We present an overview of the second edition of the Spoken CALL Shared Task. Groups competed on a prompt-response task using English-language data collected, through an online CALL game, from Swiss German teens in their second and third years of learning English. Each item consists of a written German prompt and an audio file containing a spoken response. The task is to accept linguistically correct responses and reject linguistically incorrect ones, with “linguistically correct” defined by a gold standard derived from human annotations. Scoring was performed using a metric defined as the ratio of the relative rejection rates on incorrect and correct responses. The second edition received eighteen entries and showed very substantial improvement on the first edition; all entries were better than the best entry from the first edition and the best score was about four times higher. We present the task, the resources, the results, a discussion of the metrics used and an analysis of what makes items challenging. In particular, we present quantitative evidence suggesting that incorrect responses are much more difficult to process than correct responses and that the most significant factor in making a response challenging is its distance from the closest training example.

KW - CALL

KW - shared tasks

KW - speech recognition

KW - metrics

U2 - 10.21437/Interspeech.2018-97

DO - 10.21437/Interspeech.2018-97

M3 - Conference contribution

T3 - Interspeech

SP - 2354

EP - 2358

BT - Proceedings of Interspeech 2018

PB - ISCA

CY - Hyderabad, India

T2 - Interspeech 2018

Y2 - 2 September 2018 through 6 September 2018

ER -

Overview of the 2018 spoken CALL shared task

Abstract

Publication series

Conference

Keywords

Access to Document

Fingerprint

Cite this