Text to Phoneme Alignment and Mapping for Speech Technology: a Neural Networks Approach

John Bullinaria

doi:10.1109/IJCNN.2011.6033279

Text to Phoneme Alignment and Mapping for Speech Technology: a Neural Networks Approach

John Bullinaria

Computer Science

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

3 Citations (Scopus)

Abstract

A common problem in speech technology is the alignment of representations of text and phonemes, and the learning of a mapping between them that generalizes well to unseen inputs. The state-of-the-art technology appears to be symbolic rule-based systems, which is surprising given the number of neural network systems for text to phoneme mapping that have been developed over the years. This paper explores why that may be the case, and demonstrates that it is possible for neural networks to simultaneously perform text to phoneme alignment and mapping with performance levels at least comparable to the best existing systems.

Original language	English
Title of host publication	Neural Networks (IJCNN), The 2011 International Joint Conference on
Publisher	Institute of Electrical and Electronics Engineers (IEEE)
Pages	625-632
Number of pages	8
ISBN (Print)	978-1-4244-9635-8
DOIs	https://doi.org/10.1109/IJCNN.2011.6033279
Publication status	Published - 5 Jul 2011
Event	Proceedings of the International Joint Conference on Neural Networks (IJCNN 2011) - Duration: 5 Jul 2011 → …

Conference

Conference	Proceedings of the International Joint Conference on Neural Networks (IJCNN 2011)
Period	5/07/11 → …

Access to Document

10.1109/IJCNN.2011.6033279

Cite this

@inproceedings{77feb7baf8df4d33a3c4d416a7014fd9,

title = "Text to Phoneme Alignment and Mapping for Speech Technology: a Neural Networks Approach",

abstract = "A common problem in speech technology is the alignment of representations of text and phonemes, and the learning of a mapping between them that generalizes well to unseen inputs. The state-of-the-art technology appears to be symbolic rule-based systems, which is surprising given the number of neural network systems for text to phoneme mapping that have been developed over the years. This paper explores why that may be the case, and demonstrates that it is possible for neural networks to simultaneously perform text to phoneme alignment and mapping with performance levels at least comparable to the best existing systems.",

author = "John Bullinaria",

year = "2011",

month = jul,

day = "5",

doi = "10.1109/IJCNN.2011.6033279",

language = "English",

isbn = "978-1-4244-9635-8",

pages = "625--632",

booktitle = "Neural Networks (IJCNN), The 2011 International Joint Conference on",

publisher = "Institute of Electrical and Electronics Engineers (IEEE)",

note = "Proceedings of the International Joint Conference on Neural Networks (IJCNN 2011) ; Conference date: 05-07-2011",

}

Bullinaria, J 2011, Text to Phoneme Alignment and Mapping for Speech Technology: a Neural Networks Approach. in Neural Networks (IJCNN), The 2011 International Joint Conference on. Institute of Electrical and Electronics Engineers (IEEE), pp. 625-632, Proceedings of the International Joint Conference on Neural Networks (IJCNN 2011), 5/07/11. https://doi.org/10.1109/IJCNN.2011.6033279

TY - GEN

T1 - Text to Phoneme Alignment and Mapping for Speech Technology: a Neural Networks Approach

AU - Bullinaria, John

PY - 2011/7/5

Y1 - 2011/7/5

N2 - A common problem in speech technology is the alignment of representations of text and phonemes, and the learning of a mapping between them that generalizes well to unseen inputs. The state-of-the-art technology appears to be symbolic rule-based systems, which is surprising given the number of neural network systems for text to phoneme mapping that have been developed over the years. This paper explores why that may be the case, and demonstrates that it is possible for neural networks to simultaneously perform text to phoneme alignment and mapping with performance levels at least comparable to the best existing systems.

AB - A common problem in speech technology is the alignment of representations of text and phonemes, and the learning of a mapping between them that generalizes well to unseen inputs. The state-of-the-art technology appears to be symbolic rule-based systems, which is surprising given the number of neural network systems for text to phoneme mapping that have been developed over the years. This paper explores why that may be the case, and demonstrates that it is possible for neural networks to simultaneously perform text to phoneme alignment and mapping with performance levels at least comparable to the best existing systems.

U2 - 10.1109/IJCNN.2011.6033279

DO - 10.1109/IJCNN.2011.6033279

M3 - Conference contribution

SN - 978-1-4244-9635-8

SP - 625

EP - 632

BT - Neural Networks (IJCNN), The 2011 International Joint Conference on

PB - Institute of Electrical and Electronics Engineers (IEEE)

T2 - Proceedings of the International Joint Conference on Neural Networks (IJCNN 2011)

Y2 - 5 July 2011

ER -

Text to Phoneme Alignment and Mapping for Speech Technology: a Neural Networks Approach

Abstract

Conference

Access to Document

Fingerprint

Cite this