Does that sound right? A novel method of evaluating models of reading aloud:Rating nonword pronunciations

Abstract

Nonword pronunciation is a critical challenge for models of reading aloud but little attention has been given to identifying the best method for assessing model predictions. The most typical approach involves comparing the model’s pronunciations of nonwords to pronunciations of the same nonwords by human participants and deeming the model’s output correct if it matches with any transcription of the human pronunciations. The present paper introduces a new ratings-based method, in which participants are shown printed nonwords and asked to rate the plausibility of the provided pronunciations, generated here by a speech synthesiser. We demonstrate this method with reference to a previously published database of 915 disyllabic nonwords (Mousikou et al., 2017). We evaluated two well-known psychological models, RC00 and CDP++, as well as an additional grapheme-to-phoneme algorithm known as Sequitur, and compared our model assessment with the corpus-based method adopted by Mousikou et al. We find that the ratings method: a) is much easier to implement than a corpus-based method, b) has a high hit rate and low false-alarm rate in assessing nonword reading accuracy, and c) provided a similar outcome as the corpus-based method in its assessment of RC00 and CDP++. However, the two methods differed in their evaluation of Sequitur, which performed much better under the ratings method. Indeed, our evaluation of Sequitur revealed that the corpus-based method introduced a number of false positives and more often, false negatives. Implications of these findings are discussed.

Publication DOI: https://doi.org/10.3758/s13428-022-01794-8
Divisions: College of Health & Life Sciences > School of Psychology
College of Health & Life Sciences
Additional Information: This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
Uncontrolled Keywords: Computational reading models,Generalization,Pronunciation,Reading aloud,Experimental and Cognitive Psychology,Developmental and Educational Psychology,Arts and Humanities (miscellaneous),Psychology (miscellaneous),Psychology(all)
Publication ISSN: 1554-3528
Last Modified: 10 Apr 2024 07:20
Date Deposited: 06 Jun 2022 12:01
Full Text Link:
Related URLs: https://link.sp ... 428-022-01794-8 (Publisher URL)
http://www.scop ... tnerID=8YFLogxK (Scopus URL)
PURE Output Type: Article
Published Date: 2022-06-01
Published Online Date: 2022-06-01
Accepted Date: 2022-01-06
Authors: Gubian, Michele
Blything, Ryan (ORCID Profile 0000-0003-2285-7219)
Davis, Colin J.
Bowers, Jeffrey S.

Download

[img]

Version: Published Version

License: Creative Commons Attribution

| Preview

Export / Share Citation


Statistics

Additional statistics for this record