Contributo in atti di convegno, 2021, ENG, 10.36253/978-88-5518-449-6
Bruno E., Giulivi S., Cappa C., Marini M., Ferro M.
ILC-CNR; SUPSI; IFC-CNR; DICI-UNIPI; ILC-CNR;
Digital tools based on automatic speech recognition (ASR) could be a useful support for teachers in assessing the reading skills of the students. We focus on the evaluation of the decoding accuracy of children with grade level ranging from the 3rd to the 6th performing a reading aloud task on a narrative text displayed on an ordinary tablet using the ReadLet platform. On the basis of previously collected data, we built a gold dataset with sentences characterised by the audio data, the original text to be read, and the text actually spoken by the child. By using the open-source Kaldi toolkit an ASR system based on the GMM-HMM model was trained on the training portion of the gold dataset. The accuracy of the ASR system was calculated as the ability to correctly decode the test audio data with respect to the annotated text, and the decoding accuracy of the children was estimated by measuring the gap between the results obtained with the annotated text and the original text. A consistent trend with increasing grade level was found in terms of word correctness, substitutions and insertions, while the trained model appears to be significantly able to evaluate the children decoding accuracy.
12th International Workshop on Models and Analysis of Vocal Emissions for Biomedical Applications (MAVEBA'21), pp. 145–148, Firenze (Italy), 14-16/12/2021
speech recognition, decoding accuracy, reading aloud, voice parameters, Kaldi, GMM-HMM acoustic model
Bruno Ester, Ferro Marcello, Cappa Claudia
IFC – Istituto di fisiologia clinica, ILC – Istituto di linguistica computazionale "Antonio Zampolli"
ID: 461393
Year: 2021
Type: Contributo in atti di convegno
Creation: 2021-12-21 13:07:20.000
Last update: 2023-02-01 13:00:33.000
CNR authors
External IDs
CNR OAI-PMH: oai:it.cnr:prodotti:461393
DOI: 10.36253/978-88-5518-449-6