论文标题

TLT学校:非本地儿童演讲的语料库

TLT-school: a Corpus of Non Native Children Speech

论文作者

Gretter, Roberto, Matassoni, Marco, Bannò, Stefano, Falavigna, Daniele

论文摘要

本文描述了“ TLT-School”在意大利北部学校收集的一系列语音,以评估学习英语和德语的学生的表现。该语料库是在2017年和2018年记录的,年龄在九至16岁之间,就读于小学,中学和高中。人类专家就某些预定义的熟练度指标评分了所有话语。此外,2017年记录的大多数话语都经过精心转录。将详细描述用于手动抄录的指南和程序,以及通过我们开发的自动语音识别系统实现的结果。一部分语料库将被自由分发给科学界,特别是对非本地语音识别和对第二语言水平的自动评估的感兴趣。

This paper describes "TLT-school" a corpus of speech utterances collected in schools of northern Italy for assessing the performance of students learning both English and German. The corpus was recorded in the years 2017 and 2018 from students aged between nine and sixteen years, attending primary, middle and high school. All utterances have been scored, in terms of some predefined proficiency indicators, by human experts. In addition, most of utterances recorded in 2017 have been manually transcribed carefully. Guidelines and procedures used for manual transcriptions of utterances will be described in detail, as well as results achieved by means of an automatic speech recognition system developed by us. Part of the corpus is going to be freely distributed to scientific community particularly interested both in non-native speech recognition and automatic assessment of second language proficiency.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源