论文标题
进行口吃和口吃疗法的自动评估
Towards Automated Assessment of Stuttering and Stuttering Therapy
论文作者
论文摘要
口吃是一种复杂的语音障碍,可以通过讲话时的重复,声音,音节或单词的延长以及障碍物来识别。严重性评估通常由语音治疗师进行。尽管进行了自动评估的尝试,但很少在治疗中使用。评估口吃严重程度的常见方法包括口吃百分比(%SS),语音任务中最长的口吃症状的平均值或最近引入的语音效率评分(SES)。本文介绍了语音控制指数(SCI),这是一种评估口吃严重程度的新方法。与SES不同,它也可以用来评估流利塑形的治疗成功。我们在一个新的全面标记的数据集上评估了SES和SCI,该数据集包含口吃,在接受口吃疗法之前,之中和之后的客户演讲。自动语音识别系统的电话比对在统计上是根据其相对位置与标记的口吃事件相关的评估。结果表明,电话长度分布在其在标记的口吃事件中及其周围的位置有所不同
Stuttering is a complex speech disorder that can be identified by repetitions, prolongations of sounds, syllables or words, and blocks while speaking. Severity assessment is usually done by a speech therapist. While attempts at automated assessment were made, it is rarely used in therapy. Common methods for the assessment of stuttering severity include percent stuttered syllables (% SS), the average of the three longest stuttering symptoms during a speech task, or the recently introduced Speech Efficiency Score (SES). This paper introduces the Speech Control Index (SCI), a new method to evaluate the severity of stuttering. Unlike SES, it can also be used to assess therapy success for fluency shaping. We evaluate both SES and SCI on a new comprehensively labeled dataset containing stuttered German speech of clients prior to, during, and after undergoing stuttering therapy. Phone alignments of an automatic speech recognition system are statistically evaluated in relation to their relative position to labeled stuttering events. The results indicate that phone length distributions differ with respect to their position in and around labeled stuttering events