论文标题

一个插槽不是用一种话语构建的:带有子插槽的口语对话

A Slot Is Not Built in One Utterance: Spoken Language Dialogs with Sub-Slots

论文作者

Zhang, Sai, Hu, Yuwei, Wu, Yuchuan, Wu, Jiaman, Li, Yongbin, Sun, Jian, Yuan, Caixia, Wang, Xiaojie

论文摘要

可以通过对话框中的多转交互,尤其是对于某些重要信息,例如电话号码和名称,可以通过细分进行插槽值。这是日常生活中的普遍现象,但是在以前的工作中几乎没有关注它。为了填补空白,本文定义了一项名为基于子阶段的面向任务的对话框(SSTOD)的新任务,并构建了一个中文对话框数据集SSD,以增强对SSTOD的研究。该数据集包含来自四个不同域的总共40k对话框和500k个话语:中文名称,电话号码,ID号码和车牌号码。数据用子插槽值,插槽值,对话框状态和操作很好地注释数据。我们在SSTOD中发现了一些新的语言现象和互动方式,这些现象引发了为任务构建对话剂的关键挑战。我们在SSTOD上测试了三个最先进的对话框模型,发现它们在四个域中的任何一个中都无法很好地处理任务。我们还通过以插件方式涉及插槽知识来研究改进的模型。应该做更多的工作来应对SSTOD提出的新挑战,这些挑战在现实生活中广泛存在。数据集和代码可通过https://github.com/shunjiu/sstod公开获得。

A slot value might be provided segment by segment over multiple-turn interactions in a dialog, especially for some important information such as phone numbers and names. It is a common phenomenon in daily life, but little attention has been paid to it in previous work. To fill the gap, this paper defines a new task named Sub-Slot based Task-Oriented Dialog (SSTOD) and builds a Chinese dialog dataset SSD for boosting research on SSTOD. The dataset includes a total of 40K dialogs and 500K utterances from four different domains: Chinese names, phone numbers, ID numbers and license plate numbers. The data is well annotated with sub-slot values, slot values, dialog states and actions. We find some new linguistic phenomena and interactive manners in SSTOD which raise critical challenges of building dialog agents for the task. We test three state-of-the-art dialog models on SSTOD and find they cannot handle the task well on any of the four domains. We also investigate an improved model by involving slot knowledge in a plug-in manner. More work should be done to meet the new challenges raised from SSTOD which widely exists in real-life applications. The dataset and code are publicly available via https://github.com/shunjiu/SSTOD.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源