论文标题
星:零射中的汉字识别,并带有中风和激进水平分解
STAR: Zero-Shot Chinese Character Recognition with Stroke- and Radical-Level Decompositions
论文作者
论文摘要
近年来,零击中的汉字认可引起了人们的关注。该问题的现有方法主要基于某些基于低级中风的分解或中级自由基分解。考虑到中风和自由基水平的分解可以提供不同级别的信息,我们提出了一种有效的零击中汉字识别方法。提出的方法包括一个训练阶段和推理阶段。在训练阶段,我们采用了两个类似的编码器模型来产生中风和自由基编码的估计,然后将它们与真实的编码一起用于正式化相关的中风和训练的自由基损失。引入了相似性损失,以使中风和自由基编码器正式化,以产生具有高相关性的相同字符的特征。在推理阶段,引入了两个关键模块,即中风筛选模块(SSM)和特征匹配模块(FMM),以分别解决确定性和令人困惑的案例。特别是,我们在FMM中引入了一个有效的中风整流方案,以扩大候选人的最终推理字符集。在三个基准数据集上进行了许多实验,这些数据集涵盖手写,印刷艺术和街道视图方案,以证明该方法的有效性。数值结果表明,所提出的方法在字符和激进的零击设置中都优于最先进的方法,并在传统的可见角色设置中保持竞争性能。
Zero-shot Chinese character recognition has attracted rising attention in recent years. Existing methods for this problem are mainly based on either certain low-level stroke-based decomposition or medium-level radical-based decomposition. Considering that the stroke- and radical-level decompositions can provide different levels of information, we propose an effective zero-shot Chinese character recognition method by combining them. The proposed method consists of a training stage and an inference stage. In the training stage, we adopt two similar encoder-decoder models to yield the estimates of stroke and radical encodings, which together with the true encodings are then used to formalize the associated stroke and radical losses for training. A similarity loss is introduced to regularize stroke and radical encoders to yield features of the same characters with high correlation. In the inference stage, two key modules, i.e., the stroke screening module (SSM) and feature matching module (FMM) are introduced to tackle the deterministic and confusing cases respectively. In particular, we introduce an effective stroke rectification scheme in FMM to enlarge the candidate set of characters for final inference. Numerous experiments over three benchmark datasets covering the handwritten, printed artistic and street view scenarios are conducted to demonstrate the effectiveness of the proposed method. Numerical results show that the proposed method outperforms the state-of-the-art methods in both character and radical zero-shot settings, and maintains competitive performance in the traditional seen character setting.