论文标题
孟加拉语手写数字识别的Numtadb上的图像预处理
Image Pre-processing on NumtaDB for Bengali Handwritten Digit Recognition
论文作者
论文摘要
Numtadb是迄今为止最大的数据集集合,用于孟加拉语中的手写数字。这是一个包含85000多个图像的不同数据集。但是,这种多样性也使该数据集非常困难。本文的目的是找到预处理图像的基准,该图像可在任何机器学习模型上都具有良好的准确性。原因是,孟加拉人数字识别的可用数据可用,可以与MNIST的英语数字一起使用。
NumtaDB is by far the largest data-set collection for handwritten digits in Bengali. This is a diverse dataset containing more than 85000 images. But this diversity also makes this dataset very difficult to work with. The goal of this paper is to find the benchmark for pre-processed images which gives good accuracy on any machine learning models. The reason being, there are no available pre-processed data for Bengali digit recognition to work with like the English digits for MNIST.