论文标题

有限数据的医学图像分类中的两阶段联合转移学习框架:COVID-19案例研究

A Two-Stage Federated Transfer Learning Framework in Medical Images Classification on Limited Data: A COVID-19 Case Study

论文作者

Zhang, Alexandros Shikun, Li, Naomi Fengqi

论文摘要

Covid-19-大流行迅速蔓延,导致全球医疗资源短缺。 COVID-19诊断的效率已变得非常重要。由于深度学习和卷积神经网络(CNN)已被广泛使用并在分析医学图像中得到了验证,因此它已成为计算机辅助诊断的强大工具。但是,在深度学习和神经网络的帮助下,医学图像分类面临两个最重要的挑战,其中之一是难以获取足够的样本,这可能会导致模型过度拟合。隐私问题主要带来另一个挑战,因为与医疗相关的记录通常被视为患者的私人信息,并受到GDPR和HIPPA等法律的保护。联合学习可以确保模型培训在不同的设备上分散,并且它们之间没有共享数据,从而保证了隐私。但是,借助位于不同设备上的数据,每个设备的可访问数据可能受到限制。由于转移学习在处理有限的数据方面得到了验证,因此,在本文中,我们进行了一项试验,以使用CNN使用CNN实施联合学习和转移学习技术,以使用肺CT扫描对COVID-19进行分类。我们还探讨了数据集分布在客户端在联合学习中的影响,并培训了培训时期的培训时代的数量。最后,我们通过联合学习获得了非常高的表现,这证明了我们在利用准确性和隐私方面的成功。

COVID-19 pandemic has spread rapidly and caused a shortage of global medical resources. The efficiency of COVID-19 diagnosis has become highly significant. As deep learning and convolutional neural network (CNN) has been widely utilized and been verified in analyzing medical images, it has become a powerful tool for computer-assisted diagnosis. However, there are two most significant challenges in medical image classification with the help of deep learning and neural networks, one of them is the difficulty of acquiring enough samples, which may lead to model overfitting. Privacy concerns mainly bring the other challenge since medical-related records are often deemed patients' private information and protected by laws such as GDPR and HIPPA. Federated learning can ensure the model training is decentralized on different devices and no data is shared among them, which guarantees privacy. However, with data located on different devices, the accessible data of each device could be limited. Since transfer learning has been verified in dealing with limited data with good performance, therefore, in this paper, We made a trial to implement federated learning and transfer learning techniques using CNNs to classify COVID-19 using lung CT scans. We also explored the impact of dataset distribution at the client-side in federated learning and the number of training epochs a model is trained. Finally, we obtained very high performance with federated learning, demonstrating our success in leveraging accuracy and privacy.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源