论文标题
使用分销意识数据策展和增强的公平准确的年龄预测
Fair and accurate age prediction using distribution aware data curation and augmentation
论文作者
论文摘要
由于表现出不公平的行为,基于深度学习的面部识别系统已经增加了媒体的关注。因此,大型企业(例如IBM)关闭了他们的面部识别和年龄预测系统。年龄预测是一个特别困难的应用,公平性问题仍然存在一个开放的研究问题(例如,预测不同种族的年龄同样准确)。年龄预测方法中不公平行为的主要原因之一在于培训数据的分布和多样性。在这项工作中,我们提出了两种用于数据集策划和数据增强的新颖方法,以通过平衡的特征策展来提高公平性,并通过分配意识增强来增加多样性。为了实现这一目标,我们将分布外检测介绍给面部识别域,该域用于选择与深度神经网络(DNN)任务最相关的数据,而在年龄,种族和性别之间进行数据平衡。我们的方法显示出令人鼓舞的结果。我们受过良好训练的DNN模型在公平性方面优于所有学术和工业基线,最多超过4.92倍,也增强了DNN概括超过亚马逊AWS和Microsoft Azure公共云系统的能力,分别为31.88%和10.95%。
Deep learning-based facial recognition systems have experienced increased media attention due to exhibiting unfair behavior. Large enterprises, such as IBM, shut down their facial recognition and age prediction systems as a consequence. Age prediction is an especially difficult application with the issue of fairness remaining an open research problem (e.g., predicting age for different ethnicity equally accurate). One of the main causes of unfair behavior in age prediction methods lies in the distribution and diversity of the training data. In this work, we present two novel approaches for dataset curation and data augmentation in order to increase fairness through balanced feature curation and increase diversity through distribution aware augmentation. To achieve this, we introduce out-of-distribution detection to the facial recognition domain which is used to select the data most relevant to the deep neural network's (DNN) task when balancing the data among age, ethnicity, and gender. Our approach shows promising results. Our best-trained DNN model outperformed all academic and industrial baselines in terms of fairness by up to 4.92 times and also enhanced the DNN's ability to generalize outperforming Amazon AWS and Microsoft Azure public cloud systems by 31.88% and 10.95%, respectively.