论文标题
2021年的大数据尺寸调查
Survey of Big Data sizes in 2021
论文作者
论文摘要
数据生产的现代增加是由多种因素驱动的,来自各个部门的几个利益相关者为此做出了贡献。尽管由于缺乏官方数据而难以比较不同的大数据参与者的大小,但本报告试图通过挖掘几个在线资源来重建一些最重要的组织产生的年度数量级。该估计是基于为每个考虑的大数据源检索有意义的统一数据生产指标,然后通过猜想合理的每个单位尺寸来获得年度数量。最终结果以气泡图的形式汇总。
The modern increase in data production is driven by multiple factors, and several stakeholders from various sectors contribute to it. Although drawing a comparison of the sizes at stake for different big data players is hard due to the lack of official data, this report tries to reconstruct the yearly orders of magnitude generated by some of the most important organizations by mining several online sources. The estimation is based on retrieving meaningful unitary data production measures for each of the big data sources considered, and the yearly amounts are then obtained by conjecturing reasonable per-unit sizes. The final result is summarized in the form of a bubble plot.