论文标题
Nela-GT-2022:一个大型多标签新闻数据集,用于研究新闻文章中的错误信息
NELA-GT-2022: A Large Multi-Labelled News Dataset for The Study of Misinformation in News Articles
论文作者
论文摘要
在本文中,我们介绍了Nela-GT数据集的第五期Nela-GT-2022。该数据集包含来自2022年1月1日至2022年12月31日之间361个媒体的1,778,361篇文章。就像在数据集的过去版本中一样,Nela-GT-2022包括媒体偏见/事实检查的出口级真实性Labels,来自媒体偏见/事实检查和嵌入了收集到的新闻艺术中的推文。 Nela-GT-2022数据集可在以下网址找到:https://doi.org/10.7910/dvn/amcv2h
In this paper, we present the fifth installment of the NELA-GT datasets, NELA-GT-2022. The dataset contains 1,778,361 articles from 361 outlets between January 1st, 2022 and December 31st, 2022. Just as in past releases of the dataset, NELA-GT-2022 includes outlet-level veracity labels from Media Bias/Fact Check and tweets embedded in collected news articles. The NELA-GT-2022 dataset can be found at: https://doi.org/10.7910/DVN/AMCV2H