论文标题
在10年的网络安全用户研究中,统计报告的保真度
Fidelity of Statistical Reporting in 10 Years of Cyber Security User Studies
论文作者
论文摘要
在社会技术方面的研究通常依赖于用户研究和对研究关系的统计推断以提出案例。因此,他们使从业者和科学家都可以判断进行研究的有效性和可靠性。为了确定这种能力,我们研究了安全用户研究的报告保真度。基于2006--2016 10年中从选定场所进行的114美元的网络安全性用户研究的系统文献综述,我们使用\ textsf {r} package \ textsf {statcheck}评估了$ 1775 $统计推断的报告的保真度。我们对不完整的报告,报告不一致和决策错误进行了系统的分类,从而导致多项式逻辑回归(MLR)对出版物/年的影响以及与心理学兼容领域的比较。我们发现,一半的网络安全用户研究认为报告了不完整的结果,这与心理学领域的可比结果有明显的差异。随着时间的流逝,我们的分析结果MLR的测试可能性略有不完整,而汤的可能性比其他场所正确地报告了统计数据的可能性要高几%。在这项研究中,我们将对安全性研究的态度进行首次完全定量分析。尽管我们强调了不完整报告的影响和流行率,但我们还提供了有关如何应对情况的细粒度诊断和建议。
Studies in socio-technical aspects of security often rely on user studies and statistical inferences on investigated relations to make their case. They, thereby, enable practitioners and scientists alike to judge on the validity and reliability of the research undertaken. To ascertain this capacity, we investigated the reporting fidelity of security user studies. Based on a systematic literature review of $114$ user studies in cyber security from selected venues in the 10 years 2006--2016, we evaluated fidelity of the reporting of $1775$ statistical inferences using the \textsf{R} package \textsf{statcheck}. We conducted a systematic classification of incomplete reporting, reporting inconsistencies and decision errors, leading to multinomial logistic regression (MLR) on the impact of publication venue/year as well as a comparison to a compatible field of psychology. We found that half the cyber security user studies considered reported incomplete results, in stark difference to comparable results in a field of psychology. Our MLR on analysis outcomes yielded a slight increase of likelihood of incomplete tests over time, while SOUPS yielded a few percent greater likelihood to report statistics correctly than other venues. In this study, we offer the first fully quantitative analysis of the state-of-play of socio-technical studies in security. While we highlight the impact and prevalence of incomplete reporting, we also offer fine-grained diagnostics and recommendations on how to respond to the situation.