论文标题
测试和评估安全分析方法的方法
Methodology for Testing and Evaluation of Safety Analytics Approaches
论文作者
论文摘要
近年来,数据驱动的安全分析方法的发展显着增加。鉴于这些进步,必须以原则性的方式评估这种方法来确定其优点和局限性。为此,我们提出了一种以模拟环境为基础的评估方法,该方法允许对安全分析方法进行全面评估。毫无疑问,使用历史领域数据评估这些方法很重要,但这种评估具有有限的统计能力,因为它仅对应于一种实现。提出的方法可以对大量实现进行验证,从而规避评估对历史数据的统计局限性。此外,通过使用模拟环境,人们可以清楚地区分观察到的数据的可变性和安全方法之间的性能差异。模拟环境通过比较在受控情况下的方法来实现这一目标,从而对潜在的长期收益进行公平而系统的评估。我们通过比较一些候选安全分析方法的案例研究证明了拟议方法论的实用性。这些方法在吸收现场安全数据以评估安全风险并提出缓解行动方面有所不同。我们表明,所提出的方法确实揭示了有用的见解,并量化了不同方法的相对优点和缺点,否则这些方法在现实世界中很难客观地确定。
There has been a significant increase in the development of data-driven safety analytics approaches in recent years. In light of these advances it has become imperative to evaluate such approaches in a principled way to determine their merits and limitations. To that end, we propose an evaluation methodology underpinned by a simulated environment that allows for a comprehensive assessment of safety analytics approaches. While assessing those approaches with historical field data is undoubtedly important, such an assessment has limited statistical power because it corresponds to only one realization. The proposed methodology enables validation over a large number of realizations, thereby circumventing the statistical limitations of evaluation on historical data. Moreover, by using a simulated environment one is able to clearly distinguish between the variability in the observed data and differences in performance between safety approaches. A simulated environment does this by comparing the approaches under controlled circumstances, resulting in a fair and systematic evaluation of the potential long-term benefits. We demonstrate the utility of the proposed methodology via a case study that compares a few candidate safety analytics approaches. These approaches differ in how they assimilate field safety data to assess safety risk and suggest mitigative actions. We show that the proposed methodology indeed reveals useful insights and quantifies the relative merits and drawbacks of the different approaches, which would be otherwise difficult to objectively determine in a real-world scenario.