论文标题

大约最近的邻居搜索用户位置的隐私感知编码,以识别模拟流行病中的敏感感染

Approximate Nearest Neighbour Search on Privacy-aware Encoding of User Locations to Identify Susceptible Infections in Simulated Epidemics

论文作者

Biswas, Chandan, Ganguly, Debasis, Bhattacharya, Ujjwal

论文摘要

在共同199年大流行期间受感染病例越来越多的情况下,至关重要的是要尽早追踪,由于与对病毒呈阳性的人的亲密接近,可能已被疾病感染的易感人士。这种早期的接触追踪可能会限制当地感染的传播率。在本文中,我们研究了鉴于感染者及其位置清单,发现了这种易感人士的有效和有效效率。为了从信息检索(搜索)角度解决此问题,我们将每个人的位置表示为矢量空间中的一个点。通过使用给定感染者列表作为查询的位置,我们研究了应用大约基于最近的邻居(ANN)的索引和检索方法的可行性,以实时获取TOP-K可疑用户的列表。由于利用来自真实用户位置数据的信息可能会导致安全性和隐私问题,因此我们还研究了远距离编码方法对ANN方法的有效性具有什么影响。对真实和合成数据集进行的实验表明,在现有ANN方法(KD-TREE和HNSW)检索的易感用户的列表(KD-TREE和HNSW)中会产生令人满意的精度和召回值,从而表明ANN方法可以在实践中使用ANN方法在实践中的实时接触,即使在实现的隐私范围内促进实时接触。

Amidst an increasing number of infected cases during the Covid-19 pandemic, it is essential to trace, as early as possible, the susceptible people who might have been infected by the disease due to their close proximity with people who were tested positive for the virus. This early contact tracing is likely to limit the rate of spread of the infection within a locality. In this paper, we investigate how effectively and efficiently can such a list of susceptible people be found given a list of infected persons and their locations. To address this problem from an information retrieval (search) perspective, we represent the location of each person at each time instant as a point in a vector space. By using the locations of the given list of infected persons as queries, we investigate the feasibility of applying approximate nearest neighbour (ANN) based indexing and retrieval approaches to obtain a list of top-k suspected users in real-time. Since leveraging information from true user location data can lead to security and privacy concerns, we also investigate what effects does distance-preserving encoding methods have on the effectiveness of the ANN methods. Experiments conducted on real and synthetic datasets demonstrate that the top-k retrieved lists of susceptible users retrieved with existing ANN approaches (KD-tree and HNSW) yield satisfactory precision and recall values, thus indicating that ANN approaches can potentially be applied in practice to facilitate real-time contact tracing even under the presence of imposed privacy constraints.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源