论文标题
学习产品排名对假用户
Learning Product Rankings Robust to Fake Users
论文作者
论文摘要
在许多在线平台中,客户的决策受到产品排名的很大影响,因为大多数客户仅检查一些排名最高的产品。同时,此类平台还使用与客户操作相对应的相同数据,以了解必须如何对这些产品进行排名或订购。但是,在基础学习过程中的这些相互作用可能会激励卖方通过雇用假用户来人为地膨胀其位置,这是点击农场的出现。在这种欺诈行为的驱动下,我们研究了一个平台的排名问题,该平台面临着彼此无法区分的真实和虚假用户的混合。我们首先表明,在没有伪造用户的情况下,现有的学习算法是最佳的 - 可能会收敛于假用户操纵下的高度次级优势排名。为了克服这种缺陷,我们在两个信息环境下开发了有效的学习算法:在第一个设置中,该平台意识到了伪造用户的数量,在第二个设置中,对于伪造用户的数量,它不可知。对于这两种环境,我们证明我们的算法会融合到最佳排名,同时对上述欺诈行为有力。我们还为我们的方法提供了最差的性能保证,并表明它们的表现明显优于现有算法。在高水平上,我们的工作采用了几种新颖的方法来保证鲁棒性,例如:(i)构建产品顺序的图表,以从客户的行动中推断出的产品之间编码成对的关系; (ii)在层次之间以明智的双向交叉学习来实施多个级别的学习。
In many online platforms, customers' decisions are substantially influenced by product rankings as most customers only examine a few top-ranked products. Concurrently, such platforms also use the same data corresponding to customers' actions to learn how these products must be ranked or ordered. These interactions in the underlying learning process, however, may incentivize sellers to artificially inflate their position by employing fake users, as exemplified by the emergence of click farms. Motivated by such fraudulent behavior, we study the ranking problem of a platform that faces a mixture of real and fake users who are indistinguishable from one another. We first show that existing learning algorithms---that are optimal in the absence of fake users---may converge to highly sub-optimal rankings under manipulation by fake users. To overcome this deficiency, we develop efficient learning algorithms under two informational environments: in the first setting, the platform is aware of the number of fake users, and in the second setting, it is agnostic to the number of fake users. For both these environments, we prove that our algorithms converge to the optimal ranking, while being robust to the aforementioned fraudulent behavior; we also present worst-case performance guarantees for our methods, and show that they significantly outperform existing algorithms. At a high level, our work employs several novel approaches to guarantee robustness such as: (i) constructing product-ordering graphs that encode the pairwise relationships between products inferred from the customers' actions; and (ii) implementing multiple levels of learning with a judicious amount of bi-directional cross-learning between levels.