教育中的强化学习：一种多臂强盗方法

论文标题

教育中的强化学习：一种多臂强盗方法

Reinforcement Learning in Education: A Multi-Armed Bandit Approach

论文作者

Combrink, Herkulaas, Marivate, Vukosi, Rosman, Benjamin

论文摘要

加强学习研究的进步已经证明了不同基于代理的模型可以学习如何在给定环境中执行任务的方式。强化倾斜解决了无监督的问题，在这些问题中，代理人通过国家行动奖励循环移动以最大程度地提高代理的总体奖励，这反过来又优化了在给定环境中解决特定问题的解决方案。但是，这些算法是基于我们对应在现实世界中应采取的行动的理解来设计的，以解决特定问题。这样的问题之一是能够在用户是主题的系统中识别，推荐和执行操作，例如在教育中。近年来，在教育环境中，将面对面学习与在线学习相结合的混合学习方法的使用已经不可能。此外，用于教育的在线平台需要自动化某些功能，例如识别，建议或执行可以使用户受益的动作，从这个意义上讲，学生或学习者。尽管这些科学进步是有希望的，但仍然需要在各个不同领域进行研究，以确保这些代理在教育系统中成功部署。因此，这项研究的目的是在教育背景下为干预建议问题的环境中的累积奖励进行背景化和模拟。

Advances in reinforcement learning research have demonstrated the ways in which different agent-based models can learn how to optimally perform a task within a given environment. Reinforcement leaning solves unsupervised problems where agents move through a state-action-reward loop to maximize the overall reward for the agent, which in turn optimizes the solving of a specific problem in a given environment. However, these algorithms are designed based on our understanding of actions that should be taken in a real-world environment to solve a specific problem. One such problem is the ability to identify, recommend and execute an action within a system where the users are the subject, such as in education. In recent years, the use of blended learning approaches integrating face-to-face learning with online learning in the education context, has in-creased. Additionally, online platforms used for education require the automation of certain functions such as the identification, recommendation or execution of actions that can benefit the user, in this sense, the student or learner. As promising as these scientific advances are, there is still a need to conduct research in a variety of different areas to ensure the successful deployment of these agents within education systems. Therefore, the aim of this study was to contextualise and simulate the cumulative reward within an environment for an intervention recommendation problem in the education context.

下载PDF全文

下载文献需遵守相关版权规定

论文标题