We consider online learning in repeated decision problems, within the framework of a repeated game against an arbitrary opponent. For repeated matrix games, well known results esta...
Making students aware of their learning styles and presenting them with learning material that incorporates their individual learning styles has potential to make learning easier ...
Bandit convex optimization is a special case of online convex optimization with partial information. In this setting, a player attempts to minimize a sequence of adversarially gen...
This paper presents a task selection model for personalised educational instruction. The proposed model is based on the student expertise level and it takes into account performan...
The existing reinforcement learning approaches have been suffering from the curse of dimension problem when they are applied to multiagent dynamic environments. One of the typical...