Sciweavers

699 search results - page 17 / 140
» Online Dynamic Value System for Machine Learning
Sort
View
COLT
2006
Springer
13 years 11 months ago
Online Learning with Variable Stage Duration
We consider online learning in repeated decision problems, within the framework of a repeated game against an arbitrary opponent. For repeated matrix games, well known results esta...
Shie Mannor, Nahum Shimkin
ICALT
2008
IEEE
14 years 1 months ago
Identifying Learning Styles in Learning Management Systems by Using Indications from Students' Behaviour
Making students aware of their learning styles and presenting them with learning material that incorporates their individual learning styles has potential to make learning easier ...
Sabine Graf, Kinshuk, Tzu-Chien Liu
COLT
2010
Springer
13 years 5 months ago
Optimal Algorithms for Online Convex Optimization with Multi-Point Bandit Feedback
Bandit convex optimization is a special case of online convex optimization with partial information. In this setting, a player attempts to minimize a sequence of adversarially gen...
Alekh Agarwal, Ofer Dekel, Lin Xiao
ICALT
2008
IEEE
14 years 1 months ago
Expertise Measure for Dynamic Task Selection within Intelligent Educational Systems
This paper presents a task selection model for personalised educational instruction. The proposed model is based on the student expertise level and it takes into account performan...
François Courtemanche, Mehdi Najjar, Andr&e...
ROBOCUP
2007
Springer
167views Robotics» more  ROBOCUP 2007»
14 years 1 months ago
Cooperative/Competitive Behavior Acquisition Based on State Value Estimation of Others
The existing reinforcement learning approaches have been suffering from the curse of dimension problem when they are applied to multiagent dynamic environments. One of the typical...
Kentarou Noma, Yasutake Takahashi, Minoru Asada