Sciweavers

472 search results - page 56 / 95
» Linear programming with online learning
Sort
View
ML
2000
ACM
126views Machine Learning» more  ML 2000»
13 years 8 months ago
Learning to Play Chess Using Temporal Differences
In this paper we present TDLEAF( ), a variation on the TD( ) algorithm that enables it to be used in conjunction with game-tree search. We present some experiments in which our che...
Jonathan Baxter, Andrew Tridgell, Lex Weaver
COLT
2010
Springer
13 years 6 months ago
Convex Games in Banach Spaces
We study the regret of an online learner playing a multi-round game in a Banach space B against an adversary that plays a convex function at each round. We characterize the minima...
Karthik Sridharan, Ambuj Tewari
TNN
2010
159views Management» more  TNN 2010»
13 years 3 months ago
Multiple incremental decremental learning of support vector machines
We propose a multiple incremental decremental algorithm of Support Vector Machine (SVM). Conventional single incremental decremental SVM can update the trained model efficiently w...
Masayuki Karasuyama, Ichiro Takeuchi
ICANN
1997
Springer
14 years 29 days ago
On Learning Soccer Strategies
We use simulated soccer to study multiagent learning. Each team's players (agents) share action set and policy but may behave differently due to position-dependent inputs. All...
Rafal Salustowicz, Marco Wiering, Jürgen Schm...
EOR
2007
165views more  EOR 2007»
13 years 8 months ago
Adaptive credit scoring with kernel learning methods
Credit scoring is a method of modelling potential risk of credit applications. Traditionally, logistic regression, linear regression and discriminant analysis are the most popular...
Yingxu Yang