Sciweavers

3381 search results - page 256 / 677
» LEO - DB2's LEarning Optimizer
Sort
View
ECML
2004
Springer
15 years 11 months ago
Convergence and Divergence in Standard and Averaging Reinforcement Learning
Although tabular reinforcement learning (RL) methods have been proved to converge to an optimal policy, the combination of particular conventional reinforcement learning techniques...
Marco Wiering
PAKDD
2004
ACM
96views Data Mining» more  PAKDD 2004»
15 years 11 months ago
Spectral Energy Minimization for Semi-supervised Learning
The use of unlabeled data to aid classification is important as labeled data is often available in limited quantity. Instead of utilizing training samples directly into semi-super...
Chun Hung Li, Zhi-Li Wu
AGENTS
2001
Springer
15 years 10 months ago
Learning structured reactive navigation plans from executing MDP navigation policies
Autonomous robots, such as robot office couriers, need navigation routines that support flexible task execution and effective action planning. This paper describes XFRMLEARN, a s...
Michael Beetz, Thorsten Belker
AI
2001
Springer
15 years 10 months ago
Imitation and Reinforcement Learning in Agents with Heterogeneous Actions
Reinforcement learning techniques are increasingly being used to solve di cult problems in control and combinatorial optimization with promising results. Implicit imitation can acc...
Bob Price, Craig Boutilier
ICANN
2001
Springer
15 years 10 months ago
Learning and Prediction of the Nonlinear Dynamics of Biological Neurons with Support Vector Machines
Based on biological data we examine the ability of Support Vector Machines (SVMs) with gaussian kernels to learn and predict the nonlinear dynamics of single biological neurons. We...
Thomas Frontzek, Thomas Navin Lal, Rolf Eckmiller