Sciweavers

13784 search results - page 138 / 2757
» On Computing Functions with Uncertainty
Sort
View
ICML
2007
IEEE
14 years 10 months ago
Tracking value function dynamics to improve reinforcement learning with piecewise linear function approximation
Reinforcement learning algorithms can become unstable when combined with linear function approximation. Algorithms that minimize the mean-square Bellman error are guaranteed to co...
Chee Wee Phua, Robert Fitch
AIPS
2006
13 years 10 months ago
Solving Factored MDPs with Exponential-Family Transition Models
Markov decision processes (MDPs) with discrete and continuous state and action components can be solved efficiently by hybrid approximate linear programming (HALP). The main idea ...
Branislav Kveton, Milos Hauskrecht
JAIR
2010
108views more  JAIR 2010»
13 years 7 months ago
Kalman Temporal Differences
This paper deals with value (and Q-) function approximation in deterministic Markovian decision processes (MDPs). A general statistical framework based on the Kalman filtering pa...
Matthieu Geist, Olivier Pietquin
ICDE
2002
IEEE
149views Database» more  ICDE 2002»
14 years 10 months ago
GADT: A Probability Space ADT for Representing and Querying the Physical World
Large sensor networks are being widely deployed for measurement, detection, and monitoring applications. Many of these applications involve database systems to store and process d...
Anton Faradjian, Johannes Gehrke, Philippe Bonnet
IJCNN
2006
IEEE
14 years 3 months ago
Anti-swing control for overhead crane with neural compensation
— This paper considers the problem of PD control of overhead crane in the presence of uncertainty associated with crane dynamics. By using radial basis function neural networks, ...
Rigoberto Toxqui Toxqui, Wen Yu, Xiaoou Li