Sciweavers

252 search results - page 29 / 51
» Learning Partially Observable Action Models: Efficient Algor...
Sort
View
166
Voted
PKDD
2010
Springer
179views Data Mining» more  PKDD 2010»
15 years 15 days ago
Gaussian Processes for Sample Efficient Reinforcement Learning with RMAX-Like Exploration
Abstract. We present an implementation of model-based online reinforcement learning (RL) for continuous domains with deterministic transitions that is specifically designed to achi...
Tobias Jung, Peter Stone
128
Voted
DIS
2009
Springer
15 years 9 months ago
An Iterative Learning Algorithm for Within-Network Regression in the Transductive Setting
Within-network regression addresses the task of regression in partially labeled networked data where labels are sparse and continuous. Data for inference consist of entities associ...
Annalisa Appice, Michelangelo Ceci, Donato Malerba
101
Voted
ICPR
2008
IEEE
15 years 9 months ago
Optimal feature weighting for the discrete HMM
We propose a modified discrete HMM that includes a feature weighting discrimination component. We assume that the feature space is partitioned into subspaces and that the relevan...
Oualid Missaoui, Hichem Frigui
146
Voted
CSL
2010
Springer
15 years 2 months ago
Bayesian update of dialogue state: A POMDP framework for spoken dialogue systems
This paper describes a statistically motivated framework for performing real-time dialogue state updates and policy learning in a spoken dialogue system. The framework is based on...
Blaise Thomson, Steve Young
116
Voted
AI
2006
Springer
15 years 6 months ago
Satisfaction Equilibrium: Achieving Cooperation in Incomplete Information Games
So far, most equilibrium concepts in game theory require that the rewards and actions of the other agents are known and/or observed by all agents. However, in real life problems, a...
Stéphane Ross, Brahim Chaib-draa