Sciweavers

495 search results - page 86 / 99
» Constructing States for Reinforcement Learning
Sort
View
CORR
2010
Springer
187views Education» more  CORR 2010»
15 years 4 months ago
Learning in A Changing World: Non-Bayesian Restless Multi-Armed Bandit
We consider the restless multi-armed bandit (RMAB) problem with unknown dynamics. In this problem, at each time, a player chooses K out of N (N > K) arms to play. The state of ...
Haoyang Liu, Keqin Liu, Qing Zhao
IJCAI
2003
15 years 5 months ago
Use of Off-line Dynamic Programming for Efficient Image Interpretation
An interpretation system finds the likely mappings from portions of an image to real-world objects. An interpretation policy specifies when to apply which imaging operator, to whi...
Ramana Isukapalli, Russell Greiner
RSS
2007
176views Robotics» more  RSS 2007»
15 years 5 months ago
Active Policy Learning for Robot Planning and Exploration under Uncertainty
Abstract— This paper proposes a simulation-based active policy learning algorithm for finite-horizon, partially-observed sequential decision processes. The algorithm is tested i...
Ruben Martinez-Cantin, Nando de Freitas, Arnaud Do...
135
Voted
LACL
2001
Springer
15 years 8 months ago
Structural Equations in Language Learning
In categorial systems with a fixed structural component, the learning problem comes down to finding the solution for a set of typeassignment equations. A hard-wired structural co...
Michael Moortgat
160
Voted
IROS
2009
IEEE
142views Robotics» more  IROS 2009»
15 years 10 months ago
Phoneme acquisition model based on vowel imitation using Recurrent Neural Network
- A phoneme-acquisition system was developed using a computational model that explains the developmental process of human infants in the early period of acquiring language. There a...
Hisashi Kanda, Tetsuya Ogata, Toru Takahashi, Kazu...