Sciweavers

1235 search results - page 156 / 247
» Reinforcement learning in a nutshell
Sort
View
GECCO
2009
Springer
150views Optimization» more  GECCO 2009»
14 years 4 months ago
Discrete dynamical genetic programming in XCS
A number of representation schemes have been presented for use within Learning Classifier Systems, ranging from binary encodings to neural networks. This paper presents results fr...
Richard Preen, Larry Bull
TSMC
2002
136views more  TSMC 2002»
13 years 9 months ago
Expertness based cooperative Q-learning
By using other agents' experiences and knowledge, a learning agent may learn faster, make fewer mistakes, and create some rules for unseen situations. These benefits would be ...
Majid Nili Ahmadabadi, Masoud Asadpour
NORDICHI
2004
ACM
14 years 3 months ago
Adaptivity in speech-based multilingual e-mail client
In speech interfaces users must be aware what can be done with the system – in other words, the system must provide information to help the users to know what to say. We have ad...
Esa-Pekka Salonen, Mikko Hartikainen, Markku Turun...
IROS
2006
IEEE
187views Robotics» more  IROS 2006»
14 years 4 months ago
Fast and Stable Learning of Quasi-Passive Dynamic Walking by an Unstable Biped Robot based on Off-Policy Natural Actor-Critic
— Recently, many researchers on humanoid robotics are interested in Quasi-Passive-Dynamic Walking (Quasi-PDW) which is similar to human walking. It is desirable that control para...
Tsuyoshi Ueno, Yutaka Nakamura, Takashi Takuma, To...
PKDD
2009
Springer
184views Data Mining» more  PKDD 2009»
14 years 2 months ago
Boosting Active Learning to Optimality: A Tractable Monte-Carlo, Billiard-Based Algorithm
Abstract. This paper focuses on Active Learning with a limited number of queries; in application domains such as Numerical Engineering, the size of the training set might be limite...
Philippe Rolet, Michèle Sebag, Olivier Teyt...