Sciweavers

1512 search results - page 171 / 303
» Qualitative reinforcement learning
Sort
View
NIPS
1996
13 years 10 months ago
Why did TD-Gammon Work?
Although TD-Gammon is one of the major successes in machine learning, it has not led to similar impressive breakthroughs in temporal difference learning for other applications or ...
Jordan B. Pollack, Alan D. Blair
GECCO
2008
Springer
144views Optimization» more  GECCO 2008»
13 years 10 months ago
Self-adaptive constructivism in Neural XCS and XCSF
For artificial entities to achieve high degrees of autonomy they will need to display appropriate adaptability. In this sense adaptability includes representational flexibility gu...
Gerard David Howard, Larry Bull, Pier Luca Lanzi
ICML
2010
IEEE
13 years 10 months ago
Finite-Sample Analysis of LSTD
In this paper we consider the problem of policy evaluation in reinforcement learning, i.e., learning the value function of a fixed policy, using the least-squares temporal-differe...
Alessandro Lazaric, Mohammad Ghavamzadeh, Ré...
ECIS
2003
13 years 10 months ago
Reflections on the use of grounded theory in interpretive information systems research
In Information Systems research there are a growing number of studies that must necessarily draw upon the contexts, experiences and narratives of practitioners. This calls for res...
Jim Hughes, Steven Jones
FORMATS
2007
Springer
14 years 3 months ago
On Timed Models of Gene Networks
Abstract. We present a systematic translation from timed models of genetic regulatory networks into products of timed automata to which one can apply verification tools in order l...
Grégory Batt, Ramzi Ben Salah, Oded Maler