Search Sciweavers | Sciweavers

1512 search results - page 171 / 303

» Qualitative reinforcement learning

151

click to vote

NIPS
1996

134views Information Technology» more NIPS 1996»

Why did TD-Gammon Work?

15 years 7 months ago

Download www.cse.unsw.edu.au

Although TD-Gammon is one of the major successes in machine learning, it has not led to similar impressive breakthroughs in temporal difference learning for other applications or ...

Jordan B. Pollack, Alan D. Blair

claim paper

Read More »

207

click to vote

GECCO
2008
Springer

144views Optimization» more GECCO 2008»

Self-adaptive constructivism in Neural XCS and XCSF

15 years 7 months ago

Download www.cems.uwe.ac.uk

For artificial entities to achieve high degrees of autonomy they will need to display appropriate adaptability. In this sense adaptability includes representational flexibility gu...

Gerard David Howard, Larry Bull, Pier Luca Lanzi

claim paper

Read More »

148

click to vote

ICML
2010
IEEE

167views Machine Learning» more ICML 2010»

Finite-Sample Analysis of LSTD

15 years 7 months ago

Download hal.inria.fr

In this paper we consider the problem of policy evaluation in reinforcement learning, i.e., learning the value function of a fixed policy, using the least-squares temporal-differe...

Alessandro Lazaric, Mohammad Ghavamzadeh, Ré...

claim paper

Read More »

144

click to vote

ECIS
2003

125views Information Technology» more ECIS 2003»

Reflections on the use of grounded theory in interpretive information systems research

15 years 7 months ago

Download is2.lse.ac.uk

In Information Systems research there are a growing number of studies that must necessarily draw upon the contexts, experiences and narratives of practitioners. This calls for res...

Jim Hughes, Steven Jones

claim paper

Read More »

139

click to vote

FORMATS
2007
Springer

89views Formal Methods» more FORMATS 2007»

On Timed Models of Gene Networks

16 years 1 days ago

Download www-verimag.imag.fr

Abstract. We present a systematic translation from timed models of genetic regulatory networks into products of timed automata to which one can apply veriﬁcation tools in order l...

Grégory Batt, Ramzi Ben Salah, Oded Maler

claim paper

Read More »

« Prev « First page 171 / 303 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers