Search Sciweavers | Sciweavers

1233 search results - page 176 / 247

» Feudal Reinforcement Learning

151

click to vote

ESANN
2003

152views Neural Networks» more ESANN 2003»

Improving iterative repair strategies for scheduling with the SVM

15 years 7 months ago

Download www2.in.tu-clausthal.de

The resource constraint project scheduling problem (RCPSP) is an NP-hard benchmark problem in scheduling which takes into account the limitation of resources’ availabilities in ...

Kai Gersmann, Barbara Hammer

claim paper

Read More »

185

Voted

UAI
2008

236views Artificial Intelligence» more UAI 2008»

CORL: A Continuous-state Offset-dynamics Reinforcement Learner

15 years 7 months ago

Download uai2008.cs.helsinki.fi

Continuous state spaces and stochastic, switching dynamics characterize a number of rich, realworld domains, such as robot navigation across varying terrain. We describe a reinfor...

Emma Brunskill, Bethany R. Leffler, Lihong Li, Mic...

claim paper

Read More »

170

click to vote

JCP
2008

139views more JCP 2008»

Agent Learning in Relational Domains based on Logical MDPs with Negation

15 years 5 months ago

Download www.academypublisher.com

In this paper, we propose a model named Logical Markov Decision Processes with Negation for Relational Reinforcement Learning for applying Reinforcement Learning algorithms on the ...

Song Zhiwei, Chen Xiaoping, Cong Shuang

claim paper

Read More »

169

click to vote

ICML
2008
IEEE

117views Machine Learning» more ICML 2008»

Sample-based learning and search with permanent and transient memories

16 years 6 months ago

Download www.cs.ualberta.ca

We present a reinforcement learning architecture, Dyna-2, that encompasses both samplebased learning and sample-based search, and that generalises across states during both learni...

David Silver, Martin Müller 0003, Richard S. ...

claim paper

Read More »

166

Voted

AAAI
2007

122views Intelligent Agents» more AAAI 2007»

RETALIATE: Learning Winning Policies in First-Person Shooter Games

15 years 8 months ago

Download www.cse.lehigh.edu

In this paper we present RETALIATE, an online reinforcement learning algorithm for developing winning policies in team firstperson shooter games. RETALIATE has three crucial chara...

Megan Smith, Stephen Lee-Urban, Hector Muño...

claim paper

Read More »

« Prev « First page 176 / 247 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers