Search Sciweavers | Sciweavers

377 search results - page 10 / 76

» Optimizing Production Manufacturing Using Reinforcement Lear...

189

click to vote

ICML
1999
IEEE

138views Machine Learning» more ICML 1999»

Using Reinforcement Learning to Spider the Web Efficiently

16 years 7 months ago

Download www.cs.iastate.edu

Consider the task of exploring the Web in order to find pages of a particular kind or on a particular topic. This task arises in the construction of search engines and Web knowled...

Jason Rennie, Andrew McCallum

claim paper

Read More »

180

click to vote

CORR
2010
Springer

105views Education» more CORR 2010»

Optimism in Reinforcement Learning Based on Kullback-Leibler Divergence

15 years 5 months ago

Download hal.archives-ouvertes.fr

We consider model-based reinforcement learning in ﬁnite Markov Decision Processes (MDPs), focussing on so-called optimistic strategies. Optimism is usually implemented by carryin...

Sarah Filippi, Olivier Cappé, Aurelien Gari...

claim paper

Read More »

199

click to vote

AAAI
2006

127views Intelligent Agents» more AAAI 2006»

Reinforcement Learning with Human Teachers: Evidence of Feedback and Guidance with Implications for Learning Performance

15 years 8 months ago

Download robotic.media.mit.edu

As robots become a mass consumer product, they will need to learn new skills by interacting with typical human users. Past approaches have adapted reinforcement learning (RL) to a...

Andrea Lockerd Thomaz, Cynthia Breazeal

claim paper

Read More »

183

click to vote

ICML
2002
IEEE

146views Machine Learning» more ICML 2002»

Hierarchically Optimal Average Reward Reinforcement Learning

16 years 7 months ago

Download www.cs.ualberta.ca

Two notions of optimality have been explored in previous work on hierarchical reinforcement learning (HRL): hierarchical optimality, or the optimal policy in the space defined by ...

Mohammad Ghavamzadeh, Sridhar Mahadevan

claim paper

Read More »

177

click to vote

GECCO
2009
Springer

135views Optimization» more GECCO 2009»

Neuroevolutionary reinforcement learning for generalized helicopter control

16 years 1 months ago

Download www.science.uva.nl

Helicopter hovering is an important challenge problem in the ﬁeld of reinforcement learning. This paper considers several neuroevolutionary approaches to discovering robust cont...

Rogier Koppejan, Shimon Whiteson

claim paper

Read More »

« Prev « First page 10 / 76 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers