Search Sciweavers | Sciweavers

77 search results - page 11 / 16

» Value Function Approximation in Reinforcement Learning Using...

138

click to vote

CORR
2010
Springer

152views Education» more CORR 2010»

Neuroevolutionary optimization

15 years 3 months ago

Download jmlr.csail.mit.edu

Temporal difference methods are theoretically grounded and empirically effective methods for addressing reinforcement learning problems. In most real-world reinforcement learning ...

Eva Volná

claim paper

Read More »

114

Voted

P2P
2006
IEEE

101views Communications» more P2P 2006»

Reinforcement Learning for Query-Oriented Routing Indices in Unstructured Peer-to-Peer Networks

15 years 9 months ago

Download www.cc.gatech.edu

The idea of building query-oriented routing indices has changed the way of improving routing efﬁciency from the basis as it can learn the content distribution during the query r...

Cong Shi, Shicong Meng, Yuanjie Liu, Dingyi Han, Y...

claim paper

Read More »

151

click to vote

JAIR
2002

163views more JAIR 2002»

Efficient Reinforcement Learning Using Recursive Least-Squares Methods

15 years 2 months ago

Download www.jair.org

The recursive least-squares (RLS) algorithm is one of the most well-known algorithms used in adaptive filtering, system identification and adaptive control. Its popularity is main...

Xin Xu, Hangen He, Dewen Hu

claim paper

Read More »

119

click to vote

ESANN
2003

152views Neural Networks» more ESANN 2003»

Improving iterative repair strategies for scheduling with the SVM

15 years 4 months ago

Download www2.in.tu-clausthal.de

The resource constraint project scheduling problem (RCPSP) is an NP-hard benchmark problem in scheduling which takes into account the limitation of resources’ availabilities in ...

Kai Gersmann, Barbara Hammer

claim paper

Read More »

180

click to vote

AI
1998
Springer

177views Artificial Intelligence» more AI 1998»

Model-Based Average Reward Reinforcement Learning

15 years 2 months ago

Download web.engr.oregonstate.edu

Reinforcement Learning (RL) is the study of programs that improve their performance by receiving rewards and punishments from the environment. Most RL methods optimize the discoun...

Prasad Tadepalli, DoKyeong Ok

claim paper

Read More »

« Prev « First page 11 / 16 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers