Search Sciweavers | Sciweavers

187

IROS
2009
IEEE

206views Robotics» more IROS 2009»

Bayesian reinforcement learning in continuous POMDPs with gaussian processes

16 years 25 days ago

— Partially Observable Markov Decision Processes (POMDPs) provide a rich mathematical model to handle realworld sequential decision processes but require a known model to be solv...

Patrick Dallaire, Camille Besse, Stéphane R...

claim paper

Read More »

142

click to vote

AAMAS
2005
Springer

133views Intelligent Agents» more AAMAS 2005»

Advice-Exchange Between Evolutionary Algorithms and Reinforcement Learning Agents: Experiments in the Pursuit Domain

15 years 11 months ago

Download iscte.pt

This research aims at studying the effects of exchanging information during the learning process in Multiagent Systems. The concept of advice-exchange, introduced in (Nunes and Ol...

Luís Nunes, Eugénio C. Oliveira

claim paper

Read More »

193

click to vote

ATAL
2007
Springer

147views Intelligent Agents» more ATAL 2007»

A reinforcement learning based distributed search algorithm for hierarchical peer-to-peer information retrieval systems

15 years 10 months ago

Download www.haizhengzhang.com

The dominant existing routing strategies employed in peerto-peer(P2P) based information retrieval(IR) systems are similarity-based approaches. In these approaches, agents depend o...

Haizheng Zhang, Victor R. Lesser

claim paper

Read More »

156

click to vote

GECCO
2005
Springer

111views Optimization» more GECCO 2005»

XCS with eligibility traces

15 years 11 months ago

Download www.bcs.rochester.edu

The development of the XCS Learning Classiﬁer System has produced a robust and stable implementation that performs competitively in direct-reward environments. Although investig...

Jan Drugowitsch, Alwyn Barry

claim paper

Read More »

154

click to vote

ESANN
2003

152views Neural Networks» more ESANN 2003»

Improving iterative repair strategies for scheduling with the SVM

15 years 7 months ago

Download www2.in.tu-clausthal.de

The resource constraint project scheduling problem (RCPSP) is an NP-hard benchmark problem in scheduling which takes into account the limitation of resources’ availabilities in ...

Kai Gersmann, Barbara Hammer

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers