Search Sciweavers | Sciweavers

170 search results - page 24 / 34

» Heuristic Selection of Actions in Multiagent Reinforcement L...

187

click to vote

JAIR
2011

134views more JAIR 2011»

Scaling up Heuristic Planning with Relational Decision Trees

15 years 17 days ago

Download www.jair.org

Current evaluation functions for heuristic planning are expensive to compute. In numerous planning problems these functions provide good guidance to the solution, so they are wort...

Tomás de la Rosa, Sergio Jiménez, Ra...

claim paper

Read More »

166

click to vote

ATAL
2010
Springer

146views Intelligent Agents» more ATAL 2010»

PAC-MDP learning with knowledge-based admissible models

15 years 5 months ago

Download www.aamas-conference.org

PAC-MDP algorithms approach the exploration-exploitation problem of reinforcement learning agents in an effective way which guarantees that with high probability, the algorithm pe...

Marek Grzes, Daniel Kudenko

claim paper

Read More »

177

click to vote

LION
2007
Springer

192views Optimization» more LION 2007»

Learning While Optimizing an Unknown Fitness Surface

15 years 11 months ago

Download www.science.unitn.it

This paper is about Reinforcement Learning (RL) applied to online parameter tuning in Stochastic Local Search (SLS) methods. In particular a novel application of RL is considered i...

Roberto Battiti, Mauro Brunato, Paolo Campigotto

claim paper

Read More »

130

click to vote

ATAL
2005
Springer

117views Intelligent Agents» more ATAL 2005»

Multi-agent reward analysis for learning in noisy domains

15 years 11 months ago

Download ti.arc.nasa.gov

In many multi agent learning problems, it is difﬁcult to determine, a priori, the agent reward structure that will lead to good performance. This problem is particularly pronoun...

Adrian K. Agogino, Kagan Tumer

claim paper

Read More »

158

click to vote

JAIR
2011

144views more JAIR 2011»

Non-Deterministic Policies in Markovian Decision Processes

15 years 17 days ago

Download www.jair.org

Markovian processes have long been used to model stochastic environments. Reinforcement learning has emerged as a framework to solve sequential planning and decision-making proble...

Mahdi Milani Fard, Joelle Pineau

claim paper

Read More »

« Prev « First page 24 / 34 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers