Search Sciweavers | Sciweavers

179 search results - page 6 / 36

» Learning Relational Navigation Policies

157

Voted

RSS
2007

176views Robotics» more RSS 2007»

Active Policy Learning for Robot Planning and Exploration under Uncertainty

15 years 4 months ago

Download www.roboticsproceedings.org

Abstract— This paper proposes a simulation-based active policy learning algorithm for ﬁnite-horizon, partially-observed sequential decision processes. The algorithm is tested i...

Ruben Martinez-Cantin, Nando de Freitas, Arnaud Do...

claim paper

Read More »

111

click to vote

IJCAI
2007

223views Artificial Intelligence» more IJCAI 2007»

Relational Knowledge with Predictive State Representations

15 years 4 months ago

Download web.mit.edu

Most work on Predictive Representations of State (PSRs) has focused on learning and planning in unstructured domains (for example, those represented by ﬂat POMDPs). This paper e...

David Wingate, Vishal Soni, Britton Wolfe, Satinde...

claim paper

Read More »

119

Voted

IJCAI
2007

175views Artificial Intelligence» more IJCAI 2007»

An Experts Algorithm for Transfer Learning

15 years 4 months ago

Download www.ijcai.org

A long-lived agent continually faces new tasks in its environment. Such an agent may be able to use knowledge learned in solving earlier tasks to produce candidate policies for it...

Erik Talvitie, Satinder Singh

claim paper

Read More »

122

click to vote

ICML
2009
IEEE

172views Machine Learning» more ICML 2009»

Model-free reinforcement learning as mixture learning

16 years 4 months ago

Download user.cs.tu-berlin.de

We cast model-free reinforcement learning as the problem of maximizing the likelihood of a probabilistic mixture model via sampling, addressing both the infinite and finite horizo...

Nikos Vlassis, Marc Toussaint

claim paper

Read More »

130

Voted

JAIR
2011

144views more JAIR 2011»

Non-Deterministic Policies in Markovian Decision Processes

14 years 10 months ago

Download www.jair.org

Markovian processes have long been used to model stochastic environments. Reinforcement learning has emerged as a framework to solve sequential planning and decision-making proble...

Mahdi Milani Fard, Joelle Pineau

claim paper

Read More »

« Prev « First page 6 / 36 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers