Search Sciweavers | Sciweavers

113 search results - page 16 / 23

» Learning Representation and Control in Continuous Markov Dec...

click to vote

ILP
2007
Springer

283views Automated Reasoning» more ILP 2007»

Building Relational World Models for Reinforcement Learning

14 years 3 months ago

Download ftp.cs.wisc.edu

Abstract. Many reinforcement learning domains are highly relational. While traditional temporal-difference methods can be applied to these domains, they are limited in their capaci...

Trevor Walker, Lisa Torrey, Jude W. Shavlik, Richa...

claim paper

Read More »

click to vote

IJCAI
2007

175views Artificial Intelligence» more IJCAI 2007»

An Experts Algorithm for Transfer Learning

13 years 10 months ago

Download www.ijcai.org

A long-lived agent continually faces new tasks in its environment. Such an agent may be able to use knowledge learned in solving earlier tasks to produce candidate policies for it...

Erik Talvitie, Satinder Singh

claim paper

Read More »

click to vote

AAAI
2006

142views Intelligent Agents» more AAAI 2006»

Learning Basis Functions in Hybrid Domains

13 years 10 months ago

Download www.aaai.org

Markov decision processes (MDPs) with discrete and continuous state and action components can be solved efficiently by hybrid approximate linear programming (HALP). The main idea ...

Branislav Kveton, Milos Hauskrecht

claim paper

Read More »

click to vote

AIPS
1998

127views Artificial Intelligence» more AIPS 1998»

Solving Stochastic Planning Problems with Large State and Action Spaces

13 years 10 months ago

Download www.cs.brown.edu

Planning methods for deterministic planning problems traditionally exploit factored representations to encode the dynamics of problems in terms of a set of parameters, e.g., the l...

Thomas Dean, Robert Givan, Kee-Eung Kim

claim paper

Read More »

click to vote

ICRA
2007
IEEE

155views Robotics» more ICRA 2007»

Value Function Approximation on Non-Linear Manifolds for Robot Motor Control

14 years 3 months ago

Download sugiyama-www.cs.titech.ac.jp

— The least squares approach works efﬁciently in value function approximation, given appropriate basis functions. Because of its smoothness, the Gaussian kernel is a popular an...

Masashi Sugiyama, Hirotaka Hachiya, Christopher To...

claim paper

Read More »

« Prev « First page 16 / 23 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers