Search Sciweavers | Sciweavers

423 search results - page 72 / 85

» Multi-objective Model Checking of Markov Decision Processes

109

click to vote

ATAL
2008
Springer

104views Intelligent Agents» more ATAL 2008»

Expediting RL by using graphical structures

15 years 5 months ago

Download www.cs.washington.edu

The goal of Reinforcement learning (RL) is to maximize reward (minimize cost) in a Markov decision process (MDP) without knowing the underlying model a priori. RL algorithms tend ...

Peng Dai, Alexander L. Strehl, Judy Goldsmith

claim paper

Read More »

145

click to vote

NIPS
2003

145views Information Technology» more NIPS 2003»

A Nonlinear Predictive State Representation

15 years 4 months ago

Download books.nips.cc

Predictive state representations (PSRs) use predictions of a set of tests to represent the state of controlled dynamical systems. One reason why this representation is exciting as...

Matthew R. Rudary, Satinder P. Singh

claim paper

Read More »

135

click to vote

ATAL
2010
Springer

157views Intelligent Agents» more ATAL 2010»

Augmenting appearance-based localization and navigation using belief update

15 years 4 months ago

Download www.aamas-conference.org

Appearance-based localization compares the current image taken from a robot's camera to a set of pre-recorded images in order to estimate the current location of the robot. S...

George Chrysanthakopoulos, Guy Shani

claim paper

Read More »

148

click to vote

CSL
2010
Springer

163views Automated Reasoning» more CSL 2010»

Evaluation of a hierarchical reinforcement learning spoken dialogue system

15 years 3 months ago

Download www.cstr.ed.ac.uk

We describe an evaluation of spoken dialogue strategies designed using hierarchical reinforcement learning agents. The dialogue strategies were learnt in a simulated environment a...

Heriberto Cuayáhuitl, Steve Renals, Oliver ...

claim paper

Read More »

136

click to vote

ICTAI
2009
IEEE

86views Artificial Intelligence» more ICTAI 2009»

TiMDPpoly: An Improved Method for Solving Time-Dependent MDPs

15 years 28 days ago

Download www.montefiore.ulg.ac.be

We introduce TiMDPpoly, an algorithm designed to solve planning problems with durative actions, under probabilistic uncertainty, in a non-stationary, continuous-time context. Miss...

Emmanuel Rachelson, Patrick Fabiani, Fréd&e...

claim paper

Read More »

« Prev « First page 72 / 85 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers