Search Sciweavers | Sciweavers

802 search results - page 61 / 161

» Experts in a Markov Decision Process

178

click to vote

ALT
2006
Springer

146views Machine Learning» more ALT 2006»

Probabilistic Generalization of Simple Grammars and Its Application to Reinforcement Learning

16 years 2 months ago

Download www.logos.t.u-tokyo.ac.jp

Abstract. Recently, some non-regular subclasses of context-free grammars have been found to be eﬃciently learnable from positive data. In order to use these eﬃcient algorithms ...

Takeshi Shibata, Ryo Yoshinaka, Takashi Chikayama

claim paper

Read More »

136

Voted

DSS
2008

84views more DSS 2008»

Human decision-making behavior and modeling effects

15 years 5 months ago

Download www.pacis-net.org

Previous research indicates that the human decision-making process is somewhat nonlinear and that nonlinear models would be more suitable than linear models for developing advance...

Choong Nyoung Kim, Kyung Hoon Yang, Jaekyung Kim

claim paper

Read More »

143

click to vote

ICML
2006
IEEE

131views Machine Learning» more ICML 2006»

PAC model-free reinforcement learning

16 years 6 months ago

Download cseweb.ucsd.edu

For a Markov Decision Process with finite state (size S) and action spaces (size A per state), we propose a new algorithm--Delayed Q-Learning. We prove it is PAC, achieving near o...

Alexander L. Strehl, Lihong Li, Eric Wiewiora, Joh...

claim paper

Read More »

142

click to vote

ICML
2006
IEEE

142views Machine Learning» more ICML 2006»

An intrinsic reward mechanism for efficient exploration

16 years 6 months ago

Download www-anw.cs.umass.edu

How should a reinforcement learning agent act if its sole purpose is to efficiently learn an optimal policy for later use? In other words, how should it explore, to be able to exp...

Özgür Simsek, Andrew G. Barto

claim paper

Read More »

138

click to vote

ICRA
2007
IEEE

126views Robotics» more ICRA 2007»

A formal framework for robot learning and control under model uncertainty

15 years 12 months ago

Download www.cs.mcgill.ca

— While the Partially Observable Markov Decision Process (POMDP) provides a formal framework for the problem of robot control under uncertainty, it typically assumes a known and ...

Robin Jaulmes, Joelle Pineau, Doina Precup

claim paper

Read More »

« Prev « First page 61 / 161 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers