Search Sciweavers | Sciweavers

286 search results - page 32 / 58

» Using inaccurate models in reinforcement learning

124

Voted

AAAI
2008

207views Intelligent Agents» more AAAI 2008»

Adaptive Importance Sampling with Automatic Model Selection in Value Function Approximation

15 years 6 months ago

Download sugiyama-www.cs.titech.ac.jp

Off-policy reinforcement learning is aimed at efficiently reusing data samples gathered in the past, which is an essential problem for physically grounded AI as experiments are us...

Hirotaka Hachiya, Takayuki Akiyama, Masashi Sugiya...

claim paper

Read More »

124

click to vote

CVPR
2010
IEEE

254views Computer Vision» more CVPR 2010»

Lymph Node Detection in 3-D Chest CT using a Spatial Prior Probability

15 years 7 months ago

Download www5.informatik.uni-erlangen.de

Lymph nodes have high clinical relevance but detection is challenging as they are hard to see due to low contrast and irregular shape. In this paper, a method for fully automatic ...

Johannes Feulner, Kevin Zhou, Martin Huber, Joachi...

claim paper

Read More »

129

click to vote

AIED
2009
Springer

129views Artificial Intelligence» more AIED 2009»

Transfer Learning and Representation Discovery in Intelligent Tutoring Systems

15 years 10 months ago

Download www.cs.umass.edu

We describe a novel framework developed for transfer learning within reinforcement learning (RL) problems. Then we exhibit how this framework can be extended to intelligent tutorin...

Kimberly Ferguson, Beverly Park Woolf, Sridhar Mah...

claim paper

Read More »

179

click to vote

JAIR
2008

148views more JAIR 2008»

Learning Partially Observable Deterministic Action Models

15 years 4 months ago

Download www.jair.org

We present exact algorithms for identifying deterministic-actions' effects and preconditions in dynamic partially observable domains. They apply when one does not know the ac...

Eyal Amir, Allen Chang

claim paper

Read More »

141

click to vote

NIPS
2008

129views Information Technology» more NIPS 2008»

Structure Learning in Human Sequential Decision-Making

15 years 5 months ago

Download www-users.cs.umn.edu

We use graphical models and structure learning to explore how people learn policies in sequential decision making tasks. Studies of sequential decision-making in humans frequently...

Daniel Acuña, Paul R. Schrater

claim paper

Read More »

« Prev « First page 32 / 58 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers