Sciweavers

286 search results - page 32 / 58
» Using inaccurate models in reinforcement learning
Sort
View
AAAI
2008
13 years 10 months ago
Adaptive Importance Sampling with Automatic Model Selection in Value Function Approximation
Off-policy reinforcement learning is aimed at efficiently reusing data samples gathered in the past, which is an essential problem for physically grounded AI as experiments are us...
Hirotaka Hachiya, Takayuki Akiyama, Masashi Sugiya...
CVPR
2010
IEEE
13 years 11 months ago
Lymph Node Detection in 3-D Chest CT using a Spatial Prior Probability
Lymph nodes have high clinical relevance but detection is challenging as they are hard to see due to low contrast and irregular shape. In this paper, a method for fully automatic ...
Johannes Feulner, Kevin Zhou, Martin Huber, Joachi...
AIED
2009
Springer
14 years 2 months ago
Transfer Learning and Representation Discovery in Intelligent Tutoring Systems
We describe a novel framework developed for transfer learning within reinforcement learning (RL) problems. Then we exhibit how this framework can be extended to intelligent tutorin...
Kimberly Ferguson, Beverly Park Woolf, Sridhar Mah...
JAIR
2008
148views more  JAIR 2008»
13 years 7 months ago
Learning Partially Observable Deterministic Action Models
We present exact algorithms for identifying deterministic-actions' effects and preconditions in dynamic partially observable domains. They apply when one does not know the ac...
Eyal Amir, Allen Chang
NIPS
2008
13 years 9 months ago
Structure Learning in Human Sequential Decision-Making
We use graphical models and structure learning to explore how people learn policies in sequential decision making tasks. Studies of sequential decision-making in humans frequently...
Daniel Acuña, Paul R. Schrater