Search Sciweavers | Sciweavers

149

ECML
2005
Springer

120views Machine Learning» more ECML 2005»

Using Rewards for Belief State Updates in Partially Observable Markov Decision Processes

15 years 8 months ago

Partially Observable Markov Decision Processes (POMDP) provide a standard framework for sequential decision making in stochastic environments. In this setting, an agent takes actio...

Masoumeh T. Izadi, Doina Precup

claim paper

Read More »

147

Voted

ICMLA
2010

207views Machine Learning» more ICMLA 2010»

Multi-Agent Inverse Reinforcement Learning

15 years 14 days ago

Download ftp.cs.wisc.edu

Learning the reward function of an agent by observing its behavior is termed inverse reinforcement learning and has applications in learning from demonstration or apprenticeship l...

Sriraam Natarajan, Gautam Kunapuli, Kshitij Judah,...

claim paper

Read More »

83

Voted

IDEAL
2004
Springer

90views Intelligent Agents» more IDEAL 2004»

Generating and Applying Rules for Interval Valued Fuzzy Observations

15 years 7 months ago

Download cms.dt.uh.edu

Abstract. One of the objectives of intelligent data engineering and automated learning is to develop algorithms that learn the environment, generate rules, and take possible course...

André de Korvin, Chenyi Hu, Ping Chen

claim paper

Read More »

106

click to vote

ECAI
2004
Springer

172views Artificial Intelligence» more ECAI 2004»

Combining Multiple Answers for Learning Mathematical Structures from Visual Observation

15 years 8 months ago

Download www.comp.leeds.ac.uk

Learning general truths from the observation of simple domains and, further, learning how to use this knowledge are essential capabilities for any intelligent agent to understand ...

Paulo Santos, Derek R. Magee, Anthony G. Cohn, Dav...

claim paper

Read More »

147

Voted

ICML
1995
IEEE

213views Machine Learning» more ICML 1995»

Learning Policies for Partially Observable Environments: Scaling Up

16 years 3 months ago

Download reference.kfupm.edu.sa

Partially observable Markov decision processes (pomdp's) model decision problems in which an agent tries to maximize its reward in the face of limited and/or noisy sensor fee...

Michael L. Littman, Anthony R. Cassandra, Leslie P...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers