Search Sciweavers | Sciweavers

3274 search results - page 189 / 655

» Using Learning in a Control Agent

146

click to vote

ECML
2005
Springer

120views Machine Learning» more ECML 2005»

Using Rewards for Belief State Updates in Partially Observable Markov Decision Processes

15 years 7 months ago

Download www.cs.mcgill.ca

Partially Observable Markov Decision Processes (POMDP) provide a standard framework for sequential decision making in stochastic environments. In this setting, an agent takes actio...

Masoumeh T. Izadi, Doina Precup

claim paper

Read More »

117

Voted

AIED
2007
Springer

87views Artificial Intelligence» more AIED 2007»

Does Learner Control Affect Learning?

15 years 8 months ago

Download www.cs.cmu.edu

Many intelligent tutoring systems permit some degree of learner control. A natural question is whether the increased student engagement and motivation such control provides results...

Joseph E. Beck

claim paper

Read More »

Voted

ATAL
2003
Springer

101views Intelligent Agents» more ATAL 2003»

How to calm hyperactive agents

15 years 7 months ago

Download www.isi.edu

System performance in multi-agent resource allocation systems can often improve if individual agents reduce their activity. Agents in such systems need a way to modulate their ind...

H. Van Dyke Parunak, Sven Brueckner, Robert S. Mat...

claim paper

Read More »

112

click to vote

PKDD
2009
Springer

181views Data Mining» more PKDD 2009»

Active Learning for Reward Estimation in Inverse Reinforcement Learning

15 years 8 months ago

Download users.isr.ist.utl.pt

Abstract. Inverse reinforcement learning addresses the general problem of recovering a reward function from samples of a policy provided by an expert/demonstrator. In this paper, w...

Manuel Lopes, Francisco S. Melo, Luis Montesano

claim paper

Read More »

106

click to vote

GECCO
2007
Springer

186views Optimization» more GECCO 2007»

Evolving controllers for simulated car racing using object oriented genetic programming

15 years 8 months ago

Download julian.togelius.com

Several diﬀerent controller representations are compared on a non-trivial problem in simulated car racing, with respect to learning speed and ﬁnal ﬁtness. The controller rep...

Alexandros Agapitos, Julian Togelius, Simon M. Luc...

claim paper

Read More »

« Prev « First page 189 / 655 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers