Search Sciweavers | Sciweavers

21

DAGSTUHL
2007

107views Software Engineering» more DAGSTUHL 2007»

Learning Probabilistic Relational Dynamics for Multiple Tasks

13 years 8 months ago

The ways in which an agent’s actions affect the world can often be modeled compactly using a set of relational probabilistic planning rules. This paper addresses the problem of ...

Ashwin Deshpande, Brian Milch, Luke S. Zettlemoyer...

claim paper

Read More »

21

click to vote

GECCO
2006
Springer

133views Optimization» more GECCO 2006»

On-line evolutionary computation for reinforcement learning in stochastic domains

13 years 11 months ago

Download userweb.cs.utexas.edu

In reinforcement learning, an agent interacting with its environment strives to learn a policy that specifies, for each state it may encounter, what action to take. Evolutionary c...

Shimon Whiteson, Peter Stone

claim paper

Read More »

23

click to vote

ECML
2005
Springer

120views Machine Learning» more ECML 2005»

Using Rewards for Belief State Updates in Partially Observable Markov Decision Processes

14 years 27 days ago

Download www.cs.mcgill.ca

Partially Observable Markov Decision Processes (POMDP) provide a standard framework for sequential decision making in stochastic environments. In this setting, an agent takes actio...

Masoumeh T. Izadi, Doina Precup

claim paper

Read More »

22

click to vote

CHI
2004
ACM

113views Human Computer Interaction» more CHI 2004»

Understanding the micronote lifecycle: improving mobile support for informal note taking

14 years 7 months ago

Download userpages.umbc.edu

People frequently write messages to themselves. These informal, hurried personal jottings serve as temporary storage for notable information as well as reminders for future action...

Min Lin, Wayne G. Lutters, Tina S. Kim

claim paper

Read More »

21

click to vote

NIPS
2007

149views Information Technology» more NIPS 2007»

Online Linear Regression and Its Application to Model-Based Reinforcement Learning

13 years 8 months ago

Download books.nips.cc

We provide a provably efﬁcient algorithm for learning Markov Decision Processes (MDPs) with continuous state and action spaces in the online setting. Speciﬁcally, we take a mo...

Alexander L. Strehl, Michael L. Littman

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers