Search Sciweavers | Sciweavers

128 search results - page 24 / 26

» Hierarchically Optimal Average Reward Reinforcement Learning

199

click to vote

PKDD
2010
Springer

164views Data Mining» more PKDD 2010»

Efficient Planning in Large POMDPs through Policy Graph Based Factorized Approximations

15 years 1 months ago

Download users.ics.tkk.fi

Partially observable Markov decision processes (POMDPs) are widely used for planning under uncertainty. In many applications, the huge size of the POMDP state space makes straightf...

Joni Pajarinen, Jaakko Peltonen, Ari Hottinen, Mik...

claim paper

Read More »

164

click to vote

ICML
1999
IEEE

168views Machine Learning» more ICML 1999»

Least-Squares Temporal Difference Learning

16 years 4 months ago

Download www.research.rutgers.edu

Excerpted from: Boyan, Justin. Learning Evaluation Functions for Global Optimization. Ph.D. thesis, Carnegie Mellon University, August 1998. (Available as Technical Report CMU-CS-...

Justin A. Boyan

claim paper

Read More »

134

click to vote

ICML
2006
IEEE

144views Machine Learning» more ICML 2006»

Probabilistic inference for solving discrete and continuous state Markov Decision Processes

16 years 4 months ago

Download eprints.pascal-network.org

Inference in Markov Decision Processes has recently received interest as a means to infer goals of an observed action, policy recognition, and also as a tool to compute policies. ...

Marc Toussaint, Amos J. Storkey

claim paper

Read More »

154

Voted

ICML
1996
IEEE

162views Machine Learning» more ICML 1996»

Learning Evaluation Functions for Large Acyclic Domains

16 years 4 months ago

Download www.ri.cmu.edu

Some of the most successful recent applications of reinforcement learning have used neural networks and the TD algorithm to learn evaluation functions. In this paper, we examine t...

Justin A. Boyan, Andrew W. Moore

claim paper

Read More »

127

click to vote

RECSYS
2009
ACM

141views Control Systems» more RECSYS 2009»

Improving rating estimation in recommender systems using aggregation- and variance-based hierarchical models

15 years 10 months ago

Download pages.stern.nyu.edu

Previous work on using external aggregate rating information showed that this information can be incorporated in several diﬀerent types of recommender systems and improves their...

Akhmed Umyarov, Alexander Tuzhilin

claim paper

Read More »

« Prev « First page 24 / 26 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers