Search Sciweavers | Sciweavers

109 search results - page 14 / 22

» Policy teaching through reward function learning

209

Voted

Publication

334views

Rollout Sampling Approximate Policy Iteration

16 years 13 days ago

Download www.springerlink.com

Several researchers have recently investigated the connection between reinforcement learning and classification. We are motivated by proposals of approximate policy iteration schem...

Christos Dimitrakakis, Michail G. Lagoudakis

posted by olethros

Read More »

123

Voted

CDC
2008
IEEE

197views Control Systems» more CDC 2008»

Dynamic spectrum access policies for cognitive radio

15 years 10 months ago

Download www.ifp.illinois.edu

—We study the problem of dynamic spectrum sensing and access in cognitive radio systems as a partially observed Markov decision process (POMDP). A group of cognitive users cooper...

Jayakrishnan Unnikrishnan, Venugopal V. Veeravalli

claim paper

Read More »

155

click to vote

HICSS
2003
IEEE

142views Biometrics» more HICSS 2003»

Evolution of a Knowledge Focused Computer Supported Learning System by Ensuring Extensibility through Generalization and Replica

15 years 8 months ago

Download www.hicss.hawaii.edu

If sufficient attention is not paid to the information models on which Learning Platforms are based the ability to deliver rich functionality is hindered. This paper describes the...

David White, Lesley A. Gardner, Don Sheridan

claim paper

Read More »

109

click to vote

ATAL
2004
Springer

120views Intelligent Agents» more ATAL 2004»

Communication for Improving Policy Computation in Distributed POMDPs

15 years 8 months ago

Download teamcore.usc.edu

Distributed Partially Observable Markov Decision Problems (POMDPs) are emerging as a popular approach for modeling multiagent teamwork where a group of agents work together to joi...

Ranjit Nair, Milind Tambe, Maayan Roth, Makoto Yok...

claim paper

Read More »

160

Voted

ICML
1999
IEEE

168views Machine Learning» more ICML 1999»

Least-Squares Temporal Difference Learning

16 years 4 months ago

Download www.research.rutgers.edu

Excerpted from: Boyan, Justin. Learning Evaluation Functions for Global Optimization. Ph.D. thesis, Carnegie Mellon University, August 1998. (Available as Technical Report CMU-CS-...

Justin A. Boyan

claim paper

Read More »

« Prev « First page 14 / 22 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers