Search Sciweavers | Sciweavers

2566 search results - page 92 / 514

» Relating reinforcement learning performance to classificatio...

131

click to vote

EPIA
1995
Springer

110views Artificial Intelligence» more EPIA 1995»

Using Stochastic Grammars to Learn Robotic Tasks

15 years 7 months ago

Download welcome.isr.ist.utl.pt

Abstract. The paper introduces a reinforcement learning-based methodology for performance improvement of Intelligent Controllers. The translation interfaces of a 3-level Hierarchic...

Pedro U. Lima, George N. Saridis

claim paper

Read More »

177

Voted

JMLR
2012

200views Programming Languages» more JMLR 2012»

Contextual Bandit Learning with Predictable Rewards

13 years 6 months ago

Download www.cs.princeton.edu

Contextual bandit learning is a reinforcement learning problem where the learner repeatedly receives a set of features (context), takes an action and receives a reward based on th...

Alekh Agarwal, Miroslav Dudík, Satyen Kale,...

claim paper

Read More »

160

click to vote

ICMLA
2009

167views Machine Learning» more ICMLA 2009»

Learning Parameters for Relational Probabilistic Models with Noisy-Or Combining Rule

15 years 1 months ago

Download ftp.cs.wisc.edu

Languages that combine predicate logic with probabilities are needed to succinctly represent knowledge in many real-world domains. We consider a formalism based on universally qua...

Sriraam Natarajan, Prasad Tadepalli, Gautam Kunapu...

claim paper

Read More »

139

click to vote

SMC
2007
IEEE

102views Control Systems» more SMC 2007»

An improved immune Q-learning algorithm

15 years 10 months ago

Download web2.uwindsor.ca

—Reinforcement learning is a framework in which an agent can learn behavior without knowledge on a task or an environment by exploration and exploitation. Striking a balance betw...

Zhengqiao Ji, Q. M. Jonathan Wu, Maher A. Sid-Ahme...

claim paper

Read More »

150

click to vote

ICML
2010
IEEE

222views Machine Learning» more ICML 2010»

Temporal Difference Bayesian Model Averaging: A Bayesian Perspective on Adapting Lambda

15 years 2 months ago

Download www.icml2010.org

Temporal difference (TD) algorithms are attractive for reinforcement learning due to their ease-of-implementation and use of "bootstrapped" return estimates to make effi...

Carlton Downey, Scott Sanner

claim paper

Read More »

« Prev « First page 92 / 514 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers