Search Sciweavers | Sciweavers

473 search results - page 49 / 95

» Optimal policy switching algorithms for reinforcement learni...

click to vote

ATAL
2006
Springer

192views Intelligent Agents» more ATAL 2006»

A hierarchical approach to efficient reinforcement learning in deterministic domains

14 years 22 days ago

Download paul.rutgers.edu

Factored representations, model-based learning, and hierarchies are well-studied techniques for improving the learning efficiency of reinforcement-learning algorithms in large-sca...

Carlos Diuk, Alexander L. Strehl, Michael L. Littm...

claim paper

Read More »

click to vote

IWLCS
2005
Springer

161views Machine Learning» more IWLCS 2005»

Counter Example for Q-Bucket-Brigade Under Prediction Problem

14 years 2 months ago

Download www.cs.bham.ac.uk

Aiming to clarify the convergence or divergence conditions for Learning Classiﬁer System (LCS), this paper explores: (1) an extreme condition where the reinforcement process of ...

Atsushi Wada, Keiki Takadama, Katsunori Shimohara

claim paper

Read More »

click to vote

ICML
2006
IEEE

131views Machine Learning» more ICML 2006»

PAC model-free reinforcement learning

14 years 9 months ago

Download cseweb.ucsd.edu

For a Markov Decision Process with finite state (size S) and action spaces (size A per state), we propose a new algorithm--Delayed Q-Learning. We prove it is PAC, achieving near o...

Alexander L. Strehl, Lihong Li, Eric Wiewiora, Joh...

claim paper

Read More »

click to vote

ATAL
2007
Springer

151views Intelligent Agents» more ATAL 2007»

Batch reinforcement learning in a complex domain

14 years 3 months ago

Download userweb.cs.utexas.edu

Temporal diﬀerence reinforcement learning algorithms are perfectly suited to autonomous agents because they learn directly from an agent’s experience based on sequential actio...

Shivaram Kalyanakrishnan, Peter Stone

claim paper

Read More »

click to vote

ITNG
2007
IEEE

118views Information Technology» more ITNG 2007»

Input Fuzzy Modeling for the Recognition of Handwritten Hindi Numerals

14 years 3 months ago

Download eprints.qut.edu.au

This paper presents the recognition of Handwritten Hindi Numerals based on the modified exponential membership function fitted to the fuzzy sets derived from normalized distance f...

Madasu Hanmandlu, J. Grover, Vamsi Krishna Madasu,...

claim paper

Read More »

« Prev « First page 49 / 95 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers