Search Sciweavers | Sciweavers

4544 search results - page 222 / 909

» Reinforcement Learning with Time

113

Voted

AAAI
1996

97views Intelligent Agents» more AAAI 1996»

Learning Efficient Rules by Maintaining the Explanation Structure

15 years 5 months ago

Download www.isi.edu

Many learning systems suffer from the utility problem; that is, that time after learning is greater than time before learning. Discovering how to assure that learned knowledge wil...

Jihie Kim, Paul S. Rosenbloom

claim paper

Read More »

132

click to vote

ICML
2008
IEEE

117views Machine Learning» more ICML 2008»

Sample-based learning and search with permanent and transient memories

16 years 4 months ago

Download www.cs.ualberta.ca

We present a reinforcement learning architecture, Dyna-2, that encompasses both samplebased learning and sample-based search, and that generalises across states during both learni...

David Silver, Martin Müller 0003, Richard S. ...

claim paper

Read More »

115

click to vote

ATAL
2008
Springer

146views Intelligent Agents» more ATAL 2008»

Adaptive Kanerva-based function approximation for multi-agent systems

15 years 5 months ago

Download www.aamas-conference.org

In this paper, we show how adaptive prototype optimization can be used to improve the performance of function approximation based on Kanerva Coding when solving largescale instanc...

Cheng Wu, Waleed Meleis

claim paper

Read More »

140

click to vote

SIGCSE
2009
ACM

119views Education» more SIGCSE 2009»

Implications of integrating test-driven development into CS1/CS2 curricula

16 years 4 months ago

Download users.csc.calpoly.edu

Many academic and industry professionals have called for more testing in computer science curricula. Test-driven development (TDD) has been proposed as a solution to improve testi...

Chetan Desai, David S. Janzen, John Clements

claim paper

Read More »

125

click to vote

GECCO
2009
Springer

82views Optimization» more GECCO 2009»

On the scalability of XCS(F)

15 years 10 months ago

Download www.coboslab.psychologie.uni-wuerzburg.de

Many successful applications have proven the potential of Learning Classiﬁer Systems and the XCS classiﬁer system in particular in datamining, reinforcement learning, and func...

Patrick O. Stalph, Martin V. Butz, David E. Goldbe...

claim paper

Read More »

« Prev « First page 222 / 909 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers