Search Sciweavers | Sciweavers

494 search results - page 30 / 99

» Evaluating a Reinforcement Learning Algorithm with a General...

128

Voted

NIPS
1993

128views Information Technology» more NIPS 1993»

Convergence of Stochastic Iterative Dynamic Programming Algorithms

15 years 4 months ago

Download www.bitsavers.org

Recent developments in the area of reinforcement learning have yielded a number of new algorithms for the prediction and control of Markovian environments. These algorithms,includ...

Tommi Jaakkola, Michael I. Jordan, Satinder P. Sin...

claim paper

Read More »

152

Voted

KI
2002
Springer

254views Artificial Intelligence» more KI 2002»

Advantages, Opportunities and Limits of Empirical Evaluations: Evaluating Adaptive Systems

15 years 2 months ago

Download www.easy-hub.org

While empirical evaluations are a common research method in some areas of Artificial Intelligence (AI), others still neglect this approach. This article outlines both the opportun...

Stephan Weibelzahl, Gerhard Weber

claim paper

Read More »

129

Voted

IAT
2005
IEEE

138views Intelligent Agents» more IAT 2005»

Multiagent Reputation Management to Achieve Robust Software Using Redundancy

15 years 8 months ago

Download www.cse.sc.edu

This paper explains the building of robust software using multiagent reputation. One of the major goals of software engineering is to achieve robust software. Our hypothesis is th...

Rajesh Turlapati, Michael N. Huhns

claim paper

Read More »

149

Voted

ECML
2007
Springer

143views Machine Learning» more ECML 2007»

Generalization-Based Similarity for Conceptual Clustering

15 years 9 months ago

Download www.di.uniba.it

The availability of techniques for comparing descriptions has many applications in Artiﬁcial Intelligence, ranging from description selection to ﬂexible matching, from instance...

Stefano Ferilli, Teresa Maria Altomare Basile, Nic...

claim paper

Read More »

128

Voted

ECAI
2006
Springer

245views Artificial Intelligence» more ECAI 2006»

Least Squares SVM for Least Squares TD Learning

15 years 6 months ago

Download homepages.feis.herts.ac.uk

Abstract. We formulate the problem of least squares temporal difference learning (LSTD) in the framework of least squares SVM (LS-SVM). To cope with the large amount (and possible ...

Tobias Jung, Daniel Polani

claim paper

Read More »

« Prev « First page 30 / 99 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers