Search Sciweavers | Sciweavers

82 search results - page 4 / 17

» MDPs: Learning in Varying Environments

150

click to vote

ICML
2003
IEEE

121views Machine Learning» more ICML 2003»

Planning in the Presence of Cost Functions Controlled by an Adversary

16 years 6 months ago

Download www.cs.cmu.edu

We investigate methods for planning in a Markov Decision Process where the cost function is chosen by an adversary after we fix our policy. As a running example, we consider a rob...

H. Brendan McMahan, Geoffrey J. Gordon, Avrim Blum

claim paper

Read More »

171

Voted

JMLR
2008

129views more JMLR 2008»

Finite-Time Bounds for Fitted Value Iteration

15 years 5 months ago

Download www.sztaki.hu

In this paper we develop a theoretical analysis of the performance of sampling-based fitted value iteration (FVI) to solve infinite state-space, discounted-reward Markovian decisi...

Rémi Munos, Csaba Szepesvári

claim paper

Read More »

129

click to vote

AMAI
2004
Springer

96views Artificial Intelligence» more AMAI 2004»

Learning via Finitely Many Queries

15 years 11 months ago

Download rutcor.rutgers.edu

This work introduces a new query inference model that can access data and communicate with a teacher by asking ﬁnitely many boolean queries in a language L. In this model the pa...

Andrew C. Lee

claim paper

Read More »

164

click to vote

IADIS
2003

125views Internet Technology» more IADIS 2003»

Knowledge Acquisition Strategies and Navigation in Hypermedia Learning Environments: THe Influence of Instructional Design Prope

15 years 7 months ago

Download www.l3s.de

In order to understand and enhance the value of new media in education it is necessary to develop criteria for the evaluation of the effectiveness of learning with hypermedia envi...

Mattias Steinke, Thomas Huk, Christian Floto

claim paper

Read More »

157

click to vote

ICPR
2008
IEEE

191views Computer Vision» more ICPR 2008»

Incremental learning in non-stationary environments with concept drift using a multiple classifier based approach

16 years 7 months ago

Download users.rowan.edu

We outline an incremental learning algorithm designed for nonstationary environments where the underlying data distribution changes over time. With each dataset drawn from a new e...

Matthew T. Karnick, Michael Muhlbaier, Robi Polika...

claim paper

Read More »

« Prev « First page 4 / 17 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers