Search Sciweavers | Sciweavers

94 search results - page 15 / 19

» Sequential cost-sensitive decision making with reinforcement...

191

click to vote

ECML
2005
Springer

120views Machine Learning» more ECML 2005»

Using Rewards for Belief State Updates in Partially Observable Markov Decision Processes

15 years 11 months ago

Download www.cs.mcgill.ca

Partially Observable Markov Decision Processes (POMDP) provide a standard framework for sequential decision making in stochastic environments. In this setting, an agent takes actio...

Masoumeh T. Izadi, Doina Precup

claim paper

Read More »

167

click to vote

COLT
2007
Springer

108views Machine Learning» more COLT 2007»

Minimax Bounds for Active Learning

16 years 5 days ago

Download www.ee.columbia.edu

This paper analyzes the potential advantages and theoretical challenges of “active learning” algorithms. Active learning involves sequential sampling procedures that use infor...

Rui Castro, Robert D. Nowak

claim paper

Read More »

184

click to vote

IAT
2010
IEEE

167views Intelligent Agents» more IAT 2010»

Selecting Operator Queries Using Expected Myopic Gain

15 years 4 months ago

Download www.eecs.umich.edu

When its human operator cannot continuously supervise (much less teleoperate) an agent, the agent should be able to recognize its limitations and ask for help when it risks making...

Robert Cohn, Michael Maxim, Edmund H. Durfee, Sati...

claim paper

Read More »

169

click to vote

CORR
2010
Springer

152views Education» more CORR 2010»

Neuroevolutionary optimization

15 years 6 months ago

Download jmlr.csail.mit.edu

Temporal difference methods are theoretically grounded and empirically effective methods for addressing reinforcement learning problems. In most real-world reinforcement learning ...

Eva Volná

claim paper

Read More »

240

click to vote

ECCV
2010
Springer

251views Computer Vision» more ECCV 2010»

Discriminative Tracking by Metric Learning

15 years 10 months ago

Download www.eecs.northwestern.edu

We present a discriminative model that casts appearance modeling and visual matching into a single objective for visual tracking. Most previous discriminative models for visual tra...

claim paper

Read More »

« Prev « First page 15 / 19 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers