Search Sciweavers | Sciweavers

252 search results - page 20 / 51

» Optimal Sequential Exploration: A Binary Learning Model

click to vote

CVPR
2010
IEEE

286views Computer Vision» more CVPR 2010»

Exploring Features in a Bayesian Framework for Material Recognition

14 years 4 months ago

Download people.csail.mit.edu

We are interested in identifying the material category, e.g. glass, metal, fabric, plastic or wood, from a single image of a surface. Unlike other visual recognition tasks in comp...

Ce Liu, Lavanya Sharan, Edward Adelson, Ruth Rosen...

claim paper

Read More »

click to vote

COLT
2010
Springer

207views Machine Learning» more COLT 2010»

An Asymptotically Optimal Bandit Algorithm for Bounded Support Models

13 years 5 months ago

Download www.colt2010.org

Multiarmed bandit problem is a typical example of a dilemma between exploration and exploitation in reinforcement learning. This problem is expressed as a model of a gambler playi...

Junya Honda, Akimichi Takemura

claim paper

Read More »

click to vote

ICMLA
2009

167views Machine Learning» more ICMLA 2009»

Structured Prediction Models for Chord Transcription of Music Audio

13 years 5 months ago

Download www.cs.columbia.edu

Chord sequences are a compact and useful description of music, representing each beat or measure in terms of a likely distribution over individual notes without specifying the not...

Adrian Weller, Daniel P. W. Ellis, Tony Jebara

claim paper

Read More »

click to vote

ICANNGA
2009
Springer

212views Algorithms» more ICANNGA 2009»

Evolutionary Regression Modeling with Active Learning: An Application to Rainfall Runoff Modeling

14 years 2 months ago

Download www.sumo.intec.ugent.be

Many complex, real world phenomena are difﬁcult to study directly using controlled experiments. Instead, the use of computer simulations has become commonplace as a feasible alte...

Ivo Couckuyt, Dirk Gorissen, Hamed Rouhani, Eric L...

claim paper

Read More »

click to vote

IJCAI
2001

151views Artificial Intelligence» more IJCAI 2001»

R-MAX - A General Polynomial Time Algorithm for Near-Optimal Reinforcement Learning

13 years 9 months ago

Download jmlr.csail.mit.edu

R-max is a very simple model-based reinforcement learning algorithm which can attain near-optimal average reward in polynomial time. In R-max, the agent always maintains a complet...

Ronen I. Brafman, Moshe Tennenholtz

claim paper

Read More »

« Prev « First page 20 / 51 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers