Search Sciweavers | Sciweavers

360 search results - page 10 / 72

» Learning Evaluation Functions for Large Acyclic Domains

click to vote

ICCBR
2005
Springer

210views Automated Reasoning» more ICCBR 2005»

CBR for State Value Function Approximation in Reinforcement Learning

14 years 2 months ago

Download ml.informatik.uni-freiburg.de

CBR is one of the techniques that can be applied to the task of approximating a function over high-dimensional, continuous spaces. In Reinforcement Learning systems a learning agen...

Thomas Gabel, Martin A. Riedmiller

claim paper

Read More »

click to vote

ICIP
2009
IEEE

232views Image Processing» more ICIP 2009»

Learning Large Margin Likelihoods For Realtime Head Pose Tracking

14 years 9 months ago

Download www.idiap.ch

We consider the problem of head tracking and pose estimation in realtime from low resolution images. Tracking and pose recognition are treated as two coupled problems in a probabi...

claim paper

Read More »

click to vote

PKDD
2009
Springer

103views Data Mining» more PKDD 2009»

Kernels for Periodic Time Series Arising in Astronomy

14 years 3 months ago

Download www.cs.tufts.edu

Abstract. We present a method for applying machine learning algorithms to the automatic classiﬁcation of astronomy star surveys using time series of star brightness. Currently su...

Gabriel Wachman, Roni Khardon, Pavlos Protopapas, ...

claim paper

Read More »

click to vote

AINA
2010
IEEE

130views Computer Networks» more AINA 2010»

Routing Loops in DAG-Based Low Power and Lossy Networks

14 years 1 months ago

Download www.emmanuelbaccelli.org

Abstract—Directed Acyclic Graphs (DAGs), rooted at popular/default destinations, have emerged as a preferred mechanism to provide IPv6 routing functionality in large scale low po...

Weigao Xie, Mukul Goyal, Hossein Hosseini, Jerald ...

claim paper

Read More »

click to vote

ICMLA
2010

207views Machine Learning» more ICMLA 2010»

Multi-Agent Inverse Reinforcement Learning

13 years 6 months ago

Download ftp.cs.wisc.edu

Learning the reward function of an agent by observing its behavior is termed inverse reinforcement learning and has applications in learning from demonstration or apprenticeship l...

Sriraam Natarajan, Gautam Kunapuli, Kshitij Judah,...

claim paper

Read More »

« Prev « First page 10 / 72 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers