Search Sciweavers | Sciweavers

3395 search results - page 126 / 679

» Learning to efficiently rank

174

click to vote

ECML
2007
Springer

167views Machine Learning» more ECML 2007»

Efficient Continuous-Time Reinforcement Learning with Adaptive State Graphs

15 years 7 months ago

Download www.igi.tugraz.at

Abstract. We present a new reinforcement learning approach for deterministic continuous control problems in environments with unknown, arbitrary reward functions. The difficulty of...

Gerhard Neumann, Michael Pfeiffer, Wolfgang Maass

claim paper

Read More »

144

Voted

AAAI
2006

105views Intelligent Agents» more AAAI 2006»

Learning Partially Observable Action Models: Efficient Algorithms

15 years 5 months ago

Download www.aaai.org

We present tractable, exact algorithms for learning actions' effects and preconditions in partially observable domains. Our algorithms maintain a propositional logical repres...

Dafna Shahaf, Allen Chang, Eyal Amir

claim paper

Read More »

115

click to vote

ICCV
2007
IEEE

174views Computer Vision» more ICCV 2007»

Boosting Invariance and Efficiency in Supervised Learning

16 years 5 months ago

Download www.vlfeat.org

In this paper we present a novel boosting algorithm for supervised learning that incorporates invariance to data transformations and has high generalization capabilities. While on...

Andrea Vedaldi, Paolo Favaro, Enrico Grisan

claim paper

Read More »

108

click to vote

CVPR
2003
IEEE

167views Computer Vision» more CVPR 2003»

An Efficient Approach to Learning Inhomogeneous Gibbs Model

16 years 5 months ago

Download www.cs.cmu.edu

Inhomogeneous Gibbs model (IGM) [4] is an effective maximum entropy model in characterizing complex highdimensional distributions. However, its training process is so slow that th...

Ziqiang Liu, Hong Chen, Heung-Yeung Shum

claim paper

Read More »

132

click to vote

ICML
2009
IEEE

159views Machine Learning» more ICML 2009»

Efficient learning algorithms for changing environments

16 years 4 months ago

Download www.cs.princeton.edu

We study online learning in an oblivious changing environment. The standard measure of regret bounds the difference between the cost of the online learner and the best decision in...

Elad Hazan, C. Seshadhri

claim paper

Read More »

« Prev « First page 126 / 679 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers