Search Sciweavers | Sciweavers

360 search results - page 26 / 72

» Learning Evaluation Functions for Large Acyclic Domains

click to vote

IJCV
2000

141views more IJCV 2000»

Reliable Estimation of Dense Optical Flow Fields with Large Displacements

13 years 8 months ago

Download ami.dis.ulpgc.es

In this paper we show that a classic optical ow technique by Nagel and Enkelmann (1986) can be regarded as an early anisotropic di usion method with a di usion tensor. We introduc...

Luis Álvarez, Joachim Weickert, Javier S&aa...

claim paper

Read More »

click to vote

ICML
2008
IEEE

122views Machine Learning» more ICML 2008»

Reinforcement learning in the presence of rare events

14 years 9 months ago

Download www.ece.mcgill.ca

We consider the task of reinforcement learning in an environment in which rare significant events occur independently of the actions selected by the controlling agent. If these ev...

Jordan Frank, Shie Mannor, Doina Precup

claim paper

Read More »

click to vote

GECCO
2007
Springer

235views Optimization» more GECCO 2007»

Expensive optimization, uncertain environment: an EA-based solution

14 years 2 months ago

Download www.cs.bham.ac.uk

Real life optimization problems often require finding optimal solution to complex high dimensional, multimodal problems involving computationally very expensive fitness function e...

Maumita Bhattacharya

claim paper

Read More »

click to vote

ICPR
2000
IEEE

129views computer vision» more ICPR 2000»

General Bias/Variance Decomposition with Target Independent Variance of Error Functions Derived from the Exponential Family of D

14 years 9 months ago

Download www.vogdrup-hansen.dk

An important theoretical tool in machine learning is the bias/variance decomposition of the generalization error. It was introduced for the mean square error in [3]. The bias/vari...

Jakob Vogdrup Hansen, Tom Heskes

claim paper

Read More »

click to vote

IJCAI
2007

179views Artificial Intelligence» more IJCAI 2007»

Heuristic Selection of Actions in Multiagent Reinforcement Learning

13 years 10 months ago

Download www.ijcai.org

This work presents a new algorithm, called Heuristically Accelerated Minimax-Q (HAMMQ), that allows the use of heuristics to speed up the wellknown Multiagent Reinforcement Learni...

Reinaldo A. C. Bianchi, Carlos H. C. Ribeiro, Anna...

claim paper

Read More »

« Prev « First page 26 / 72 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers