Search Sciweavers | Sciweavers

286 search results - page 35 / 58

» Using inaccurate models in reinforcement learning

154

click to vote

FLAIRS
2009

135views Artificial Intelligence» more FLAIRS 2009»

Beating the Defense: Using Plan Recognition to Inform Learning Agents

15 years 2 months ago

Download www.knexusresearch.com

In this paper, we investigate the hypothesis that plan recognition can significantly improve the performance of a casebased reinforcement learner in an adversarial action selectio...

Matthew Molineaux, David W. Aha, Gita Sukthankar

claim paper

Read More »

124

click to vote

CEEMAS
2005
Springer

87views Intelligent Agents» more CEEMAS 2005»

A Direct Reputation Model for VO Formation

15 years 10 months ago

Download www.dcs.kcl.ac.uk

We show that reputation is a basic ingredient in the Virtual Organisation (VO) formation process. Agents can use their experiences gained in direct past interactions to model other...

Arturo Avila-Rosas, Michael Luck

claim paper

Read More »

127

click to vote

ACL
2010

135views Computational Linguistics» more ACL 2010»

Reading between the Lines: Learning to Map High-Level Instructions to Commands

15 years 2 months ago

Download ai.cs.washington.edu

In this paper, we address the task of mapping high-level instructions to sequences of commands in an external environment. Processing these instructions is challenging--they posit...

S. R. K. Branavan, Luke S. Zettlemoyer, Regina Bar...

claim paper

Read More »

126

click to vote

ML
2002
ACM

100views Machine Learning» more ML 2002»

Structure in the Space of Value Functions

15 years 4 months ago

Download www.gatsby.ucl.ac.uk

Solving in an efficient manner many different optimal control tasks within the same underlying environment requires decomposing the environment into its computationally elemental ...

David J. Foster, Peter Dayan

claim paper

Read More »

147

click to vote

CVPR
2008
IEEE

213views Computer Vision» more CVPR 2008»

Kernel-based learning of cast shadows from a physical model of light sources and surfaces for low-level segmentation

16 years 6 months ago

Download vision.gel.ulaval.ca

In background subtraction, cast shadows induce silhouette distortions and object fusions hindering performance of high level algorithms in scene monitoring. We introduce a nonpara...

André Zaccarin, Nicolas Martel-Brisson

claim paper

Read More »

« Prev « First page 35 / 58 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers