Search Sciweavers | Sciweavers

451 search results - page 37 / 91

» Performance evaluation with temporal rewards

152

click to vote

ATAL
2010
Springer

171views Intelligent Agents» more ATAL 2010»

Closing the learning-planning loop with predictive state representations

15 years 5 months ago

Download www.cs.cmu.edu

A central problem in artificial intelligence is to choose actions to maximize reward in a partially observable, uncertain environment. To do so, we must learn an accurate model of ...

Byron Boots, Sajid M. Siddiqi, Geoffrey J. Gordon

claim paper

Read More »

115

click to vote

CVPR
2009
IEEE

287views Computer Vision» more CVPR 2009»

Contextualizing histogram

15 years 11 months ago

Download www.lv-nus.org

In this paper, we investigate how to incorporate spatial and/or temporal contextual information into classical histogram features with the aim of boosting visual classiﬁcation p...

Bingbing Ni, Shuicheng Yan, Ashraf A. Kassim

claim paper

Read More »

151

click to vote

ATAL
2010
Springer

146views Intelligent Agents» more ATAL 2010»

PAC-MDP learning with knowledge-based admissible models

15 years 4 months ago

Download www.aamas-conference.org

PAC-MDP algorithms approach the exploration-exploitation problem of reinforcement learning agents in an effective way which guarantees that with high probability, the algorithm pe...

Marek Grzes, Daniel Kudenko

claim paper

Read More »

101

click to vote

AIPS
2009

102views Artificial Intelligence» more AIPS 2009»

Using Distance Estimates in Heuristic Search

15 years 5 months ago

Download www.cs.unh.edu

This paper explores the use of an oft-ignored information source in heuristic search: a search-distance-to-go estimate. Operators frequently have different costs and cost-to-go is...

Jordan Tyler Thayer, Wheeler Ruml

claim paper

Read More »

181

click to vote

NCA
2008
IEEE

165views Computer Networks» more NCA 2008»

Neurodynamic programming: a case study of the traveling salesman problem

15 years 4 months ago

Download www.ece.uic.edu

The paper focuses on the study of solving the large-scale traveling salesman problem (TSP) based on neurodynamic programming. From this perspective, two methods, temporal differenc...

Jia Ma, Tao Yang, Zeng-Guang Hou, Min Tan, Derong ...

claim paper

Read More »

« Prev « First page 37 / 91 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers