Search Sciweavers | Sciweavers

85 search results - page 12 / 17

» Solving Stochastic Planning Problems with Large State and Ac...

click to vote

AAAI
2007

102views Intelligent Agents» more AAAI 2007»

Thresholded Rewards: Acting Optimally in Timed, Zero-Sum Games

13 years 9 months ago

Download www.cs.cmu.edu

In timed, zero-sum games, the goal is to maximize the probability of winning, which is not necessarily the same as maximizing our expected reward. We consider cumulative intermedi...

Colin McMillen, Manuela M. Veloso

claim paper

Read More »

click to vote

ICML
1996
IEEE

162views Machine Learning» more ICML 1996»

Learning Evaluation Functions for Large Acyclic Domains

14 years 8 months ago

Download www.ri.cmu.edu

Some of the most successful recent applications of reinforcement learning have used neural networks and the TD algorithm to learn evaluation functions. In this paper, we examine t...

Justin A. Boyan, Andrew W. Moore

claim paper

Read More »

click to vote

AI
2011
Springer

170views Artificial Intelligence» more AI 2011»

A unifying action calculus

13 years 2 months ago

Download www.computational-logic.org

Abstract McCarthy’s Situation Calculus is arguably the oldest special-purpose knowledge representation formalism, designed to axiomatize knowledge of actions and their eﬀects. ...

Michael Thielscher

claim paper

Read More »

click to vote

JAIR
2011

134views more JAIR 2011»

Scaling up Heuristic Planning with Relational Decision Trees

13 years 2 months ago

Download www.jair.org

Current evaluation functions for heuristic planning are expensive to compute. In numerous planning problems these functions provide good guidance to the solution, so they are wort...

Tomás de la Rosa, Sergio Jiménez, Ra...

claim paper

Read More »

click to vote

AAAI
2010

131views Intelligent Agents» more AAAI 2010»

SixthSense: Fast and Reliable Recognition of Dead Ends in MDPs

13 years 9 months ago

Download www.cs.washington.edu

The results of the latest International Probabilistic Planning Competition (IPPC-2008) indicate that the presence of dead ends, states with no trajectory to the goal, makes MDPs h...

Andrey Kolobov, Mausam, Daniel S. Weld

claim paper

Read More »

« Prev « First page 12 / 17 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers