Search Sciweavers | Sciweavers

231 search results - page 27 / 47

» Active Learning in Partially Observable Markov Decision Proc...

click to vote

IJCAI
2003

173views Artificial Intelligence» more IJCAI 2003»

A Planning Algorithm for Predictive State Representations

13 years 10 months ago

Download dli.iiit.ac.in

We address the problem of optimally controlling stochastic environments that are partially observable. The standard method for tackling such problems is to define and solve a Part...

Masoumeh T. Izadi, Doina Precup

claim paper

Read More »

click to vote

ATAL
2006
Springer

107views Intelligent Agents» more ATAL 2006»

Winning back the CUP for distributed POMDPs: planning over continuous belief spaces

14 years 11 days ago

Download teamcore.usc.edu

Distributed Partially Observable Markov Decision Problems (Distributed POMDPs) are evolving as a popular approach for modeling multiagent systems, and many different algorithms ha...

Pradeep Varakantham, Ranjit Nair, Milind Tambe, Ma...

claim paper

Read More »

click to vote

ICTAI
2005
IEEE

117views Artificial Intelligence» more ICTAI 2005»

Planning with POMDPs Using a Compact, Logic-Based Representation

14 years 2 months ago

Download www.cs.tufts.edu

Partially Observable Markov Decision Processes (POMDPs) provide a general framework for AI planning, but they lack the structure for representing real world planning problems in a...

Chenggang Wang, James G. Schmolze

claim paper

Read More »

click to vote

IJCAI
2007

176views Artificial Intelligence» more IJCAI 2007»

Opponent Modeling in Scrabble

13 years 10 months ago

Download www.ijcai.org

Computers have already eclipsed the level of human play in competitive Scrabble, but there remains room for improvement. In particular, there is much to be gained by incorporating...

Mark Richards, Eyal Amir

claim paper

Read More »

click to vote

ML
2002
ACM

121views Machine Learning» more ML 2002»

Near-Optimal Reinforcement Learning in Polynomial Time

13 years 8 months ago

Download www.cis.upenn.edu

We present new algorithms for reinforcement learning, and prove that they have polynomial bounds on the resources required to achieve near-optimal return in general Markov decisio...

Michael J. Kearns, Satinder P. Singh

claim paper

Read More »

« Prev « First page 27 / 47 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers