Search Sciweavers | Sciweavers

771 search results - page 86 / 155

» Markov Decision Processes with Arbitrary Reward Processes

107

Voted

IJCAI
2007

176views Artificial Intelligence» more IJCAI 2007»

Opponent Modeling in Scrabble

15 years 5 months ago

Download www.ijcai.org

Computers have already eclipsed the level of human play in competitive Scrabble, but there remains room for improvement. In particular, there is much to be gained by incorporating...

Mark Richards, Eyal Amir

claim paper

Read More »

129

click to vote

ATAL
2006
Springer

107views Intelligent Agents» more ATAL 2006»

Winning back the CUP for distributed POMDPs: planning over continuous belief spaces

15 years 7 months ago

Download teamcore.usc.edu

Distributed Partially Observable Markov Decision Problems (Distributed POMDPs) are evolving as a popular approach for modeling multiagent systems, and many different algorithms ha...

Pradeep Varakantham, Ranjit Nair, Milind Tambe, Ma...

claim paper

Read More »

155

click to vote

ICCD
2006
IEEE

171views Hardware» more ICCD 2006»

Stochastic Dynamic Thermal Management: A Markovian Decision-based Approach

16 years 1 months ago

Download atrak.usc.edu

This paper proposes a stochastic dynamic thermal management (DTM) technique in high-performance VLSI system with especial attention to the uncertainty in temperature observation. ...

Hwisung Jung, Massoud Pedram

claim paper

Read More »

279

click to vote

Publication

151views

Robust Bayesian reinforcement learning through tight lower bounds

14 years 2 months ago

Download arxiv.org

In the Bayesian approach to sequential decision making, exact calculation of the (subjective) utility is intractable. This extends to most special cases of interest, such as reinfo...

Christos Dimitrakakis

posted by olethros

Read More »

190

click to vote

AAAI
2011

246views Intelligent Agents» more AAAI 2011»

An Online Spectral Learning Algorithm for Partially Observable Nonlinear Dynamical Systems

14 years 4 months ago

Download www.cs.cmu.edu

Recently, a number of researchers have proposed spectral algorithms for learning models of dynamical systems—for example, Hidden Markov Models (HMMs), Partially Observable Marko...

Byron Boots, Geoffrey J. Gordon

claim paper

Read More »

« Prev « First page 86 / 155 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers