Search Sciweavers | Sciweavers

71 search results - page 4 / 15

» A Behavior Adaptation Algorithm based on Hierarchical Partia...

click to vote

ICML
2006
IEEE

144views Machine Learning» more ICML 2006»

Probabilistic inference for solving discrete and continuous state Markov Decision Processes

14 years 8 months ago

Download eprints.pascal-network.org

Inference in Markov Decision Processes has recently received interest as a means to infer goals of an observed action, policy recognition, and also as a tool to compute policies. ...

Marc Toussaint, Amos J. Storkey

claim paper

Read More »

click to vote

ICWS
2004
IEEE

164views Internet Technology» more ICWS 2004»

Dynamic Workflow Composition using Markov Decision Processes

13 years 8 months ago

Download www.cs.uga.edu

The advent of Web services has made automated workflow composition relevant to Web based applications. One technique that has received some attention, for automatically composing ...

Prashant Doshi, Richard Goodwin, Rama Akkiraju, Ku...

claim paper

Read More »

click to vote

CONNECTION
2008

178views more CONNECTION 2008»

Spoken language interaction with model uncertainty: an adaptive human-robot interaction system

13 years 7 months ago

Download people.csail.mit.edu

Spoken language is one of the most intuitive forms of interaction between humans and agents. Unfortunately, agents that interact with people using natural language often experienc...

Finale Doshi, Nicholas Roy

claim paper

Read More »

click to vote

COLT
2000
Springer

87views Machine Learning» more COLT 2000»

Estimation and Approximation Bounds for Gradient-Based Reinforcement Learning

13 years 11 months ago

Download www.cs.iastate.edu

We model reinforcement learning as the problem of learning to control a Partially Observable Markov Decision Process ( ¢¡¤£¦¥§ ), and focus on gradient ascent approache...

Peter L. Bartlett, Jonathan Baxter

claim paper

Read More »

click to vote

FOCS
2007
IEEE

157views Theoretical Computer Science» more FOCS 2007»

Approximation Algorithms for Partial-Information Based Stochastic Control with Markovian Rewards

14 years 1 months ago

Download www.cis.upenn.edu

We consider a variant of the classic multi-armed bandit problem (MAB), which we call FEEDBACK MAB, where the reward obtained by playing each of n independent arms varies according...

Sudipto Guha, Kamesh Munagala

claim paper

Read More »

« Prev « First page 4 / 15 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers