Search Sciweavers | Sciweavers

93 search results - page 15 / 19

» A Novel Prioritization Technique for Solving Markov Decision...

click to vote

CSL
2012
Springer

311views Automated Reasoning» more CSL 2012»

Reinforcement learning for parameter estimation in statistical spoken dialogue systems

12 years 3 months ago

Download mi.eng.cam.ac.uk

Reinforcement techniques have been successfully used to maximise the expected cumulative reward of statistical dialogue systems. Typically, reinforcement learning is used to estim...

Filip Jurcícek, Blaise Thomson, Steve Young

claim paper

Read More »

click to vote

JAIR
2006

101views more JAIR 2006»

Resource Allocation Among Agents with MDP-Induced Preferences

13 years 7 months ago

Download www.jair.org

Allocating scarce resources among agents to maximize global utility is, in general, computationally challenging. We focus on problems where resources enable agents to execute acti...

Dmitri A. Dolgov, Edmund H. Durfee

claim paper

Read More »

click to vote

ATAL
2009
Springer

134views Intelligent Agents» more ATAL 2009»

Improving adjustable autonomy strategies for time-critical domains

14 years 2 months ago

Download www.aamas-conference.org

As agents begin to perform complex tasks alongside humans as collaborative teammates, it becomes crucial that the resulting humanmultiagent teams adapt to time-critical domains. I...

Nathan Schurr, Janusz Marecki, Milind Tambe

claim paper

Read More »

click to vote

JAIR
2010

115views more JAIR 2010»

An Investigation into Mathematical Programming for Finite Horizon Decentralized POMDPs

13 years 5 months ago

Download www.jair.org

Decentralized planning in uncertain environments is a complex task generally dealt with by using a decision-theoretic approach, mainly through the framework of Decentralized Parti...

Raghav Aras, Alain Dutech

claim paper

Read More »

click to vote

AIPS
2007

174views Artificial Intelligence» more AIPS 2007»

Learning to Plan Using Harmonic Analysis of Diffusion Models

13 years 9 months ago

Download www.cs.umass.edu

This paper summarizes research on a new emerging framework for learning to plan using the Markov decision process model (MDP). In this paradigm, two approaches to learning to plan...

Sridhar Mahadevan, Sarah Osentoski, Jeffrey Johns,...

claim paper

Read More »

« Prev « First page 15 / 19 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers