Search Sciweavers | Sciweavers

332 search results - page 29 / 67

» Ranking policies in discrete Markov decision processes

click to vote

ICIS
2003

142views Information Technology» more ICIS 2003»

A Computational Approach to Compare Information Revelation Policies

13 years 9 months ago

Download www.heinz.cmu.edu

Revelation policies in an e-marketplace differ in terms of the level of competitive information disseminated to participating sellers. Since sellers who repeatedly compete against...

Amy R. Greenwald, Karthik Kannan, Ramayya Krishnan

claim paper

Read More »

click to vote

ATAL
2006
Springer

107views Intelligent Agents» more ATAL 2006»

Winning back the CUP for distributed POMDPs: planning over continuous belief spaces

13 years 11 months ago

Download teamcore.usc.edu

Distributed Partially Observable Markov Decision Problems (Distributed POMDPs) are evolving as a popular approach for modeling multiagent systems, and many different algorithms ha...

Pradeep Varakantham, Ranjit Nair, Milind Tambe, Ma...

claim paper

Read More »

click to vote

SIGECOM
2009
ACM

114views ECommerce» more SIGECOM 2009»

Policy teaching through reward function learning

14 years 2 months ago

Download www.eecs.harvard.edu

Policy teaching considers a Markov Decision Process setting in which an interested party aims to inﬂuence an agent’s decisions by providing limited incentives. In this paper, ...

Haoqi Zhang, David C. Parkes, Yiling Chen

claim paper

Read More »

click to vote

ISLPED
1999
ACM

91views Hardware» more ISLPED 1999»

Stochastic modeling of a power-managed system: construction and optimization

14 years 10 days ago

Download hydrogen.ws.binghamton.edu

-- The goal of a dynamic power management policy is to reduce the power consumption of an electronic system by putting system components into different states, each representing ce...

Qinru Qiu, Qing Wu, Massoud Pedram

claim paper

Read More »

click to vote

JMLR
2010

125views more JMLR 2010»

Variational methods for Reinforcement Learning

13 years 2 months ago

Download jmlr.csail.mit.edu

We consider reinforcement learning as solving a Markov decision process with unknown transition distribution. Based on interaction with the environment, an estimate of the transit...

Thomas Furmston, David Barber

claim paper

Read More »

« Prev « First page 29 / 67 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers