Search Sciweavers | Sciweavers

802 search results - page 3 / 161

» Experts in a Markov Decision Process

189

click to vote

EWRL
2008

129views Machine Learning» more EWRL 2008»

Markov Decision Processes with Arbitrary Reward Processes

15 years 9 months ago

Download www.cim.mcgill.ca

Abstract. We consider a control problem where the decision maker interacts with a standard Markov decision process with the exception that the reward functions vary arbitrarily ove...

Jia Yuan Yu, Shie Mannor, Nahum Shimkin

claim paper

Read More »

148

Voted

AUTOMATICA
2010

92views more AUTOMATICA 2010»

Simulation-based optimization of Markov decision processes: An empirical process theory approach

15 years 7 months ago

Download paleale.eecs.berkeley.edu

Rahul Jain, Pravin Varaiya

claim paper

Read More »

199

Voted

IPMU
2010
Springer

145views Information Technology» more IPMU 2010»

A New Adaptive Consensus Reaching Process Based on the Experts' Importance

16 years 6 days ago

Download sci2s.ugr.es

Usually, in a group decision context, the importance level, conﬁdence degree and amount of knowledge are very diﬀerent among individuals. So, when all the individuals have to r...

Ignacio J. Pérez, Francisco Javier Cabreriz...

claim paper

Read More »

202

Voted

NIPS
2000

127views Information Technology» more NIPS 2000»

Using Free Energies to Represent Q-values in a Multiagent Reinforcement Learning Task

15 years 8 months ago

Download members.chello.at

The problem of reinforcement learning in large factored Markov decision processes is explored. The Q-value of a state-action pair is approximated by the free energy of a product o...

Brian Sallans, Geoffrey E. Hinton

claim paper

Read More »

176

Voted

CDC
2008
IEEE

140views Control Systems» more CDC 2008»

Information state for Markov decision processes with network delays

16 years 1 months ago

Download wsl.stanford.edu

We consider a networked control system, where each subsystem evolves as a Markov decision process (MDP). Each subsystem is coupled to its neighbors via communication links over wh...

Sachin Adlakha, Sanjay Lall, Andrea J. Goldsmith

claim paper

Read More »

« Prev « First page 3 / 161 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers