Search Sciweavers | Sciweavers

67 search results - page 7 / 14

» Limits of Multi-Discounted Markov Decision Processes

144

click to vote

JMLR
2010

125views more JMLR 2010»

Variational methods for Reinforcement Learning

15 years 22 days ago

Download jmlr.csail.mit.edu

We consider reinforcement learning as solving a Markov decision process with unknown transition distribution. Based on interaction with the environment, an estimate of the transit...

Thomas Furmston, David Barber

claim paper

Read More »

146

click to vote

CALCO
2007
Springer

100views Mathematics» more CALCO 2007»

Applications of Metric Coinduction

16 years 3 days ago

Download www.cs.cornell.edu

Metric coinduction is a form of coinduction that can be used to establish properties of objects constructed as a limit of ﬁnite approximations. One can prove a coinduction step s...

Dexter Kozen, Nicholas Ruozzi

claim paper

Read More »

176

click to vote

ICRA
2010
IEEE

133views Robotics» more ICRA 2010»

Variable resolution decomposition for robotic navigation under a POMDP framework

15 years 4 months ago

Download www.cs.mcgill.ca

— Partially Observable Markov Decision Processes (POMDPs) offer a powerful mathematical framework for making optimal action choices in noisy and/or uncertain environments, in par...

Robert Kaplow, Amin Atrash, Joelle Pineau

claim paper

Read More »

161

click to vote

ICASSP
2009
IEEE

147views Signal Processing» more ICASSP 2009»

Evolution of social P2P networks based on the dynamics of heterogeneous multimedia peers

15 years 9 months ago

Download medianetlab.ee.ucla.edu

In this paper, we consider social peer-to-peer (P2P) networks, where peers are sharing their resources (i.e., multimedia content and upload bandwidth). In the considered P2P netwo...

Hyunggon Park, Mihaela van der Schaar

claim paper

Read More »

179

click to vote

ICC
2007
IEEE

121views Communications» more ICC 2007»

Structure and Optimality of Myopic Sensing for Opportunistic Spectrum Access

16 years 8 days ago

Download www.ece.ucdavis.edu

We consider opportunistic spectrum access for secondary users over multiple channels whose occupancy by primary users is modeled as discrete-time Markov processes. Due to hardware...

Qing Zhao, Bhaskar Krishnamachari

claim paper

Read More »

« Prev « First page 7 / 14 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers