Search Sciweavers | Sciweavers

86 search results - page 11 / 18

» Optimal Resource Allocation and Policy Formulation in Loosel...

click to vote

ICML
2006
IEEE

142views Machine Learning» more ICML 2006»

An intrinsic reward mechanism for efficient exploration

14 years 8 months ago

Download www-anw.cs.umass.edu

How should a reinforcement learning agent act if its sole purpose is to efficiently learn an optimal policy for later use? In other words, how should it explore, to be able to exp...

Özgür Simsek, Andrew G. Barto

claim paper

Read More »

click to vote

ML
2002
ACM

121views Machine Learning» more ML 2002»

Near-Optimal Reinforcement Learning in Polynomial Time

13 years 7 months ago

Download www.cis.upenn.edu

We present new algorithms for reinforcement learning, and prove that they have polynomial bounds on the resources required to achieve near-optimal return in general Markov decisio...

Michael J. Kearns, Satinder P. Singh

claim paper

Read More »

click to vote

AAAI
2006

86views Intelligent Agents» more AAAI 2006»

Targeting Specific Distributions of Trajectories in MDPs

13 years 8 months ago

Download www.cc.gatech.edu

We define TTD-MDPs, a novel class of Markov decision processes where the traditional goal of an agent is changed from finding an optimal trajectory through a state space to realiz...

David L. Roberts, Mark J. Nelson, Charles Lee Isbe...

claim paper

Read More »

click to vote

GLOBECOM
2008
IEEE

133views Communications» more GLOBECOM 2008»

Foresighted Resource Reciprocation Strategies in P2P Networks

14 years 1 months ago

Download medianetlab.ee.ucla.edu

—We consider peer-to-peer (P2P) networks, where multiple peers are interested in sharing content. While sharing resources, autonomous and self-interested peers need to make decis...

Hyunggon Park, Mihaela van der Schaar

claim paper

Read More »

click to vote

GLOBECOM
2006
IEEE

99views Communications» more GLOBECOM 2006»

Optimal Routing Between Alternate Paths With Different Network Transit Delays

14 years 1 months ago

Download www.cs.ucr.edu

— We consider the path-determination problem in Internet core routers that distribute ﬂows across alternate paths leading to the same destination. We assume that the remainder ...

Essia Hamouda Elhafsi, Mart Molle

claim paper

Read More »

« Prev « First page 11 / 18 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers