Search Sciweavers | Sciweavers

We present an efficient "sparse sampling" technique for approximating Bayes optimal decision making in reinforcement learning, addressing the well known exploration vers...

Tao Wang, Daniel J. Lizotte, Michael H. Bowling, D...

claim paper

Read More »

233

click to vote

ICC
2007
IEEE

121views Communications» more ICC 2007»

Structure and Optimality of Myopic Sensing for Opportunistic Spectrum Access

16 years 1 months ago

Download www.ece.ucdavis.edu

We consider opportunistic spectrum access for secondary users over multiple channels whose occupancy by primary users is modeled as discrete-time Markov processes. Due to hardware...

Qing Zhao, Bhaskar Krishnamachari

claim paper

Read More »

231

click to vote

ICASSP
2009
IEEE

179views Signal Processing» more ICASSP 2009»

The speed of greed: Characterizing myopic gossip through network voracity

16 years 2 months ago

Download www.tsp.ece.mcgill.ca

This paper analyzes the rate of convergence of greedy gossip with eavesdropping (GGE). In previous work, we proposed GGE, a fast gossip algorithm based on exploiting the broadcast...

Deniz Üstebay, Boris N. Oreshkin, Mark Coates...

claim paper

Read More »

240

click to vote

IAT
2010
IEEE

167views Intelligent Agents» more IAT 2010»

Selecting Operator Queries Using Expected Myopic Gain

15 years 5 months ago

Download www.eecs.umich.edu

When its human operator cannot continuously supervise (much less teleoperate) an agent, the agent should be able to recognize its limitations and ask for help when it risks making...

Robert Cohn, Michael Maxim, Edmund H. Durfee, Sati...

claim paper

Read More »

« Prev « First page 1 / 291 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers