Search Sciweavers | Sciweavers

182

NIPS
2001

144views Information Technology» more NIPS 2001»

Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning

15 years 8 months ago

Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...

Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...

claim paper

Read More »

178

Voted

IIE
2007

63views more IIE 2007»

Investigation of Q-Learning in the Context of a Virtual Learning Environment

15 years 7 months ago

Download www.mii.lt

We investigate the possibility to apply a known machine learning algorithm of Q-learning in the domain of a Virtual Learning Environment (VLE). It is important in this problem doma...

Dalia Baziukaite

claim paper

Read More »

188

click to vote

TWC
2008

130views more TWC 2008»

On myopic sensing for multi-channel opportunistic access: structure, optimality, and performance

15 years 7 months ago

Download anrg.usc.edu

We consider a multi-channel opportunistic communication system where the states of these channels evolve as independent and statistically identical Markov chains (the Gilbert-Elli...

Qing Zhao, Bhaskar Krishnamachari, Keqin Liu

claim paper

Read More »

239

click to vote

IAT
2010
IEEE

166views Intelligent Agents» more IAT 2010»

Using a Social Orientation Model for the Evolution of Cooperative Societies

15 years 5 months ago

Download www.cs.umd.edu

We utilize evolutionary game theory to study the evolution of cooperative societies and the behaviors of individual agents (i.e., players) in such societies. We present a novel pla...

Kan-Leung Cheng, Inon Zuckerman, Ugur Kuter, Dana ...

claim paper

Read More »

207

click to vote

ICDM
2007
IEEE

159views Data Mining» more ICDM 2007»

Spectral Regression: A Unified Approach for Sparse Subspace Learning

15 years 11 months ago

Download www.cs.uiuc.edu

Recently the problem of dimensionality reduction (or, subspace learning) has received a lot of interests in many fields of information processing, including data mining, informati...

Deng Cai, Xiaofei He, Jiawei Han

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers