Search Sciweavers | Sciweavers

96 search results - page 7 / 20

» Adding Reinforcement Learning Features to the Neural-Gas Met...

201

click to vote

ICML
2001
IEEE

185views Machine Learning» more ICML 2001»

Off-Policy Temporal Difference Learning with Function Approximation

16 years 7 months ago

Download www.cs.ualberta.ca

We introduce the first algorithm for off-policy temporal-difference learning that is stable with linear function approximation. Off-policy learning is of interest because it forms...

Doina Precup, Richard S. Sutton, Sanjoy Dasgupta

claim paper

Read More »

201

click to vote

ACL
2012

225views Computational Linguistics» more ACL 2012»

Fast Online Training with Frequency-Adaptive Learning Rates for Chinese Word Segmentation and New Word Detection

13 years 9 months ago

Download aclweb.org

We present a joint model for Chinese word segmentation and new word detection. We present high dimensional new features, including word-based features and enriched edge (label-tra...

Xu Sun, Houfeng Wang, Wenjie Li

claim paper

Read More »

227

click to vote

CORR
2010
Springer

204views Education» more CORR 2010»

Predictive State Temporal Difference Learning

15 years 5 months ago

Download www.cs.cmu.edu

We propose a new approach to value function approximation which combines linear temporal difference reinforcement learning with subspace identiﬁcation. In practical applications...

Byron Boots, Geoffrey J. Gordon

claim paper

Read More »

197

click to vote

JSAC
2007

189views more JSAC 2007»

Non-Cooperative Power Control for Wireless Ad Hoc Networks with Repeated Games

15 years 7 months ago

Download www.cs.ust.hk

— One of the distinctive features in a wireless ad hoc network is lack of any central controller or single point of authority, in which each node/link then makes its own decision...

Chengnian Long, Qian Zhang, Bo Li, Huilong Yang, X...

claim paper

Read More »

178

click to vote

CVPR
2005
IEEE

270views Computer Vision» more CVPR 2005»

Learning Feature Distance Measures for Image Correspondences

16 years 9 months ago

Download web.mysites.ntu.edu.sg

Standard but ad hoc measures such as sum-of-squared pixel differences (SSD) are often used when comparing and registering two images that have not been previously observed before....

Xi Chen, Tat-Jen Cham

claim paper

Read More »

« Prev « First page 7 / 20 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers