Search Sciweavers | Sciweavers

592 search results - page 75 / 119

» Online Self-Assessment as a Learning Method

158

click to vote

NIPS
1993

86views Information Technology» more NIPS 1993»

Robust Reinforcement Learning in Motion Planning

15 years 7 months ago

Download www.cs.cmu.edu

While exploring to nd better solutions, an agent performing online reinforcement learning (RL) can perform worse than is acceptable. In some cases, exploration might have unsafe, ...

Satinder P. Singh, Andrew G. Barto, Roderic A. Gru...

claim paper

Read More »

273

click to vote

SIGECOM
2011
ACM

259views ECommerce» more SIGECOM 2011»

Designing adaptive trading agents

14 years 8 months ago

Download www.sigecom.org

ended abstract summarizes the research presented in Dr. Pardoe’s recently-completed Ph.D. thesis [Pardoe 2011]. The thesis considers how adaptive trading agents can take advantag...

David Pardoe, Peter Stone

claim paper

Read More »

159

click to vote

NLE
2007

148views more NLE 2007»

Abbreviated text input using language modeling

15 years 5 months ago

Download www.eecs.harvard.edu

We address the problem of improving the eﬃciency of natural language text input under degraded conditions (for instance, on mobile computing devices or by disabled users), by ta...

Stuart M. Shieber, Rani Nelken

claim paper

Read More »

196

click to vote

TMA
2010
Springer

137views Management» more TMA 2010»

K-Dimensional Trees for Continuous Traffic Classification

15 years 3 months ago

Download www.pam2010.ethz.ch

Abstract. The network measurement community has proposed multiple machine learning (ML) methods for traffic classification during the last years. Although several research works ha...

Valentín Carela-Español, Pere Barlet...

claim paper

Read More »

170

click to vote

ACL
2009

165views Computational Linguistics» more ACL 2009»

Stochastic Gradient Descent Training for L1-regularized Log-linear Models with Cumulative Penalty

15 years 3 months ago

Download www.aclweb.org

Stochastic gradient descent (SGD) uses approximate gradients estimated from subsets of the training data and updates the parameters in an online fashion. This learning framework i...

Yoshimasa Tsuruoka, Jun-ichi Tsujii, Sophia Anania...

claim paper

Read More »

« Prev « First page 75 / 119 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers