Search Sciweavers | Sciweavers

599 search results - page 77 / 120

» Online learning by ellipsoid method

161

click to vote

IWANN
1999
Springer

115views Neural Networks» more IWANN 1999»

Using Temporal Neighborhoods to Adapt Function Approximators in Reinforcement Learning

15 years 10 months ago

Download www.cs.colostate.edu

To avoid the curse of dimensionality, function approximators are used in reinforcement learning to learn value functions for individual states. In order to make better use of comp...

R. Matthew Kretchmar, Charles W. Anderson

claim paper

Read More »

150

click to vote

HT
2000
ACM

127views Internet Technology» more HT 2000»

Reusable hypertext structures for distance and JIT learning

15 years 10 months ago

Download www.cs.brown.edu

Software components for distance and just-in-time (JIT) learning are an increasingly common method of encouraging reuse and facilitating the development process[58], but no analog...

Anne Morgan Spalter, Rosemary Michelle Simpson

claim paper

Read More »

161

click to vote

ICML
2007
IEEE

141views Machine Learning» more ICML 2007»

Exponentiated gradient algorithms for log-linear structured prediction

16 years 6 months ago

Download www.machinelearning.org

Conditional log-linear models are a commonly used method for structured prediction. Efficient learning of parameters in these models is therefore an important problem. This paper ...

Amir Globerson, Terry Koo, Xavier Carreras, Michae...

claim paper

Read More »

199

click to vote

JMLR
2010

189views more JMLR 2010»

Adaptive Step-size Policy Gradients with Average Reward Metric

15 years 16 days ago

Download jmlr.csail.mit.edu

In this paper, we propose a novel adaptive step-size approach for policy gradient reinforcement learning. A new metric is defined for policy gradients that measures the effect of ...

Takamitsu Matsubara, Tetsuro Morimura, Jun Morimot...

claim paper

Read More »

156

click to vote

HICSS
2005
IEEE

160views Biometrics» more HICSS 2005»

Using Content and Process Scaffolds to Support Collaborative Discourse in Asynchronous Learning Networks

15 years 11 months ago

Download csdl2.computer.org

Discourse, a form of collaborative learning [44], is one of the most widely used methods of teaching and learning in the online environment. Particularly in large courses, discour...

I. Wong-Bushby, Starr Roxanne Hiltz, Michael Biebe...

claim paper

Read More »

« Prev « First page 77 / 120 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers