Search Sciweavers | Sciweavers

16

SIGECOM
2009
ACM

114views ECommerce» more SIGECOM 2009»

Policy teaching through reward function learning

14 years 2 months ago

Policy teaching considers a Markov Decision Process setting in which an interested party aims to inﬂuence an agent’s decisions by providing limited incentives. In this paper, ...

Haoqi Zhang, David C. Parkes, Yiling Chen

claim paper

Read More »

25

click to vote

IJCNN
2007
IEEE

103views Neural Networks» more IJCNN 2007»

A Functional Link Network With Ordered Basis Functions

14 years 1 months ago

Download www-ee.uta.edu

—A procedure is presented for selecting and ordering the polynomial basis functions in the functional link net (FLN). This procedure, based upon a modified Gram Schmidt orthonorm...

Saurabh Sureka, Michael T. Manry

claim paper

Read More »

26

click to vote

ICML
1995
IEEE

184views Machine Learning» more ICML 1995»

Residual Algorithms: Reinforcement Learning with Function Approximation

14 years 8 months ago

Download www.leemon.com

A number of reinforcement learning algorithms have been developed that are guaranteed to converge to the optimal solution when used with lookup tables. It is shown, however, that ...

Leemon C. Baird III

claim paper

Read More »

25

click to vote

COLT
2008
Springer

115views Machine Learning» more COLT 2008»

Competing in the Dark: An Efficient Algorithm for Bandit Linear Optimization

13 years 9 months ago

Download www-stat.wharton.upenn.edu

We introduce an efficient algorithm for the problem of online linear optimization in the bandit setting which achieves the optimal O ( T) regret. The setting is a natural general...

Jacob Abernethy, Elad Hazan, Alexander Rakhlin

claim paper

Read More »

22

click to vote

ICONIP
2008

93views Information Technology» more ICONIP 2008»

Improvement of Practical Recurrent Learning Method and Application to a Pattern Classification Task

13 years 9 months ago

Download shws.cc.oita-u.ac.jp

Practical Recurrent Learning (PRL) has been proposed as a simple learning algorithm for recurrent neural networks[1][2]. This algorithm enables learning with practical order O(n2 )...

Mohamad Faizal Bin Samsudin, Katsunari Shibata

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers