Search Sciweavers | Sciweavers

1234 search results - page 175 / 247

» Multi-criteria Reinforcement Learning

134

click to vote

AE
2003
Springer

123views Artificial Intelligence» more AE 2003»

An Agent Model for First Price and Second Price Private Value Auctions

15 years 8 months ago

Download www.uea.ac.uk

The aim of this research is to develop an adaptive agent based model of auction scenarios commonly used in auction theory to help understand how competitors in auctions reach equil...

Anthony J. Bagnall, Iain Toft

claim paper

Read More »

click to vote

COMAD
2008

157views Knowledge Management» more COMAD 2008»

Personalized Web-page Rendering System

15 years 4 months ago

Download www.cse.iitb.ac.in

Personalized rendering of web pages gives the users greater control to view only what they prefer. The goal of this work is to provide a tool that will let users customize the con...

Swapna Raj Prabakara Raj, Balaraman Ravindran

claim paper

Read More »

129

click to vote

SMC
2007
IEEE

102views Control Systems» more SMC 2007»

An improved immune Q-learning algorithm

15 years 9 months ago

Download web2.uwindsor.ca

—Reinforcement learning is a framework in which an agent can learn behavior without knowledge on a task or an environment by exploration and exploitation. Striking a balance betw...

Zhengqiao Ji, Q. M. Jonathan Wu, Maher A. Sid-Ahme...

claim paper

Read More »

141

click to vote

IROS
2006
IEEE

113views Robotics» more IROS 2006»

Policy Gradient Methods for Robotics

15 years 9 months ago

Download www.cs.utah.edu

— The aquisition and improvement of motor skills and control policies for robotics from trial and error is of essential importance if robots should ever leave precisely pre-struc...

Jan Peters, Stefan Schaal

claim paper

Read More »

150

click to vote

ECML
2005
Springer

193views Machine Learning» more ECML 2005»

Natural Actor-Critic

15 years 8 months ago

Download www-clmc.usc.edu

This paper investigates a novel model-free reinforcement learning architecture, the Natural Actor-Critic. The actor updates are based on stochastic policy gradients employing Amari...

Jan Peters, Sethu Vijayakumar, Stefan Schaal

claim paper

Read More »

« Prev « First page 175 / 247 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers