Sciweavers

1234 search results - page 175 / 247
» Multi-criteria Reinforcement Learning
Sort
View
AE
2003
Springer
14 years 3 months ago
An Agent Model for First Price and Second Price Private Value Auctions
The aim of this research is to develop an adaptive agent based model of auction scenarios commonly used in auction theory to help understand how competitors in auctions reach equil...
Anthony J. Bagnall, Iain Toft
COMAD
2008
13 years 11 months ago
Personalized Web-page Rendering System
Personalized rendering of web pages gives the users greater control to view only what they prefer. The goal of this work is to provide a tool that will let users customize the con...
Swapna Raj Prabakara Raj, Balaraman Ravindran
SMC
2007
IEEE
102views Control Systems» more  SMC 2007»
14 years 4 months ago
An improved immune Q-learning algorithm
—Reinforcement learning is a framework in which an agent can learn behavior without knowledge on a task or an environment by exploration and exploitation. Striking a balance betw...
Zhengqiao Ji, Q. M. Jonathan Wu, Maher A. Sid-Ahme...
IROS
2006
IEEE
113views Robotics» more  IROS 2006»
14 years 4 months ago
Policy Gradient Methods for Robotics
— The aquisition and improvement of motor skills and control policies for robotics from trial and error is of essential importance if robots should ever leave precisely pre-struc...
Jan Peters, Stefan Schaal
ECML
2005
Springer
14 years 3 months ago
Natural Actor-Critic
This paper investigates a novel model-free reinforcement learning architecture, the Natural Actor-Critic. The actor updates are based on stochastic policy gradients employing Amari...
Jan Peters, Sethu Vijayakumar, Stefan Schaal