Search Sciweavers | Sciweavers

220

Voted

ANSS
1996
IEEE

134views Modeling and Simulation» more ANSS 1996»

Computation of the Asymptotic Bias and Variance for Simulation of Markov Reward Models

15 years 11 months ago

The asymptotic bias and variance are important determinants of the quality of a simulation run. In particular, the asymptotic bias can be used to approximate the bias introduced b...

Aad P. A. van Moorsel, Latha A. Kant, William H. S...

claim paper

Read More »

176

click to vote

ECRA
2010

111views more ECRA 2010»

RDRP: Reward-Driven Request Prioritization for e-Commerce web sites

15 years 7 months ago

Download static.googleusercontent.com

Meeting client Quality-of-Service (QoS) expectations proves to be a difficult task for the providers of e-Commerce services, especially when web servers experience overload condit...

Alexander Totok, Vijay Karamcheti

claim paper

Read More »

162

click to vote

AIIDE
2008

120views Artificial Intelligence» more AIIDE 2008»

Constructing Complex NPC Behavior via Multi-Objective Neuroevolution

15 years 9 months ago

Download nn.cs.utexas.edu

It is difficult to discover effective behavior for NPCs automatically. For instance, evolutionary methods can learn sophisticated behaviors based on a single objective, but realis...

Jacob Schrum, Risto Miikkulainen

claim paper

Read More »

172

click to vote

ICML
2002
IEEE

138views Machine Learning» more ICML 2002»

Reinforcement Learning and Shaping: Encouraging Intended Behaviors

16 years 8 months ago

Download www.grappa.univ-lille3.fr

We explore dynamic shaping to integrate our prior beliefs of the final policy into a conventional reinforcement learning system. Shaping provides a positive or negative artificial...

Adam Laud, Gerald DeJong

claim paper

Read More »

189

click to vote

EUROCAST
2007
Springer

182views Hardware» more EUROCAST 2007»

A k-NN Based Perception Scheme for Reinforcement Learning

16 years 1 months ago

Download www.dia.fi.upm.es

Abstract a paradigm of modern Machine Learning (ML) which uses rewards and punishments to guide the learning process. One of the central ideas of RL is learning by “direct-online...

José Antonio Martin H., Javier de Lope Asia...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers