Sciweavers

26 search results - page 4 / 6
» Space-indexed dynamic programming: learning to follow trajec...
Sort
View
JMLR
2010
148views more  JMLR 2010»
13 years 1 months ago
A Generalized Path Integral Control Approach to Reinforcement Learning
With the goal to generate more scalable algorithms with higher efficiency and fewer open parameters, reinforcement learning (RL) has recently moved towards combining classical tec...
Evangelos Theodorou, Jonas Buchli, Stefan Schaal
ICCS
2007
Springer
14 years 1 months ago
Creating Individual Based Models of the Plankton Ecosystem
Abstract. The Virtual Ecology Workbench (VEW) is a suite of utilities for creating, executing and analysing biological models of the ocean. At its core is a mathematical language a...
Wes Hinsley, Tony Field, John Woods
NIPS
1996
13 years 8 months ago
Multidimensional Triangulation and Interpolation for Reinforcement Learning
Dynamic Programming, Q-learning and other discrete Markov Decision Process solvers can be applied to continuous d-dimensional state-spaces by quantizing the state space into an arr...
Scott Davies
TAP
2008
Springer
93views Hardware» more  TAP 2008»
13 years 7 months ago
Pex-White Box Test Generation for .NET
Pex automatically produces a small test suite with high code coverage for a .NET program. To this end, Pex performs a systematic program analysis (using dynamic symbolic execution,...
Nikolai Tillmann, Jonathan de Halleux
CIMCA
2008
IEEE
14 years 1 months ago
Seller's Strategies for Predicting Winning Bid Prices in Online Auctions
Online auctions have become extremely popular in recent years. Ability to predict winning bid prices accurately can help bidders to maximize their profit. This paper proposes a nu...
Yevgeniya Kovalchuk