Sciweavers

567 search results - page 14 / 114
» Regularized Policy Iteration
Sort
View
CDC
2008
IEEE
206views Control Systems» more  CDC 2008»
14 years 1 months ago
Approximate dynamic programming using support vector regression
— This paper presents a new approximate policy iteration algorithm based on support vector regression (SVR). It provides an overview of commonly used cost approximation architect...
Brett Bethke, Jonathan P. How, Asuman E. Ozdaglar
ANLP
1997
96views more  ANLP 1997»
13 years 8 months ago
Identifying Topics by Position
This paper addresses the problem of identifying likely topics of texts by their position in the text. It describes the automated training and evaluation of an Optimal Position Pol...
Chin-Yew Lin, Eduard H. Hovy
UAI
2004
13 years 8 months ago
Heuristic Search Value Iteration for POMDPs
We present a novel POMDP planning algorithm called heuristic search value iteration (HSVI). HSVI is an anytime algorithm that returns a policy and a provable bound on its regret w...
Trey Smith, Reid G. Simmons
IAT
2007
IEEE
14 years 1 months ago
A Study of an Approach to the Collective Iterative Task Allocation Problem
A major challenge in the field of Multi-Agent Systems is to enable autonomous agents to allocate tasks efficiently. This paper extends previous work on an approach to the collec...
Christian Guttmann, Iyad Rahwan, Michael P. George...
EOR
2008
109views more  EOR 2008»
13 years 7 months ago
A dynamic model for managing overlapped iterative product development
Intense competition in many industries impels firms to develop more products in less time. Overlapping of development activities is regarded as one of the most promising strategie...
Jun Lin, Kah Hin Chai, Yoke San Wong, Aarnout Brom...