Sciweavers

1619 search results - page 26 / 324
» Structure in the Space of Value Functions
Sort
View
JMLR
2008
129views more  JMLR 2008»
13 years 8 months ago
Finite-Time Bounds for Fitted Value Iteration
In this paper we develop a theoretical analysis of the performance of sampling-based fitted value iteration (FVI) to solve infinite state-space, discounted-reward Markovian decisi...
Rémi Munos, Csaba Szepesvári
ICML
1996
IEEE
14 years 9 months ago
Learning Evaluation Functions for Large Acyclic Domains
Some of the most successful recent applications of reinforcement learning have used neural networks and the TD algorithm to learn evaluation functions. In this paper, we examine t...
Justin A. Boyan, Andrew W. Moore
UAI
2000
13 years 9 months ago
Utilities as Random Variables: Density Estimation and Structure Discovery
Decision theory does not traditionally include uncertainty over utility functions. We argue that the a person's utility value for a given outcome can be treated as we treat o...
Urszula Chajewska, Daphne Koller
BMCBI
2005
131views more  BMCBI 2005»
13 years 8 months ago
Functional annotation by identification of local surface similarities: a novel tool for structural genomics
Background: Protein function is often dependent on subsets of solvent-exposed residues that may exist in a similar three-dimensional configuration in non homologous proteins thus ...
Fabrizio Ferrè, Gabriele Ausiello, Andreas ...
FOCS
1992
IEEE
14 years 16 days ago
Dynamic Half-Space Reporting, Geometric Optimization, and Minimum Spanning Trees
We describe dynamic data structures for half-space range reporting and for maintaining the minima of a decomposable function. Using these data structures, we obtain efficient dyna...
Pankaj K. Agarwal, David Eppstein, Jirí Mat...