Sciweavers

98 search results - page 12 / 20
» Some Experiments with Real-time Decision Algorithms
Sort
View
150
Voted
CIKM
2010
Springer
15 years 2 months ago
Ranking under temporal constraints
This paper introduces the notion of temporally constrained ranked retrieval, which, given a query and a time constraint, produces the best possible ranked list within the specifi...
Lidan Wang, Donald Metzler, Jimmy Lin
140
Voted
COLT
2006
Springer
15 years 7 months ago
Online Learning with Constraints
In this paper, we study a sequential decision making problem. The objective is to maximize the total reward while satisfying constraints, which are defined at every time step. The...
Shie Mannor, John N. Tsitsiklis
134
Voted
ISDA
2006
IEEE
15 years 9 months ago
Modular Neural Network Task Decomposition Via Entropic Clustering
The use of monolithic neural networks (such as a multilayer perceptron) has some drawbacks: e.g. slow learning, weight coupling, the black box effect. These can be alleviated by t...
Jorge M. Santos, Luís A. Alexandre, Joaquim...
127
Voted
CAV
1993
Springer
127views Hardware» more  CAV 1993»
15 years 7 months ago
Symbolic Equivalence Checking
Abstract. We describe the implementation, within ALDEBARAN of an algorithmic method allowing the generation of a minimal labeled transition rom an abstract model ; this minimality ...
Jean-Claude Fernandez, Alain Kerbrat, Laurent Moun...
115
Voted
WSC
2008
15 years 6 months ago
On step sizes, stochastic shortest paths, and survival probabilities in Reinforcement Learning
Reinforcement Learning (RL) is a simulation-based technique useful in solving Markov decision processes if their transition probabilities are not easily obtainable or if the probl...
Abhijit Gosavi