Sciweavers

325 search results - page 40 / 65
» Structured Reachability Analysis for Markov Decision Process...
Sort
View
172
Voted
JMLR
2010
189views more  JMLR 2010»
14 years 10 months ago
Adaptive Step-size Policy Gradients with Average Reward Metric
In this paper, we propose a novel adaptive step-size approach for policy gradient reinforcement learning. A new metric is defined for policy gradients that measures the effect of ...
Takamitsu Matsubara, Tetsuro Morimura, Jun Morimot...
157
Voted
DIALM
2000
ACM
112views Algorithms» more  DIALM 2000»
15 years 8 months ago
A decision-theoretic approach to resource allocation in wireless multimedia networks
The allocation of scarce spectral resources to support as many user applications as possible while maintaining reasonable quality of service is a fundamental problem in wireless c...
Zygmunt J. Haas, Joseph Y. Halpern, Erran L. Li, S...
MABS
2000
Springer
15 years 7 months ago
Agent-Based Social Simulation with Coalitions in Social Reasoning
There is a growing belief that the agents' cognitive structures play a central role on the enhancement of predicative capacities of decision-making strategies. This paper anal...
Nuno David, Jaime Simão Sichman, Helder Coe...
IANDC
2008
112views more  IANDC 2008»
15 years 4 months ago
Inclusion dynamics hybrid automata
Hybrid systems are dynamical systems with the ability to describe mixed discretecontinuous evolution of a wide range of systems. Consequently, at first glance, hybrid systems appe...
Alberto Casagrande, Carla Piazza, Alberto Policrit...
DSOM
2008
Springer
15 years 5 months ago
SYMIAN: A Simulation Tool for the Optimization of the IT Incident Management Process
Incident Management is the process through which IT support organizations manage to restore normal service operation after a service disruption. The complexity of IT support organi...
Claudio Bartolini, Cesare Stefanelli, Mauro Torton...