Sciweavers

1167 search results - page 118 / 234
» policy 2007
Sort
View
NIPS
1993
15 years 5 months ago
Packet Routing in Dynamically Changing Networks: A Reinforcement Learning Approach
This paper describes the Q-routing algorithm for packet routing, in which a reinforcement learning module is embedded into each node of a switching network. Only local communicati...
Justin A. Boyan, Michael L. Littman
ICML
2007
IEEE
16 years 5 months ago
Conditional random fields for multi-agent reinforcement learning
Conditional random fields (CRFs) are graphical models for modeling the probability of labels given the observations. They have traditionally been trained with using a set of obser...
Xinhua Zhang, Douglas Aberdeen, S. V. N. Vishwanat...
GLOBECOM
2007
IEEE
15 years 11 months ago
Efficient Queue Based Dynamic Bandwidth Allocation Scheme for Ethernet PONs
Abstract—We propose an Optical Line Terminal (OLT) centric Dynamic Bandwidth Allocation (DBA) scheme based on individual requests from service queues in Optical Network Units (ON...
Pallab K. Choudhury, Poompat Saengudomlert
AIL
2007
105views more  AIL 2007»
15 years 4 months ago
The application of fuzzy logic to the precautionary principle
One of the major problems in the implementation of the precautionary principle in environmental cases is the estimation of the weight of evidence. In this paper we propose a forma...
Mirit Shamir, Lior Shamir, Mary H. Durfee
ACL
2000
15 years 5 months ago
Spoken Dialogue Management Using Probabilistic Reasoning
Spoken dialogue managers have benefited from using stochastic planners such as Markov Decision Processes (MDPs). However, so far, MDPs do not handle well noisy and ambiguous speec...
Nicholas Roy, Joelle Pineau, Sebastian Thrun