Sciweavers

2566 search results - page 183 / 514
» Relating reinforcement learning performance to classificatio...
Sort
View
ACL
2010
15 years 2 months ago
Optimising Information Presentation for Spoken Dialogue Systems
We present a novel approach to Information Presentation (IP) in Spoken Dialogue Systems (SDS) using a data-driven statistical optimisation framework for content planning and attri...
Verena Rieser, Oliver Lemon, Xingkun Liu
FLAIRS
2010
15 years 2 months ago
Decision-Theoretic Simulated Annealing
The choice of a good annealing schedule is necessary for good performance of simulated annealing for combinatorial optimization problems. In this paper, we pose the simulated anne...
Todd W. Neller, Christopher J. La Pilla
CDC
2009
IEEE
160views Control Systems» more  CDC 2009»
15 years 2 months ago
Exploring and exploiting routing opportunities in wireless ad-hoc networks
Abstract--In this paper, d-AdaptOR, a distributed opportunistic routing scheme for multi-hop wireless ad-hoc networks is proposed. The proposed scheme utilizes a reinforcement lear...
Abhijeet Bhorkar, Mohammad Naghshvar, Tara Javidi,...
PE
2011
Springer
215views Optimization» more  PE 2011»
14 years 11 months ago
Energy-aware routing in the Cognitive Packet Network
An energy aware routing protocol (EARP) is proposed to minimise a performance metric that combines the total consumed power in the network and the QoS that is specified for the ...
Toktam Mahmoodi
CDC
2010
IEEE
160views Control Systems» more  CDC 2010»
14 years 11 months ago
Adaptive bases for Q-learning
Abstract-- We consider reinforcement learning, and in particular, the Q-learning algorithm in large state and action spaces. In order to cope with the size of the spaces, a functio...
Dotan Di Castro, Shie Mannor