Sciweavers

802 search results - page 105 / 161
» Experts in a Markov Decision Process
Sort
View
PKDD
2010
Springer
122views Data Mining» more  PKDD 2010»
13 years 8 months ago
Exploration in Relational Worlds
Abstract. One of the key problems in model-based reinforcement learning is balancing exploration and exploitation. Another is learning and acting in large relational domains, in wh...
Tobias Lang, Marc Toussaint, Kristian Kersting
STACS
2012
Springer
12 years 5 months ago
Stabilization of Branching Queueing Networks
Queueing networks are gaining attraction for the performance analysis of parallel computer systems. A Jackson network is a set of interconnected servers, where the completion of a...
Tomás Brázdil, Stefan Kiefer
ESEM
2007
ACM
14 years 2 months ago
Using Context Distance Measurement to Analyze Results across Studies
Providing robust decision support for software engineering (SE) requires the collection of data across multiple contexts so that one can begin to elicit the context variables that...
Daniela Cruzes, Victor R. Basili, Forrest Shull, M...
GECCO
2008
Springer
178views Optimization» more  GECCO 2008»
13 years 11 months ago
Agent Smith: a real-time game-playing agent for interactive dynamic games
The goal of this project is to develop an agent capable of learning and behaving autonomously and making decisions quickly in a dynamic environment. The agent’s environment is a...
Ryan K. Small
VTC
2007
IEEE
14 years 4 months ago
Q-Learning-based Hybrid ARQ for High Speed Downlink Packet Access in UMTS
Abstract-In this paper, a Q-learning-based hybrid automatic repeat request (Q-HARQ) scheme is proposed to achieve efficient resource utilization for high speed downlink packet acc...
Chung-Ju Chang, Chia-Yuan Chang, Fang-Ching Ren