Search Sciweavers | Sciweavers

802 search results - page 105 / 161

» Experts in a Markov Decision Process

167

click to vote

PKDD
2010
Springer

122views Data Mining» more PKDD 2010»

Exploration in Relational Worlds

15 years 4 months ago

Download user.cs.tu-berlin.de

Abstract. One of the key problems in model-based reinforcement learning is balancing exploration and exploitation. Another is learning and acting in large relational domains, in wh...

Tobias Lang, Marc Toussaint, Kristian Kersting

claim paper

Read More »

194

click to vote

STACS
2012
Springer

260views Theoretical Computer Science» more STACS 2012»

Stabilization of Branching Queueing Networks

14 years 1 months ago

Download www.model.in.tum.de

Queueing networks are gaining attraction for the performance analysis of parallel computer systems. A Jackson network is a set of interconnected servers, where the completion of a...

Tomás Brázdil, Stefan Kiefer

claim paper

Read More »

159

click to vote

ESEM
2007
ACM

101views Software Engineering» more ESEM 2007»

Using Context Distance Measurement to Analyze Results across Studies

15 years 9 months ago

Download www.cs.umd.edu

Providing robust decision support for software engineering (SE) requires the collection of data across multiple contexts so that one can begin to elicit the context variables that...

Daniela Cruzes, Victor R. Basili, Forrest Shull, M...

claim paper

Read More »

210

click to vote

GECCO
2008
Springer

178views Optimization» more GECCO 2008»

Agent Smith: a real-time game-playing agent for interactive dynamic games

15 years 6 months ago

Download www.cs.bham.ac.uk

The goal of this project is to develop an agent capable of learning and behaving autonomously and making decisions quickly in a dynamic environment. The agent’s environment is a...

Ryan K. Small

claim paper

Read More »

145

click to vote

VTC
2007
IEEE

91views Communications» more VTC 2007»

Q-Learning-based Hybrid ARQ for High Speed Downlink Packet Access in UMTS

16 years 2 days ago

Download ntserver.cm.nctu.edu.tw

Abstract-In this paper, a Q-learning-based hybrid automatic repeat request (Q-HARQ) scheme is proposed to achieve efﬁcient resource utilization for high speed downlink packet acc...

Chung-Ju Chang, Chia-Yuan Chang, Fang-Ching Ren

claim paper

Read More »

« Prev « First page 105 / 161 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers