Search Sciweavers | Sciweavers

2990 search results - page 592 / 598

» Hidden Markov processes

237

Voted

ATAL
2010
Springer

158views Intelligent Agents» more ATAL 2010»

Combining manual feedback with subsequent MDP reward signals for reinforcement learning

15 years 8 months ago

Download www.cs.utexas.edu

As learning agents move from research labs to the real world, it is increasingly important that human users, including those without programming skills, be able to teach agents de...

W. Bradley Knox, Peter Stone

claim paper

Read More »

174

click to vote

CORR
2008
Springer

122views Education» more CORR 2008»

Strategy Improvement for Concurrent Safety Games

15 years 7 months ago

Download www.soe.ucsc.edu

We consider concurrent games played on graphs. At every round of the game, each player simultaneously and independently selects a move; the moves jointly determine the transition ...

Krishnendu Chatterjee, Luca de Alfaro, Thomas A. H...

claim paper

Read More »

226

click to vote

CSL
2010
Springer

238views Automated Reasoning» more CSL 2010»

Bayesian update of dialogue state: A POMDP framework for spoken dialogue systems

15 years 7 months ago

Download mi.eng.cam.ac.uk

This paper describes a statistically motivated framework for performing real-time dialogue state updates and policy learning in a spoken dialogue system. The framework is based on...

Blaise Thomson, Steve Young

claim paper

Read More »

192

click to vote

CORR
2008
Springer

173views Education» more CORR 2008»

Decomposition Principles and Online Learning in Cross-Layer Optimization for Delay-Sensitive Applications

15 years 7 months ago

Download documents.scribd.com

In this paper, we propose a general cross-layer optimization framework in which we explicitly consider both the heterogeneous and dynamically changing characteristics of delay-sens...

Fangwen Fu, Mihaela van der Schaar

claim paper

Read More »

191

click to vote

AI
2006
Springer

145views Artificial Intelligence» more AI 2006»

Backward-chaining evolutionary algorithms

15 years 7 months ago

Download www.cs.ucl.ac.uk

Starting from some simple observations on a popular selection method in Evolutionary Algorithms (EAs)--tournament selection--we highlight a previously-unknown source of inefficien...

Riccardo Poli, William B. Langdon

claim paper

Read More »

« Prev « First page 592 / 598 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers