Search Sciweavers | Sciweavers

1234 search results - page 66 / 247

» Multi-criteria Reinforcement Learning

149

click to vote

NCA
2010
IEEE

163views Computer Networks» more NCA 2010»

Genetic algorithm-based training for semi-supervised SVM

15 years 4 months ago

Download www.synchromedia.ca

The Support Vector Machine (SVM) is an interesting classiﬁer with excellent power of generalization. In this paper, we consider applying the SVM to semi-supervised learning. We p...

Mathias M. Adankon, Mohamed Cheriet

claim paper

Read More »

179

click to vote

INTERSPEECH
2010

175views Signal Processing» more INTERSPEECH 2010»

Still talking to machines (cognitively speaking)

15 years 26 days ago

Download mi.eng.cam.ac.uk

This overview article reviews the structure of a fully statistical spoken dialogue system (SDS), using as illustration, various systems and components built at Cambridge over the ...

Steve Young

claim paper

Read More »

202

click to vote

JMLR
2010

189views more JMLR 2010»

Adaptive Step-size Policy Gradients with Average Reward Metric

15 years 25 days ago

Download jmlr.csail.mit.edu

In this paper, we propose a novel adaptive step-size approach for policy gradient reinforcement learning. A new metric is defined for policy gradients that measures the effect of ...

Takamitsu Matsubara, Tetsuro Morimura, Jun Morimot...

claim paper

Read More »

155

click to vote

ATAL
2003
Springer

154views Intelligent Agents» more ATAL 2003»

Coordination in multiagent reinforcement learning: a Bayesian approach

15 years 11 months ago

Download www.cs.toronto.edu

Much emphasis in multiagent reinforcement learning (MARL) research is placed on ensuring that MARL algorithms (eventually) converge to desirable equilibria. As in standard reinfor...

Georgios Chalkiadakis, Craig Boutilier

claim paper

Read More »

168

click to vote

IJCAI
2007

173views Artificial Intelligence» more IJCAI 2007»

Reinforcement Learning of Local Shape in the Game of Go

15 years 7 months ago

Download webdocs.cs.ualberta.ca

We explore an application to the game of Go of a reinforcement learning approach based on a linear evaluation function and large numbers of binary features. This strategy has prov...

David Silver, Richard S. Sutton, Martin Mülle...

claim paper

Read More »

« Prev « First page 66 / 247 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers