Search Sciweavers | Sciweavers

301 search results - page 36 / 61

» Learning to Select a Good Translation

click to vote

ICML
2004
IEEE

120views Machine Learning» more ICML 2004»

Utile distinction hidden Markov models

14 years 10 months ago

Download www.idsia.ch

This paper addresses the problem of constructing good action selection policies for agents acting in partially observable environments, a class of problems generally known as Part...

Daan Wierstra, Marco Wiering

claim paper

Read More »

click to vote

NIPS
2003

196views Information Technology» more NIPS 2003»

Approximate Policy Iteration with a Policy Language Bias

13 years 11 months ago

Download www.jair.org

We study an approach to policy selection for large relational Markov Decision Processes (MDPs). We consider a variant of approximate policy iteration (API) that replaces the usual...

Alan Fern, Sung Wook Yoon, Robert Givan

claim paper

Read More »

click to vote

AAAI
1998

81views Intelligent Agents» more AAAI 1998»

Feature Generation for Sequence Categorization

13 years 11 months ago

Download www.aaai.org

The problem of sequence categorization is to generalize from a corpus of labeled sequences procedures for accurately labeling future unlabeled sequences. The choice of representat...

Daniel Kudenko, Haym Hirsh

claim paper

Read More »

click to vote

EUROCAST
2007
Springer

182views Hardware» more EUROCAST 2007»

A k-NN Based Perception Scheme for Reinforcement Learning

14 years 3 months ago

Download www.dia.fi.upm.es

Abstract a paradigm of modern Machine Learning (ML) which uses rewards and punishments to guide the learning process. One of the central ideas of RL is learning by “direct-online...

José Antonio Martin H., Javier de Lope Asia...

claim paper

Read More »

click to vote

IWANN
1999
Springer

115views Neural Networks» more IWANN 1999»

Using Temporal Neighborhoods to Adapt Function Approximators in Reinforcement Learning

14 years 1 months ago

Download www.cs.colostate.edu

To avoid the curse of dimensionality, function approximators are used in reinforcement learning to learn value functions for individual states. In order to make better use of comp...

R. Matthew Kretchmar, Charles W. Anderson

claim paper

Read More »

« Prev « First page 36 / 61 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers