Search Sciweavers | Sciweavers

495 search results - page 59 / 99

» Constructing States for Reinforcement Learning

142

click to vote

TASLP
2010

106views more TASLP 2010»

Efficient and Robust Music Identification With Weighted Finite-State Transducers

14 years 10 months ago

Download static.googleusercontent.com

We present an approach to music identification based on weighted finite-state transducers and Gaussian mixture models, inspired by techniques used in large-vocabulary speech recogn...

Mehryar Mohri, Pedro Moreno, Eugene Weinstein

claim paper

Read More »

174

click to vote

ICML
2003
IEEE

150views Machine Learning» more ICML 2003»

The Significance of Temporal-Difference Learning in Self-Play Training TD-Rummy versus EVO-rummy

15 years 9 months ago

Download www.hpl.hp.com

Reinforcement learning has been used for training game playing agents. The value function for a complex game must be approximated with a continuous function because the number of ...

Clifford Kotnik, Jugal K. Kalita

claim paper

Read More »

109

click to vote

IVA
2005
Springer

118views Intelligent Agents» more IVA 2005»

Teaching Virtual Characters How to Use Body Language

15 years 9 months ago

Download eprints.ucl.ac.uk

Abstract. Non-verbal communication, or “body language”, is a critical component in constructing believable virtual characters. Most often, body language is implemented by a set...

Doron A. Friedman, Marco Gillies

claim paper

Read More »

124

click to vote

FLAIRS
2006

109views Artificial Intelligence» more FLAIRS 2006»

Refining Human Behavior Models in a Context-based Architecture

15 years 5 months ago

Download www.aaai.org

This paper describes an investigation into the refinement of context-based human behavior models through the use of experiential learning. Specifically, a tactical agent was endow...

David Aihe, Avelino J. Gonzalez

claim paper

Read More »

172

click to vote

JMLR
2010

189views more JMLR 2010»

Adaptive Step-size Policy Gradients with Average Reward Metric

14 years 11 months ago

Download jmlr.csail.mit.edu

In this paper, we propose a novel adaptive step-size approach for policy gradient reinforcement learning. A new metric is defined for policy gradients that measures the effect of ...

Takamitsu Matsubara, Tetsuro Morimura, Jun Morimot...

claim paper

Read More »

« Prev « First page 59 / 99 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers