Search Sciweavers | Sciweavers

1236 search results - page 220 / 248

» Opposition-Based Reinforcement Learning

148

click to vote

GECCO
2006
Springer

195views Optimization» more GECCO 2006»

Studying XCS/BOA learning in Boolean functions: structure encoding and random Boolean functions

15 years 9 months ago

Download www.coboslab.psychologie.uni-wuerzburg.de

Recently, studies with the XCS classifier system on Boolean functions have shown that in certain types of functions simple crossover operators can lead to disruption and, conseque...

Martin V. Butz, Martin Pelikan

claim paper

Read More »

157

Voted

NIPS
2001

106views Information Technology» more NIPS 2001»

Improvisation and Learning

15 years 7 months ago

Download books.nips.cc

This article presents a 2-phase computational learning model and application. As a demonstration, a system has been built, called CHIME for Computer Human Interacting Musical Enti...

Judy A. Franklin

claim paper

Read More »

187

click to vote

LWA
2007

160views Software Engineering» more LWA 2007»

Towards Learning User-Adaptive State Models in a Conversational Recommender System

15 years 7 months ago

Download users.informatik.uni-halle.de

Typical conversational recommender systems support interactive strategies that are hard-coded in advance and followed rigidly during a recommendation session. In fact, Reinforceme...

Tariq Mahmood, Francesco Ricci

claim paper

Read More »

156

click to vote

ICML
2009
IEEE

155views Machine Learning» more ICML 2009»

Near-Bayesian exploration in polynomial time

16 years 7 months ago

Download ai.stanford.edu

We consider the exploration/exploitation problem in reinforcement learning (RL). The Bayesian approach to model-based RL offers an elegant solution to this problem, by considering...

J. Zico Kolter, Andrew Y. Ng

claim paper

Read More »

201

Voted

Publication

222views

Algorithms and Bounds for Rollout Sampling Approximate Policy Iteration

16 years 3 months ago

Download arxiv.org

Abstract: Several approximate policy iteration schemes without value functions, which focus on policy representation using classifiers and address policy learning as a supervis...

Christos Dimitrakakis, Michail G. Lagoudakis

posted by olethros

Read More »

« Prev « First page 220 / 248 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers