Search Sciweavers | Sciweavers

1235 search results - page 215 / 247

» ABC Reinforcement Learning

217

click to vote

ICML
1999
IEEE

168views Machine Learning» more ICML 1999»

Least-Squares Temporal Difference Learning

16 years 7 months ago

Download www.research.rutgers.edu

Excerpted from: Boyan, Justin. Learning Evaluation Functions for Global Optimization. Ph.D. thesis, Carnegie Mellon University, August 1998. (Available as Technical Report CMU-CS-...

Justin A. Boyan

claim paper

Read More »

198

click to vote

ATAL
2009
Springer

172views Intelligent Agents» more ATAL 2009»

Integrating organizational control into multi-agent learning

16 years 1 months ago

Download www.aamas-conference.org

Multi-Agent Reinforcement Learning (MARL) algorithms suffer from slow convergence and even divergence, especially in largescale systems. In this work, we develop an organization-b...

Chongjie Zhang, Sherief Abdallah, Victor R. Lesser

claim paper

Read More »

181

click to vote

GECCO
2006
Springer

195views Optimization» more GECCO 2006»

Studying XCS/BOA learning in Boolean functions: structure encoding and random Boolean functions

15 years 10 months ago

Download www.coboslab.psychologie.uni-wuerzburg.de

Recently, studies with the XCS classifier system on Boolean functions have shown that in certain types of functions simple crossover operators can lead to disruption and, conseque...

Martin V. Butz, Martin Pelikan

claim paper

Read More »

188

click to vote

NIPS
2001

106views Information Technology» more NIPS 2001»

Improvisation and Learning

15 years 8 months ago

Download books.nips.cc

This article presents a 2-phase computational learning model and application. As a demonstration, a system has been built, called CHIME for Computer Human Interacting Musical Enti...

Judy A. Franklin

claim paper

Read More »

210

Voted

LWA
2007

160views Software Engineering» more LWA 2007»

Towards Learning User-Adaptive State Models in a Conversational Recommender System

15 years 8 months ago

Download users.informatik.uni-halle.de

Typical conversational recommender systems support interactive strategies that are hard-coded in advance and followed rigidly during a recommendation session. In fact, Reinforceme...

Tariq Mahmood, Francesco Ricci

claim paper

Read More »

« Prev « First page 215 / 247 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers