Search Sciweavers | Sciweavers

358 search results - page 17 / 72

» Online Testing with Reinforcement Learning

175

click to vote

ICMAS
2000

169views Intelligent Agents» more ICMAS 2000»

Evolutionary On-line Learning of Cooperative Behavior with Situation-Action-Pairs

15 years 8 months ago

Download pages.cpsc.ucalgary.ca

We present a concept to use off-line learning approaches to achieve on-line learning of cooperative behavior of agents and instantiate this concept for evolutionary learning with ...

Jörg Denzinger, Michael Kordt

claim paper

Read More »

206

click to vote

ICRA
2008
IEEE

173views Robotics» more ICRA 2008»

Bayesian reinforcement learning in continuous POMDPs with application to robot navigation

16 years 1 months ago

Download www.cs.cmu.edu

— We consider the problem of optimal control in continuous and partially observable environments when the parameters of the model are not known exactly. Partially Observable Mark...

Stéphane Ross, Brahim Chaib-draa, Joelle Pi...

claim paper

Read More »

166

click to vote

FLAIRS
2006

103views Artificial Intelligence» more FLAIRS 2006»

Using Active Relocation to Aid Reinforcement Learning

15 years 8 months ago

Download www.cs.utexas.edu

We propose a new framework for aiding a reinforcement learner by allowing it to relocate, or move, to a state it selects so as to decrease the number of steps it needs to take in ...

Lilyana Mihalkova, Raymond J. Mooney

claim paper

Read More »

217

click to vote

ACMICEC
2008
ACM

272views ECommerce» more ACMICEC 2008»

Adapting the interaction state model in conversational recommender systems

15 years 9 months ago

Download www.inf.unibz.it

Conventional conversational recommender systems support interaction strategies that are hard-coded into the system in advance. In this context, Reinforcement Learning techniques h...

Tariq Mahmood, Francesco Ricci

claim paper

Read More »

180

click to vote

ISNN
2007
Springer

116views Neural Networks» more ISNN 2007»

Online Dynamic Value System for Machine Learning

16 years 1 months ago

Download www.ent.ohiou.edu

A novel online dynamic value system for machine learning is proposed in this paper. The proposed system has a dual network structure: data processing network (DPN) and information ...

Haibo He, Janusz A. Starzyk

claim paper

Read More »

« Prev « First page 17 / 72 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers