Search Sciweavers | Sciweavers

1236 search results - page 123 / 248

» Opposition-Based Reinforcement Learning

213

click to vote

ATAL
2007
Springer

147views Intelligent Agents» more ATAL 2007»

A reinforcement learning based distributed search algorithm for hierarchical peer-to-peer information retrieval systems

15 years 10 months ago

Download www.haizhengzhang.com

The dominant existing routing strategies employed in peerto-peer(P2P) based information retrieval(IR) systems are similarity-based approaches. In these approaches, agents depend o...

Haizheng Zhang, Victor R. Lesser

claim paper

Read More »

223

click to vote

ICML
1995
IEEE

196views Machine Learning» more ICML 1995»

Ant-Q: A Reinforcement Learning Approach to the Traveling Salesman Problem

16 years 7 months ago

Download www.idsia.ch

In this paper we introduce Ant-Q, a family of algorithms which present many similarities with Q-learning (Watkins, 1989), and which we apply to the solution of symmetric and asymm...

Luca Maria Gambardella, Marco Dorigo

claim paper

Read More »

180

click to vote

ICRA
2008
IEEE

173views Robotics» more ICRA 2008»

Bayesian reinforcement learning in continuous POMDPs with application to robot navigation

16 years 1 months ago

Download www.cs.cmu.edu

— We consider the problem of optimal control in continuous and partially observable environments when the parameters of the model are not known exactly. Partially Observable Mark...

Stéphane Ross, Brahim Chaib-draa, Joelle Pi...

claim paper

Read More »

181

click to vote

HICSS
2000
IEEE

134views Biometrics» more HICSS 2000»

Peer-to-Peer Valuation as a Mechanism for Reinforcing Active Learning in Virtual Communities: Actualizing Social Exchange Theory

15 years 11 months ago

Download www.bus.iastate.edu

As knowledge becomes the primary focus of work in many industries, virtual communities and groups are emerging as part of new organizational forms. Within these virtual forms, eff...

Amrit Tiwana, Ashley A. Bush

claim paper

Read More »

178

click to vote

AAAI
2008

199views Intelligent Agents» more AAAI 2008»

Maximum Entropy Inverse Reinforcement Learning

15 years 9 months ago

Download www.andrew.cmu.edu

Recent research has shown the benefit of framing problems of imitation learning as solutions to Markov Decision Problems. This approach reduces learning to the problem of recoveri...

Brian Ziebart, Andrew L. Maas, J. Andrew Bagnell, ...

claim paper

Read More »

« Prev « First page 123 / 248 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers