Search Sciweavers | Sciweavers

128 search results - page 19 / 26

» Hierarchically Optimal Average Reward Reinforcement Learning

206

click to vote

AAAI
2007

104views Intelligent Agents» more AAAI 2007»

Active Imitation Learning

15 years 9 months ago

Download www.cs.washington.edu

Imitation learning, also called learning by watching or programming by demonstration, has emerged as a means of accelerating many reinforcement learning tasks. Previous work has s...

Aaron P. Shon, Deepak Verma, Rajesh P. N. Rao

claim paper

Read More »

221

Voted

ATAL
2004
Springer

102views Intelligent Agents» more ATAL 2004»

A Pheromone-Based Utility Model for Collaborative Foraging

16 years 23 days ago

Download cs.gmu.edu

Multi-agent research often borrows from biology, where remarkable examples of collective intelligence may be found. One interesting example is ant colonies’ use of pheromones as...

Liviu Panait, Sean Luke

claim paper

Read More »

200

click to vote

COGSR
2011

71views more COGSR 2011»

Psychological models of human and optimal performance in bandit problems

15 years 2 months ago

Download www.socsci.uci.edu

In bandit problems, a decision-maker must choose between a set of alternatives, each of which has a ﬁxed but unknown rate of reward, to maximize their total number of rewards ov...

Michael D. Lee, Shunan Zhang, Miles Munro, Mark St...

claim paper

Read More »

178

Voted

IIE
2007

63views more IIE 2007»

Investigation of Q-Learning in the Context of a Virtual Learning Environment

15 years 7 months ago

Download www.mii.lt

We investigate the possibility to apply a known machine learning algorithm of Q-learning in the domain of a Virtual Learning Environment (VLE). It is important in this problem doma...

Dalia Baziukaite

claim paper

Read More »

252

click to vote

CVPR
2011
IEEE

446views Computer Vision» more CVPR 2011»

Shape Grammar Parsing via Reinforcement Learning

15 years 4 months ago

Download www.mas.ecp.fr

This paper tackles shape grammar parsing for facade segmentation using a novel optimization approach based on reinforcement learning (RL). To this end, we use a binary recursive g...

Olivier Teboul, Iasonas Kokkinos, Panagiotis Kouts...

claim paper

Read More »

« Prev « First page 19 / 26 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers