Search Sciweavers | Sciweavers

1233 search results - page 148 / 247

» Feudal Reinforcement Learning

193

click to vote

ECML
2007
Springer

170views Machine Learning» more ECML 2007»

Sequence Labeling with Reinforcement Learning and Ranking Algorithms

15 years 7 months ago

Download nieme.lip6.fr

Many problems in areas such as Natural Language Processing, Information Retrieval, or Bioinformatic involve the generic task of sequence labeling. In many cases, the aim is to assi...

Francis Maes, Ludovic Denoyer, Patrick Gallinari

claim paper

Read More »

148

click to vote

AAAI
2004

135views Intelligent Agents» more AAAI 2004»

Performance Bounded Reinforcement Learning in Strategic Interactions

15 years 7 months ago

Download www.aaai.org

Despite increasing deployment of agent technologies in several business and industry domains, user confidence in fully automated agent driven applications is noticeably lacking. T...

Bikramjit Banerjee, Jing Peng

claim paper

Read More »

137

click to vote

AAAI
2008

105views Intelligent Agents» more AAAI 2008»

Potential-based Shaping in Model-based Reinforcement Learning

15 years 8 months ago

Download www.aaai.org

Potential-based shaping was designed as a way of introducing background knowledge into model-free reinforcement-learning algorithms. By identifying states that are likely to have ...

John Asmuth, Michael L. Littman, Robert Zinkov

claim paper

Read More »

161

click to vote

NIPS
2007

149views Information Technology» more NIPS 2007»

Online Linear Regression and Its Application to Model-Based Reinforcement Learning

15 years 7 months ago

Download books.nips.cc

We provide a provably efﬁcient algorithm for learning Markov Decision Processes (MDPs) with continuous state and action spaces in the online setting. Speciﬁcally, we take a mo...

Alexander L. Strehl, Michael L. Littman

claim paper

Read More »

146

click to vote

EUSFLAT
2001

144views Fuzzy Logic» more EUSFLAT 2001»

Adaptive torque control using a connectionist reinforcement learning agent

15 years 7 months ago

Download www.eusflat.org

The correction of angular misalignment between mating components is a fundamental requirement for their successful assembly. In this paper we present how a learning agent based on...

Lorenzo Brignone, Martin Howarth, S. Sivayoganatha...

claim paper

Read More »

« Prev « First page 148 / 247 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers