Search Sciweavers | Sciweavers

199 search results - page 14 / 40

» Efficient Reinforcement Learning with Relocatable Action Mod...

163

Voted

NIPS
2001

104views Information Technology» more NIPS 2001»

Switch Packet Arbitration via Queue-Learning

15 years 8 months ago

Download books.nips.cc

In packet switches, packets queue at switch inputs and contend for outputs. The contention arbitration policy directly affects switch performance. The best policy depends on the c...

Timothy X. Brown

claim paper

Read More »

217

click to vote

JCP
2008

139views more JCP 2008»

Agent Learning in Relational Domains based on Logical MDPs with Negation

15 years 7 months ago

Download www.academypublisher.com

In this paper, we propose a model named Logical Markov Decision Processes with Negation for Relational Reinforcement Learning for applying Reinforcement Learning algorithms on the ...

Song Zhiwei, Chen Xiaoping, Cong Shuang

claim paper

Read More »

187

click to vote

ICML
2006
IEEE

142views Machine Learning» more ICML 2006»

An intrinsic reward mechanism for efficient exploration

16 years 8 months ago

Download www-anw.cs.umass.edu

How should a reinforcement learning agent act if its sole purpose is to efficiently learn an optimal policy for later use? In other words, how should it explore, to be able to exp...

Özgür Simsek, Andrew G. Barto

claim paper

Read More »

241

click to vote

INLG
2010
Springer

134views Natural Language Processing» more INLG 2010»

Hierarchical Reinforcement Learning for Adaptive Text Generation

15 years 5 months ago

Download www.aclweb.org

We present a novel approach to natural language generation (NLG) that applies hierarchical reinforcement learning to text generation in the wayfinding domain. Our approach aims to...

Nina Dethlefs, Heriberto Cuayáhuitl

claim paper

Read More »

267

click to vote

AAAI
2012

205views Intelligent Agents» more AAAI 2012»

Competing with Humans at Fantasy Football: Team Formation in Large Partially-Observable Domains

13 years 9 months ago

Download www.intelligence.tuc.gr

We present the ﬁrst real-world benchmark for sequentiallyoptimal team formation, working within the framework of a class of online football prediction games known as Fantasy Foo...

Tim Matthews, Sarvapali D. Ramchurn, Georgios Chal...

claim paper

Read More »

« Prev « First page 14 / 40 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers