Search Sciweavers | Sciweavers

95 search results - page 17 / 19

» Policy Gradients for Cryptanalysis

138

Voted

NIPS
2003

207views Information Technology» more NIPS 2003»

Extending Q-Learning to General Adaptive Multi-Agent Systems

15 years 4 months ago

Download books.nips.cc

Recent multi-agent extensions of Q-Learning require knowledge of other agents’ payoffs and Q-functions, and assume game-theoretic play at all times by all other agents. This pap...

Gerald Tesauro

claim paper

Read More »

118

Voted

ICONIP
2007

147views Information Technology» more ICONIP 2007»

Finding Exploratory Rewards by Embodied Evolution and Constrained Reinforcement Learning in the Cyber Rodents

15 years 4 months ago

Download www.nc.irp.oist.jp

The aim of the Cyber Rodent project [1] is to elucidate the origin of our reward and aﬀective systems by building artiﬁcial agents that share the natural biological constraints...

Eiji Uchibe, Kenji Doya

claim paper

Read More »

162

click to vote

SIGDIAL
2010

137views Natural Language Processing» more SIGDIAL 2010»

Modeling Spoken Decision Making Dialogue and Optimization of its Dialogue Strategy

15 years 18 days ago

Download mastarpj.nict.go.jp

This paper presents a spoken dialogue framework that helps users in making decisions. Users often do not have a definite goal or criteria for selecting from a list of alternatives...

Teruhisa Misu, Komei Sugiura, Kiyonori Ohtake, Chi...

claim paper

Read More »

114

Voted

ACL
2009

123views Computational Linguistics» more ACL 2009»

Reinforcement Learning for Mapping Instructions to Actions

15 years 17 days ago

Download www.aclweb.org

In this paper, we present a reinforcement learning approach for mapping natural language instructions to sequences of executable actions. We assume access to a reward function tha...

S. R. K. Branavan, Harr Chen, Luke S. Zettlemoyer,...

claim paper

Read More »

128

Voted

EDBT
2008
ACM

144views Database» more EDBT 2008»

BI batch manager: a system for managing batch workloads on enterprise data-warehouses

16 years 2 months ago

Download www.edbt.org

Modern enterprise data warehouses have complex workloads that are notoriously difficult to manage. An important problem in workload management is to run these complex workloads `o...

Abhay Mehta, Chetan Gupta, Umeshwar Dayal

claim paper

Read More »

« Prev « First page 17 / 19 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers