Search Sciweavers | Sciweavers

We model reinforcement learning as the problem of learning to control a Partially Observable Markov Decision Process ( ¢¡¤£¦¥§ ), and focus on gradient ascent approache...

Peter L. Bartlett, Jonathan Baxter

claim paper

Read More »

235

click to vote

AAAI
2012

205views Intelligent Agents» more AAAI 2012»

Competing with Humans at Fantasy Football: Team Formation in Large Partially-Observable Domains

13 years 9 months ago

Download www.intelligence.tuc.gr

We present the ﬁrst real-world benchmark for sequentiallyoptimal team formation, working within the framework of a class of online football prediction games known as Fantasy Foo...

Tim Matthews, Sarvapali D. Ramchurn, Georgios Chal...

claim paper

Read More »

137

click to vote

ICML
2010
IEEE

175views Machine Learning» more ICML 2010»

Telling cause from effect based on high-dimensional observations

15 years 7 months ago

Download www.kyb.tuebingen.mpg.de

Dominik Janzing, Patrik O. Hoyer, Bernhard Sch&oum...

claim paper

Read More »

190

Voted

COLING
2010

138views Computational Linguistics» more COLING 2010»

Controlling Listening-oriented Dialogue using Partially Observable Markov Decision Processes

15 years 1 months ago

Download aclweb.org

This paper investigates how to automatically create a dialogue control component of a listening agent to reduce the current high cost of manually creating such components. We coll...

Toyomi Meguro, Ryuichiro Higashinaka, Yasuhiro Min...

claim paper

Read More »

« Prev « First page 7 / 352 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers