Search Sciweavers | Sciweavers

44 search results - page 9 / 9

» Single-Player Monte-Carlo Tree Search

155

click to vote

ACL
2000

85views Computational Linguistics» more ACL 2000»

An Improved Parser for Data-Oriented Lexical-Functional Analysis

15 years 7 months ago

Download acl.ldc.upenn.edu

We present an LFG-DOP parser which uses fragments from LFG-annotated sentences to parse new sentences. Experiments with the Verbmobil and Homecentre corpora show that (1) Viterbi ...

Rens Bod

claim paper

Read More »

154

click to vote

AAAI
2010

171views Intelligent Agents» more AAAI 2010»

Reinforcement Learning via AIXI Approximation

15 years 7 months ago

Download jveness.info

This paper introduces a principled approach for the design of a scalable general reinforcement learning agent. This approach is based on a direct approximation of AIXI, a Bayesian...

Joel Veness, Kee Siong Ng, Marcus Hutter, David Si...

claim paper

Read More »

191

click to vote

DATAMINE
1999

143views more DATAMINE 1999»

Partitioning Nominal Attributes in Decision Trees

15 years 5 months ago

Download sci2s.ugr.es

To find the optimal branching of a nominal attribute at a node in an L-ary decision tree, one is often forced to search over all possible L-ary partitions for the one that yields t...

Don Coppersmith, Se June Hong, Jonathan R. M. Hosk...

claim paper

Read More »

181

click to vote

ICML
2007
IEEE

136views Machine Learning» more ICML 2007»

Combining online and offline knowledge in UCT

16 years 6 months ago

Download www.machinelearning.org

The UCT algorithm learns a value function online using sample-based search. The TD() algorithm can learn a value function offline for the on-policy distribution. We consider three...

Sylvain Gelly, David Silver

claim paper

Read More »

« Prev « First page 9 / 9 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers