Search Sciweavers | Sciweavers

1916 search results - page 181 / 384

» Reconfiguring a state machine

click to vote

ICML
2003
IEEE

151views Machine Learning» more ICML 2003»

Hierarchical Policy Gradient Algorithms

14 years 9 months ago

Download www.hpl.hp.com

Hierarchical reinforcement learning is a general framework which attempts to accelerate policy learning in large domains. On the other hand, policy gradient reinforcement learning...

Mohammad Ghavamzadeh, Sridhar Mahadevan

claim paper

Read More »

click to vote

ICML
2000
IEEE

155views Machine Learning» more ICML 2000»

Maximum Entropy Markov Models for Information Extraction and Segmentation

14 years 9 months ago

Download www.seas.upenn.edu

Hidden Markov models (HMMs) are a powerful probabilistic tool for modeling sequential data, and have been applied with success to many text-related tasks, such as part-of-speech t...

Andrew McCallum, Dayne Freitag, Fernando C. N. Per...

claim paper

Read More »

click to vote

COLT
2007
Springer

120views Machine Learning» more COLT 2007»

Online Learning with Prior Knowledge

14 years 2 months ago

Download www.cs.princeton.edu

The standard so-called experts algorithms are methods for utilizing a given set of “experts” to make good choices in a sequential decision-making problem. In the standard setti...

Elad Hazan, Nimrod Megiddo

claim paper

Read More »

click to vote

ICCAD
1997
IEEE

126views Hardware» more ICCAD 1997»

An output encoding problem and a solution technique

14 years 11 days ago

Download www-crc.stanford.edu

We present a new output encoding problem as follows: Given a specification table, such as a truth table or a finite state machine state table, where some of the outputs are specif...

Subhasish Mitra, LaNae J. Avra, Edward J. McCluske...

claim paper

Read More »

click to vote

COLT
1995
Springer

106views Machine Learning» more COLT 1995»

Exactly Learning Automata with Small Cover Time

13 years 11 months ago

Download www.cs.huji.ac.il

We present algorithms for exactly learning unknown environments that can be described by deterministic nite automata. The learner performs a walk on the target automaton, where at...

Dana Ron, Ronitt Rubinfeld

claim paper

Read More »

« Prev « First page 181 / 384 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers