Search Sciweavers | Sciweavers

1916 search results - page 234 / 384

» Reconfiguring a state machine

147

click to vote

ICML
2005
IEEE

200views Machine Learning» more ICML 2005»

2D Conditional Random Fields for Web information extraction

16 years 3 months ago

Download research.microsoft.com

The Web contains an abundance of useful semistructured information about real world objects, and our empirical study shows that strong sequence characteristics exist for Web infor...

Jun Zhu, Zaiqing Nie, Ji-Rong Wen, Bo Zhang, Wei-Y...

claim paper

Read More »

128

click to vote

ICML
2004
IEEE

145views Machine Learning» more ICML 2004»

Convergence of synchronous reinforcement learning with linear function approximation

16 years 3 months ago

Download www.machinelearning.org

Synchronous reinforcement learning (RL) algorithms with linear function approximation are representable as inhomogeneous matrix iterations of a special form (Schoknecht & Merk...

Artur Merke, Ralf Schoknecht

claim paper

Read More »

133

click to vote

ICML
2004
IEEE

156views Machine Learning» more ICML 2004»

Learning to fly by combining reinforcement learning with behavioural cloning

16 years 3 months ago

Download ccc.inaoep.mx

Reinforcement learning deals with learning optimal or near optimal policies while interacting with the environment. Application domains with many continuous variables are difficul...

Eduardo F. Morales, Claude Sammut

claim paper

Read More »

130

click to vote

ICML
2004
IEEE

120views Machine Learning» more ICML 2004»

Utile distinction hidden Markov models

16 years 3 months ago

Download www.idsia.ch

This paper addresses the problem of constructing good action selection policies for agents acting in partially observable environments, a class of problems generally known as Part...

Daan Wierstra, Marco Wiering

claim paper

Read More »

114

click to vote

ICML
2003
IEEE

121views Machine Learning» more ICML 2003»

Q-Decomposition for Reinforcement Learning Agents

16 years 3 months ago

Download www.hpl.hp.com

The paper explores a very simple agent design method called Q-decomposition, wherein a complex agent is built from simpler subagents. Each subagent has its own reward function and...

Stuart J. Russell, Andrew Zimdars

claim paper

Read More »

« Prev « First page 234 / 384 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers