Search Sciweavers | Sciweavers

475 search results - page 11 / 95

» TSSP: A Reinforcement Algorithm to Find Related Papers

160

click to vote

ICML
2002
IEEE

146views Machine Learning» more ICML 2002»

Hierarchically Optimal Average Reward Reinforcement Learning

16 years 6 months ago

Download www.cs.ualberta.ca

Two notions of optimality have been explored in previous work on hierarchical reinforcement learning (HRL): hierarchical optimality, or the optimal policy in the space defined by ...

Mohammad Ghavamzadeh, Sridhar Mahadevan

claim paper

Read More »

146

click to vote

FLAIRS
1998

132views Artificial Intelligence» more FLAIRS 1998»

Analytical Design of Reinforcement Learning Tasks

15 years 7 months ago

Download www.aaai.org

Reinforcement learning (RL) problems constitute an important class of learning and control problems faced by artificial intelligence systems. In these problems, one is faced with ...

Robert E. Smith

claim paper

Read More »

131

click to vote

UAI
2004

117views Artificial Intelligence» more UAI 2004»

On Finding Minimal w-cutset

15 years 7 months ago

Download uai.sis.pitt.edu

The complexity of a reasoning task over a graphical model is tied to the induced width of the underlying graph. It is well-known that the conditioning (assigning values) on a subs...

Bozhena Bidyuk, Rina Dechter

claim paper

Read More »

192

click to vote

SDM
2004
SIAM

194views Data Mining» more SDM 2004»

Finding Frequent Patterns in a Large Sparse Graph

15 years 7 months ago

Download glaros.dtc.umn.edu

Graph-based modeling has emerged as a powerful abstraction capable of capturing in a single and unified framework many of the relational, spatial, topological, and other characteri...

Michihiro Kuramochi, George Karypis

claim paper

Read More »

144

click to vote

ICML
2000
IEEE

126views Machine Learning» more ICML 2000»

Reinforcement Learning in POMDP's via Direct Gradient Ascent

16 years 6 months ago

Download reference.kfupm.edu.sa

This paper discusses theoretical and experimental aspects of gradient-based approaches to the direct optimization of policy performance in controlled ??? ?s. We introduce ??? ?, a...

Jonathan Baxter, Peter L. Bartlett

claim paper

Read More »

« Prev « First page 11 / 95 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers