Search Sciweavers | Sciweavers

1799 search results - page 232 / 360

» Filtered Reinforcement Learning

129

Voted

UIC
2007
Springer

106views Applied Computing» more UIC 2007»

Devising a Context Selection-Based Reasoning Engine for Context-Aware Ubiquitous Computing Middleware

15 years 8 months ago

Download uclab.khu.ac.kr

We propose a novel reasoning engine for context-aware ubiquitous computing middleware in this paper. Our reasoning engine supports both rulebased reasoning and machine learning rea...

Donghai Guan, Weiwei Yuan, Seong Jin Cho, Andrey G...

claim paper

Read More »

177

Voted

DATAMINE
2010

161views more DATAMINE 2010»

Predicting labels for dyadic data

15 years 3 hour ago

Download dollar.biz.uiowa.edu

: In dyadic prediction, the input consists of a pair of items (a dyad), and the goal is to predict the value of an observation related to the dyad. Special cases of dyadic predicti...

Aditya Krishna Menon, Charles Elkan

claim paper

Read More »

107

Voted

ICML
2009
IEEE

131views Machine Learning» more ICML 2009»

Monte-Carlo simulation balancing

16 years 3 months ago

Download www.cs.ualberta.ca

In this paper we introduce the first algorithms for efficiently learning a simulation policy for Monte-Carlo search. Our main idea is to optimise the balance of a simulation polic...

David Silver, Gerald Tesauro

claim paper

Read More »

144

Voted

ICML
2001
IEEE

159views Machine Learning» more ICML 2001»

Direct Policy Search using Paired Statistical Tests

16 years 3 months ago

Download www.autonlab.org

Direct policy search is a practical way to solve reinforcement learning problems involving continuous state and action spaces. The goal becomes finding policy parameters that maxi...

Malcolm J. A. Strens, Andrew W. Moore

claim paper

Read More »

116

click to vote

ECML
2007
Springer

192views Machine Learning» more ECML 2007»

Policy Gradient Critics

15 years 8 months ago

Download www.idsia.ch

We present Policy Gradient Actor-Critic (PGAC), a new model-free Reinforcement Learning (RL) method for creating limited-memory stochastic policies for Partially Observable Markov ...

Daan Wierstra, Jürgen Schmidhuber

claim paper

Read More »

« Prev « First page 232 / 360 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers