Sciweavers

1799 search results - page 232 / 360
» Filtered Reinforcement Learning
Sort
View
129
Voted
UIC
2007
Springer
15 years 8 months ago
Devising a Context Selection-Based Reasoning Engine for Context-Aware Ubiquitous Computing Middleware
We propose a novel reasoning engine for context-aware ubiquitous computing middleware in this paper. Our reasoning engine supports both rulebased reasoning and machine learning rea...
Donghai Guan, Weiwei Yuan, Seong Jin Cho, Andrey G...
177
Voted
DATAMINE
2010
161views more  DATAMINE 2010»
15 years 3 hour ago
Predicting labels for dyadic data
: In dyadic prediction, the input consists of a pair of items (a dyad), and the goal is to predict the value of an observation related to the dyad. Special cases of dyadic predicti...
Aditya Krishna Menon, Charles Elkan
107
Voted
ICML
2009
IEEE
16 years 3 months ago
Monte-Carlo simulation balancing
In this paper we introduce the first algorithms for efficiently learning a simulation policy for Monte-Carlo search. Our main idea is to optimise the balance of a simulation polic...
David Silver, Gerald Tesauro
144
Voted
ICML
2001
IEEE
16 years 3 months ago
Direct Policy Search using Paired Statistical Tests
Direct policy search is a practical way to solve reinforcement learning problems involving continuous state and action spaces. The goal becomes finding policy parameters that maxi...
Malcolm J. A. Strens, Andrew W. Moore
ECML
2007
Springer
15 years 8 months ago
Policy Gradient Critics
We present Policy Gradient Actor-Critic (PGAC), a new model-free Reinforcement Learning (RL) method for creating limited-memory stochastic policies for Partially Observable Markov ...
Daan Wierstra, Jürgen Schmidhuber