Search Sciweavers | Sciweavers

30

TSMC
2011

258views more TSMC 2011»

Cross-Entropy Optimization of Control Policies With Adaptive Basis Functions

13 years 2 months ago

—This paper introduces an algorithm for direct search of control policies in continuous-state discrete-action Markov decision processes. The algorithm looks for the best closed-l...

Lucian Busoniu, Damien Ernst, Bart De Schutter, Ro...

claim paper

Read More »

22

click to vote

IJCAI
2007

135views Artificial Intelligence» more IJCAI 2007»

Using Learned Policies in Heuristic-Search Planning

13 years 9 months ago

Download www2.parc.com

Many current state-of-the-art planners rely on forward heuristic search. The success of such search typically depends on heuristic distance-to-the-goal estimates derived from the ...

Sung Wook Yoon, Alan Fern, Robert Givan

claim paper

Read More »

30

click to vote

CIKM
2009
Springer

197views Information Technology» more CIKM 2009»

Applying differential privacy to search queries in a policy based interactive framework

14 years 2 months ago

Download ebiquity.umbc.edu

Web search logs are of growing importance to researchers as they help understanding search behavior and search engine performance. However, search logs typically contain sensitive...

Palanivel Balaji Kodeswaran, Evelyne Viegas

claim paper

Read More »

29

click to vote

UAI
2000

133views Artificial Intelligence» more UAI 2000»

PEGASUS: A policy search method for large MDPs and POMDPs

13 years 9 months ago

Download ai.stanford.edu

We propose a new approach to the problem of searching a space of policies for a Markov decision process (MDP) or a partially observable Markov decision process (POMDP), given a mo...

Andrew Y. Ng, Michael I. Jordan

claim paper

Read More »

28

click to vote

CCIA
2005
Springer

117views Artificial Intelligence» more CCIA 2005»

Direct Policy Search Reinforcement Learning for Robot Control

14 years 1 months ago

Download vicorob.udg.es

— This paper proposes a high-level Reinforcement Learning (RL) control system for solving the action selection problem of an autonomous robot. Although the dominant approach, whe...

Andres El-Fakdi, Marc Carreras, Narcís Palo...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers