Search Sciweavers | Sciweavers

1760 search results - page 75 / 352

» Learning from Partial Observations

188

Voted

DMIN
2006

141views Data Mining» more DMIN 2006»

Extracting Forensic Explanation from Intrusion Alerts

15 years 8 months ago

Download ww1.ucmss.com

Since it is desirable for an intrusion detection system to be operated with the real time performance, it is not unusual for an intrusion detection engine to perform a "lazy ...

Bon Sy, Negmat Mullodzhanov

claim paper

Read More »

197

click to vote

IROS
2008
IEEE

165views Robotics» more IROS 2008»

Mutual development of behavior acquisition and recognition based on value system

16 years 1 months ago

Download www.er.ams.eng.osaka-u.ac.jp

Abstract. Both self-learning architecture (embedded structure) and explicit/implicit teaching from other agents (environmental design issue) are necessary not only for one behavior...

Yasutake Takahashi, Yoshihiro Tamura, Minoru Asada

claim paper

Read More »

239

click to vote

GECCO
2009
Springer

162views Optimization» more GECCO 2009»

Uncertainty handling CMA-ES for reinforcement learning

15 years 5 months ago

Download www.neuroinformatik.ruhr-uni-bochum.de

The covariance matrix adaptation evolution strategy (CMAES) has proven to be a powerful method for reinforcement learning (RL). Recently, the CMA-ES has been augmented with an ada...

Verena Heidrich-Meisner, Christian Igel

claim paper

Read More »

172

click to vote

ICANN
2007
Springer

95views Neural Networks» more ICANN 2007»

Solving Deep Memory POMDPs with Recurrent Policy Gradients

16 years 1 months ago

Download www.idsia.ch

Abstract. This paper presents Recurrent Policy Gradients, a modelfree reinforcement learning (RL) method creating limited-memory stochastic policies for partially observable Markov...

Daan Wierstra, Alexander Förster, Jan Peters,...

claim paper

Read More »

205

click to vote

INFOCOM
2012
IEEE

189views Communications» more INFOCOM 2012»

Approximately optimal adaptive learning in opportunistic spectrum access

13 years 9 months ago

Download web.eecs.umich.edu

—In this paper we develop an adaptive learning algorithm which is approximately optimal for an opportunistic spectrum access (OSA) problem with polynomial complexity. In this OSA...

Cem Tekin, Mingyan Liu

claim paper

Read More »

« Prev « First page 75 / 352 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers