Search Sciweavers | Sciweavers

95 search results - page 9 / 19

» Policy Gradients for Cryptanalysis

201

click to vote

EWRL
2008

148views Machine Learning» more EWRL 2008»

Policy Learning - A Unified Perspective with Applications in Robotics

15 years 8 months ago

Download www.kyb.tuebingen.mpg.de

Policy Learning approaches are among the best suited methods for high-dimensional, continuous control systems such as anthropomorphic robot arms and humanoid robots. In this paper,...

Jan Peters, Jens Kober, Duy Nguyen-Tuong

claim paper

Read More »

181

click to vote

CTRSA
2008
Springer

150views Cryptology» more CTRSA 2008»

Improving the Efficiency of Impossible Differential Cryptanalysis of Reduced Camellia and MISTY1

15 years 8 months ago

Download www.cosic.esat.kuleuven.be

Abstract. Camellia and MISTY1 are Feistel block ciphers. In this paper, we observe that, when conducting impossible differential cryptanalysis on Camellia and MISTY1, their round s...

Jiqiang Lu, Jongsung Kim, Nathan Keller, Orr Dunke...

claim paper

Read More »

188

Voted

NIPS
2003

180views Information Technology» more NIPS 2003»

Bounded Finite State Controllers

15 years 8 months ago

Download books.nips.cc

We describe a new approximation algorithm for solving partially observable MDPs. Our bounded policy iteration approach searches through the space of bounded-size, stochastic ﬁni...

Pascal Poupart, Craig Boutilier

claim paper

Read More »

173

click to vote

ICML
2000
IEEE

126views Machine Learning» more ICML 2000»

Reinforcement Learning in POMDP's via Direct Gradient Ascent

16 years 7 months ago

Download reference.kfupm.edu.sa

This paper discusses theoretical and experimental aspects of gradient-based approaches to the direct optimization of policy performance in controlled ??? ?s. We introduce ??? ?, a...

Jonathan Baxter, Peter L. Bartlett

claim paper

Read More »

188

click to vote

IOR
2011

107views more IOR 2011»

Information Collection on a Graph

15 years 1 months ago

Download www.castlelab.princeton.edu

We derive a knowledge gradient policy for an optimal learning problem on a graph, in which we use sequential measurements to reﬁne Bayesian estimates of individual edge values i...

Ilya O. Ryzhov, Warren B. Powell

claim paper

Read More »

« Prev « First page 9 / 19 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers