Search Sciweavers | Sciweavers

22

CIMCA
2008
IEEE

125views Intelligent Agents» more CIMCA 2008»

Tree Exploration for Bayesian RL Exploration

14 years 2 months ago

Research in reinforcement learning has produced algorithms for optimal decision making under uncertainty that fall within two main types. The ﬁrst employs a Bayesian framework, ...

Christos Dimitrakakis

posted by olethros

Read More »

29

click to vote

JMLR
2008

230views more JMLR 2008»

Exponentiated Gradient Algorithms for Conditional Random Fields and Max-Margin Markov Networks

13 years 7 months ago

Download www.stat.berkeley.edu

Log-linear and maximum-margin models are two commonly-used methods in supervised machine learning, and are frequently used in structured prediction problems. Efficient learning of...

Michael Collins, Amir Globerson, Terry Koo, Xavier...

claim paper

Read More »

22

click to vote

GECCO
2009
Springer

124views Optimization» more GECCO 2009»

Reinforcement learning for games: failures and successes

14 years 8 days ago

Download www.gm.fh-koeln.de

We apply CMA-ES, an evolution strategy with covariance matrix adaptation, and TDL (Temporal Difference Learning) to reinforcement learning tasks. In both cases these algorithms se...

Wolfgang Konen, Thomas Bartz-Beielstein

claim paper

Read More »

21

click to vote

JMLR
2006

118views more JMLR 2006»

Learning Factor Graphs in Polynomial Time and Sample Complexity

13 years 7 months ago

Download jmlr.csail.mit.edu

We study the computational and sample complexity of parameter and structure learning in graphical models. Our main result shows that the class of factor graphs with bounded degree...

Pieter Abbeel, Daphne Koller, Andrew Y. Ng

claim paper

Read More »

26

click to vote

COLT
2006
Springer

179views Machine Learning» more COLT 2006»

Logarithmic Regret Algorithms for Online Convex Optimization

13 years 11 months ago

Download www.cs.princeton.edu

In an online convex optimization problem a decision-maker makes a sequence of decisions, i.e., chooses a sequence of points in Euclidean space, from a fixed feasible set. After ea...

Elad Hazan, Adam Kalai, Satyen Kale, Amit Agarwal

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers