Sciweavers

190 search results - page 33 / 38
» Abstraction and Generalization in Reinforcement Learning: A ...
Sort
View
MLG
2007
Springer
14 years 3 months ago
Abductive Stochastic Logic Programs for Metabolic Network Inhibition Learning
Abstract. We revisit an application developed originally using Inductive Logic Programming (ILP) by replacing the underlying Logic Program (LP) description with Stochastic Logic Pr...
Jianzhong Chen, Stephen Muggleton, Jose Santos
EKAW
2004
Springer
14 years 2 months ago
Semantic Webs for Learning: A Vision and Its Realization
Abstract. Augmenting web pages with semantic contents, i.e., building a ‘Semantic Web’, promises a number of benefits for web users in general and learners in particular. Seman...
Arthur Stutt, Enrico Motta
ICIA
2007
13 years 11 months ago
Learning Interaction between Conflicting Human Agents and Their Assistants
We build the generic methodology based on machine learning and reasoning to detect the patterns of interaction between conflicting agents, including humans and their assistants. L...
Boris Galitsky, Boris Kovalerchuk
CORR
2007
Springer
130views Education» more  CORR 2007»
13 years 9 months ago
Lagrangian Relaxation for MAP Estimation in Graphical Models
Abstract— We develop a general framework for MAP estimation in discrete and Gaussian graphical models using Lagrangian relaxation techniques. The key idea is to reformulate an in...
Jason K. Johnson, Dmitry M. Malioutov, Alan S. Wil...
ALT
2009
Springer
14 years 6 months ago
Pure Exploration in Multi-armed Bandits Problems
Abstract. We consider the framework of stochastic multi-armed bandit problems and study the possibilities and limitations of strategies that explore sequentially the arms. The stra...
Sébastien Bubeck, Rémi Munos, Gilles...