Search Sciweavers | Sciweavers

499 search results - page 76 / 100

» Model Minimization in Markov Decision Processes

MR
2007

173views Robotics» more MR 2007»

A maintenance planning and business case development model for the application of prognostics and health management (PHM) to ele

13 years 8 months ago

Download www.enme.umd.edu

- This paper presents a model that enables the optimal interpretation of Prognostics and Health Management (PHM) results for electronic systems. In this context, optimal interpreta...

Peter A. Sandborn, Chris Wilkinson

claim paper

Read More »

click to vote

ICML
2007
IEEE

172views Machine Learning» more ICML 2007»

Conditional random fields for multi-agent reinforcement learning

14 years 9 months ago

Download www.machinelearning.org

Conditional random fields (CRFs) are graphical models for modeling the probability of labels given the observations. They have traditionally been trained with using a set of obser...

Xinhua Zhang, Douglas Aberdeen, S. V. N. Vishwanat...

claim paper

Read More »

click to vote

ICML
2007
IEEE

200views Machine Learning» more ICML 2007»

Multi-task reinforcement learning: a hierarchical Bayesian approach

14 years 9 months ago

Download www.machinelearning.org

We consider the problem of multi-task reinforcement learning, where the agent needs to solve a sequence of Markov Decision Processes (MDPs) chosen randomly from a fixed but unknow...

Aaron Wilson, Alan Fern, Soumya Ray, Prasad Tadepa...

claim paper

Read More »

click to vote

PRIMA
2007
Springer

98views Intelligent Agents» more PRIMA 2007»

Multiagent Planning with Trembling-Hand Perfect Equilibrium in Multiagent POMDPs

14 years 2 months ago

Download lang.is.kyushu-u.ac.jp

Multiagent Partially Observable Markov Decision Processes are a popular model of multiagent systems with uncertainty. Since the computational cost for ﬁnding an optimal joint pol...

Yuichi Yabu, Makoto Yokoo, Atsushi Iwasaki

claim paper

Read More »

click to vote

ICML
1999
IEEE

168views Machine Learning» more ICML 1999»

Least-Squares Temporal Difference Learning

14 years 9 months ago

Download www.research.rutgers.edu

Excerpted from: Boyan, Justin. Learning Evaluation Functions for Global Optimization. Ph.D. thesis, Carnegie Mellon University, August 1998. (Available as Technical Report CMU-CS-...

Justin A. Boyan

claim paper

Read More »

« Prev « First page 76 / 100 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers