Search Sciweavers | Sciweavers

260 search results - page 49 / 52

» Quasi-Deterministic Partially Observable Markov Decision Pro...

170

click to vote

CPAIOR
2008
Springer

198views Operations Research» more CPAIOR 2008»

Amsaa: A Multistep Anticipatory Algorithm for Online Stochastic Combinatorial Optimization

15 years 6 months ago

Download cs.brown.edu

The one-step anticipatory algorithm (1s-AA) is an online algorithm making decisions under uncertainty by ignoring future non-anticipativity constraints. It makes near-optimal decis...

Luc Mercier, Pascal Van Hentenryck

claim paper

Read More »

172

click to vote

ICML
1999
IEEE

168views Machine Learning» more ICML 1999»

Least-Squares Temporal Difference Learning

16 years 5 months ago

Download www.research.rutgers.edu

Excerpted from: Boyan, Justin. Learning Evaluation Functions for Global Optimization. Ph.D. thesis, Carnegie Mellon University, August 1998. (Available as Technical Report CMU-CS-...

Justin A. Boyan

claim paper

Read More »

161

Voted

DEXA
2003
Springer

147views Database» more DEXA 2003»

Context-Aware Data Mining Framework for Wireless Medical Application

15 years 10 months ago

Download www.sice.umkc.edu

Abstract. Data mining, which aims at extracting interesting information from large collections of data, has been widely used as an eﬀective decision making tool. Mining the datas...

Pravin Vajirkar, Sachin Singh, Yugyung Lee

claim paper

Read More »

171

click to vote

ICML
2007
IEEE

200views Machine Learning» more ICML 2007»

Multi-task reinforcement learning: a hierarchical Bayesian approach

16 years 5 months ago

Download www.machinelearning.org

We consider the problem of multi-task reinforcement learning, where the agent needs to solve a sequence of Markov Decision Processes (MDPs) chosen randomly from a fixed but unknow...

Aaron Wilson, Alan Fern, Soumya Ray, Prasad Tadepa...

claim paper

Read More »

157

click to vote

ICML
2004
IEEE

214views Machine Learning» more ICML 2004»

Apprenticeship learning via inverse reinforcement learning

16 years 5 months ago

Download ai.stanford.edu

We consider learning in a Markov decision process where we are not explicitly given a reward function, but where instead we can observe an expert demonstrating the task that we wa...

Pieter Abbeel, Andrew Y. Ng

claim paper

Read More »

« Prev « First page 49 / 52 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers