Search Sciweavers | Sciweavers

262 search results - page 50 / 53

» Bounded-Parameter Partially Observable Markov Decision Proce...

click to vote

IAT
2009
IEEE

120views Intelligent Agents» more IAT 2009»

Introducing Communication in Dis-POMDPs with Finite State Machines

14 years 5 months ago

Download www.eecs.harvard.edu

Distributed Partially Observable Markov Decision Problems (DisPOMDPs) are emerging as a popular approach for modeling sequential decision making in teams operating under uncertain...

Yuki Iwanari, Makoto Tasaki, Makoto Yokoo, Atsushi...

claim paper

Read More »

click to vote

ICML
1999
IEEE

168views Machine Learning» more ICML 1999»

Least-Squares Temporal Difference Learning

14 years 11 months ago

Download www.research.rutgers.edu

Excerpted from: Boyan, Justin. Learning Evaluation Functions for Global Optimization. Ph.D. thesis, Carnegie Mellon University, August 1998. (Available as Technical Report CMU-CS-...

Justin A. Boyan

claim paper

Read More »

click to vote

DEXA
2003
Springer

147views Database» more DEXA 2003»

Context-Aware Data Mining Framework for Wireless Medical Application

14 years 4 months ago

Download www.sice.umkc.edu

Abstract. Data mining, which aims at extracting interesting information from large collections of data, has been widely used as an eﬀective decision making tool. Mining the datas...

Pravin Vajirkar, Sachin Singh, Yugyung Lee

claim paper

Read More »

click to vote

MOBIHOC
2008
ACM

136views Computer Networks» more MOBIHOC 2008»

Routing in a cyclic mobispace

14 years 10 months ago

Download www.cse.fau.edu

A key challenge of routing in delay tolerant networks (DTNs) is to find routes that have high delivery rates and low endto-end delays. When oracles are not available for future co...

Cong Liu, Jie Wu

claim paper

Read More »

click to vote

ICML
2007
IEEE

200views Machine Learning» more ICML 2007»

Multi-task reinforcement learning: a hierarchical Bayesian approach

14 years 11 months ago

Download www.machinelearning.org

We consider the problem of multi-task reinforcement learning, where the agent needs to solve a sequence of Markov Decision Processes (MDPs) chosen randomly from a fixed but unknow...

Aaron Wilson, Alan Fern, Soumya Ray, Prasad Tadepa...

claim paper

Read More »

« Prev « First page 50 / 53 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers