Search Sciweavers | Sciweavers

91 search results - page 12 / 19

» Magnifying-Lens Abstraction for Markov Decision Processes

click to vote

PKDD
2010
Springer

122views Data Mining» more PKDD 2010»

Exploration in Relational Worlds

13 years 6 months ago

Download user.cs.tu-berlin.de

Abstract. One of the key problems in model-based reinforcement learning is balancing exploration and exploitation. Another is learning and acting in large relational domains, in wh...

Tobias Lang, Marc Toussaint, Kristian Kersting

claim paper

Read More »

click to vote

ATAL
2011
Springer

169views Intelligent Agents» more ATAL 2011»

Towards a unifying characterization for quantifying weak coupling in dec-POMDPs

12 years 7 months ago

Download ai.eecs.umich.edu

Researchers in the ﬁeld of multiagent sequential decision making have commonly used the terms “weakly-coupled” and “loosely-coupled” to qualitatively classify problems i...

Stefan J. Witwicki, Edmund H. Durfee

claim paper

Read More »

click to vote

JSAC
2010

107views more JSAC 2010»

Online learning in autonomic multi-hop wireless networks for transmitting mission-critical applications

13 years 6 months ago

Download medianetlab.ee.ucla.edu

Abstract—In this paper, we study how to optimize the transmission decisions of nodes aimed at supporting mission-critical applications, such as surveillance, security monitoring,...

Hsien-Po Shiang, Mihaela van der Schaar

claim paper

Read More »

click to vote

AAAI
2000

144views Intelligent Agents» more AAAI 2000»

Back to the Future for Consistency-Based Trajectory Tracking

13 years 9 months ago

Download people.csail.mit.edu

Given a model of a physical process and a sequence of commands and observations received over time, the task of an autonomous controller is to determine the likely states of the p...

James Kurien, P. Pandurang Nayak

claim paper

Read More »

click to vote

ICML
2004
IEEE

167views Machine Learning» more ICML 2004»

Bellman goes relational

14 years 8 months ago

Download people.csail.mit.edu

Motivated by the interest in relational reinforcement learning, we introduce a novel relational Bellman update operator called ReBel. It employs a constraint logic programming lan...

Kristian Kersting, Martijn Van Otterlo, Luc De Rae...

claim paper

Read More »

« Prev « First page 12 / 19 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers