Search Sciweavers | Sciweavers

771 search results - page 77 / 155

» Markov Decision Processes with Arbitrary Reward Processes

141

Voted

ACMICEC
2007
ACM

154views ECommerce» more ACMICEC 2007»

Learning and adaptivity in interactive recommender systems

15 years 8 months ago

Download www.inf.unibz.it

Recommender systems are intelligent E-commerce applications that assist users in a decision-making process by offering personalized product recommendations during an interaction s...

Tariq Mahmood, Francesco Ricci

claim paper

Read More »

120

click to vote

ECML
2006
Springer

88views Machine Learning» more ECML 2006»

Reinforcement Learning for MDPs with Constraints

15 years 6 months ago

Download www.peter-geibel.de

In this article, I will consider Markov Decision Processes with two criteria, each defined as the expected value of an infinite horizon cumulative return. The second criterion is e...

Peter Geibel

claim paper

Read More »

164

Voted

SCIA
2005
Springer

211views Image Analysis» more SCIA 2005»

Perception-Action Based Object Detection from Local Descriptor Combination and Reinforcement Learning

15 years 9 months ago

Download www.mobvis.org

This work proposes to learn visual encodings of attention patterns that enables sequential attention for object detection in real world environments. The system embeds a saccadic d...

Lucas Paletta, Gerald Fritz, Christin Seifert

claim paper

Read More »

126

click to vote

IJCAI
2007

143views Artificial Intelligence» more IJCAI 2007»

Direct Code Access in Self-Organizing Neural Networks for Reinforcement Learning

15 years 5 months ago

Download www.aaai.org

TD-FALCON is a self-organizing neural network that incorporates Temporal Difference (TD) methods for reinforcement learning. Despite the advantages of fast and stable learning, TD...

Ah-Hwee Tan

claim paper

Read More »

141

click to vote

QEST
2005
IEEE

137views Modeling and Simulation» more QEST 2005»

iLTLChecker: A Probabilistic Model Checker for Multiple DTMCs

15 years 9 months ago

Download osl.cs.uiuc.edu

iLTL is a probabilistic temporal logic that can specify properties of multiple discrete time Markov chains (DTMCs). In this paper, we describe two related tools: MarkovEstimator a...

YoungMin Kwon, Gul A. Agha

claim paper

Read More »

« Prev « First page 77 / 155 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers