Search Sciweavers | Sciweavers

332 search results - page 19 / 67

» Ranking policies in discrete Markov decision processes

click to vote

ECML
2007
Springer

170views Machine Learning» more ECML 2007»

Sequence Labeling with Reinforcement Learning and Ranking Algorithms

13 years 9 months ago

Download nieme.lip6.fr

Many problems in areas such as Natural Language Processing, Information Retrieval, or Bioinformatic involve the generic task of sequence labeling. In many cases, the aim is to assi...

Francis Maes, Ludovic Denoyer, Patrick Gallinari

claim paper

Read More »

click to vote

ICTAI
2007
IEEE

96views Artificial Intelligence» more ICTAI 2007»

Multi-criteria Decision Making for Local Coordination in Multi-agent Systems

14 years 2 months ago

Download users.info.unicaen.fr

Unlike mono-agent systems, multi-agent planing addresses the problem of resolving conﬂicts between individual and group interests. In this paper, we are using a Decentralized Ve...

Matthieu Boussard, Maroua Bouzid, Abdel-Illah Moua...

claim paper

Read More »

click to vote

ICML
1995
IEEE

213views Machine Learning» more ICML 1995»

Learning Policies for Partially Observable Environments: Scaling Up

14 years 8 months ago

Download reference.kfupm.edu.sa

Partially observable Markov decision processes (pomdp's) model decision problems in which an agent tries to maximize its reward in the face of limited and/or noisy sensor fee...

Michael L. Littman, Anthony R. Cassandra, Leslie P...

claim paper

Read More »

click to vote

DATE
2008
IEEE

136views Hardware» more DATE 2008»

A Framework of Stochastic Power Management Using Hidden Markov Model

14 years 2 months ago

Download www.date-conference.com

- The effectiveness of stochastic power management relies on the accurate system and workload model and effective policy optimization. Workload modeling is a machine learning proce...

Ying Tan, Qinru Qiu

claim paper

Read More »

click to vote

ISCC
2000
IEEE

104views Communications» more ISCC 2000»

Dynamic Routing and Wavelength Assignment Using First Policy Iteration

14 years 11 days ago

Download www.netlab.tkk.fi

With standard assumptions the routing and wavelength assignment problem (RWA) can be viewed as a Markov Decision Process (MDP). The problem, however, deﬁes an exact solution bec...

Esa Hyytiä, Jorma T. Virtamo

claim paper

Read More »

« Prev « First page 19 / 67 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers