Search Sciweavers | Sciweavers

2415 search results - page 422 / 483

» Markov Processes on Curves

149

click to vote

AIPS
2003

131views Artificial Intelligence» more AIPS 2003»

A Framework for Planning in Continuous-time Stochastic Domains

15 years 6 months ago

Download www.aaai.org

We propose a framework for policy generation in continuoustime stochastic domains with concurrent actions and events of uncertain duration. We make no assumptions regarding the co...

Håkan L. S. Younes, David J. Musliner, Reid ...

claim paper

Read More »

158

click to vote

LWA
2004

124views Software Engineering» more LWA 2004»

Dirichlet Enhanced Latent Semantic Analysis

15 years 6 months ago

Download www.gatsby.ucl.ac.uk

This paper describes nonparametric Bayesian treatments for analyzing records containing occurrences of items. The introduced model retains the strength of previous approaches that...

Kai Yu, Shipeng Yu, Volker Tresp

claim paper

Read More »

151

click to vote

IJCAI
2001

174views Artificial Intelligence» more IJCAI 2001»

Complexity of Probabilistic Planning under Average Rewards

15 years 6 months ago

Download www.informatik.uni-freiburg.de

A general and expressive model of sequential decision making under uncertainty is provided by the Markov decision processes (MDPs) framework. Complex applications with very large ...

Jussi Rintanen

claim paper

Read More »

128

click to vote

FLAIRS
2003

108views Artificial Intelligence» more FLAIRS 2003»

Orthographic Case Restoration Using Supervised Learning Without Manual Annotation

15 years 6 months ago

Download www.aaai.org

One challenge in text processing is the treatment of case insensitive documents such as speech recognition results. The traditional approach is to re-train a language model exclud...

Cheng Niu, Wei Li 0003, Jihong Ding, Rohini K. Sri...

claim paper

Read More »

142

click to vote

NIPS
2001

144views Information Technology» more NIPS 2001»

Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning

15 years 6 months ago

Download jmlr.csail.mit.edu

Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...

Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...

claim paper

Read More »

« Prev « First page 422 / 483 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers