Search Sciweavers | Sciweavers

125 search results - page 22 / 25

» Reinforcement Learning in Continuous Time and Space

181

click to vote

UAI
2008

242views Artificial Intelligence» more UAI 2008»

Dyna-Style Planning with Linear Function Approximation and Prioritized Sweeping

15 years 6 months ago

Download uai2008.cs.helsinki.fi

We consider the problem of efficiently learning optimal control policies and value functions over large state spaces in an online setting in which estimates must be available afte...

Richard S. Sutton, Csaba Szepesvári, Alborz...

claim paper

Read More »

160

click to vote

ICML
2010
IEEE

247views Machine Learning» more ICML 2010»

Inverse Optimal Control with Linearly-Solvable MDPs

15 years 6 months ago

Download www.cs.washington.edu

We present new algorithms for inverse optimal control (or inverse reinforcement learning, IRL) within the framework of linearlysolvable MDPs (LMDPs). Unlike most prior IRL algorit...

Dvijotham Krishnamurthy, Emanuel Todorov

claim paper

Read More »

154

click to vote

AGENTS
2000
Springer

119views Security Privacy» more AGENTS 2000»

Adaptivity in agent-based routing for data networks

15 years 9 months ago

Download web.engr.oregonstate.edu

Adaptivity, both of the individual agents and of the interaction structure among the agents, seems indispensable for scaling up multi-agent systems MAS's in noisy environme...

David Wolpert, Sergey Kirshner, Christopher J. Mer...

claim paper

Read More »

166

click to vote

NRHM
2000

149views more NRHM 2000»

Navigable history: a reader's view of writer's time

15 years 5 months ago

Download www.csdl.tamu.edu

Collecting, analyzing, and sharing information via a hypertext results in the continuous modification of information content over a long period of time. Such tasks will benefit fr...

Frank M. Shipman III, Hao-wei Hsieh

claim paper

Read More »

154

click to vote

ICVGIP
2004

143views Computer Vision» more ICVGIP 2004»

Modeling Signs Using Functional Data Analysis

15 years 6 months ago

Download www.csee.usf.edu

1 We present a functional data analysis (FDA) based method to statistically model continuous signs of the American Sign Language (ASL) for use in the recognition of signs in contin...

Sunita Nayak, Sudeep Sarkar, Kuntal Sengupta

claim paper

Read More »

« Prev « First page 22 / 25 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers