Search Sciweavers | Sciweavers

231 search results - page 34 / 47

» Stochastic Optimization is (Almost) as easy as Deterministic...

click to vote

ICRA
2010
IEEE

145views Robotics» more ICRA 2010»

Reinforcement learning of motor skills in high dimensions: A path integral approach

13 years 6 months ago

Download www-personal.acfr.usyd.edu.au

— Reinforcement learning (RL) is one of the most general approaches to learning control. Its applicability to complex motor systems, however, has been largely impossible so far d...

Evangelos Theodorou, Jonas Buchli, Stefan Schaal

claim paper

Read More »

click to vote

SIGECOM
2003
ACM

107views ECommerce» more SIGECOM 2003»

Using value queries in combinatorial auctions

14 years 24 days ago

Download www.cs.cmu.edu

Combinatorial auctions, where bidders can bid on bundles of items are known to be desirable auction mechanisms for selling items that are complementary and/or substitutable. Howev...

Benoît Hudson, Tuomas Sandholm

claim paper

Read More »

click to vote

PAMI
2007

107views more PAMI 2007»

Segmentation of Multivariate Mixed Data via Lossy Data Coding and Compression

13 years 7 months ago

Download perception.csl.uiuc.edu

—In this paper, based on ideas from lossy data coding and compression, we present a simple but effective technique for segmenting multivariate mixed data that are drawn from a mi...

Yi Ma, Harm Derksen, Wei Hong, John Wright

claim paper

Read More »

click to vote

AAAI
2008

123views Intelligent Agents» more AAAI 2008»

Computing Observation Vectors for Max-Fault Min-Cardinality Diagnoses

13 years 10 months ago

Download www.cs.ucc.ie

Model-Based Diagnosis (MBD) typically focuses on diagnoses, minimal under some minimality criterion, e.g., the minimal-cardinality set of faulty components that explain an observa...

Alexander Feldman, Gregory M. Provan, Arjan J. C. ...

claim paper

Read More »

click to vote

CDC
2010
IEEE

139views Control Systems» more CDC 2010»

Q-learning and enhanced policy iteration in discounted dynamic programming

13 years 2 months ago

Download web.mit.edu

We consider the classical finite-state discounted Markovian decision problem, and we introduce a new policy iteration-like algorithm for finding the optimal state costs or Q-facto...

Dimitri P. Bertsekas, Huizhen Yu

claim paper

Read More »

« Prev « First page 34 / 47 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers