Search Sciweavers | Sciweavers

312 search results - page 35 / 63

» Learning Partially Observable Deterministic Action Models

131

click to vote

ATAL
2010
Springer

146views Intelligent Agents» more ATAL 2010»

PAC-MDP learning with knowledge-based admissible models

15 years 2 months ago

Download www.aamas-conference.org

PAC-MDP algorithms approach the exploration-exploitation problem of reinforcement learning agents in an effective way which guarantees that with high probability, the algorithm pe...

Marek Grzes, Daniel Kudenko

claim paper

Read More »

125

click to vote

ICRA
2008
IEEE

208views Robotics» more ICRA 2008»

Unsupervised body scheme learning through self-perception

15 years 8 months ago

Download www.informatik.uni-freiburg.de

— In this paper, we present an approach allowing a robot to learn a generative model of its own physical body from scratch using self-perception with a single monocular camera. O...

Jürgen Sturm, Christian Plagemann, Wolfram Bu...

claim paper

Read More »

134

click to vote

ECAI
2010
Springer

238views Artificial Intelligence» more ECAI 2010»

The Dynamics of Multi-Agent Reinforcement Learning

15 years 3 months ago

Download www.doc.ic.ac.uk

Abstract. Infinite-horizon multi-agent control processes with nondeterminism and partial state knowledge have particularly interesting properties with respect to adaptive control, ...

Luke Dickens, Krysia Broda, Alessandra Russo

claim paper

Read More »

132

click to vote

DATE
2008
IEEE

136views Hardware» more DATE 2008»

A Framework of Stochastic Power Management Using Hidden Markov Model

15 years 8 months ago

Download www.date-conference.com

- The effectiveness of stochastic power management relies on the accurate system and workload model and effective policy optimization. Workload modeling is a machine learning proce...

Ying Tan, Qinru Qiu

claim paper

Read More »

113

click to vote

ICML
1990
IEEE

106views Machine Learning» more ICML 1990»

Explanations of Empirically Derived Reactive Plans

15 years 6 months ago

Download www.cs.uwyo.edu

Given an adequate simulation model of the task environment and payoff function that measures the quality of partially successful plans, competition-based heuristics such as geneti...

Diana F. Gordon, John J. Grefenstette

claim paper

Read More »

« Prev « First page 35 / 63 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers