Search Sciweavers | Sciweavers

3084 search results - page 201 / 617

» Learning to Take Actions

124

Voted

TSMC
2002

98views more TSMC 2002»

The STAR automaton: expediency and optimality properties

15 years 4 months ago

Download www.conta.uom.gr

Abstract--We present the STack ARchitecture (STAR) automaton. It is a fixed structure, multiaction, reward-penalty learning automaton, characterized by a star-shaped state transiti...

Anastasios A. Economides, Athanasios Kehagias

claim paper

Read More »

175

click to vote

NN
2007
Springer

105views Neural Networks» more NN 2007»

Guiding exploration by pre-existing knowledge without modifying reward

15 years 4 months ago

Download www.cs.hut.fi

Reinforcement learning is based on exploration of the environment and receiving reward that indicates which actions taken by the agent are good and which ones are bad. In many app...

Kary Främling

claim paper

Read More »

197

click to vote

CORR
2012
Springer

216views Education» more CORR 2012»

Fractional Moments on Bandit Problems

14 years 8 days ago

Download www.cse.iitm.ac.in

Reinforcement learning addresses the dilemma between exploration to ﬁnd profitable actions and exploitation to act according to the best observations already made. Bandit proble...

Ananda Narayanan B., Balaraman Ravindran

claim paper

Read More »

162

Voted

JMLR
2012

229views Programming Languages» more JMLR 2012»

Hierarchical Relative Entropy Policy Search

13 years 7 months ago

Download www.ias.informatik.tu-darmstadt.de

Many real-world problems are inherently hierarchically structured. The use of this structure in an agent’s policy may well be the key to improved scalability and higher performa...

Christian Daniel, Gerhard Neumann, Jan Peters

claim paper

Read More »

153

click to vote

CVPR
2000
IEEE

252views Computer Vision» more CVPR 2000»

Multimodal Speaker Detection Using Error Feedback Dynamic Bayesian Networks

16 years 6 months ago

Download www.cc.gatech.edu

Design and development of novel human-computer interfaces poses a challenging problem: actions and intentions of users have to be inferred from sequences of noisy and ambiguous mu...

Vladimir Pavlovic, James M. Rehg, Ashutosh Garg, T...

claim paper

Read More »

« Prev « First page 201 / 617 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers