Search Sciweavers | Sciweavers

343 search results - page 55 / 69

» Action discovery for reinforcement learning

click to vote

CDC
2010
IEEE

160views Control Systems» more CDC 2010»

Adaptive bases for Q-learning

13 years 2 months ago

Download webee.technion.ac.il

Abstract-- We consider reinforcement learning, and in particular, the Q-learning algorithm in large state and action spaces. In order to cope with the size of the spaces, a functio...

Dotan Di Castro, Shie Mannor

claim paper

Read More »

click to vote

AI
2006
Springer

197views Artificial Intelligence» more AI 2006»

Adaptive Fraud Detection Using Benford's Law

13 years 11 months ago

Download csc.lsu.edu

Abstract. Adaptive Benford's Law [1] is a digital analysis technique that specifies the probabilistic distribution of digits for many commonly occurring phenomena, even for in...

Fletcher Lu, J. Efrim Boritz, H. Dominic Covvey

claim paper

Read More »

click to vote

IJCNN
2006
IEEE

111views Neural Networks» more IJCNN 2006»

Training Coordination Proxy Agents

14 years 1 months ago

Download cs.itd.nrl.navy.mil

— Delegating the coordination role to proxy agents can improve the overall outcome of the task at the expense of cognitive overload due to switching subtasks. Stability and commi...

Myriam Abramson, William Chao, Ranjeev Mittu

claim paper

Read More »

click to vote

CEC
2003
IEEE

102views Artificial Intelligence» more CEC 2003»

Real-time adaptation technique to real robots: an experiment with a humanoid robot

14 years 29 days ago

Download www.iba.t.u-tokyo.ac.jp

We introduce a technique that allows a real robot to execute real-time learning, in which GP and RL are integrated. In our former research, we showed the result of an experiment wi...

Shotaro Kamio, Hitoshi Iba

claim paper

Read More »

click to vote

EWRL
2008

129views Machine Learning» more EWRL 2008»

Markov Decision Processes with Arbitrary Reward Processes

13 years 9 months ago

Download www.cim.mcgill.ca

Abstract. We consider a control problem where the decision maker interacts with a standard Markov decision process with the exception that the reward functions vary arbitrarily ove...

Jia Yuan Yu, Shie Mannor, Nahum Shimkin

claim paper

Read More »

« Prev « First page 55 / 69 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers