Search Sciweavers | Sciweavers

1233 search results - page 245 / 247

» Reinforcement learning

199

click to vote

IJCAI
2003

99views Artificial Intelligence» more IJCAI 2003»

Use of Off-line Dynamic Programming for Efficient Image Interpretation

15 years 8 months ago

Download dli.iiit.ac.in

An interpretation system finds the likely mappings from portions of an image to real-world objects. An interpretation policy specifies when to apply which imaging operator, to whi...

Ramana Isukapalli, Russell Greiner

claim paper

Read More »

218

Voted

NIPS
1998

164views Information Technology» more NIPS 1998»

Finite-Sample Convergence Rates for Q-Learning and Indirect Algorithms

15 years 8 months ago

Download www.cis.upenn.edu

In this paper, we address two issues of long-standing interest in the reinforcement learning literature. First, what kinds of performance guarantees can be made for Q-learning aft...

Michael J. Kearns, Satinder P. Singh

claim paper

Read More »

200

click to vote

BC
2006

124views more BC 2006»

Motor-maps, navigation and implicit space representation in the hippocampus

15 years 7 months ago

Download ece.ut.ac.ir

Abstract Multiple sensory-motor maps located in the brainstem and the cortex are involved in spatial orientation. Guiding movements of eyes, head, neck and arms they provide an app...

Alexander Kaske, Gösta Winberg, Joakim Cö...

claim paper

Read More »

223

click to vote

JMLR
2006

124views more JMLR 2006»

Policy Gradient in Continuous Time

15 years 7 months ago

Download hal.inria.fr

Policy search is a method for approximately solving an optimal control problem by performing a parametric optimization search in a given class of parameterized policies. In order ...

Rémi Munos

claim paper

Read More »

210

click to vote

JSAC
2007

189views more JSAC 2007»

Non-Cooperative Power Control for Wireless Ad Hoc Networks with Repeated Games

15 years 7 months ago

Download www.cs.ust.hk

— One of the distinctive features in a wireless ad hoc network is lack of any central controller or single point of authority, in which each node/link then makes its own decision...

Chengnian Long, Qian Zhang, Bo Li, Huilong Yang, X...

claim paper

Read More »

« Prev « First page 245 / 247 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers