Search Sciweavers | Sciweavers

659 search results - page 73 / 132

» Invisible Learning in Online Systems

130

click to vote

IJCAI
2007

172views Artificial Intelligence» more IJCAI 2007»

Optimistic Active-Learning Using Mutual Information

15 years 5 months ago

Download www.ijcai.org

An “active learning system” will sequentially decide which unlabeled instance to label, with the goal of efﬁciently gathering the information necessary to produce a good cla...

Yuhong Guo, Russell Greiner

claim paper

Read More »

129

click to vote

AR
2002

157views more AR 2002»

Acquiring state from control dynamics to learn grasping policies for robot hands

15 years 4 months ago

Download www.mit.edu

Abstract--A prominent emerging theory of sensorimotor development in biological systems proposes that control knowledge is encoded in the dynamics of physical interaction with the ...

Roderic A. Grupen, Jefferson A. Coelho Jr.

claim paper

Read More »

150

click to vote

ICTAI
2010
IEEE

211views Artificial Intelligence» more ICTAI 2010»

Combining Mixed Integer Programming and Supervised Learning for Fast Re-planning

15 years 2 months ago

Download www.montefiore.ulg.ac.be

We introduce a new plan repair method for problems cast as Mixed Integer Programs. In order to tackle the inherent complexity of these NP-hard problems, our approach relies on the ...

Emmanuel Rachelson, Ala Ben Abbes, Sebastien Dieme...

claim paper

Read More »

132

click to vote

CDC
2010
IEEE

106views Control Systems» more CDC 2010»

Optimal cross-layer wireless control policies using TD learning

14 years 11 months ago

Download www.stanford.edu

We present an on-line crosslayer control technique to characterize and approximate optimal policies for wireless networks. Our approach combines network utility maximization and ad...

Sean P. Meyn, Wei Chen, Daniel O'Neill

claim paper

Read More »

151

click to vote

NIPS
1996

192views Information Technology» more NIPS 1996»

Multidimensional Triangulation and Interpolation for Reinforcement Learning

15 years 5 months ago

Download www.cs.cmu.edu

Dynamic Programming, Q-learning and other discrete Markov Decision Process solvers can be applied to continuous d-dimensional state-spaces by quantizing the state space into an arr...

Scott Davies

claim paper

Read More »

« Prev « First page 73 / 132 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers