Search Sciweavers | Sciweavers

908 search results - page 83 / 182

» Stochastic Finite Learning

212

click to vote

ICML
1994
IEEE

151views Machine Learning» more ICML 1994»

Learning Without State-Estimation in Partially Observable Markovian Decision Processes

15 years 11 months ago

Download www.eecs.umich.edu

Reinforcement learning (RL) algorithms provide a sound theoretical basis for building learning control architectures for embedded agents. Unfortunately all of the theory and much ...

Satinder P. Singh, Tommi Jaakkola, Michael I. Jord...

claim paper

Read More »

329

click to vote

Publication

233views

Sparse reward processes

14 years 6 months ago

Download arxiv.org

We introduce a class of learning problems where the agent is presented with a series of tasks. Intuitively, if there is relation among those tasks, then the information gained duri...

Christos Dimitrakakis

posted by olethros

Read More »

161

click to vote

ICML
2003
IEEE

156views Machine Learning» more ICML 2003»

AWESOME: A General Multiagent Learning Algorithm that Converges in Self-Play and Learns a Best Response Against Stationary Oppon

16 years 8 months ago

Download www-2.cs.cmu.edu

A satisfactory multiagent learning algorithm should, at a minimum, learn to play optimally against stationary opponents and converge to a Nash equilibrium in self-play. The algori...

Vincent Conitzer, Tuomas Sandholm

claim paper

Read More »

309

click to vote

Lab

652views

Electronic Enterprises Laboratory

17 years 6 months ago

Download lcm.csa.iisc.ernet.in

Our research is motivated by a strong conviction that business processes in electronic enterprises can be designed to deliver high levels of performance through the use of mathemat...

posted by sujit

Read More »

199

click to vote

COCOON
1995
Springer

137views Combinatorics» more COCOON 1995»

Constructing Craig Interpolation Formulas

15 years 11 months ago

Download www.cs.uwaterloo.ca

A Craig interpolant of two inconsistent theories is a formula which is true in one and false in the other. This paper gives an eificient method for constructing a Craig interpolant...

Guoxiang Huang

claim paper

Read More »

« Prev « First page 83 / 182 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers