Search Sciweavers | Sciweavers

238 search results - page 47 / 48

» Value-Function Approximations for Partially Observable Marko...

213

click to vote

DSN
2009
IEEE

131views Computer Networks» more DSN 2009»

RRE: A game-theoretic intrusion Response and Recovery Engine

15 years 4 months ago

Download netfiles.uiuc.edu

Preserving the availability and integrity of networked computing systems in the face of fast-spreading intrusions requires advances not only in detection algorithms, but also in a...

Saman A. Zonouz, Himanshu Khurana, William H. Sand...

claim paper

Read More »

195

click to vote

ATAL
2003
Springer

126views Intelligent Agents» more ATAL 2003»

Performance models for large scale multiagent systems: using distributed POMDP building blocks

16 years 4 days ago

Download teamcore.usc.edu

Given a large group of cooperative agents, selecting the right coordination or conﬂict resolution strategy can have a signiﬁcant impact on their performance (e.g., speed of co...

Hyuckchul Jung, Milind Tambe

claim paper

Read More »

194

click to vote

HRI
2007
ACM

133views Human Computer Interaction» more HRI 2007»

Efficient model learning for dialog management

15 years 10 months ago

Download www.eecs.ucf.edu

Intelligent planning algorithms such as the Partially Observable Markov Decision Process (POMDP) have succeeded in dialog management applications [10, 11, 12] because of their rob...

Finale Doshi, Nicholas Roy

claim paper

Read More »

194

click to vote

ATAL
2009
Springer

135views Intelligent Agents» more ATAL 2009»

An empirical analysis of value function-based and policy search reinforcement learning

16 years 1 months ago

Download userweb.cs.utexas.edu

In several agent-oriented scenarios in the real world, an autonomous agent that is situated in an unknown environment must learn through a process of trial and error to take actio...

Shivaram Kalyanakrishnan, Peter Stone

claim paper

Read More »

189

click to vote

NECO
2007

150views more NECO 2007»

Reinforcement Learning, Spike-Time-Dependent Plasticity, and the BCM Rule

15 years 6 months ago

Download eprints.pascal-network.org

Learning agents, whether natural or artiﬁcial, must update their internal parameters in order to improve their behavior over time. In reinforcement learning, this plasticity is ...

Dorit Baras, Ron Meir

claim paper

Read More »

« Prev « First page 47 / 48 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers