Search Sciweavers | Sciweavers

24 search results - page 3 / 5

» Learning Policy Improvements with Path Integrals

200

click to vote

ATAL
2009
Springer

172views Intelligent Agents» more ATAL 2009»

Integrating organizational control into multi-agent learning

16 years 1 months ago

Download www.aamas-conference.org

Multi-Agent Reinforcement Learning (MARL) algorithms suffer from slow convergence and even divergence, especially in largescale systems. In this work, we develop an organization-b...

Chongjie Zhang, Sherief Abdallah, Victor R. Lesser

claim paper

Read More »

210

click to vote

RTCSA
2006
IEEE

144views Embedded Systems» more RTCSA 2006»

Integrating Compiler and System Toolkit Flow for Embedded VLIW DSP Processors

16 years 1 months ago

Download www.cs.nctu.edu.tw

To support high-performance and low-power for multimedia applications and for hand-held devices, embedded VLIW DSP processors are of research focus. With the tight resource constr...

Chi Wu, Kun-Yuan Hsieh, Yung-Chia Lin, Chung-Ju Wu...

claim paper

Read More »

163

click to vote

AAAI
2000

147views Intelligent Agents» more AAAI 2000»

ADVISOR: A Machine Learning Architecture for Intelligent Tutor Construction

15 years 8 months ago

Download www.aaai.org

We have constructed ADVISOR, a two-agent machine learning architecture for intelligent tutoring systems (ITS). The purpose of this architecture is to centralize the reasoning of a...

Joseph Beck, Beverly Park Woolf, Carole R. Beal

claim paper

Read More »

215

click to vote

ICML
2006
IEEE

156views Machine Learning» more ICML 2006»

Learning the structure of Factored Markov Decision Processes in reinforcement learning problems

16 years 8 months ago

Download animatlab.lip6.fr

Recent decision-theoric planning algorithms are able to find optimal solutions in large problems, using Factored Markov Decision Processes (fmdps). However, these algorithms need ...

Thomas Degris, Olivier Sigaud, Pierre-Henri Wuille...

claim paper

Read More »

201

click to vote

ESOP
2010
Springer

160views Programming Languages» more ESOP 2010»

A Semantic Framework for Declassification and Endorsement

15 years 10 months ago

Download www.cs.cornell.edu

Language-based information flow methods offer a principled way to enforce strong security properties, but enforcing noninterference is too inflexible for realistic applications. Se...

Aslan Askarov, Andrew Myers

claim paper

Read More »

« Prev « First page 3 / 5 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers