Search Sciweavers | Sciweavers

30

ACL
2006

161views Computational Linguistics» more ACL 2006»

Minimum Risk Annealing for Training Log-Linear Models

13 years 9 months ago

When training the parameters for a natural language system, one would prefer to minimize 1-best loss (error) on an evaluation set. Since the error surface for many natural languag...

David A. Smith, Jason Eisner

claim paper

Read More »

25

click to vote

HIS
2004

195views Information Technology» more HIS 2004»

Stigmergy in Multi Agent Reinforcement Learning

13 years 9 months ago

Download hal.inria.fr

In this paper, we describe how certain aspects of the biological phenomena of stigmergy can be imported into multiagent reinforcement learning (MARL), with the purpose of better e...

Raghav Aras, Alain Dutech, François Charpil...

claim paper

Read More »

28

click to vote

NIPS
2001

174views Information Technology» more NIPS 2001»

K-Local Hyperplane and Convex Distance Nearest Neighbor Algorithms

13 years 9 months ago

Download books.nips.cc

Guided by an initial idea of building a complex (non linear) decision surface with maximal local margin in input space, we give a possible geometrical intuition as to why K-Neares...

Pascal Vincent, Yoshua Bengio

claim paper

Read More »

31

click to vote

ATAL
2010
Springer

224views Intelligent Agents» more ATAL 2010»

Asynchronous algorithms for approximate distributed constraint optimization with quality bounds

13 years 8 months ago

Download teamcore.usc.edu

Distributed Constraint Optimization (DCOP) is a popular framework for cooperative multi-agent decision making. DCOP is NPhard, so an important line of work focuses on developing f...

Christopher Kiekintveld, Zhengyu Yin, Atul Kumar, ...

claim paper

Read More »

21

click to vote

AI
2008
Springer

101views Artificial Intelligence» more AI 2008»

An approach to efficient planning with numerical fluents and multi-criteria plan quality

13 years 7 months ago

Download www.informatik.uni-freiburg.de

Dealing with numerical information is practically important in many real-world planning domains where the executability of an action can depend on certain numerical conditions, an...

Alfonso Gerevini, Alessandro Saetti, Ivan Serina

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers