Search Sciweavers | Sciweavers

892 search results - page 88 / 179

» Action respecting embedding

146

click to vote

NIPS
2008

116views Information Technology» more NIPS 2008»

Particle Filter-based Policy Gradient in POMDPs

15 years 6 months ago

Download eprints.pascal-network.org

Our setting is a Partially Observable Markov Decision Process with continuous state, observation and action spaces. Decisions are based on a Particle Filter for estimating the bel...

Pierre-Arnaud Coquelin, Romain Deguest, Rém...

claim paper

Read More »

152

click to vote

NAACL
2007

128views Computational Linguistics» more NAACL 2007»

Incremental Non-Projective Dependency Parsing

15 years 6 months ago

Download acl.ldc.upenn.edu

An open issue in data-driven dependency parsing is how to handle non-projective dependencies, which seem to be required by linguistically adequate representations, but which pose ...

Joakim Nivre

claim paper

Read More »

116

click to vote

AIPS
2003

97views Artificial Intelligence» more AIPS 2003»

A Mixed-initiative Framework for Robust Plan Sketching

15 years 5 months ago

Download www.ai.sri.com

Sketching provides a natural and compact means for a user to outline a plan for a high-level objective. Previous work on plan sketching required that sketches be valid, meaning th...

Karen L. Myers, Peter Jarvis, Mabry Tyson, Michael...

claim paper

Read More »

126

click to vote

ATAL
2010
Springer

207views Intelligent Agents» more ATAL 2010»

TacTex09: a champion bidding agent for ad auctions

15 years 5 months ago

Download www.cs.utexas.edu

In the Trading Agent Competition Ad Auctions Game, agents compete to sell products by bidding to have their ads shown in a search engine's sponsored search results. We report...

David Pardoe, Doran Chakraborty, Peter Stone

claim paper

Read More »

137

click to vote

CORR
2008
Springer

147views Education» more CORR 2008»

A Minimum Relative Entropy Principle for Learning and Acting

15 years 4 months ago

Download www.jair.org

This paper proposes a method to construct an adaptive agent that is universal with respect to a given class of experts, where each expert is designed specifically for a particular...

Pedro A. Ortega, Daniel A. Braun

claim paper

Read More »

« Prev « First page 88 / 179 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers