Search Sciweavers | Sciweavers

632 search results - page 17 / 127

» Updating with incomplete observations

click to vote

ML
2002
ACM

154views Machine Learning» more ML 2002»

Technical Update: Least-Squares Temporal Difference Learning

13 years 7 months ago

Download www.research.rutgers.edu

TD() is a popular family of algorithms for approximate policy evaluation in large MDPs. TD() works by incrementally updating the value function after each observed transition. It h...

Justin A. Boyan

claim paper

Read More »

click to vote

JUCS
2008

104views more JUCS 2008»

Optimal Transit Price Negotiation: The Distributed Learning Perspective

13 years 7 months ago

Download www.jucs.org

: We present a distributed learning algorithm for optimizing transit prices in the inter-domain routing framework. We present a combined game theoretical and distributed algorithmi...

Dominique Barth, Loubna Echabbi, Chahinez Hamlaoui

claim paper

Read More »

click to vote

AAAI
2006

121views Intelligent Agents» more AAAI 2006»

Reasoning about Partially Observed Actions

13 years 9 months ago

Download www.cs.stanford.edu

Partially observed actions are observations of action executions in which we are uncertain about the identity of objects, agents, or locations involved in the actions (e.g., we kn...

Megan Nance, Adam Vogel, Eyal Amir

claim paper

Read More »

click to vote

ICASSP
2009
IEEE

144views Signal Processing» more ICASSP 2009»

A mixed time-scale algorithm for distributed parameter estimation : Nonlinear observation models and imperfect communication

14 years 2 months ago

Download www.ece.cmu.edu

Abstract— The paper considers the algorithm NLU for distributed (vector) parameter estimation in sensor networks, where, the local observation models are nonlinear, and inter-sen...

Soummya Kar, José M. F. Moura

claim paper

Read More »

click to vote

NIPS
2008

252views Information Technology» more NIPS 2008»

An Homotopy Algorithm for the Lasso with Online Observations

13 years 9 months ago

Download www.eecs.berkeley.edu

It has been shown that the problem of 1-penalized least-square regression commonly referred to as the Lasso or Basis Pursuit DeNoising leads to solutions that are sparse and there...

Pierre Garrigues, Laurent El Ghaoui

claim paper

Read More »

« Prev « First page 17 / 127 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers