Search Sciweavers | Sciweavers

118 search results - page 20 / 24

» An Evolutionary Random Policy Search Algorithm for Solving M...

click to vote

CDC
2010
IEEE

136views Control Systems» more CDC 2010»

Pathologies of temporal difference methods in approximate dynamic programming

13 years 2 months ago

Download web.mit.edu

Approximate policy iteration methods based on temporal differences are popular in practice, and have been tested extensively, dating to the early nineties, but the associated conve...

Dimitri P. Bertsekas

claim paper

Read More »

click to vote

WECWIS
2005
IEEE

141views ECommerce» more WECWIS 2005»

An Adaptive Bilateral Negotiation Model for E-Commerce Settings

14 years 1 months ago

Download eprints.ecs.soton.ac.uk

This paper studies adaptive bilateral negotiation between software agents in e-commerce environments. Speciﬁcally, we assume that the agents are self-interested, the environment...

Vidya Narayanan, Nicholas R. Jennings

claim paper

Read More »

click to vote

ICML
2007
IEEE

200views Machine Learning» more ICML 2007»

Multi-task reinforcement learning: a hierarchical Bayesian approach

14 years 8 months ago

Download www.machinelearning.org

We consider the problem of multi-task reinforcement learning, where the agent needs to solve a sequence of Markov Decision Processes (MDPs) chosen randomly from a fixed but unknow...

Aaron Wilson, Alan Fern, Soumya Ray, Prasad Tadepa...

claim paper

Read More »

click to vote

ICML
2008
IEEE

122views Machine Learning» more ICML 2008»

Reinforcement learning in the presence of rare events

14 years 8 months ago

Download www.ece.mcgill.ca

We consider the task of reinforcement learning in an environment in which rare significant events occur independently of the actions selected by the controlling agent. If these ev...

Jordan Frank, Shie Mannor, Doina Precup

claim paper

Read More »

click to vote

QUESTA
2010

112views more QUESTA 2010»

Admission control for a multi-server queue with abandonment

13 years 5 months ago

Download www-bcf.usc.edu

In a M/M/N+M queue, when there are many customers waiting, it may be preferable to reject a new arrival rather than risk that arrival later abandoning without receiving service. O...

Yasar Levent Koçaga, Amy R. Ward

claim paper

Read More »

« Prev « First page 20 / 24 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers