Search Sciweavers | Sciweavers

1176 search results - page 135 / 236

» Sparse reward processes

156

click to vote

ATAL
2005
Springer

101views Intelligent Agents» more ATAL 2005»

Automated resource-driven mission phasing techniques for constrained agents

15 years 6 months ago

Download www.cs.huji.ac.il

A constrained agent is limited in the actions that it can take at any given time, and a challenging problem is to design policies for such agents to do the best they can despite t...

Jianhui Wu, Edmund H. Durfee

claim paper

Read More »

127

click to vote

AAAI
2010

154views Intelligent Agents» more AAAI 2010»

Towards Multiagent Meta-level Control

15 years 6 months ago

Download coitweb.uncc.edu

Embedded systems consisting of collaborating agents capable of interacting with their environment are becoming ubiquitous. It is crucial for these systems to be able to adapt to t...

Shanjun Cheng, Anita Raja, Victor R. Lesser

claim paper

Read More »

138

click to vote

AAAI
2010

178views Intelligent Agents» more AAAI 2010»

Multi-Task Active Learning with Output Constraints

15 years 6 months ago

Download www.cs.cmu.edu

Many problems in information extraction, text mining, natural language processing and other fields exhibit the same property: multiple prediction tasks are related in the sense th...

Yi Zhang 0010

claim paper

Read More »

136

click to vote

NIPS
2008

109views Information Technology» more NIPS 2008»

Biasing Approximate Dynamic Programming with a Lower Discount Factor

15 years 5 months ago

Download hal.inria.fr

Most algorithms for solving Markov decision processes rely on a discount factor, which ensures their convergence. It is generally assumed that using an artificially low discount f...

Marek Petrik, Bruno Scherrer

claim paper

Read More »

178

click to vote

NIPS
2007

207views Information Technology» more NIPS 2007»

Bayes-Adaptive POMDPs

15 years 5 months ago

Download books.nips.cc

Bayesian Reinforcement Learning has generated substantial interest recently, as it provides an elegant solution to the exploration-exploitation trade-off in reinforcement learning...

Stéphane Ross, Brahim Chaib-draa, Joelle Pi...

claim paper

Read More »

« Prev « First page 135 / 236 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers