Search Sciweavers | Sciweavers

881 search results - page 70 / 177

» Diagnosing decision quality

168

click to vote

COLT
2006
Springer

63views Machine Learning» more COLT 2006»

Online Learning with Constraints

15 years 9 months ago

Download isaim2008.unl.edu

In this paper, we study a sequential decision making problem. The objective is to maximize the total reward while satisfying constraints, which are defined at every time step. The...

Shie Mannor, John N. Tsitsiklis

claim paper

Read More »

153

click to vote

AAAI
2008

141views Intelligent Agents» more AAAI 2008»

Online Learning with Expert Advice and Finite-Horizon Constraints

15 years 8 months ago

Download www.aaai.org

In this paper, we study a sequential decision making problem. The objective is to maximize the average reward accumulated over time subject to temporal cost constraints. The novel...

Branislav Kveton, Jia Yuan Yu, Georgios Theocharou...

claim paper

Read More »

142

click to vote

EWRL
2008

144views Machine Learning» more EWRL 2008»

Regularized Fitted Q-Iteration: Application to Planning

15 years 7 months ago

Download eprints.pascal-network.org

We consider planning in a Markovian decision problem, i.e., the problem of finding a good policy given access to a generative model of the environment. We propose to use fitted Q-i...

Amir Massoud Farahmand, Mohammad Ghavamzadeh, Csab...

claim paper

Read More »

167

Voted

EMNLP
2008

144views Natural Language Processing» more EMNLP 2008»

Generalizing Local and Non-Local Word-Reordering Patterns for Syntax-Based Machine Translation

15 years 7 months ago

Download www.aclweb.org

Syntactic word reordering is essential for translations across different grammar structures between syntactically distant languagepairs. In this paper, we propose to embed local a...

Bing Zhao, Yaser Al-Onaizan

claim paper

Read More »

143

click to vote

IJCAI
2007

170views Artificial Intelligence» more IJCAI 2007»

Memory-Bounded Dynamic Programming for DEC-POMDPs

15 years 7 months ago

Download anytime.cs.umass.edu

Decentralized decision making under uncertainty has been shown to be intractable when each agent has different partial information about the domain. Thus, improving the applicabil...

Sven Seuken, Shlomo Zilberstein

claim paper

Read More »

« Prev « First page 70 / 177 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers