Sciweavers

983 search results - page 173 / 197
» A Better Update Policy
Sort
View
ATAL
2010
Springer
13 years 8 months ago
Combining manual feedback with subsequent MDP reward signals for reinforcement learning
As learning agents move from research labs to the real world, it is increasingly important that human users, including those without programming skills, be able to teach agents de...
W. Bradley Knox, Peter Stone
EOR
2008
123views more  EOR 2008»
13 years 7 months ago
Fixed versus flexible production systems: A real options analysis
In this work, we address investment decisions in production systems by using real options. As is standard in literature, the stochastic variable is assumed to be normally distribu...
Dalila B. M. M. Fontes
PVLDB
2010
85views more  PVLDB 2010»
13 years 6 months ago
Xplus: A SQL-Tuning-Aware Query Optimizer
The need to improve a suboptimal execution plan picked by the query optimizer for a repeatedly run SQL query arises routinely. Complex expressions, skewed or correlated data, and ...
Herodotos Herodotou, Shivnath Babu
TKDE
2010
393views more  TKDE 2010»
13 years 2 months ago
Adaptive Join Operators for Result Rate Optimization on Streaming Inputs
Adaptive join algorithms have recently attracted a lot of attention in emerging applications where data is provided by autonomous data sources through heterogeneous network environ...
Mihaela A. Bornea, Vasilis Vassalos, Yannis Kotidi...
ICIP
2004
IEEE
14 years 9 months ago
Robust ego-motion estimation and 3d model refinement using depth based parallax model
We present an iterative algorithm for robustly estimating the egomotion and refining and updating a coarse, noisy and partial depth map using a depth based parallax model and brig...
Amit K. Agrawal, Rama Chellappa