Sciweavers

188 search results - page 9 / 38
» Finding Best k Policies
Sort
View
SIGCPR
2005
ACM
66views Hardware» more  SIGCPR 2005»
14 years 4 months ago
Who is "the IT workforce"?: challenges facing policy makers, educators, management, and research
Despite over six decades of research on the role and definition of the IT workforce, there remain numerous, and often contradictory, definitions related to the identity of this gr...
Michelle L. Kaarst-Brown, Indira R. Guzman
JMLR
2010
103views more  JMLR 2010»
13 years 5 months ago
Regret Bounds and Minimax Policies under Partial Monitoring
This work deals with four classical prediction settings, namely full information, bandit, label efficient and bandit label efficient as well as four different notions of regret: p...
Jean-Yves Audibert, Sébastien Bubeck
ML
2002
ACM
133views Machine Learning» more  ML 2002»
13 years 10 months ago
Finite-time Analysis of the Multiarmed Bandit Problem
Reinforcement learning policies face the exploration versus exploitation dilemma, i.e. the search for a balance between exploring the environment to find profitable actions while t...
Peter Auer, Nicolò Cesa-Bianchi, Paul Fisch...
ANOR
2005
83views more  ANOR 2005»
13 years 10 months ago
Optimal Consumption Portfolio and No-Arbitrage with Nonproportional Transaction Costs
In this paper we consider a finite-state financial market with non-proportional transaction cost and bid-ask spreads. The transaction cost consists of two parts: a fixed cost and a...
Xiuli Chao, K. K. Lai, Shouyang Wang, Mei Yu
ARSCOM
2004
86views more  ARSCOM 2004»
13 years 10 months ago
New Conditions for k-ordered Hamiltonian Graphs
We show that in any graph G on n vertices with d(x) + d(y) n for any two nonadjacent vertices x and y, we can fix the order of k vertices on a given cycle and find a hamiltonian c...
Guantao Chen, Ronald J. Gould, Florian Pfender