Sciweavers

258 search results - page 40 / 52
» Continuous Capacities on Continuous State Spaces
Sort
View
ICML
2004
IEEE
14 years 9 months ago
Learning to fly by combining reinforcement learning with behavioural cloning
Reinforcement learning deals with learning optimal or near optimal policies while interacting with the environment. Application domains with many continuous variables are difficul...
Eduardo F. Morales, Claude Sammut
SODA
2008
ACM
108views Algorithms» more  SODA 2008»
13 years 10 months ago
Price based protocols for fair resource allocation: convergence time analysis and extension to Leontief utilities
We analyze several distributed, continuous time protocols for a fair allocation of bandwidths to flows in a network (or resources to agents). Our protocols converge to an allocati...
Ashish Goel, Hamid Nazerzadeh
QEST
2005
IEEE
14 years 2 months ago
iLTLChecker: A Probabilistic Model Checker for Multiple DTMCs
iLTL is a probabilistic temporal logic that can specify properties of multiple discrete time Markov chains (DTMCs). In this paper, we describe two related tools: MarkovEstimator a...
YoungMin Kwon, Gul A. Agha
ICML
2001
IEEE
14 years 9 months ago
Direct Policy Search using Paired Statistical Tests
Direct policy search is a practical way to solve reinforcement learning problems involving continuous state and action spaces. The goal becomes finding policy parameters that maxi...
Malcolm J. A. Strens, Andrew W. Moore
HICSS
2006
IEEE
120views Biometrics» more  HICSS 2006»
14 years 2 months ago
Systems Thinking and Information Literacy: Elements of a Knowledge Enabling Workplace Environment
Dynamic technology-driven circumstances fortify academic librarians’ reconsideration of their professional purposes, processes and relationships. In response, California Polytec...
Mary M. Somerville, Anita Mirijamdotter, Lydia Col...