Sciweavers

4544 search results - page 174 / 909
» Reinforcement Learning with Time
Sort
View
AAAI
2012
12 years 18 days ago
Competing with Humans at Fantasy Football: Team Formation in Large Partially-Observable Domains
We present the first real-world benchmark for sequentiallyoptimal team formation, working within the framework of a class of online football prediction games known as Fantasy Foo...
Tim Matthews, Sarvapali D. Ramchurn, Georgios Chal...
ATAL
2006
Springer
14 years 1 months ago
Efficient agents for cliff-edge environments with a large set of decision options
This paper proposes an efficient agent for competing in Cliff Edge (CE) environments, such as sealed-bid auctions, dynamic pricing and the ultimatum game. The agent competes in on...
Ron Katz, Sarit Kraus
GECCO
2009
Springer
150views Optimization» more  GECCO 2009»
14 years 4 months ago
Discrete dynamical genetic programming in XCS
A number of representation schemes have been presented for use within Learning Classifier Systems, ranging from binary encodings to neural networks. This paper presents results fr...
Richard Preen, Larry Bull
TSMC
2002
136views more  TSMC 2002»
13 years 9 months ago
Expertness based cooperative Q-learning
By using other agents' experiences and knowledge, a learning agent may learn faster, make fewer mistakes, and create some rules for unseen situations. These benefits would be ...
Majid Nili Ahmadabadi, Masoud Asadpour
IROS
2006
IEEE
187views Robotics» more  IROS 2006»
14 years 4 months ago
Fast and Stable Learning of Quasi-Passive Dynamic Walking by an Unstable Biped Robot based on Off-Policy Natural Actor-Critic
— Recently, many researchers on humanoid robotics are interested in Quasi-Passive-Dynamic Walking (Quasi-PDW) which is similar to human walking. It is desirable that control para...
Tsuyoshi Ueno, Yutaka Nakamura, Takashi Takuma, To...