Search Sciweavers | Sciweavers

60 search results - page 7 / 12

» Iteratively Extending Time Horizon Reinforcement Learning

262

click to vote

AI
1998
Springer

177views Artificial Intelligence» more AI 1998»

Model-Based Average Reward Reinforcement Learning

15 years 6 months ago

Download web.engr.oregonstate.edu

Reinforcement Learning (RL) is the study of programs that improve their performance by receiving rewards and punishments from the environment. Most RL methods optimize the discoun...

Prasad Tadepalli, DoKyeong Ok

claim paper

Read More »

228

click to vote

ATAL
2011
Springer

220views Intelligent Agents» more ATAL 2011»

Using iterated reasoning to predict opponent strategies

14 years 7 months ago

Download paul.rutgers.edu

The ﬁeld of multiagent decision making is extending its tools from classical game theory by embracing reinforcement learning, statistical analysis, and opponent modeling. For ex...

Michael Wunder, Michael Kaisers, John Robert Yaros...

claim paper

Read More »

198

click to vote

ROBOCUP
2009
Springer

134views Robotics» more ROBOCUP 2009»

Learning Complementary Multiagent Behaviors: A Case Study

16 years 1 months ago

Download teamcore.usc.edu

As the reach of multiagent reinforcement learning extends to more and more complex tasks, it is likely that the diverse challenges posed by some of these tasks can only be address...

Shivaram Kalyanakrishnan, Peter Stone

claim paper

Read More »

151

click to vote

AAAI
2006

118views Intelligent Agents» more AAAI 2006»

Hard Constrained Semi-Markov Decision Processes

15 years 8 months ago

Download www.aaai.org

In multiple criteria Markov Decision Processes (MDP) where multiple costs are incurred at every decision point, current methods solve them by minimising the expected primary cost ...

Wai-Leong Yeow, Chen-Khong Tham, Wai-Choong Wong

claim paper

Read More »

188

click to vote

SIGIR
2003
ACM

116views Information Technology» more SIGIR 2003»

ReCoM: reinforcement clustering of multi-type interrelated data objects

16 years 6 days ago

Download research.microsoft.com

Most existing clustering algorithms cluster highly related data objects such as Web pages and Web users separately. The interrelation among different types of data objects is eith...

Jidong Wang, Hua-Jun Zeng, Zheng Chen, Hongjun Lu,...

claim paper

Read More »

« Prev « First page 7 / 12 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers