Search Sciweavers | Sciweavers

170 search results - page 27 / 34

» Heuristic Selection of Actions in Multiagent Reinforcement L...

181

click to vote

ICML
2000
IEEE

153views Machine Learning» more ICML 2000»

Eligibility Traces for Off-Policy Policy Evaluation

16 years 6 months ago

Download www.cs.ualberta.ca

Eligibility traces have been shown to speed reinforcement learning, to make it more robust to hidden states, and to provide a link between Monte Carlo and temporal-difference meth...

Doina Precup, Richard S. Sutton, Satinder P. Singh

claim paper

Read More »

192

click to vote

ATAL
2011
Springer

220views Intelligent Agents» more ATAL 2011»

Using iterated reasoning to predict opponent strategies

14 years 5 months ago

Download paul.rutgers.edu

The ﬁeld of multiagent decision making is extending its tools from classical game theory by embracing reinforcement learning, statistical analysis, and opponent modeling. For ex...

Michael Wunder, Michael Kaisers, John Robert Yaros...

claim paper

Read More »

173

click to vote

JIRS
2000

144views more JIRS 2000»

An Integrated Approach of Learning, Planning, and Execution

15 years 5 months ago

Download laboratorios.fi.uba.ar

Agents (hardware or software) that act autonomously in an environment have to be able to integrate three basic behaviors: planning, execution, and learning. This integration is man...

Ramón García-Martínez, Daniel...

claim paper

Read More »

159

click to vote

NIPS
2008

129views Information Technology» more NIPS 2008»

Structure Learning in Human Sequential Decision-Making

15 years 7 months ago

Download www-users.cs.umn.edu

We use graphical models and structure learning to explore how people learn policies in sequential decision making tasks. Studies of sequential decision-making in humans frequently...

Daniel Acuña, Paul R. Schrater

claim paper

Read More »

161

click to vote

SMC
2007
IEEE

102views Control Systems» more SMC 2007»

An improved immune Q-learning algorithm

15 years 12 months ago

Download web2.uwindsor.ca

—Reinforcement learning is a framework in which an agent can learn behavior without knowledge on a task or an environment by exploration and exploitation. Striking a balance betw...

Zhengqiao Ji, Q. M. Jonathan Wu, Maher A. Sid-Ahme...

claim paper

Read More »

« Prev « First page 27 / 34 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers