Search Sciweavers | Sciweavers

1630 search results - page 82 / 326

» Coordinated Reinforcement Learning

130

click to vote

VLSID
2005
IEEE

105views VLSI» more VLSID 2005»

Placement and Routing for 3D-FPGAs Using Reinforcement Learning and Support Vector Machines

15 years 9 months ago

Download www.cse.psu.edu

The primary advantage of using 3D-FPGA over 2D-FPGA is that the vertical stacking of active layers reduce the Manhattan distance between the components in 3D-FPGA than when placed...

R. Manimegalai, E. Siva Soumya, V. Muralidharan, B...

claim paper

Read More »

Voted

PRICAI
1999
Springer

108views Artificial Intelligence» more PRICAI 1999»

Rationality of Reward Sharing in Multi-agent Reinforcement Learning

15 years 7 months ago

Download svrrd2.niad.ac.jp

Abstract. In multi-agent reinforcement learning systems, it is important to share a reward among all agents. We focus on the Rationality Theorem of Proﬁt Sharing [5] and analyze ...

Kazuteru Miyazaki, Shigenobu Kobayashi

claim paper

Read More »

113

Voted

NIPS
1994

90views Information Technology» more NIPS 1994»

Reinforcement Learning with Soft State Aggregation

15 years 4 months ago

Download www.eecs.umich.edu

It is widely accepted that the use of more compact representations than lookup tables is crucial to scaling reinforcement learning (RL) algorithms to real-world problems. Unfortun...

Satinder P. Singh, Tommi Jaakkola, Michael I. Jord...

claim paper

Read More »

151

Voted

CSL
2010
Springer

163views Automated Reasoning» more CSL 2010»

Evaluation of a hierarchical reinforcement learning spoken dialogue system

15 years 3 months ago

Download www.cstr.ed.ac.uk

We describe an evaluation of spoken dialogue strategies designed using hierarchical reinforcement learning agents. The dialogue strategies were learnt in a simulated environment a...

Heriberto Cuayáhuitl, Steve Renals, Oliver ...

claim paper

Read More »

136

Voted

CORR
2010
Springer

105views Education» more CORR 2010»

Optimism in Reinforcement Learning Based on Kullback-Leibler Divergence

15 years 2 months ago

Download hal.archives-ouvertes.fr

We consider model-based reinforcement learning in ﬁnite Markov Decision Processes (MDPs), focussing on so-called optimistic strategies. Optimism is usually implemented by carryin...

Sarah Filippi, Olivier Cappé, Aurelien Gari...

claim paper

Read More »

« Prev « First page 82 / 326 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers