Search Sciweavers | Sciweavers

91 search results - page 17 / 19

» Parameter-exploring policy gradients

157

click to vote

ICONIP
2007

147views Information Technology» more ICONIP 2007»

Finding Exploratory Rewards by Embodied Evolution and Constrained Reinforcement Learning in the Cyber Rodents

15 years 7 months ago

Download www.nc.irp.oist.jp

The aim of the Cyber Rodent project [1] is to elucidate the origin of our reward and aﬀective systems by building artiﬁcial agents that share the natural biological constraints...

Eiji Uchibe, Kenji Doya

claim paper

Read More »

162

click to vote

EDBT
2008
ACM

144views Database» more EDBT 2008»

BI batch manager: a system for managing batch workloads on enterprise data-warehouses

16 years 6 months ago

Download www.edbt.org

Modern enterprise data warehouses have complex workloads that are notoriously difficult to manage. An important problem in workload management is to run these complex workloads `o...

Abhay Mehta, Chetan Gupta, Umeshwar Dayal

claim paper

Read More »

143

click to vote

ATAL
2007
Springer

81views Intelligent Agents» more ATAL 2007»

Multiagent learning in adaptive dynamic systems

16 years 8 days ago

Download www.damas.ift.ulaval.ca

Classically, an approach to the multiagent policy learning supposed that the agents, via interactions and/or by using preliminary knowledge about the reward functions of all playe...

Andriy Burkov, Brahim Chaib-draa

claim paper

Read More »

216

Voted

JMLR
2010

148views more JMLR 2010»

A Generalized Path Integral Control Approach to Reinforcement Learning

15 years 26 days ago

Download jmlr.csail.mit.edu

With the goal to generate more scalable algorithms with higher efficiency and fewer open parameters, reinforcement learning (RL) has recently moved towards combining classical tec...

Evangelos Theodorou, Jonas Buchli, Stefan Schaal

claim paper

Read More »

177

Voted

DAC
2009
ACM

210views Computer Architecture» more DAC 2009»

Throughput optimal task allocation under thermal constraints for multi-core processors

16 years 27 days ago

Download veda.eas.asu.edu

It is known that temperature gradients and thermal hotspots aﬀect the reliability of microprocessors. Temperature is also an important constraint when maximizing the performance...

Vinay Hanumaiah, Ravishankar Rao, Sarma B. K. Vrud...

claim paper

Read More »

« Prev « First page 17 / 19 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers