Search Sciweavers | Sciweavers

57 search results - page 7 / 12

» Optimizing time warp simulation with reinforcement learning ...

201

click to vote

SASO
2009
IEEE

172views Control Systems» more SASO 2009»

Distributed W-Learning: Multi-Policy Optimization in Self-Organizing Systems

16 years 2 months ago

Download www.scss.tcd.ie

—Large-scale agent-based systems are required to self-optimize towards multiple, potentially conﬂicting, policies of varying spatial and temporal scope. As a result, not all ag...

Ivana Dusparic, Vinny Cahill

claim paper

Read More »

221

click to vote

AEI
2004

150views more AEI 2004»

Ant colony optimization techniques for the vehicle routing problem

15 years 7 months ago

Download www.joydivisionman.com

This research applies the meta-heuristic method of ant colony optimization (ACO) to an established set of vehicle routing problems (VRP). The procedure simulates the decision-maki...

John E. Bell, Patrick R. McMullen

claim paper

Read More »

182

click to vote

NETCOOP
2007
Springer

130views Computer Networks» more NETCOOP 2007»

Load Shared Sequential Routing in MPLS Networks: System and User Optimal Solutions

16 years 1 months ago

Download www.tsp.ece.mcgill.ca

Recently Gerald Ash has shown through case studies that event dependent routing is attractive in large scale multi-service MPLS networks. In this paper, we consider the application...

Gilles Brunet, Fariba Heidari, Lorne Mason

claim paper

Read More »

160

Voted

GLOBECOM
2009
IEEE

106views Communications» more GLOBECOM 2009»

A Fresh Look at Multicanonical Monte Carlo from a Telecom Perspective

16 years 2 months ago

Download www.tlc.unipr.it

—The Multicanonical Monte Carlo (MMC) technique is a new form of adaptive importance sampling (IS). Thanks to its blind adaptation algorithm, it does not require an in-depth syst...

Alberto Bononi, Leslie A. Rusch, Amirhossein Ghazi...

claim paper

Read More »

182

click to vote

AAAI
2008

207views Intelligent Agents» more AAAI 2008»

Adaptive Importance Sampling with Automatic Model Selection in Value Function Approximation

15 years 9 months ago

Download sugiyama-www.cs.titech.ac.jp

Off-policy reinforcement learning is aimed at efficiently reusing data samples gathered in the past, which is an essential problem for physically grounded AI as experiments are us...

Hirotaka Hachiya, Takayuki Akiyama, Masashi Sugiya...

claim paper

Read More »

« Prev « First page 7 / 12 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers