Search Sciweavers | Sciweavers

205 search results - page 5 / 41

» Simulation Optimization Using Balanced Explorative and Explo...

click to vote

IJCAI
2007

201views Artificial Intelligence» more IJCAI 2007»

Using Linear Programming for Bayesian Exploration in Markov Decision Processes

13 years 9 months ago

Download www.cs.mcgill.ca

A key problem in reinforcement learning is ﬁnding a good balance between the need to explore the environment and the need to gain rewards by exploiting existing knowledge. Much ...

Pablo Samuel Castro, Doina Precup

claim paper

Read More »

click to vote

GECCO
2003
Springer

107views Optimization» more GECCO 2003»

Exploration of a Two Sided Rendezvous Search Problem Using Genetic Algorithms

14 years 23 days ago

Download www.cs.york.ac.uk

The problem of searching for a walker that wants to be found, when the walker moves toward the helicopter when it can hear it, is an example of a two sided search problem which is ...

T. Q. S. Truong, A. Stacey

claim paper

Read More »

click to vote

WSC
2004

132views Modeling And Simulation» more WSC 2004»

Stochastic Approximation with Simulated Annealing as an Approach to Global Discrete-Event Simulation Optimization

13 years 9 months ago

Download www.informs-sim.org

This paper explores an approach to global, stochastic, simulation optimization which combines stochastic approximation (SA) with simulated annealing (SAN). SA directs a search of ...

Matthew H. Jones, K. Preston White

claim paper

Read More »

click to vote

AAIM
2005
Springer

110views Algorithms» more AAIM 2005»

Dynamically Updating the Exploiting Parameter in Improving Performance of Ant-Based Algorithms

14 years 1 months ago

Download www.engr.uconn.edu

Abstract. The utilization of pseudo-random proportional rule to balance between the exploitation and exploration of the search process was shown in Ant Colony System (ACS) algorith...

Hoang Trung Dinh, Abdullah Al Mamun, Hieu T. Dinh

claim paper

Read More »

click to vote

COGSR
2011

71views more COGSR 2011»

Psychological models of human and optimal performance in bandit problems

13 years 2 months ago

Download www.socsci.uci.edu

In bandit problems, a decision-maker must choose between a set of alternatives, each of which has a ﬁxed but unknown rate of reward, to maximize their total number of rewards ov...

Michael D. Lee, Shunan Zhang, Miles Munro, Mark St...

claim paper

Read More »

« Prev « First page 5 / 41 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers