Search Sciweavers | Sciweavers

119 search results - page 7 / 24

» Average Reward Timed Games

214

click to vote

COCOON
2008
Springer

137views Combinatorics» more COCOON 2008»

Average-Case Competitive Analyses for One-Way Trading

15 years 9 months ago

Download www.algo.ics.tut.ac.jp

Consider a trader who exchanges one dollar into yen and assume that the exchange rate fluctuates within the interval [m, M]. The game ends without advance notice, then the trader ...

Hiroshi Fujiwara, Kazuo Iwama, Yoshiyuki Sekiguchi

claim paper

Read More »

204

click to vote

ICML
2009
IEEE

104views Machine Learning» more ICML 2009»

Learning when to stop thinking and do something!

16 years 8 months ago

Download www.cs.ualberta.ca

An anytime algorithm is capable of returning a response to the given task at essentially any time; typically the quality of the response improves as the time increases. Here, we c...

Barnabás Póczos, Csaba Szepesv&aacut...

claim paper

Read More »

199

click to vote

NECO
2007

150views more NECO 2007»

Reinforcement Learning, Spike-Time-Dependent Plasticity, and the BCM Rule

15 years 6 months ago

Download eprints.pascal-network.org

Learning agents, whether natural or artiﬁcial, must update their internal parameters in order to improve their behavior over time. In reinforcement learning, this plasticity is ...

Dorit Baras, Ron Meir

claim paper

Read More »

205

click to vote

DIGRA
2005
Springer

112views Computer Graphics» more DIGRA 2005»

History of Digital Games in Turkey

16 years 26 days ago

Download www.digra.org

As an important entertainment tool, “digital games” has been used by several hundred millions of people all around the world for almost 30 years. Although the number of game p...

Erdal Yilmaz, Kursat Cagiltay

claim paper

Read More »

188

Voted

EUROCAST
2007
Springer

182views Hardware» more EUROCAST 2007»

A k-NN Based Perception Scheme for Reinforcement Learning

16 years 1 months ago

Download www.dia.fi.upm.es

Abstract a paradigm of modern Machine Learning (ML) which uses rewards and punishments to guide the learning process. One of the central ideas of RL is learning by “direct-online...

José Antonio Martin H., Javier de Lope Asia...

claim paper

Read More »

« Prev « First page 7 / 24 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers