Sciweavers

4544 search results - page 216 / 909
» Reinforcement Learning with Time
Sort
View
NIPS
2004
13 years 11 months ago
Responding to Modalities with Different Latencies
Motor control depends on sensory feedback in multiple modalities with different latencies. In this paper we consider within the framework of reinforcement learning how different s...
Fredrik Bissmarck, Hiroyuki Nakahara, Kenji Doya, ...
AIS
2006
Springer
13 years 10 months ago
Context enhancement for co-intentionality and co-reference in asynchronous CMC
The regulative and semantic `distance' of electronic conferencing may impede the topical alignment and the unambiguous interpretation of messages, hindering collaborative lear...
J. van der Pol, Wilfried Admiraal, P. Simons
JCP
2007
143views more  JCP 2007»
13 years 10 months ago
Noisy K Best-Paths for Approximate Dynamic Programming with Application to Portfolio Optimization
Abstract— We describe a general method to transform a non-Markovian sequential decision problem into a supervised learning problem using a K-bestpaths algorithm. We consider an a...
Nicolas Chapados, Yoshua Bengio
COLING
2002
13 years 10 months ago
Fine Grained Classification of Named Entities
While Named Entity extraction is useful in many natural language applications, the coarse categories that most NE extractors work with prove insufficient for complex applications ...
Michael Fleischman, Eduard H. Hovy
NN
2002
Springer
13 years 10 months ago
Opponent interactions between serotonin and dopamine
Anatomical and pharmacological evidence suggests that the dorsal raphe serotonin system and the ventral tegmental and substantia nigra dopamine system may act as mutual opponents....
Nathaniel D. Daw, Sham Kakade, Peter Dayan