Search Sciweavers | Sciweavers

97 search results - page 10 / 20

» Learning Investment Functions for Controlling the Utility of...

150

click to vote

JCP
2007

143views more JCP 2007»

Noisy K Best-Paths for Approximate Dynamic Programming with Application to Portfolio Optimization

15 years 3 months ago

Download www.academypublisher.com

Abstract— We describe a general method to transform a non-Markovian sequential decision problem into a supervised learning problem using a K-bestpaths algorithm. We consider an a...

Nicolas Chapados, Yoshua Bengio

claim paper

Read More »

138

click to vote

CDC
2009
IEEE

160views Control Systems» more CDC 2009»

Exploring and exploiting routing opportunities in wireless ad-hoc networks

15 years 1 months ago

Download circuit.ucsd.edu

Abstract--In this paper, d-AdaptOR, a distributed opportunistic routing scheme for multi-hop wireless ad-hoc networks is proposed. The proposed scheme utilizes a reinforcement lear...

Abhijeet Bhorkar, Mohammad Naghshvar, Tara Javidi,...

claim paper

Read More »

115

click to vote

ECTEL
2009
Springer

126views Machine Learning» more ECTEL 2009»

Getting to Know Your User - Unobtrusive User Model Maintenance within Work-Integrated Learning Environments

15 years 7 months ago

Download www.know-center.tugraz.at

Work-integrated learning (WIL) poses unique challenges for user model design: on the one hand users’ knowledge levels need to be determined based on their work activities – tes...

Stefanie N. Lindstaedt, Günter Beham, Barbara...

claim paper

Read More »

244

Voted

ARTCOM
2009
IEEE

397views Communications» more ARTCOM 2009»

ANFIS Approach for Navigation of Mobile Robots

15 years 10 months ago

Download dspace.nitrkl.ac.in

— This paper, discusses about navigation control of mobile robot using adaptive neuro-fuzzy inference system (ANFIS) in a real word dynamic environment. In the ANFIS controller a...

Mukesh Kumar Singh, Dayal R. Parhi, Jayanta Kumar ...

claim paper

Read More »

116

click to vote

ICML
2000
IEEE

126views Machine Learning» more ICML 2000»

Reinforcement Learning in POMDP's via Direct Gradient Ascent

16 years 4 months ago

Download reference.kfupm.edu.sa

This paper discusses theoretical and experimental aspects of gradient-based approaches to the direct optimization of policy performance in controlled ??? ?s. We introduce ??? ?, a...

Jonathan Baxter, Peter L. Bartlett

claim paper

Read More »

« Prev « First page 10 / 20 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers