Search Sciweavers | Sciweavers

60 search results - page 8 / 12

» Iteratively Extending Time Horizon Reinforcement Learning

119

click to vote

IAT
2008
IEEE

151views Intelligent Agents» more IAT 2008»

Introducing Communication in Dis-POMDPs with Locality of Interaction

15 years 9 months ago

Download teamcore.usc.edu

The Networked Distributed POMDPs (ND-POMDPs) can model multiagent systems in uncertain domains and has begun to scale-up the number of agents. However, prior work in ND-POMDPs has ...

Makoto Tasaki, Yuichi Yabu, Yuki Iwanari, Makoto Y...

claim paper

Read More »

134

Voted

KBSE
2005
IEEE

127views Software Engineering» more KBSE 2005»

Learning to verify branching time properties

15 years 8 months ago

Download osl.cs.uiuc.edu

We present a new model checking algorithm for verifying computation tree logic (CTL) properties. Our technique is based on using language inference to learn the ﬁxpoints necessar...

Abhay Vardhan, Mahesh Viswanathan

claim paper

Read More »

click to vote

IJCNN
2007
IEEE

92views Neural Networks» more IJCNN 2007»

On Extending the SMO Algorithm Sub-Problem

15 years 9 months ago

Download ml.cecs.ucf.edu

—The Support Vector Machine is a widely employed machine learning model due to its repeatedly demonstrated superior generalization performance. The Sequential Minimal Optimizatio...

Christopher Sentelle, Michael Georgiopoulos, Georg...

claim paper

Read More »

114

click to vote

ICML
2009
IEEE

155views Machine Learning» more ICML 2009»

Near-Bayesian exploration in polynomial time

16 years 3 months ago

Download ai.stanford.edu

We consider the exploration/exploitation problem in reinforcement learning (RL). The Bayesian approach to model-based RL offers an elegant solution to this problem, by considering...

J. Zico Kolter, Andrew Y. Ng

claim paper

Read More »

112

click to vote

IIE
2007

63views more IIE 2007»

Investigation of Q-Learning in the Context of a Virtual Learning Environment

15 years 2 months ago

Download www.mii.lt

We investigate the possibility to apply a known machine learning algorithm of Q-learning in the domain of a Virtual Learning Environment (VLE). It is important in this problem doma...

Dalia Baziukaite

claim paper

Read More »

« Prev « First page 8 / 12 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers