Search Sciweavers | Sciweavers

180

IJCAI
2001

151views Artificial Intelligence» more IJCAI 2001»

R-MAX - A General Polynomial Time Algorithm for Near-Optimal Reinforcement Learning

15 years 7 months ago

R-max is a very simple model-based reinforcement learning algorithm which can attain near-optimal average reward in polynomial time. In R-max, the agent always maintains a complet...

Ronen I. Brafman, Moshe Tennenholtz

claim paper

Read More »

180

Voted

NIPS
2003

145views Information Technology» more NIPS 2003»

A Nonlinear Predictive State Representation

15 years 7 months ago

Download books.nips.cc

Predictive state representations (PSRs) use predictions of a set of tests to represent the state of controlled dynamical systems. One reason why this representation is exciting as...

Matthew R. Rudary, Satinder P. Singh

claim paper

Read More »

132

click to vote

GRAPHICSINTERFACE
2000

105views Computer Graphics» more GRAPHICSINTERFACE 2000»

Are We All In the Same "Bloat"?

15 years 7 months ago

Download www.cs.ubc.ca

"Bloat", a term that has existed in the technical community for many years, has recently received attention in the popular press. The term has a negative connotation imp...

Joanna McGrenere, Gale Moore

claim paper

Read More »

153

click to vote

IPCO
1998

99views Optimization» more IPCO 1998»

Non-approximability Results for Scheduling Problems with Minsum Criteria

15 years 7 months ago

Download www.win.tue.nl

We provide several non-approximability results for deterministic scheduling problems whose objective is to minimize the total job completion time. Unless P = NP, none of the probl...

Han Hoogeveen, Petra Schuurman, Gerhard J. Woeging...

claim paper

Read More »

178

Voted

IJCAI
1989

122views Artificial Intelligence» more IJCAI 1989»

Constrained Heuristic Search

15 years 7 months ago

Download agi-conf.org

Cognitive architectures aspire for generality both in terms of problem solving and learning across a range of problems, yet to date few examples of domain independent learning has...

Mark S. Fox, Norman M. Sadeh, Can A. Baykan

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers