Search Sciweavers | Sciweavers

88

FOCS
2003
IEEE

107views Theoretical Computer Science» more FOCS 2003»

Approximation Algorithms for Orienteering and Discounted-Reward TSP

15 years 7 months ago

In this paper, we give the rst constant-factor approximationalgorithmfor the rooted Orienteering problem, as well as a new problem that we call the Discounted-Reward TSP, motivate...

Avrim Blum, Shuchi Chawla, David R. Karger, Terran...

claim paper

Read More »

107

click to vote

ESAW
2004
Springer

105views Intelligent Agents» more ESAW 2004»

Motivation-Based Selection of Negotiation Opponents

15 years 7 months ago

Download www.irit.fr

Abstract. If we are to enable agents to handle increasingly greater levels of complexity, it is necessary to equip them with mechanisms that support greater degrees of autonomy. Th...

Stephen J. Munroe, Michael Luck

claim paper

Read More »

116

Voted

ICML
2002
IEEE

146views Machine Learning» more ICML 2002»

Hierarchically Optimal Average Reward Reinforcement Learning

16 years 3 months ago

Download www.cs.ualberta.ca

Two notions of optimality have been explored in previous work on hierarchical reinforcement learning (HRL): hierarchical optimality, or the optimal policy in the space defined by ...

Mohammad Ghavamzadeh, Sridhar Mahadevan

claim paper

Read More »

115

Voted

WEBI
2009
Springer

152views Internet Technology» more WEBI 2009»

Zero-Sum Reward and Punishment Collaborative Filtering Recommendation Algorithm

15 years 8 months ago

Download dm.thss.tsinghua.edu.cn

In this paper, we propose a novel memory-based collaborative ﬁltering recommendation algorithm. Our algorithm use a new metric named inﬂuence weight, which is adjusted with ze...

Nan Li, Chunping Li

claim paper

Read More »

161

Voted

ATAL
2010
Springer

158views Intelligent Agents» more ATAL 2010»

Combining manual feedback with subsequent MDP reward signals for reinforcement learning

15 years 3 months ago

Download www.cs.utexas.edu

As learning agents move from research labs to the real world, it is increasingly important that human users, including those without programming skills, be able to teach agents de...

W. Bradley Knox, Peter Stone

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers