Search Sciweavers | Sciweavers

2393 search results - page 335 / 479

» Bounds-Consistent Local Search

click to vote

ECAI
2008
Springer

143views Artificial Intelligence» more ECAI 2008»

Combining Domain-Independent Planning and HTN Planning: The Duet Planner

15 years 4 months ago

Download zeus.ing.unibs.it

Abstract. Despite the recent advances in planning for classical domains, the question of how to use domain knowledge in planning is yet to be completely and clearly answered. Some ...

Alfonso Gerevini, Ugur Kuter, Dana S. Nau, Alessan...

claim paper

Read More »

135

Voted

IJCAI
2001

163views Artificial Intelligence» more IJCAI 2001»

Exploiting Multiple Secondary Reinforcers in Policy Gradient Reinforcement Learning

15 years 3 months ago

Download www.cs.colorado.edu

Most formulations of Reinforcement Learning depend on a single reinforcement reward value to guide the search for the optimal policy solution. If observation of this reward is rar...

Gregory Z. Grudic, Lyle H. Ungar

claim paper

Read More »

117

Voted

IJCAI
2001

95views Artificial Intelligence» more IJCAI 2001»

The Exponentiated Subgradient Algorithm for Heuristic Boolean Programming

15 years 3 months ago

Download www.cs.ubc.ca

Boolean linear programs (BLPs) are ubiquitous in AI. Satisfiability testing, planning with resource constraints, and winner determination in combinatorial auctions are all example...

Dale Schuurmans, Finnegan Southey, Robert C. Holte

claim paper

Read More »

117

click to vote

IADIS
2003

135views Internet Technology» more IADIS 2003»

Web Information Management System: Personalization and Generalization

15 years 3 months ago

Download www.iadis.net

Our research focuses on web information management for people who want to monitor and use the World Wide Web (WWW) information, as their information resource. Web information is m...

Sung Sik Park, Yang Sok Kim, Byeong Ho Kang

claim paper

Read More »

132

click to vote

IJCAI
2003

142views Artificial Intelligence» more IJCAI 2003»

Taming Decentralized POMDPs: Towards Efficient Policy Computation for Multiagent Settings

15 years 3 months ago

Download dli.iiit.ac.in

The problem of deriving joint policies for a group of agents that maximize some joint reward function can be modeled as a decentralized partially observable Markov decision proces...

Ranjit Nair, Milind Tambe, Makoto Yokoo, David V. ...

claim paper

Read More »

« Prev « First page 335 / 479 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers