Search Sciweavers | Sciweavers

543 search results - page 22 / 109

» Exploiting content redundancy for web information extraction

click to vote

ECIR
2009
Springer

155views Information Technology» more ECIR 2009»

PathRank: Web Page Retrieval with Navigation Path

13 years 6 months ago

Download goanna.cs.rmit.edu.au

Abstract. This paper describes a path-based method to use the multi-step navigation information discovered from website structures for web page ranking. Use of hyperlinks to enhanc...

Jianqiang Li, Yu Zhao 0002

claim paper

Read More »

click to vote

ECAI
2006
Springer

125views Artificial Intelligence» more ECAI 2006»

Identifying Inter-Domain Similarities Through Content-Based Analysis of Hierarchical Web-Directories

14 years 13 days ago

Download www.inf.unibz.it

Providing accurate personalized information services to the users requires knowing their interests and needs, as defined by their User Models (UMs). Since the quality of the person...

Shlomo Berkovsky, Dan Goldwasser, Tsvi Kuflik, Fra...

claim paper

Read More »

click to vote

AUSAI
2003
Springer

81views Artificial Intelligence» more AUSAI 2003»

Information Extraction via Path Merging

14 years 2 months ago

Download www.ict.csiro.au

Abstract. In this paper, we describe a new approach to information extraction that neatly integrates top-down hypothesis driven information with bottom-up data driven information. ...

Robert Dale, Cécile Paris, Marc Tilbrook

claim paper

Read More »

click to vote

WSDM
2012
ACM

214views Data Mining» more WSDM 2012»

Selecting actions for resource-bounded information extraction using reinforcement learning

12 years 4 months ago

Download people.cs.umass.edu

Given a database with missing or uncertain content, our goal is to correct and ﬁll the database by extracting speciﬁc information from a large corpus such as the Web, and to d...

Pallika H. Kanani, Andrew K. McCallum

claim paper

Read More »

click to vote

WWW
2007
ACM

186views Internet Technology» more WWW 2007»

Organizing and searching the world wide web of facts -- step two: harnessing the wisdom of the crowds

14 years 9 months ago

Download www2007.org

As part of a large effort to acquire large repositories of facts from unstructured text on the Web, a seed-based framework for textual information extraction allows for weakly sup...

Marius Pasca

claim paper

Read More »

« Prev « First page 22 / 109 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers