Search Sciweavers | Sciweavers

563 search results - page 51 / 113

» Crawling the web for structured documents

149

click to vote

WISE
2002
Springer

126views Internet Technology» more WISE 2002»

Towards Translating Authorizations for Transformed XML Documents

15 years 9 months ago

Download turtle.ee.ncku.edu.tw

Web based services and applications have increased the availability and accessibility of information. XML has recently emerged as an important standard in the area of information ...

Somchai Chatvichienchai, Mizuho Iwaihara, Yahiko K...

claim paper

Read More »

135

click to vote

AAAI
1997

162views Intelligent Agents» more AAAI 1997»

Template-Based Information Mining from HTML Documents

15 years 5 months ago

Download research.microsoft.com

Tools for mining information from data can create added value for the Internet. As the majority of electronic documents available over the network are in unstructured textual form...

Jane Yung-jen Hsu, Wen-tau Yih

claim paper

Read More »

143

click to vote

SIGIR
2006
ACM

178views Information Technology» more SIGIR 2006»

AggregateRank: bringing order to web sites

15 years 10 months ago

Download research.microsoft.com

Since the website is one of the most important organizational structures of the Web, how to effectively rank websites has been essential to many Web applications, such as Web sear...

Guang Feng, Tie-Yan Liu, Ying Wang, Ying Bao, Zhim...

claim paper

Read More »

136

click to vote

VLDB
2011
ACM

251views Database» more VLDB 2011»

Harvesting relational tables from lists on the web

14 years 11 months ago

Download www.vldb.org

A large number of web pages contain data structured in the form of “lists”. Many such lists can be further split into multi-column tables, which can then be used in more seman...

Hazem Elmeleegy, Jayant Madhavan, Alon Y. Halevy

claim paper

Read More »

164

click to vote

WWW
2007
ACM

189views Internet Technology» more WWW 2007»

Extraction and classification of dense communities in the web

16 years 4 months ago

Download www2007.org

The World Wide Web (WWW) is rapidly becoming important for society as a medium for sharing data, information and services, and there is a growing interest in tools for understandi...

Yon Dourisboure, Filippo Geraci, Marco Pellegrini

claim paper

Read More »

« Prev « First page 51 / 113 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers