Search Sciweavers | Sciweavers

309 search results - page 44 / 62

» An Analysis of Web Documents Retrieved and Viewed

click to vote

WWW
2002
ACM

148views Internet Technology» more WWW 2002»

A machine learning based approach for table detection on the web

14 years 8 months ago

Download www.math.ucla.edu

Table is a commonly used presentation scheme, especially for describing relational information. However, table understanding remains an open problem. In this paper, we consider th...

Yalin Wang, Jianying Hu

claim paper

Read More »

click to vote

WWW
2010
ACM

220views Internet Technology» more WWW 2010»

Not so creepy crawler: easy crawler generation with standard xml queries

14 years 2 months ago

Download www2.pms.ifi.lmu.de

Web crawlers are increasingly used for focused tasks such as the extraction of data from Wikipedia or the analysis of social networks like last.fm. In these cases, pages are far m...

Franziska von dem Bussche, Klara A. Weiand, Benedi...

claim paper

Read More »

click to vote

WWW
2010
ACM

257views Internet Technology» more WWW 2010»

CETR: content extraction via tag ratios

14 years 2 months ago

Download www.cs.illinois.edu

We present Content Extraction via Tag Ratios (CETR) – a method to extract content text from diverse webpages by using the HTML document’s tag ratios. We describe how to comput...

Tim Weninger, William H. Hsu, Jiawei Han

claim paper

Read More »

click to vote

WSDM
2010
ACM

265views Data Mining» more WSDM 2010»

Data-oriented Content Query System: Searching for Data into Text on the Web

14 years 5 months ago

Download www.ews.uiuc.edu

As the Web provides rich data embedded in the immense contents inside pages, we witness many ad-hoc efforts for exploiting fine granularity information across Web text, such as We...

Kevin Chen-Chuan Chang, Mianwei Zhou, Tao Cheng

claim paper

Read More »

click to vote

WWW
2005
ACM

183views Internet Technology» more WWW 2005»

Improving Web search efficiency via a locality based static pruning method

14 years 8 months ago

Download www2005.org

The unarguably fast, and continuous, growth of the volume of indexed (and indexable) documents on the Web poses a great challenge for search engines. This is true regarding not on...

Edleno Silva de Moura, Célia Francisca dos ...

claim paper

Read More »

« Prev « First page 44 / 62 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers