Sciweavers

330 search results - page 48 / 66
» Unexpected results in automatic list extraction on the web
Sort
View
ICDE
2007
IEEE
173views Database» more  ICDE 2007»
14 years 9 months ago
Annotating Structured Data of the Deep Web
An increasing number of databases have become Web accessible through HTML form-based search interfaces. The data units returned from the underlying database are usually encoded in...
Yiyao Lu, Hai He, Hongkun Zhao, Weiyi Meng, Clemen...
MICAI
2007
Springer
14 years 1 months ago
Taking Advantage of the Web for Text Classification with Imbalanced Classes
A problem of supervised approaches for text classification is that they commonly require high-quality training data to construct an accurate classifier. Unfortunately, in many real...
Rafael Guzmán-Cabrera, Manuel Montes-y-G&oa...
ECIR
2010
Springer
13 years 8 months ago
Analyzing Information Retrieval Methods to Recover Broken Web Links
In this work we compare different techniques to automatically find candidate web pages to substitute broken links. We extract information from the anchor text, the content of the p...
Juan Martinez-Romo, Lourdes Araujo
KDD
2007
ACM
193views Data Mining» more  KDD 2007»
14 years 8 months ago
Joint optimization of wrapper generation and template detection
Many websites have large collections of pages generated dynamically from an underlying structured source like a database. The data of a category are typically encoded into similar...
Shuyi Zheng, Ruihua Song, Ji-Rong Wen, Di Wu
ECWEB
2005
Springer
127views ECommerce» more  ECWEB 2005»
14 years 1 months ago
Knowledge Discovery in Web-Directories: Finding Term-Relations to Build a Business Ontology
The Web continues to grow at a tremendous rate. Search engines find it increasingly difficult to provide useful results. To manage this explosively large number of Web documents,...
Sandip Debnath, Tracy Mullen, Arun Upneja, C. Lee ...