Sciweavers

2677 search results - page 65 / 536
» Extracting Structured Data from Web Pages
Sort
View
HICSS
2008
IEEE
105views Biometrics» more  HICSS 2008»
14 years 3 months ago
Using Visual Features for Fine-Grained Genre Classification of Web Pages
The field of automatic genre classification has primarily focused on extracting textual features from documents. The goal of this research is to investigate whether visual feature...
Ryan Levering, Michal Cutler, Lei Yu
LREC
2008
169views Education» more  LREC 2008»
13 years 10 months ago
A Large-Scale Web Data Collection as a Natural Language Processing Infrastructure
In recent years, language resources acquired from the Web are released, and these data improve the performance of applications in several NLP tasks. Although the language resource...
Keiji Shinzato, Daisuke Kawahara, Chikara Hashimot...
AAAI
2007
13 years 11 months ago
Mining Web Query Hierarchies from Clickthrough Data
In this paper, we propose to mine query hierarchies from clickthrough data, which is within the larger area of automatic acquisition of knowledge from the Web. When a user submits...
Dou Shen, Min Qin, Weizhu Chen, Qiang Yang, Zheng ...
CGF
2008
126views more  CGF 2008»
13 years 8 months ago
From Web Data to Visualization via Ontology Mapping
In this paper, we propose a novel approach for automatic generation of visualizations from domain-specific data available on the web. We describe a general system pipeline that co...
O. Gilson, N. Silva, Phil W. Grant, Min Chen
WEBDB
2004
Springer
100views Database» more  WEBDB 2004»
14 years 2 months ago
Spam, Damn Spam, and Statistics: Using Statistical Analysis to Locate Spam Web Pages
The increasing importance of search engines to commercial web sites has given rise to a phenomenon we call “web spam”, that is, web pages that exist only to mislead search eng...
Dennis Fetterly, Mark Manasse, Marc Najork