Sciweavers

664 search results - page 45 / 133
» The internet measurement data catalog
Sort
View
WISE
2005
Springer
14 years 1 months ago
Extracting Web Data Using Instance-Based Learning
This paper studies structured data extraction from Web pages, e.g., online product description pages. Existing approaches to data extraction include wrapper induction and automatic...
Yanhong Zhai, Bing Liu
IMC
2010
ACM
13 years 5 months ago
Network traffic characteristics of data centers in the wild
Although there is tremendous interest in designing improved networks for data centers, very little is known about the network-level traffic characteristics of current data centers...
Theophilus Benson, Aditya Akella, David A. Maltz
WWW
2007
ACM
14 years 8 months ago
Internet-scale collection of human-reviewed data
Enterprise and web data processing and content aggregation systems often require extensive use of human-reviewed data (e.g. for training and monitoring machine learning-based appl...
Qi Su, Dmitry Pavlov, Jyh-Herng Chow, Wendell C. B...
OTM
2009
Springer
14 years 2 months ago
A Model for Semantic Equivalence Discovery for Harmonizing Master Data
IT projects often face the challenge of harmonizing metadata and data so as to have a “single” version of the truth. Determining equivalency of multiple data instances against ...
Baba Piprani
WWW
2007
ACM
14 years 8 months ago
Web page classification with heterogeneous data fusion
Web pages are more than text and they contain much contextual and structural information, e.g., the title, the meta data, the anchor text, etc., each of which can be seen as a dat...
Zenglin Xu, Irwin King, Michael R. Lyu