Sciweavers

910 search results - page 106 / 182
» Testbed for information extraction from deep web
Sort
View
WWW
2007
ACM
14 years 8 months ago
Integrating web directories by learning their structures
Documents in the Web are often organized using category trees by information providers (e.g. CNN, BBC) or search engines (e.g. Google, Yahoo!). Such category trees are commonly kn...
Christopher C. Yang, Jianfeng Lin
KDD
2002
ACM
170views Data Mining» more  KDD 2002»
14 years 8 months ago
Web site mining: a new way to spot competitors, customers and suppliers in the world wide web
When automatically extracting information from the world wide web, most established methods focus on spotting single HTMLdocuments. However, the problem of spotting complete web s...
Martin Ester, Hans-Peter Kriegel, Matthias Schuber...
CIKM
2009
Springer
14 years 2 months ago
Identifying comparable entities on the web
Web search engines are often presented with user queries that involve comparisons of real-world entities. Thus far, this interaction has typically been captured by users submittin...
Alpa Jain, Patrick Pantel
CORIA
2011
12 years 11 months ago
Mining the Web for lists of Named Entities
Named entities play an important role in Information Extraction. They represent unitary namable information within text. In this work, we focus on groups of named entities of the s...
Arlind Kopliku, Mohand Boughanem, Karen Pinel-Sauv...
IAT
2007
IEEE
13 years 8 months ago
Similarity-Based Fuzzy Clustering for User Profiling
User profiling is a fundamental task in Web personalization. Fuzzy clustering is a valid approach to derive user profiles by capturing similar user interests from web usage data a...
Giovanna Castellano, Anna Maria Fanelli, Corrado M...