Sciweavers

1042 search results - page 104 / 209
» Logic-based Web Information Extraction
Sort
View
CIKM
2008
Springer
13 years 10 months ago
A densitometric approach to web page segmentation
Web Page segmentation is a crucial step for many applications in Information Retrieval, such as text classification, de-duplication and full-text search. In this paper we describe...
Christian Kohlschütter, Wolfgang Nejdl
LREC
2008
102views Education» more  LREC 2008»
13 years 10 months ago
Unsupervised Learning-based Anomalous Arabic Text Detection
The growing dependence of modern society on the Web as a vital source of information and communication has become inevitable. However, the Web has become an ideal channel for vari...
Nasser Abouzakhar, Ben Allison, Louise Guthrie
KDD
2008
ACM
147views Data Mining» more  KDD 2008»
14 years 9 months ago
Extracting shared subspace for multi-label classification
Multi-label problems arise in various domains such as multitopic document categorization and protein function prediction. One natural way to deal with such problems is to construc...
Shuiwang Ji, Lei Tang, Shipeng Yu, Jieping Ye
CORIA
2011
13 years 12 days ago
Mining the Web for lists of Named Entities
Named entities play an important role in Information Extraction. They represent unitary namable information within text. In this work, we focus on groups of named entities of the s...
Arlind Kopliku, Mohand Boughanem, Karen Pinel-Sauv...
SIGMOD
1998
ACM
127views Database» more  SIGMOD 1998»
14 years 1 months ago
ARIADNE: A System for Constructing Mediators for Internet Sources
The Web is based on a browsing paradigm that makes it di cult to retrieve and integrate data from multiple sites. Today, the only way to achieve this integration is by building sp...
José Luis Ambite, Naveen Ashish, Greg Baris...