Sciweavers

1947 search results - page 142 / 390
» On the Automatic Extraction of Data from the Hidden Web
Sort
View
GFKL
2005
Springer
125views Data Mining» more  GFKL 2005»
14 years 2 months ago
Towards Structure-sensitive Hypertext Categorization
Abstract. Hypertext categorization is the task of automatically assigning category labels to hypertext units. Comparable to text categorization it stays in the area of function lea...
Alexander Mehler, Rüdiger Gleim, Matthias Deh...
CORR
2010
Springer
219views Education» more  CORR 2010»
13 years 9 months ago
Finding Sequential Patterns from Large Sequence Data
Data mining is the task of discovering interesting patterns from large amounts of data. There are many data mining tasks, such as classification, clustering, association rule mini...
Mahdi Esmaeili, Fazekas Gabor
ACL
2006
13 years 10 months ago
Learning Transliteration Lexicons from the Web
This paper presents an adaptive learning framework for Phonetic Similarity Modeling (PSM) that supports the automatic construction of transliteration lexicons. The learning algori...
Jin-Shea Kuo, Haizhou Li, Ying-Kuei Yang
JIIS
2006
147views more  JIIS 2006»
13 years 9 months ago
Mining sequential patterns from data streams: a centroid approach
In recent years, emerging applications introduced new constraints for data mining methods. These constraints are typical of a new kind of data: the data streams. In data stream pro...
Alice Marascu, Florent Masseglia
WWW
2006
ACM
14 years 9 months ago
Using graph matching techniques to wrap data from PDF documents
Wrapping is the process of navigating a data source, semiautomatically extracting data and transforming it into a form suitable for data processing applications. There are current...
Tamir Hassan, Robert Baumgartner