Sciweavers

368 search results - page 10 / 74
» Template-Based Information Mining from HTML Documents
Sort
View
CIDM
2007
IEEE
14 years 1 months ago
Measuring the Validity of Document Relations Discovered from Frequent Itemset Mining
— The extension approach of frequent itemset mining can be applied to discover the relations among documents. Several schemes, i.e., n-gram, stemming, stopword removal and term w...
Kritsada Sriphaew, Thanaruk Theeramunkong
DAGSTUHL
2006
13 years 8 months ago
Information Access to Historical Documents from the Early New High German Period
With the new interest in historical documents insight grew that electronic access to these texts causes many specific problems. In the first part of the paper we survey the presen...
Andreas Hauser, Markus Heller, Elisabeth Leiss, Kl...
AAAI
2000
13 years 8 months ago
A Mutually Beneficial Integration of Data Mining and Information Extraction
Text mining concerns applying data mining techniques to unstructured text. Information extraction (IE) is a form of shallow text understanding that locates specific pieces of data...
Un Yong Nahm, Raymond J. Mooney
WEBNET
1998
13 years 8 months ago
Categorisation by Context
Assistance in retrieving of documents on the World Wide Web is provided either by search engines, through keyword based queries, or by catalogues, which organise documents into hi...
Giuseppe Attardi, Sergio Di Marco, Davide Salvi
COOPIS
1999
IEEE
13 years 11 months ago
Looking at the Web through XML Glasses
The Web so far has been incredibly successful at delivering information to human users. So successful actually, that there is now an urgent need to go beyond a browsing human and ...
Arnaud Sahuguet, Fabien Azavant