Search Sciweavers | Sciweavers

194

FLAIRS
2001

131views Artificial Intelligence» more FLAIRS 2001»

Extracting Partial Structures from HTML Documents

15 years 8 months ago

The new wrapper model for extractiong text data from HTML documents is introduced. The Kushmerick's wrapper class (Kusshmerick 2000) may be unsuccessful in the case that suff...

Hiroshi Sakamoto, Yoshitsugu Murakami, Hiroki Arim...

claim paper

Read More »

292

click to vote

DRR
2011

280views Document Analysis» more DRR 2011»

Improved document image segmentation algorithm using multiresolution morphology

14 years 7 months ago

Download www.dfki.uni-kl.de

Page segmentation into text and non-text components is an essential preprocessing step before OCR operation. If this is not done properly, an OCR classiﬁcation engine produces g...

Syed Saqib Bukhari, Faisal Shafait, Thomas M. Breu...

claim paper

Read More »

198

click to vote

AIMSA
2008
Springer

118views Artificial Intelligence» more AIMSA 2008»

Using Text Segmentation to Enhance the Cluster Hypothesis

16 years 1 months ago

Download www.info.univ-angers.fr

An alternative way to tackle Information Retrieval, called Passage Retrieval, considers text fragments independently rather than assessing global relevance of documents. In such a ...

Sylvain Lamprier, Tassadit Amghar, Bernard Levrat,...

claim paper

Read More »

210

click to vote

PAKM
1998

114views Knowledge Management» more PAKM 1998»

Knowledge Management: A Text Mining Approach

15 years 8 months ago

Download liawww.epfl.ch

Knowledge Discovery in Databases (KDD), also known as data mining, focuses on the computerized exploration of large amounts of data and on the discovery of interesting patterns wi...

Ronen Feldman, Moshe Fresko, Haym Hirsh, Yonatan A...

claim paper

Read More »

194

click to vote

EMNLP
2007

116views Natural Language Processing» more EMNLP 2007»

Topic Segmentation with Hybrid Document Indexing

15 years 9 months ago

Download people.cs.uchicago.edu

We present a domain-independent unsupervised topic segmentation approach based on hybrid document indexing. Lexical chains have been successfully employed to evaluate lexical cohe...

Irina Matveeva, Gina-Anne Levow

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers