Sciweavers

95 search results - page 9 / 19
» Improvement of Text Compression Parameters Using Cluster Ana...
Sort
View
ACL
2010
13 years 5 months ago
Unsupervised Ontology Induction from Text
Extracting knowledge from unstructured text is a long-standing goal of NLP. Although learning approaches to many of its subtasks have been developed (e.g., parsing, taxonomy induc...
Hoifung Poon, Pedro Domingos
CIKM
2008
Springer
13 years 9 months ago
Scalable community discovery on textual data with relations
Every piece of textual data is generated as a method to convey its authors' opinion regarding specific topics. Authors deliberately organize their writings and create links, ...
Huajing Li, Zaiqing Nie, Wang-Chien Lee, C. Lee Gi...
IJCAI
2003
13 years 8 months ago
Web Page Cleaning for Web Mining through Feature Weighting
Unlike conventional data or text, Web pages typically contain a large amount of information that is not part of the main contents of the pages, e.g., banner ads, navigation bars, ...
Lan Yi, Bing Liu
KDD
2007
ACM
124views Data Mining» more  KDD 2007»
14 years 1 months ago
Hierarchical mixture models: a probabilistic analysis
Mixture models form one of the most widely used classes of generative models for describing structured and clustered data. In this paper we develop a new approach for the analysis...
Mark Sandler
ICDAR
1995
IEEE
13 years 11 months ago
Visual inter-word relations and their use in OCR postprocessing
A technique is presented that uses visual relationships between word images in a document to improve the recognition of the text it contains. This technique takes advantage of the...
Tao Hong, Jonathan J. Hull