Sciweavers

806 search results - page 102 / 162
» Extracting knowledge from evaluative text
Sort
View
NAACL
2003
13 years 11 months ago
A Generative Probabilistic OCR Model for NLP Applications
In this paper, we introduce a generative probabilistic optical character recognition (OCR) model that describes an end-to-end process in the noisy channel framework, progressing f...
Okan Kolak, William J. Byrne, Philip Resnik
SAC
2009
ACM
14 years 5 months ago
Combining statistics and semantics via ensemble model for document clustering
Incorporating background knowledge into data mining algorithms is an important but challenging problem. Current approaches in semi-supervised learning require explicit knowledge p...
Samah Jamal Fodeh, William F. Punch, Pang-Ning Tan
DEXAW
2003
IEEE
136views Database» more  DEXAW 2003»
14 years 3 months ago
Ontology Based Semantic Similarity Comparison of Documents
In this work we consider ontologies as knowledge structures that specify terms, their properties and relations among them to enable knowledge extraction from texts. We represent o...
Vladimir A. Oleshchuk, Asle Pedersen
SIGSOFT
2005
ACM
14 years 3 months ago
PR-Miner: automatically extracting implicit programming rules and detecting violations in large software code
Programs usually follow many implicit programming rules, most of which are too tedious to be documented by programmers. When these rules are violated by programmers who are unawar...
Zhenmin Li, Yuanyuan Zhou
RIAO
2000
13 years 11 months ago
Combining linguistic and spatial information for document analysis
We present a framework to analyze color documents of complex layout. In addition, no assumption is made on the layout. Our framework combines in a content-driven bottom-up approac...
Marco Aiello, Christof Monz, Leon Todoran