Sciweavers

1353 search results - page 100 / 271
» Text Indexing with Errors
Sort
View
SSDBM
2010
IEEE
220views Database» more  SSDBM 2010»
14 years 5 days ago
Prefix Tree Indexing for Similarity Search and Similarity Joins on Genomic Data
Similarity search and similarity join on strings are important for applications such as duplicate detection, error detection, data cleansing, or comparison of biological sequences....
Astrid Rheinländer, Martin Knobloch, Nicky Ho...
COLING
1992
13 years 9 months ago
Towards Robust PATR
We report on the initial stages of development of a robust parsing system, to be used as part of The Editor's Assistant, a program that detects and corrects textual errors an...
Shona Douglas, Robert Dale
ICDE
2003
IEEE
133views Database» more  ICDE 2003»
14 years 9 months ago
Text Joins for Data Cleansing and Integration in an RDBMS
An organization's data records are often noisy because of transcription errors, incomplete information, lack of standard formats for textual data or combinations thereof. A f...
Luis Gravano, Panagiotis G. Ipeirotis, Nick Koudas...
NLDB
2007
Springer
14 years 2 months ago
Large-Scale Knowledge Acquisition from Botanical Texts
Free text botanical descriptions contained in printed floras can provide a wealth of valuable scientific information. In spite of this richness, these texts have seldom been anal...
François Role, Milagros Fernandez Gavilanes...
AAAI
2006
13 years 9 months ago
Corpus-based and Knowledge-based Measures of Text Semantic Similarity
This paper presents a method for measuring the semantic similarity of texts, using corpus-based and knowledge-based measures of similarity. Previous work on this problem has focus...
Rada Mihalcea, Courtney Corley, Carlo Strapparava