Sciweavers

1353 search results - page 100 / 271
» Text Indexing with Errors
Sort
View
SSDBM
2010
IEEE
220views Database» more  SSDBM 2010»
15 years 8 months ago
Prefix Tree Indexing for Similarity Search and Similarity Joins on Genomic Data
Similarity search and similarity join on strings are important for applications such as duplicate detection, error detection, data cleansing, or comparison of biological sequences....
Astrid Rheinländer, Martin Knobloch, Nicky Ho...
106
Voted
COLING
1992
15 years 4 months ago
Towards Robust PATR
We report on the initial stages of development of a robust parsing system, to be used as part of The Editor's Assistant, a program that detects and corrects textual errors an...
Shona Douglas, Robert Dale
220
Voted
ICDE
2003
IEEE
133views Database» more  ICDE 2003»
16 years 5 months ago
Text Joins for Data Cleansing and Integration in an RDBMS
An organization's data records are often noisy because of transcription errors, incomplete information, lack of standard formats for textual data or combinations thereof. A f...
Luis Gravano, Panagiotis G. Ipeirotis, Nick Koudas...
NLDB
2007
Springer
15 years 9 months ago
Large-Scale Knowledge Acquisition from Botanical Texts
Free text botanical descriptions contained in printed floras can provide a wealth of valuable scientific information. In spite of this richness, these texts have seldom been anal...
François Role, Milagros Fernandez Gavilanes...
135
Voted
AAAI
2006
15 years 5 months ago
Corpus-based and Knowledge-based Measures of Text Semantic Similarity
This paper presents a method for measuring the semantic similarity of texts, using corpus-based and knowledge-based measures of similarity. Previous work on this problem has focus...
Rada Mihalcea, Courtney Corley, Carlo Strapparava