Sciweavers

313 search results - page 16 / 63
» Using Recon for Data Cleaning
Sort
View
LREC
2010
156views Education» more  LREC 2010»
13 years 9 months ago
Data-Driven and Ontological Analysis of FrameNet for Natural Language Reasoning
This paper focuses on the improvement of the conceptual structure of FrameNet for the sake of applying this resource to knowledgeintensive NLP tasks requiring reasoning, such as q...
Ekaterina Ovchinnikova, Laure Vieu, Alessandro Olt...
KDD
2002
ACM
183views Data Mining» more  KDD 2002»
14 years 8 months ago
E-CAST: A Data Mining Algorithm for Gene Expression Data
Data clustering methods have been proven to be a successful data mining technique in the analysis of gene expression data. The Cluster affinity search technique (CAST) developed b...
Abdelghani Bellaachia, David Portnoy, Yidong Chen,...
WISE
2005
Springer
14 years 1 months ago
Identifying Value Mappings for Data Integration: An Unsupervised Approach
The Web is a distributed network of information sources where the individual sources are autonomously created and maintained. Consequently, syntactic and semantic heterogeneity of ...
Jaewoo Kang, Dongwon Lee, Prasenjit Mitra
NAACL
2010
13 years 5 months ago
Training Paradigms for Correcting Errors in Grammar and Usage
This paper proposes a novel approach to the problem of training classifiers to detect and correct grammar and usage errors in text by selectively introducing mistakes into the tra...
Alla Rozovskaya, Dan Roth
ICDE
2005
IEEE
108views Database» more  ICDE 2005»
14 years 1 months ago
Robust Identification of Fuzzy Duplicates
Detecting and eliminating fuzzy duplicates is a critical data cleaning task that is required by many applications. Fuzzy duplicates are multiple seemingly distinct tuples which re...
Surajit Chaudhuri, Venkatesh Ganti, Rajeev Motwani