Sciweavers

313 search results - page 32 / 63
» Using Recon for Data Cleaning
Sort
View
WSDM
2010
ACM
215views Data Mining» more  WSDM 2010»
14 years 5 months ago
Boilerplate Detection using Shallow Text Features
In addition to the actual content Web pages consist of navigational elements, templates, and advertisements. This boilerplate text typically is not related to the main content, ma...
Christian Kohlschütter, Peter Fankhauser, Wol...
SIGMOD
2007
ACM
192views Database» more  SIGMOD 2007»
14 years 7 months ago
Benchmarking declarative approximate selection predicates
Declarative data quality has been an active research topic. The fundamental principle behind a declarative approach to data quality is the use of declarative statements to realize...
Amit Chandel, Oktie Hassanzadeh, Nick Koudas, Moha...
EUROSYS
2009
ACM
13 years 11 months ago
Effective and efficient compromise recovery for weakly consistent replication
Weakly consistent replication of data has become increasingly important both for loosely-coupled collections of personal devices and for large-scale infrastructure services. Unfor...
Prince Mahajan, Ramakrishna Kotla, Catherine C. Ma...
AUSDM
2006
Springer
139views Data Mining» more  AUSDM 2006»
13 years 11 months ago
Integrated Scoring For Spelling Error Correction, Abbreviation Expansion and Case Restoration in Dirty Text
An increasing number of language and speech applications are gearing towards the use of texts from online sources as input. Despite such rise, not much work can be found in the as...
Wilson Wong, Wei Liu, Mohammed Bennamoun
ICMCS
2007
IEEE
76views Multimedia» more  ICMCS 2007»
14 years 1 months ago
Whispering Speaker Identification
This paper describes a study of automatically identifying whispering speakers. People usually whisper in order to avoid being identified or overheard by lowering their voices. Th...
Qin Jin, Szu-Chen Stan Jou, Tanja Schultz