Sciweavers

620 search results - page 16 / 124
» Computing with words for text processing: An approach to the...
Sort
View
SIGMOD
2004
ACM
150views Database» more  SIGMOD 2004»
14 years 9 months ago
When one Sample is not Enough: Improving Text Database Selection Using Shrinkage
Database selection is an important step when searching over large numbers of distributed text databases. The database selection task relies on statistical summaries of the databas...
Panagiotis G. Ipeirotis, Luis Gravano
ACL
2009
13 years 6 months ago
A Novel Word Segmentation Approach for Written Languages with Word Boundary Markers
Most NLP applications work under the assumption that a user input is error-free; thus, word segmentation (WS) for written languages that use word boundary markers (WBMs), such as ...
Han-Cheol Cho, Do-Gil Lee, Jung-Tae Lee, Pontus St...
ICPR
2006
IEEE
14 years 10 months ago
A Robust Split-and-Merge Text Segmentation Approach for Images
In this paper we describe a robust approach to segment text from color images. The proposed approach mainly includes four steps. Firstly, a preprocessing step is utilized to enhan...
Weiqiang Wang, Wen Gao, Yaowen Zhan
CAIP
2009
Springer
211views Image Analysis» more  CAIP 2009»
14 years 3 months ago
Contextual-Guided Bag-of-Visual-Words Model for Multi-class Object Categorization
Abstract. Bag-of-words model (BOW) is inspired by the text classification problem, where a document is represented by an unsorted set of contained words. Analogously, in the objec...
Mehdi Mirza-Mohammadi, Sergio Escalera, Petia Rade...
KDD
2003
ACM
146views Data Mining» more  KDD 2003»
14 years 9 months ago
Style mining of electronic messages for multiple authorship discrimination: first results
This paper considers the use of computational stylistics for performing authorship attribution of electronic messages, addressing categorization problems with as many as 20 differ...
Shlomo Argamon, Marin Saric, Sterling Stuart Stein