Sciweavers

421 search results - page 58 / 85
» Cross-Lingual Text Categorization
Sort
View
ICDAR
2009
IEEE
14 years 2 months ago
Finding Images and Line-Drawings in Document-Scanning Systems
The system presented in this paper finds images and line-drawings in scanned pages; it is a crucial processing step in the creation of a large-scale system to detect and index ima...
Shumeet Baluja, Michele Covell
VL
1994
IEEE
164views Visual Languages» more  VL 1994»
13 years 11 months ago
Similarity Patterns in Language
Dotplot is a technique for visualizing patterns of string matches in millions of lines of text and code. Patterns may be explored interactively or detected automatically. Applicat...
Jonathan Helfman
AI
2008
Springer
13 years 9 months ago
A Statistical Model for Topic Segmentation and Clustering
This paper presents a statistical model for discovering topical clusters of words in unstructured text. The model uses a hierarchical Bayesian structure and it is also able to iden...
M. Mahdi Shafiei, Evangelos E. Milios
IJON
2006
78views more  IJON 2006»
13 years 7 months ago
Improving self-organization of document collections by semantic mapping
In text management tasks, the dimensionality reduction becomes necessary to computation and interpretability of the results generated by machine learning algorithms. This paper de...
Renato Fernandes Corrêa, Teresa Bernarda Lud...
SIGIR
2008
ACM
13 years 7 months ago
On document splitting in passage detection
Passages can be hidden within a text to circumvent their disallowed transfer. Such release of compartmentalized information is of concern to all corporate and governmental organiz...
Nazli Goharian, Saket S. R. Mengle