Sciweavers

466 search results - page 44 / 94
» Scalable Feature Extraction from Noisy Documents
Sort
View
SIGIR
2005
ACM
14 years 3 months ago
Using term informativeness for named entity detection
Informal communication (e-mail, bulletin boards) poses a difficult learning environment because traditional grammatical and lexical information are noisy. Other information is nec...
Jason D. M. Rennie, Tommi Jaakkola
SDM
2003
SIAM
134views Data Mining» more  SDM 2003»
13 years 11 months ago
Hierarchical Document Clustering using Frequent Itemsets
A major challenge in document clustering is the extremely high dimensionality. For example, the vocabulary for a document set can easily be thousands of words. On the other hand, ...
Benjamin C. M. Fung, Ke Wang, Martin Ester
DRR
2010
14 years 9 days ago
Biomedical article retrieval using multimodal features and image annotations in region-based CBIR
Biomedical images are invaluable in establishing diagnosis, acquiring technical skills, and implementing best practices in many areas of medicine. At present, images needed for in...
Daekeun You, Sameer Antani, Dina Demner-Fushman, M...
ICDAR
2009
IEEE
14 years 4 months ago
Locally Developable Constraint for Document Surface Reconstruction
This article presents a global optimization approach to reconstruct surfaces from a single document image. Instead of assuming globally developable in previous works which restric...
Yuanlong Shao, Xinguo Liu, Xueying Qin, Yi Xu, Huj...
HICSS
2008
IEEE
105views Biometrics» more  HICSS 2008»
14 years 4 months ago
Using Visual Features for Fine-Grained Genre Classification of Web Pages
The field of automatic genre classification has primarily focused on extracting textual features from documents. The goal of this research is to investigate whether visual feature...
Ryan Levering, Michal Cutler, Lei Yu