Sciweavers

90 search results - page 8 / 18
» Extracting and Ranking Product Features in Opinion Documents
Sort
View
PAMI
2002
94views more  PAMI 2002»
13 years 7 months ago
Imaged Document Text Retrieval Without OCR
: We propose a method for text retrieval from document images without the use of OCR. Documents are segmented into character objects. Image features, namely the Vertical Traverse D...
Chew Lim Tan, Weihua Huang, Zhaohui Yu, Yi Xu
JCDL
2005
ACM
100views Education» more  JCDL 2005»
14 years 1 months ago
Automatic extraction of titles from general documents using machine learning
In this paper, we propose a machine learning approach to title extraction from general documents. By general documents, we mean documents that can belong to any one of a number of...
Yunhua Hu, Hang Li, Yunbo Cao, Dmitriy Meyerzon, Q...
IR
2010
13 years 6 months ago
LETOR: A benchmark collection for research on learning to rank for information retrieval
LETOR is a benchmark collection for the research on learning to rank for information retrieval, released by Microsoft Research Asia. In this paper, we describe the details of the L...
Tao Qin, Tie-Yan Liu, Jun Xu, Hang Li
ICIP
2001
IEEE
14 years 9 months ago
Similarity measure for CCITT Group 4 compressed document images
Similarity measure of document images acts a crucial role in the area of document image retrieval. A method of measuring the similarity of CCITT Group 4 compressed document images...
Yue Lu, Chew Lim Tan, Liying Fan, Weihua Huang
WWW
2010
ACM
14 years 2 months ago
Sampling high-quality clicks from noisy click data
Click data captures many users’ document preferences for a query and has been shown to help significantly improve search engine ranking. However, most click data is noisy and of...
Adish Singla, Ryen W. White