Sciweavers

500 search results - page 38 / 100
» Document frequency and term specificity
Sort
View
WWW
2008
ACM
14 years 11 months ago
As we may perceive: finding the boundaries of compound documents on the web
This paper considers the problem of identifying on the Web compound documents (cDocs) ? groups of web pages that in aggregate constitute semantically coherent information entities...
Pavel Dmitriev
DEBU
2010
126views more  DEBU 2010»
13 years 11 months ago
GeoSIM: A Geospatial Data Collection System for Participatory Urban Texture Documentation
Participatory texture documentation (PTD) is a geospatial data collection process in which a group of users (dedicated individuals and/or general public) with camera-equipped mobi...
Farnoush Banaei Kashani, Houtan Shirani-Mehr, Bei ...
TREC
2004
14 years 10 days ago
Feature Generation, Feature Selection, Classifiers, and Conceptual Drift for Biomedical Document Triage
We approached the problem of classifying papers for the TREC 2004 Genomics Track triage task as a four step process: feature generation, feature selection, classifier training, an...
Aaron M. Cohen, Ravi Teja Bhupatiraju, William R. ...
KDD
2005
ACM
99views Data Mining» more  KDD 2005»
14 years 11 months ago
Determining an author's native language by mining a text for errors
In this paper, we show that stylistic text features can be exploited to determine an anonymous author's native language with high accuracy. Specifically, we first use automat...
Moshe Koppel, Jonathan Schler, Kfir Zigdon
CIVR
2008
Springer
220views Image Analysis» more  CIVR 2008»
14 years 23 days ago
Web-based information content and its application to concept-based video retrieval
Semantic similarity between words or phrases is frequently used to find matching correlations between search queries and documents when straightforward matching of terms fails. Th...
Alexander Haubold, Apostol Natsev