Sciweavers

187 search results - page 12 / 38
» Entity categorization over large document collections
Sort
View
CIKM
2011
Springer
12 years 7 months ago
Integrating and querying web databases and documents
There exist many interrelated information sources on the Internet that can be categorized into structured (database) and semistructured (documents). A key challenge is to integrat...
Carlos Garcia-Alvarado, Carlos Ordonez
ICWSM
2009
13 years 5 months ago
A Categorical Model for Discovering Latent Structure in Social Annotations
The advent of social tagging systems has enabled a new community-based view of the Web in which objects like images, videos, and Web pages are annotated by thousands of users. Und...
Said Kashoob, James Caverlee, Ying Ding
WWW
2005
ACM
14 years 8 months ago
Hubble: an advanced dynamic folder system for XML
Organizing large document collections for finding information easily and quickly has always been an important user requirement. This paper describes a flexible and powerful dynami...
Ning Li, Joshua Hui, Hui-I Hsiao, Kevin S. Beyer
WWW
2005
ACM
14 years 8 months ago
An experimental study on large-scale web categorization
Taxonomies of the Web typically have hundreds of thousands of categories and skewed category distribution over documents. It is not clear whether existing text classification tech...
Tie-Yan Liu, Yiming Yang, Hao Wan, Qian Zhou, Bin ...
ERSHOV
2006
Springer
13 years 11 months ago
On the Importance of Parameter Tuning in Text Categorization
Abstract. Text Categorization algorithms have a large number of parameters that determine their behaviour, whose effect is not easily predicted objectively or intuitively and may v...
Cornelis H. A. Koster, Jean Beney