Sciweavers

1863 search results - page 65 / 373
» Automatic Collection of Related Terms from the Web
Sort
View
CEAS
2007
Springer
16 years 7 days ago
Characterizing Web Spam Using Content and HTTP Session Analysis
Web spam research has been hampered by a lack of statistically significant collections. In this paper, we perform the first large-scale characterization of web spam using conten...
Steve Webb, James Caverlee, Calton Pu
HT
2009
ACM
16 years 17 days ago
How are web characteristics evolving?
The Web is a hypertextual environment in permanent evolution. There are new technologies and Web publishing behaviors emerging everyday. This study presents trends on the evolutio...
João Miranda, Daniel Gomes
CIKM
2011
Springer
14 years 6 months ago
Integrating and querying web databases and documents
There exist many interrelated information sources on the Internet that can be categorized into structured (database) and semistructured (documents). A key challenge is to integrat...
Carlos Garcia-Alvarado, Carlos Ordonez
KDD
2006
ACM
143views Data Mining» more  KDD 2006»
16 years 6 months ago
Mining long-term search history to improve search accuracy
Long-term search history contains rich information about a user's search preferences. In this paper, we study statistical language modeling based methods to mine contextual i...
Bin Tan, Xuehua Shen, ChengXiang Zhai
ACL
2008
15 years 7 months ago
Solving Relational Similarity Problems Using the Web as a Corpus
We present a simple linguistically-motivated method for characterizing the semantic relations that hold between two nouns. The approach leverages the vast size of the Web in order...
Preslav Nakov, Marti A. Hearst