Sciweavers

2190 search results - page 42 / 438
» Unweaving a web of documents
Sort
View
CLEIEJ
2008
72views more  CLEIEJ 2008»
13 years 10 months ago
Measuring Contribution of HTML Features in Web Document Clustering
Documents in HTML format have many features to analyze, from the terms in special sections to the phrases that appear in the whole document. However, it is important to decide whi...
Esteban Meneses, Oldemar Rodríguez-Rojas
FAST
2009
13 years 7 months ago
The Case for Browser Provenance
In our increasingly networked world, web browsers are important applications. Originally an interface tool for accessing distributed documents, browsers have become ubiquitous, in...
Daniel W. Margo, Margo I. Seltzer
AUSDM
2008
Springer
243views Data Mining» more  AUSDM 2008»
14 years 3 days ago
Structure-Based Document Model with Discrete Wavelet Transforms and Its Application to Document Classification
Term signal is an existing text representation that depicts a term as a vector of frequencies of occurrences in a number of user-defined partitions of a document. Although term si...
Supphachai Thaicharoen, Tom Altman, Krzysztof J. C...
EEE
2004
IEEE
14 years 1 months ago
Model-Driven Web Services Development
Web service technologies are becoming increasingly important for integrating systems and services. There is much activity and interest around standardization and usage of web serv...
Roy Grønmo, David Skogan, Ida Solheim, Jon ...
TREC
2004
13 years 11 months ago
Language Models for Searching in Web Corpora
: We describe our participation in the TREC 2004 Web and Terabyte tracks. For the web track, we employ mixture language models based on document full-text, incoming anchortext, and...
Jaap Kamps, Gilad Mishne, Maarten de Rijke