Sciweavers

8316 search results - page 4 / 1664
» Web Document Modeling
Sort
View
TREC
2004
13 years 11 months ago
Language Models for Searching in Web Corpora
: We describe our participation in the TREC 2004 Web and Terabyte tracks. For the web track, we employ mixture language models based on document full-text, incoming anchortext, and...
Jaap Kamps, Gilad Mishne, Maarten de Rijke
RIAO
2007
13 years 11 months ago
From Layout to Semantic: a Reranking Model for Mapping Web Documents to Mediated XML Representations
Many documents on the Web are formated in a weakly structured format. Because of their weak semantic and because of the heterogeneity of their formats, the information conveyed by...
Guillaume Wisniewski, Patrick Gallinari
RIVF
2007
13 years 11 months ago
Disambiguation of People in Web Search Using a Knowledge Base
— Results of queries by personal names often contain documents related to several people because of the namesake problem. In order to differentiate documents related to different...
Quang Minh Vu, Tomonari Masada, Atsuhiro Takasu, J...
ERCIMDL
2008
Springer
122views Education» more  ERCIMDL 2008»
13 years 11 months ago
Improving Temporal Language Models for Determining Time of Non-timestamped Documents
Taking the temporal dimension into account in searching, i.e., using time of content creation as part of the search condition, is now gaining increasingly interest. However, in the...
Nattiya Kanhabua, Kjetil Nørvåg
AUSDM
2008
Springer
243views Data Mining» more  AUSDM 2008»
14 years 1 days ago
Structure-Based Document Model with Discrete Wavelet Transforms and Its Application to Document Classification
Term signal is an existing text representation that depicts a term as a vector of frequencies of occurrences in a number of user-defined partitions of a document. Although term si...
Supphachai Thaicharoen, Tom Altman, Krzysztof J. C...