Sciweavers

878 search results - page 119 / 176
» Experiments with Linguistic Categories for Language Model Op...
Sort
View
DOCENG
2009
ACM
14 years 3 months ago
Creation and maintenance of multi-structured documents
In this article, we introduce a new problem: the construction of multi-structured documents. We first offer an overview of existing solutions to the representation of such docum...
Pierre-Edouard Portier, Sylvie Calabretto
DIS
2007
Springer
14 years 2 months ago
Unsupervised Spam Detection Based on String Alienness Measures
We propose an unsupervised method for detecting spam documents from Web page data, based on equivalence relations on strings. We propose 3 measures for quantifying the alienness (...
Kazuyuki Narisawa, Hideo Bannai, Kohei Hatano, Mas...
ICS
2003
Tsinghua U.
14 years 1 months ago
Estimating cache misses and locality using stack distances
Cache behavior modeling is an important part of modern optimizing compilers. In this paper we present a method to estimate the number of cache misses, at compile time, using a mac...
Calin Cascaval, David A. Padua
SIGIR
2009
ACM
14 years 3 months ago
Incorporating prior knowledge into a transductive ranking algorithm for multi-document summarization
This paper presents a transductive approach to learn ranking functions for extractive multi-document summarization. At the first stage, the proposed approach identifies topic th...
Massih-Reza Amini, Nicolas Usunier
CORR
2008
Springer
113views Education» more  CORR 2008»
13 years 8 months ago
Expressing OLAP operators with the TAX XML algebra
With the rise of XML as a standard for representing business data, XML data warehouses appear as suitable solutions for Web-based decision-support applications. In this context, i...
Marouane Hachicha, Hadj Mahboubi, Jérô...