Sciweavers

ECIR
2008
Springer
14 years 1 months ago
Clustering Template Based Web Documents
More and more documents on the World Wide Web are based on templates. On a technical level this causes those documents to have a quite similar source code and DOM tree structure. G...
Thomas Gottron
ECIR
2008
Springer
14 years 1 months ago
Utilizing Passage-Based Language Models for Document Retrieval
Abstract. We show that several previously proposed passage-based document ranking principles, along with some new ones, can be derived from the same probabilistic model. We use lan...
Michael Bendersky, Oren Kurland
EMNLP
2007
14 years 1 months ago
Enhancing Single-Document Summarization by Combining RankNet and Third-Party Sources
We present a new approach to automatic summarization based on neural nets, called NetSum. We extract a set of features from each sentence that helps identify its importance in the...
Krysta Marie Svore, Lucy Vanderwende, Christopher ...
ECIR
2007
Springer
14 years 1 months ago
Incorporating Diversity and Density in Active Learning for Relevance Feedback
Abstract. Relevance feedback, which uses the terms in relevant documents to enrich the user’s initial query, is an effective method for improving retrieval performance. An assoc...
Zuobing Xu, Ram Akella, Yi Zhang 0001
DIMVA
2007
14 years 1 months ago
A Study of Malcode-Bearing Documents
By exploiting the object-oriented dynamic composability of modern document applications and formats, malcode hidden in otherwise inconspicuous documents can reach third-party appli...
Wei-Jen Li, Salvatore J. Stolfo, Angelos Stavrou, ...
AVI
2010
14 years 1 months ago
Deep Diffs: visually exploring the history of a document
Software tools are used to compare multiple versions of a textual document to help a reader understand the evolution of that document over time. These tools generally support the ...
Ross Shannon, Aaron J. Quigley, Paddy Nixon
ESWS
2008
Springer
14 years 2 months ago
Contextual and Metadata-based Approach for the Semantic Annotation of Heterogeneous Documents
We present SHIRI-Annot an automatic ontology-driven and unsupervised approach for the semantic annotation of documents which contain well structured parts and not well structured o...
Mouhamadou Thiam, Nathalie Pernelle, Nacéra...
ESWS
2008
Springer
14 years 2 months ago
Improving Interoperability Using Query Interpretation in Semantic Vector Spaces
Abstract. In semantic web applications where query initiators and information providers do not necessarily share the same ontology, semantic interoperability generally relies on on...
Anthony Ventresque, Sylvie Cazalens, Philippe Lama...
ESWS
2008
Springer
14 years 2 months ago
IVEA: An Information Visualization Tool for Personalized Exploratory Document Collection Analysis
Knowledge work in many fields requires examining several aspects of a collection of documents to attain meaningful understanding that is not explicitly available. Despite recent ad...
VinhTuan Thai, Siegfried Handschuh, Stefan Decker
DOCENG
2008
ACM
14 years 2 months ago
Merging changes in XML documents using reliable context fingerprints
Different dialects of XML have emerged as ubiquitous document exchange formats. For effective collaboration based on such documents, the capability to propagate edit operations pe...
Sebastian Rönnau, Christian Pauli, Uwe M. Bor...