Sciweavers

LREC
2010
187views Education» more  LREC 2010»
14 years 27 days ago
Belgisch Staatsblad Corpus: Retrieving French-Dutch Sentences from Official Documents
We describe the compilation of a large corpus of French-Dutch sentence pairs from official Belgian documents which are available in the online version of the publication Belgisch ...
Tom Vanallemeersch
LREC
2010
194views Education» more  LREC 2010»
14 years 27 days ago
Building a Gold Standard for Event Detection in Croatian
This paper describes the process of building a newspaper corpus annotated with events described in specific documents. The main difference to the corpora built as part of the TDT ...
Nikola Ljubesic, Tomislava Lauc, Damir Boras
ECIR
2007
Springer
14 years 27 days ago
Entropy-Based Authorship Search in Large Document Collections
The purpose of authorship search is to identify documents written by a particular author or in a particular style in large document collections. Standard search engines match docum...
Ying Zhao, Justin Zobel
ECIR
2007
Springer
14 years 27 days ago
Model Tree Learning for Query Term Weighting in Question Answering
Question answering systems rely on retrieval components to identify documents that contain an answer to a user’s question. The formulation of queries that are used for retrieving...
Christof Monz
LREC
2010
150views Education» more  LREC 2010»
14 years 27 days ago
A Corpus for Evaluating Semantic Multilingual Web Retrieval Systems: The Sense Folder Corpus
In this paper, we present the multilingual Sense Folder Corpus. After the analysis of different corpora, we describe the requirements that have to be satisfied for evaluating sema...
Ernesto William De Luca
AMW
2010
14 years 27 days ago
Generating XML/GML Schemas from Geographic Conceptual Schemas
Abstract. A large volume of data with complex structures is currently represented in GML (Geography Markup Language) for storing and exchanging geographic information. As the size ...
André C. Hora, Clodoveu A. Davis Jr., Mirel...
AAAI
2010
14 years 28 days ago
Utilizing Context in Generative Bayesian Models for Linked Corpus
In an interlinked corpus of documents, the context in which a citation appears provides extra information about the cited document. However, associating terms in the context to th...
Saurabh Kataria, Prasenjit Mitra, Sumit Bhatia
EKAW
2008
Springer
14 years 1 months ago
Ontological Profiles in Enterprise Search
Ontology-driven search applications use ontological concepts either to index documents or to guide and understand the users. Since ontologies by nature are domain-dependent and app...
Geir Solskinnsbakk, Jon Atle Gulla
EDBT
2008
ACM
136views Database» more  EDBT 2008»
14 years 1 months ago
Dealing with P2P semantic heterogeneity through query expansion and interpretation
In P2P systems where query initiators and information providers do not necessarily share the same ontology, semantic interoperability generally relies on ontology matching or sche...
Anthony Ventresque, Sylvie Cazalens, Philippe Lama...
DEXAW
2008
IEEE
136views Database» more  DEXAW 2008»
14 years 1 months ago
Segmentation of Legislative Documents Using a Domain-Specific Lexicon
The amount of legal information is continuously growing. New legislative documents appear everyday in the Web. Legal documents are produced on a daily basis in briefingformat, cont...
Ismael Hasan, Javier Parapar, Roi Blanco