Abstract . In this paper we propose an original method to automatically summarize Web pages. This method is based on statistics rather than linguistics. It differs from most other ...
Random indexing is a method for constructing vector spaces at a reduced dimensionality. Previously, the method has been proposed using Kanerva’s sparse distributed memory model. ...
When real applications are working with automatic speech transcription, the first source of error does not originate from the incoherence in the analysis of the application but fr...
Abstract. This paper presents a method for the generation of structured data sources for music recommendation using information extracted from unstructured text sources. The propos...
Abstract. Statistical methods have shown a remarkable ability to capture semantics. The word2vec method is a frequently cited method for capturing meaningful semantic relations bet...
Abstract. We examine the combination of pattern-based and distributional similarity for the induction of semantic categories. Pattern-based methods are precise and sparse while dis...
Question answering over linked data has emerged in the past years as an important topic of research in order to provide natural language access to a growing body of linked open dat...
Sherzod Hakimov, Christina Unger, Sebastian Walter...
We present REDEN, a tool for graph-based Named Entity Linking that allows for the disambiguation of entities using domainspecific Linked Data sources and different configuration...
Francesca Frontini, Carmen Brando, Jean-Gabriel Ga...
We deal with the task of generating a query that retrieves a given set of documents. In its abstract form, this can be seen as a “compression” of the document set to a short qu...
With ever increasing information available in social networks, the number of businesses attempting to exploit it is on the rise, particularly by keeping track of their customers’...
Christian Colombo, Jean-Paul Grech, Gordon J. Pace