Sciweavers

SPIRE
2005
Springer
14 years 27 days ago
Stemming Arabic Conjunctions and Prepositions
Abdusalam F. A. Nwesri, Seyed M. M. Tahaghoghi, Fa...
SPIRE
2005
Springer
14 years 27 days ago
Classifying Sentences Using Induced Structure
In this article we will introduce a new approach (and several implementations) to the task of sentence classification, where pre-defined classes are assigned to sentences. This a...
Menno van Zaanen, Luiz Augusto Sangoi Pizzato, Die...
SPIRE
2005
Springer
14 years 27 days ago
XML Retrieval with a Natural Language Interface
Effective information retrieval in XML documents requires the user to have good knowledge of document structure and of some formal query language. XML query languages like XPath a...
Xavier Tannier, Shlomo Geva
SPIRE
2005
Springer
14 years 27 days ago
Counting Suffix Arrays and Strings
Klaus-Bernd Schürmann, Jens Stoye
SPIRE
2005
Springer
14 years 27 days ago
Counting Lumps in Word Space: Density as a Measure of Corpus Homogeneity
This paper introduces a measure of corpus homogeneity that indicates the amount of topical dispersion in a corpus. The measure is based on the density of neighborhoods in semantic ...
Magnus Sahlgren, Jussi Karlgren
SPIRE
2005
Springer
14 years 27 days ago
Faster Generation of Super Condensed Neighbourhoods Using Finite Automata
We present a new algorithm for generating super condensed neighbourhoods. Super condensed neighbourhoods have recently been presented as the minimal set of words that represent a p...
Luís M. S. Russo, Arlindo L. Oliveira
SPIRE
2005
Springer
14 years 27 days ago
Using the k-Nearest Neighbor Graph for Proximity Searching in Metric Spaces
Proximity searching consists in retrieving from a database, objects that are close to a query. For this type of searching problem, the most general model is the metric space, where...
Rodrigo Paredes, Edgar Chávez
SPIRE
2005
Springer
14 years 27 days ago
Fast Plagiarism Detection System
Maxim Mozgovoy, Kimmo Fredriksson, Daniel R. White...
SPIRE
2005
Springer
14 years 27 days ago
Lydia: A System for Large-Scale News Analysis
Levon Lloyd, Dimitrios Kechagias, Steven Skiena
SPIRE
2005
Springer
14 years 27 days ago
Linear Time Algorithm for the Generalised Longest Common Repeat Problem
Given a set of strings U = {T1, T2, . . . , T }, the longest common repeat problem is to find the longest common substring that appears at least twice in each string of U, conside...
Inbok Lee, Yoan José Pinzón Ardila