Sciweavers

3705 search results - page 40 / 741
» Building Documentation Generators
Sort
View
ACL
2009
13 years 7 months ago
A Generative Blog Post Retrieval Model that Uses Query Expansion based on External Collections
User generated content is characterized by short, noisy documents, with many spelling errors and unexpected language usage. To bridge the vocabulary gap between the user's in...
Wouter Weerkamp, Krisztian Balog, Maarten de Rijke
KDD
1999
ACM
99views Data Mining» more  KDD 1999»
14 years 2 months ago
On the Merits of Building Categorization Systems by Supervised Clustering
This paper investigates the use of supervised clustering in order to create sets of categories for classi cation of documents. We use information from a pre-existing taxonomy in o...
Charu C. Aggarwal, Stephen C. Gates, Philip S. Yu
CASCON
2007
112views Education» more  CASCON 2007»
13 years 11 months ago
Removing manually generated boilerplate from electronic texts: experiments with project Gutenberg e-books
Collaborative work on unstructured or semistructured documents, such as in literature corpora or source code, often involves agreed upon templates containing metadata. These templ...
Owen Kaser, Daniel Lemire
ISMIR
2005
Springer
168views Music» more  ISMIR 2005»
14 years 3 months ago
Using the Gamera Framework for Building a Lute Tablature Recognition System
In this article we describe an optical recognition system for historic lute tablature prints that we have built with the aid of the Gamera toolkit for document analysis and recogn...
Christophe Dalitz, Thomas Karsten
COLING
2010
13 years 4 months ago
A Method for Automatically Generating a Mediatory Summary to Verify Credibility of Information on the Web
In this paper, we propose a method for mediatory summarization, which is a novel technique for facilitating users' assessments of the credibility of information on the Web. A...
Hideyuki Shibuki, Takahiro Nagai, Masahiro Nakano,...