Search Sciweavers | Sciweavers

35 search results - page 6 / 7

» Mining reference tables for automatic text segmentation

281

Voted

DAS
2010
Springer

177views Document Analysis» more DAS 2010»

IAMonDo-database: an online handwritten document database with non-uniform contents

15 years 11 months ago

Download www.iam.unibe.ch

In this paper we present a new database of online handwritten documents with diﬀerent contents such as text, drawings, diagrams, formulas, tables, lists, and markings. It was de...

Emanuel Indermühle, Marcus Liwicki, Horst Bun...

claim paper

Read More »

175

Voted

KDD
2007
ACM

136views Data Mining» more KDD 2007»

Information genealogy: uncovering the flow of ideas in non-hyperlinked document databases

16 years 7 months ago

Download www.benyah.net

We now have incrementally-grown databases of text documents ranging back for over a decade in areas ranging from personal email, to news-articles and conference proceedings. While...

Benyah Shaparenko, Thorsten Joachims

claim paper

Read More »

168

click to vote

BMCBI
2005

106views more BMCBI 2005»

Thesaurus-based disambiguation of gene symbols

15 years 6 months ago

Download www.biomedcentral.com

Background: Massive text mining of the biological literature holds great promise of relating disparate information and discovering new knowledge. However, disambiguation of gene s...

Bob J. A. Schijvenaars, Barend Mons, Marc Weeber, ...

claim paper

Read More »

189

click to vote

COLING
2010

236views Computational Linguistics» more COLING 2010»

Resolving Surface Forms to Wikipedia Topics

15 years 1 months ago

Download aclweb.org

Ambiguity of entity mentions and concept references is a challenge to mining text beyond surface-level keywords. We describe an effective method of disambiguating surface forms an...

Yiping Zhou, Lan Nie, Omid Rouhani-Kalleh, Flavian...

claim paper

Read More »

157

click to vote

LREC
2008

141views Education» more LREC 2008»

New Resources for Document Classification, Analysis and Translation Technologies

15 years 8 months ago

Download www.lrec-conf.org

The goal of the DARPA MADCAT (Multilingual Automatic Document Classification Analysis and Translation) Program is to automatically convert foreign language text images into Englis...

Stephanie Strassel, Lauren Friedman, Safa Ismael, ...

claim paper

Read More »

« Prev « First page 6 / 7 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers