Search Sciweavers | Sciweavers

290 search results - page 21 / 58

» Document normalization revisited

174

click to vote

COLING
2000

107views Computational Linguistics» more COLING 2000»

A Method of Measuring Term Representativeness - Baseline Method Using Co-occurrence Distribution

15 years 8 months ago

Download acl.ldc.upenn.edu

This paper introduces a scheme, which we call the baseline method, to define a measure of term representativeness and measures defined by using the scheme. The representativeness ...

Toru Hisamitsu, Yoshiki Niwa, Jun-ichi Tsujii

claim paper

Read More »

190

click to vote

SIGIR
2011
ACM

259views Information Technology» more SIGIR 2011»

When documents are very long, BM25 fails!

14 years 10 months ago

Download sifaka.cs.uiuc.edu

We reveal that the Okapi BM25 retrieval function tends to overly penalize very long documents. To address this problem, we present a simple yet eﬀective extension of BM25, namel...

Yuanhua Lv, ChengXiang Zhai

claim paper

Read More »

169

click to vote

ICPR
2002
IEEE

139views computer vision» more ICPR 2002»

Robust Text Detection from Binarized Document Images

16 years 8 months ago

Download www.ee.oulu.fi

Many document images are rich in color and have complex background. To detect text from them, a standard approach utilizes both color and binary information. This often leads to t...

Oleg Okun, Yu Yan, Matti Pietikäinen

claim paper

Read More »

191

click to vote

ICPR
2010
IEEE

209views Computer Vision» more ICPR 2010»

Text Separation from Mixed Documents Using a Tree-Structured Classifier

15 years 5 months ago

Download www.visionopen.com

In this paper, we propose a tree-structured multiclass classifier to identify annotations and overlapping text from machine printed documents. Each node of the tree-structured cla...

Xujun Peng, Srirangaraj Setlur, Venu Govindaraju, ...

claim paper

Read More »

207

click to vote

DOCENG
2010
ACM

203views Document Analysis» more DOCENG 2010»

Diffing, patching and merging XML documents: toward a generic calculus of editing deltas

15 years 4 months ago

Download www.xrce.xerox.com

This work addresses what we believe to be a central issue in the field of XML diff and merge computation: the mathematical modeling o-called editing deltas and the study of their ...

Jean-Yves Vion-Dury

claim paper

Read More »

« Prev « First page 21 / 58 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers