Sciweavers

37 search results - page 7 / 8
» Extending Page Segmentation Algorithms for Mixed-Layout Docu...
Sort
View
JCDL
2004
ACM
97views Education» more  JCDL 2004»
14 years 23 days ago
Realistic books: a bizarre homage to an obsolete medium?
: For many readers, handling a physical book is an enjoyably exquisite part of the information seeking process. Many physical characteristics of a book—its size, heft, the patina...
Yi-Chun Chu, David Bainbridge, Matt Jones, Ian H. ...
BMCBI
2006
99views more  BMCBI 2006»
13 years 7 months ago
MAGIC-SPP: a database-driven DNA sequence processing package with associated management tools
Background: Processing raw DNA sequence data is an especially challenging task for relatively small laboratories and core facilities that produce as many as 5000 or more DNA seque...
Chun Liang, Feng Sun, Haiming Wang, Junfeng Qu, Ro...
WSDM
2010
ACM
215views Data Mining» more  WSDM 2010»
14 years 4 months ago
Boilerplate Detection using Shallow Text Features
In addition to the actual content Web pages consist of navigational elements, templates, and advertisements. This boilerplate text typically is not related to the main content, ma...
Christian Kohlschütter, Peter Fankhauser, Wol...
CIKM
2008
Springer
13 years 9 months ago
Efficient and effective link analysis with precomputed salsa maps
SALSA is a link-based ranking algorithm that takes the result set of a query as input, extends the set to include additional neighboring documents in the web graph, and performs a...
Marc Najork, Nick Craswell
ICIP
2010
IEEE
13 years 5 months ago
High quality scanned book compression using pattern matching
This paper proposes a hybrid approximate pattern matching/ transform-based compression engine. The idea is to use regular video interframe prediction as a pattern matching algorit...
Alexandre Zaghetto, Ricardo L. de Queiroz