Search Sciweavers | Sciweavers

90 search results - page 15 / 18

» The lifecycle of a digital historical document: structure an...

click to vote

CIKM
2004
Springer

137views Information Technology» more CIKM 2004»

Hierarchical document categorization with support vector machines

14 years 24 days ago

Download www.cs.brown.edu

Automatically categorizing documents into pre-deﬁned topic hierarchies or taxonomies is a crucial step in knowledge and content management. Standard machine learning techniques ...

Lijuan Cai, Thomas Hofmann

claim paper

Read More »

click to vote

GIR
2006
ACM

189views Information Technology» more GIR 2006»

Associating spatial patterns to text-units for summarizing geographic information

14 years 1 months ago

Download www.geo.unizh.ch

Retrieving data based not only on key words is a challenge. We worked on semi-structured data (cultural heritage corpora). Our project aimed at getting the most relevant text-unit...

Julien Lesbegueries, Christian Sallaberry, Mauro G...

claim paper

Read More »

click to vote

ERCIMDL
2007
Springer

115views Education» more ERCIMDL 2007»

The Semantic GrowBag Algorithm: Automatically Deriving Categorization Systems

14 years 1 months ago

Download www.l3s.de

Using keyword search to ﬁnd relevant objects in digital libraries often results in way too large result sets. Based on the metadata associated with such objects, the faceted sear...

Jörg Diederich, Wolf-Tilo Balke

claim paper

Read More »

click to vote

CIKM
2009
Springer

183views Information Technology» more CIKM 2009»

Completing wikipedia's hyperlink structure through dimensionality reduction

14 years 2 months ago

Download www.cs.mcgill.ca

Wikipedia is the largest monolithic repository of human knowledge. In addition to its sheer size, it represents a new encyclopedic paradigm by interconnecting articles through hyp...

Robert West, Doina Precup, Joelle Pineau

claim paper

Read More »

click to vote

JCDL
2006
ACM

167views Education» more JCDL 2006»

Combining DOM tree and geometric layout analysis for online medical journal article segmentation

14 years 1 months ago

Download lhncbc.nlm.nih.gov

We describe an HTML web page segmentation algorithm, which is applied to segment online medical journal articles (regular HTML and PDF-Converted-HTML files). The web page content ...

Jie Zou, Daniel X. Le, George R. Thoma

claim paper

Read More »

« Prev « First page 15 / 18 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers