Search Sciweavers | Sciweavers

263 search results - page 30 / 53

» Re-engineering structures from Web documents

161

Voted

JCDL
2006
ACM

167views Education» more JCDL 2006»

Combining DOM tree and geometric layout analysis for online medical journal article segmentation

16 years 20 days ago

Download lhncbc.nlm.nih.gov

We describe an HTML web page segmentation algorithm, which is applied to segment online medical journal articles (regular HTML and PDF-Converted-HTML files). The web page content ...

Jie Zou, Daniel X. Le, George R. Thoma

claim paper

Read More »

210

Voted

JCDL
2010
ACM

188views Education» more JCDL 2010»

Exposing the hidden web for chemical digital libraries

15 years 11 months ago

Download www.ifis.cs.tu-bs.de

In recent years, the vast amount of digitally available content has lead to the creation of many topic-centered digital libraries. Also in the domain of chemistry more and more di...

Sascha Tönnies, Benjamin Köhncke, Oliver...

claim paper

Read More »

241

click to vote

WEBI
2005
Springer

216views Internet Technology» more WEBI 2005»

A Semi-Supervised Document Clustering Algorithm Based on EM

16 years 6 days ago

Download www.dii.unisi.it

Document clustering is a very hard task in Automatic Text Processing since it requires to extract regular patterns from a document collection without a priori knowledge on the cat...

Leonardo Rigutini, Marco Maggini

claim paper

Read More »

201

Voted

ADC
2006
Springer

130views Database» more ADC 2006»

A two-phase rule generation and optimization approach for wrapper generation

16 years 21 days ago

Download crpit.com

Web information extraction is a fundamental issue for web information management and integrations. A common approach is to use wrappers to extract data from web pages or documents...

Yanan Hao, Yanchun Zhang

claim paper

Read More »

181

click to vote

DAGSTUHL
2006

210views Software Engineering» more DAGSTUHL 2006»

Are we Ready to Embrace the Semantic Web?

15 years 8 months ago

Download drops.dagstuhl.de

action from low level features to high level semantics. Owing to the proliferation of multimedia content in the internet, there is widespread interest in the semantic web community...

Shankar Vembu, Stephan Baumann

claim paper

Read More »

« Prev « First page 30 / 53 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers