Search Sciweavers | Sciweavers

587 search results - page 21 / 118

» Categorisation of web documents using extraction ontologies

223

click to vote

DEXA
2005
Springer

109views Database» more DEXA 2005»

An XML Approach to Semantically Extract Data from HTML Tables

16 years 18 days ago

Download www.cis.unisa.edu.au

Abstract. Data intensive information is often published on the internet in the format of HTML tables. Extracting some of the information that is of users’ interest from the inter...

Jixue Liu, Zhuoyun Ao, Ho-Hyun Park, Yongfeng Chen

claim paper

Read More »

202

click to vote

ASWC
2006
Springer

150views Internet Technology» more ASWC 2006»

Finding Important Vocabulary Within Ontology

15 years 10 months ago

Download iws.seu.edu.cn

In current Semantic Web community, some researches have been done on ranking ontologies, while very little is paid to ranking vocabularies within ontology. However, finding importa...

Xiang Zhang, Hongda Li, Yuzhong Qu

claim paper

Read More »

206

click to vote

WWW
2003
ACM

130views Internet Technology» more WWW 2003»

DOM-based content extraction of HTML documents

16 years 7 months ago

Download www.psl.cs.columbia.edu

Web pages often contain clutter (such as pop-up ads, unnecessary images and extraneous links) around the body of an article that distracts a user from actual content. Extraction o...

Suhit Gupta, Gail E. Kaiser, David Neistadt, Peter...

claim paper

Read More »

223

click to vote

RIVF
2007

140views Internet Technology» more RIVF 2007»

Disambiguation of People in Web Search Using a Knowledge Base

15 years 8 months ago

Download www.adl.nii.ac.jp

— Results of queries by personal names often contain documents related to several people because of the namesake problem. In order to differentiate documents related to different...

Quang Minh Vu, Tomonari Masada, Atsuhiro Takasu, J...

claim paper

Read More »

224

click to vote

BMCBI
2006

153views more BMCBI 2006»

Automatic document classification of biological literature

15 years 7 months ago

Download www.biomedcentral.com

Background: Document classification is a wide-spread problem with many applications, from organizing search engine snippets to spam filtering. We previously described Textpresso, ...

David Chen, Hans-Michael Müller, Paul W. Ster...

claim paper

Read More »

« Prev « First page 21 / 118 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers