Search Sciweavers | Sciweavers

591 search results - page 36 / 119

» Extracting Route Directions from Web Pages

223

click to vote

WWW
2005
ACM

188views Internet Technology» more WWW 2005»

Hybrid semantic tagging for information extraction

16 years 8 months ago

Download www.www2005.org

The semantic web is expected to have an impact at least as big as that of the existing HTML based web, if not greater. However, the challenge lays in creating this semantic web an...

Ronen Feldman, Binyamin Rosenfeld, Moshe Fresko, B...

claim paper

Read More »

228

Voted

IJSI
2008

115views more IJSI 2008»

Towards Knowledge Acquisition from Semi-Structured Content

15 years 7 months ago

Download www.ijsi.org

Abstract A rich family of generic Information Extraction (IE) techniques have been developed by researchers nowadays. This paper proposes WebKER, a system for automatically extract...

Xi Bai, Jigui Sun, Haiyan Che, Lian Shi

claim paper

Read More »

214

click to vote

WWW
2005
ACM

150views Internet Technology» more WWW 2005»

Extracting context to improve accuracy for HTML content extraction

16 years 8 months ago

Download www1.cs.columbia.edu

Web pages contain clutter (such as ads, unnecessary images and extraneous links) around the body of an article, which distracts a user from actual content. Extraction of "use...

Suhit Gupta, Gail E. Kaiser, Salvatore J. Stolfo

claim paper

Read More »

248

click to vote

WWW
2010
ACM

188views Internet Technology» more WWW 2010»

Exploiting content redundancy for web information extraction

15 years 7 months ago

Download www.comp.nus.edu.sg

We propose a novel extraction approach that exploits content redundancy on the web to extract structured data from template-based web sites. We start by populating a seed database...

Pankaj Gulhane, Rajeev Rastogi, Srinivasan H. Seng...

claim paper

Read More »

202

Voted

ICTAI
2000
IEEE

88views Artificial Intelligence» more ICTAI 2000»

Reverse mapping of referral links from storage hierarchy for Web documents

15 years 12 months ago

Download www.scs.ryerson.ca

In world wide web, a document is usually made up of multiple pages, each one of which has a unique URL address and links to each other by hyperlink pointers. Related documents are...

Chen Ding, Chi-Hung Chi, Vincent Tam

claim paper

Read More »

« Prev « First page 36 / 119 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers