Search Sciweavers | Sciweavers

2677 search results - page 19 / 536

» Extracting Structured Data from Web Pages

178

click to vote

WSE
2003
IEEE

95views Internet Technology» more WSE 2003»

Using Keyword Extraction for Web Site Clustering

16 years 1 days ago

Download tcc.itc.it

Reverse engineering techniques have the potential to support Web site understanding, by providing views that show the organization of a site and its navigational structure. Howeve...

Paolo Tonella, Filippo Ricca, Emanuele Pianta, Chr...

claim paper

Read More »

161

click to vote

WWW
2006
ACM

104views Internet Technology» more WWW 2006»

GoGetIt!: a tool for generating structure-driven web crawlers

16 years 7 months ago

Download www2006.org

We present GoGetIt!, a tool for generating structure-driven crawlers that requires a minimum effort from the users. The tool takes as input a sample page and an entry point to a W...

Altigran Soares da Silva, Edleno Silva de Moura, J...

claim paper

Read More »

172

click to vote

AAAI
2010

147views Intelligent Agents» more AAAI 2010»

Prioritization of Domain-Specific Web Information Extraction

15 years 8 months ago

Download www.eecs.umich.edu

It is often desirable to extract structured information from raw web pages for better information browsing, query answering, and pattern mining. Many such Information Extraction (...

Jian Huang, Cong Yu

claim paper

Read More »

189

click to vote

AAAI
2006

123views Intelligent Agents» more AAAI 2006»

Table Extraction Using Spatial Reasoning on the CSS2 Visual Box Model

15 years 8 months ago

Download www.aaai.org

Tables on web pages contain a huge amount of semantically explicit information, which makes them a worthwhile target for automatic information extraction and knowledge acquisition...

Wolfgang Gatterbauer, Paul Bohunsky

claim paper

Read More »

179

click to vote

CIKM
2005
Springer

104views Information Technology» more CIKM 2005»

Retrieving answers from frequently asked questions pages on the web

16 years 9 days ago

Download staff.science.uva.nl

We address the task of answering natural language questions by using the large number of Frequently Asked Questions (FAQ) pages available on the web. The task involves three steps...

Valentin Jijkoun, Maarten de Rijke

claim paper

Read More »

« Prev « First page 19 / 536 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers