Search Sciweavers | Sciweavers

368 search results - page 5 / 74

» Template-Based Information Mining from HTML Documents

179

click to vote

ICDAR
1997
IEEE

143views Document Analysis» more ICDAR 1997»

Representing OCRed documents in HTML

15 years 11 months ago

Download www.cedar.buffalo.edu

ABSTRACT: OCR is an error-prone process. It is time-consuming and expensive to manually proofread OCR results. The errors remaining in OCRed texts can cause serious problems in rea...

Tao Hong, Sargur N. Srihari

claim paper

Read More »

178

click to vote

WOA
2001

131views Intelligent Agents» more WOA 2001»

Object Oriented Mapping for HTML Documents

15 years 8 months ago

Download lia.deis.unibo.it

Emerging distributed technologies aim to provide simple and powerful tools for web services design and implementation. Main vendors provide modern frameworks so that a good coordi...

Francesco Garelli, Carlo Ferrari

claim paper

Read More »

162

click to vote

ICTAI
1999
IEEE

101views Artificial Intelligence» more ICTAI 1999»

A New Study on Using HTML Structures to Improve Retrieval

15 years 11 months ago

Download www.cs.binghamton.edu

Locating useful information effectively from the World Wide Web (WWW) is of wide interest. This paper presents new results on a methodology of using the structures and hyperlinks ...

Michal Cutler, H. Deng, S. Maniccam, Weiyi Meng

claim paper

Read More »

175

click to vote

DMKD
2003
ACM

114views Data Mining» more DMKD 2003»

Deriving link-context from HTML tag tree

15 years 12 months ago

Download dollar.biz.uiowa.edu

HTML anchors are often surrounded by text that seems to describe the destination page appropriately. The text surrounding a link or the link-context is used for a variety of tasks...

Gautam Pant

claim paper

Read More »

173

click to vote

IJCAI
2003

102views Artificial Intelligence» more IJCAI 2003»

Information Extraction from Web Documents Based on Local Unranked Tree Automaton Inference

15 years 8 months ago

Download dli.iiit.ac.in

Information extraction (IE) aims at extracting specific information from a collection of documents. A lot of previous work on 10 from semi-structured documents (in XML or HTML) us...

Raymond Kosala, Maurice Bruynooghe, Jan Van den Bu...

claim paper

Read More »

« Prev « First page 5 / 74 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers