Search Sciweavers | Sciweavers

61

LREC
2008

70views Education» more LREC 2008»

An Approach to Modeling Heterogeneous Resources for Information Extraction

15 years 3 months ago

In this paper, we describe an approach that aims to model heterogeneous resources for information extraction. Document is modeled in graph representation that enables better under...

Lei Xia, José Iria

claim paper

Read More »

99

Voted

WSDM
2010
ACM

215views Data Mining» more WSDM 2010»

Boilerplate Detection using Shallow Text Features

15 years 11 months ago

Download www.wsdm-conference.org

In addition to the actual content Web pages consist of navigational elements, templates, and advertisements. This boilerplate text typically is not related to the main content, ma...

Christian Kohlschütter, Peter Fankhauser, Wol...

claim paper

Read More »

151

Voted

IADIS
2004

127views Internet Technology» more IADIS 2004»

A conceptual modeling of multimedia documents

15 years 3 months ago

Download www.iadis.net

Our research works are interested in the identification and the representation of the semantic structures of multimedia documents. The semantic structure of a multimedia document ...

Mohamed Mbarki, Chantal Soulé-Dupuy

claim paper

Read More »

127

click to vote

DAGSTUHL
2003

110views Software Engineering» more DAGSTUHL 2003»

Interactive Mathematical Documents on the Web

15 years 3 months ago

Download www.win.tue.nl

This paper deals with our work on interactive mathematical documents. These documents accomodate various sources, users, and mathematical services. Communication of mathematics bet...

Arjeh M. Cohen, Hans Cuypers, Ernesto Reinaldo Bar...

claim paper

Read More »

120

Voted

IJCAI
2003

102views Artificial Intelligence» more IJCAI 2003»

Information Extraction from Web Documents Based on Local Unranked Tree Automaton Inference

15 years 3 months ago

Download dli.iiit.ac.in

Information extraction (IE) aims at extracting specific information from a collection of documents. A lot of previous work on 10 from semi-structured documents (in XML or HTML) us...

Raymond Kosala, Maurice Bruynooghe, Jan Van den Bu...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers