Sciweavers

8316 search results - page 91 / 1664
» Web Document Modeling
Sort
View
LREC
2008
70views Education» more  LREC 2008»
13 years 11 months ago
An Approach to Modeling Heterogeneous Resources for Information Extraction
In this paper, we describe an approach that aims to model heterogeneous resources for information extraction. Document is modeled in graph representation that enables better under...
Lei Xia, José Iria
WSDM
2010
ACM
215views Data Mining» more  WSDM 2010»
14 years 7 months ago
Boilerplate Detection using Shallow Text Features
In addition to the actual content Web pages consist of navigational elements, templates, and advertisements. This boilerplate text typically is not related to the main content, ma...
Christian Kohlschütter, Peter Fankhauser, Wol...
IADIS
2004
13 years 11 months ago
A conceptual modeling of multimedia documents
Our research works are interested in the identification and the representation of the semantic structures of multimedia documents. The semantic structure of a multimedia document ...
Mohamed Mbarki, Chantal Soulé-Dupuy
DAGSTUHL
2003
13 years 11 months ago
Interactive Mathematical Documents on the Web
This paper deals with our work on interactive mathematical documents. These documents accomodate various sources, users, and mathematical services. Communication of mathematics bet...
Arjeh M. Cohen, Hans Cuypers, Ernesto Reinaldo Bar...
IJCAI
2003
13 years 11 months ago
Information Extraction from Web Documents Based on Local Unranked Tree Automaton Inference
Information extraction (IE) aims at extracting specific information from a collection of documents. A lot of previous work on 10 from semi-structured documents (in XML or HTML) us...
Raymond Kosala, Maurice Bruynooghe, Jan Van den Bu...