Sciweavers

587 search results - page 18 / 118
» Categorisation of web documents using extraction ontologies
Sort
View
SIGIR
2005
ACM
14 years 1 months ago
Title extraction from bodies of HTML documents and its application to web page retrieval
This paper is concerned with automatic extraction of titles from the bodies of HTML documents. Titles of HTML documents should be correctly defined in the title fields; however, i...
Yunhua Hu, Guomao Xin, Ruihua Song, Guoping Hu, Sh...
AIME
2005
Springer
14 years 1 months ago
Web Mining Techniques for Automatic Discovery of Medical Knowledge
In this paper, we propose an automatic and autonomous methodology to discover taxonomies of terms from the Web and represent retrieved web documents into a meaningful organization....
David Sánchez, Antonio Moreno
NLDB
2004
Springer
14 years 1 months ago
A Flexible Workbench for Document Analysis and Text Mining
Abstract: Document analysis and text mining techniques are used to preprocess documents in information retrieval systems, to extract concepts in ontology construction processes, an...
Jon Atle Gulla, Terje Brasethvik, Harald Kaada
EON
2007
13 years 9 months ago
Characterizing Knowledge on the Semantic Web with Watson
Abstract. Watson is a gateway to the Semantic Web: it collects, analyzes and gives access to ontologies and semantic data available online with the objective of supporting their dy...
Mathieu d'Aquin, Claudio Baldassarre, Laurian Grid...
WWW
2005
ACM
14 years 8 months ago
Interactive web-wrapper construction for extracting relational information from web documents
In this paper, we propose a new user interface to interactively specify Web wrappers to extract relational information from Web documents. In this study, we focused on improving u...
Tsuyoshi Sugibuchi, Yuzuru Tanaka