Sciweavers

368 search results - page 9 / 74
» Template-Based Information Mining from HTML Documents
Sort
View
AIED
2007
Springer
14 years 1 months ago
Helping Courseware Authors to Build Ontologies: The Case of TM4L
The authors of topic map-based learning resources face major difficulties in constructing the underlying ontologies. In this paper we propose two approaches to address this problem...
Darina Dicheva, Christo Dichev
COOPIS
1998
IEEE
13 years 11 months ago
Wrapper Generation for Web Accessible Data Sources
There is an increase in the number of data sources that can be queried across the WWW. Such sources typically support HTML forms-based interfaces and search engines query collecti...
Jean-Robert Gruser, Louiqa Raschid, Maria-Esther V...
ICDM
2006
IEEE
164views Data Mining» more  ICDM 2006»
14 years 1 months ago
Unsupervised Learning of Tree Alignment Models for Information Extraction
We propose an algorithm for extracting fields from HTML search results. The output of the algorithm is a database table– a data structure that better lends itself to high-level...
Philip Zigoris, Damian Eads, Yi Zhang
SGAI
2004
Springer
14 years 21 days ago
Neighbourhood Exploitation in Hypertext Categorization
As the web expands exponentially, the need to put some order to its content becomes apparent. Hypertext categorization, that is the automatic classification of web documents into ...
Houda Benbrahim, Max Bramer
WWW
2004
ACM
14 years 8 months ago
Hearsay: enabling audio browsing on hypertext content
In this paper we present HearSay, a system for browsing hypertext Web documents via audio. The HearSay system is based on our novel approach to automatically creating audio browsa...
I. V. Ramakrishnan, Amanda Stent, Guizhen Yang