Sciweavers

1541 search results - page 42 / 309
» Extracting Web Data Using Instance-Based Learning
Sort
View
SEMWEB
2012
Springer
12 years 4 months ago
How to deal with massively heterogeneous cultural heritage data - lessons learned in CultureSampo
Abstract. This paper presents the CultureSampo system from the viewpoint of publishing heterogeneous linked data as a service. Discussed are the problems of converting legacy data ...
Eetu Mäkelä, Eero Hyvönen, Tuukka R...
GFKL
2005
Springer
93views Data Mining» more  GFKL 2005»
14 years 2 months ago
A Hybrid Machine Learning Approach for Information Extraction from Free Text
Abstract. We present a hybrid machine learning approach for information extraction from unstructured documents by integrating a learned classifier based on the Maximum Entropy Mod...
Günter Neumann
IJCAI
1997
13 years 10 months ago
Using Case-Based Reasoning in Interpreting Unsupervised Inductive Learning Results
The objective of this work is to interpret inductive results obtained by the unsupervised learning method OSHAM. We briefly introduce the learning process of OSHAM, that extracts ...
Tu Bao Ho, Chi Main Luong
ACL
2011
13 years 15 days ago
Can Document Selection Help Semi-supervised Learning? A Case Study On Event Extraction
Annotating training data for event extraction is tedious and labor-intensive. Most current event extraction tasks rely on hundreds of annotated documents, but this is often not en...
Shasha Liao, Ralph Grishman
SIGMOD
2009
ACM
140views Database» more  SIGMOD 2009»
14 years 3 months ago
Robust web extraction: an approach based on a probabilistic tree-edit model
On script-generated web sites, many documents share common HTML tree structure, allowing wrappers to effectively extract information of interest. Of course, the scripts and thus ...
Nilesh N. Dalvi, Philip Bohannon, Fei Sha