Sciweavers

2137 search results - page 67 / 428
» Extraction of Structural Information from the Web
Sort
View
NAACL
2010
15 years 1 months ago
Extracting Parallel Sentences from Comparable Corpora using Document Level Alignment
The quality of a statistical machine translation (SMT) system is heavily dependent upon the amount of parallel sentences used in training. In recent years, there have been several...
Jason R. Smith, Chris Quirk, Kristina Toutanova
ER
2007
Springer
142views Database» more  ER 2007»
15 years 10 months ago
Automatic Hidden-Web Table Interpretation by Sibling Page Comparison
The longstanding problem of automatic table interpretation still illudes us. Its solution would not only be an aid to table processing applications such as large volume table conve...
Cui Tao, David W. Embley
AINA
2009
IEEE
15 years 11 months ago
Learning to Extract Content from News Webpages
We consider the problem of content extraction from online news webpages. To explore to what extent the syntactic markup and the visual structure of a webpage facilitate the extrac...
Alex Spengler, Patrick Gallinari
MIR
2004
ACM
189views Multimedia» more  MIR 2004»
15 years 9 months ago
Repeating pattern discovery and structure analysis from acoustic music data
Music and songs usually have repeating patterns and prominent structure. The automatic extraction of such repeating patterns and structure is useful for further music summarizatio...
Lie Lu, Muyuan Wang, HongJiang Zhang
ISI
2004
Springer
15 years 9 months ago
Generating Concept Hierarchies from Text for Intelligence Analysis
It is important to automatically extract key information from sensitive text documents for intelligence analysis. Text documents are usually unstructured and information extraction...
Jenq-Haur Wang, Chien-Chung Huang, Jei-Wen Teng, L...