Sciweavers

2677 search results - page 79 / 536
» Extracting Structured Data from Web Pages
Sort
View
BTW
2003
Springer
140views Database» more  BTW 2003»
14 years 2 months ago
An Ontology for Domain-oriented Semantic Similarity Search on XML Data
Abstract: Query languages for XML such as XPath or XQuery support Boolean retrieval where a query result is a (possibly restructured) subset of XML elements or entire documents tha...
Anja Theobald
WWW
2006
ACM
14 years 9 months ago
Bilingual web page and site readability assessment
Readability assessment is a method to measure the difficulty of a piece of text material, and it is widely used in educational field to assist instructors to prepare appropriate m...
Tak Pang Lau, Irwin King
PAMI
2007
107views more  PAMI 2007»
13 years 8 months ago
Recognition of Pornographic Web Pages by Classifying Texts and Images
—With the rapid development of the World Wide Web, people benefit more and more from the sharing of information. However, Web pages with obscene, harmful, or illegal content can ...
Weiming Hu, Ou Wu, Zhouyao Chen, Zhouyu Fu, Stephe...
NAACL
2007
13 years 10 months ago
Multilingual Structural Projection across Interlinear Text
This paper explores the potential for annotating and enriching data for low-density languages via the alignment and projection of syntactic structure from parsed data for resource...
Fei Xia, William Lewis
KDD
2008
ACM
195views Data Mining» more  KDD 2008»
14 years 9 months ago
Learning from multi-topic web documents for contextual advertisement
Contextual advertising on web pages has become very popular recently and it poses its own set of unique text mining challenges. Often advertisers wish to either target (or avoid) ...
Yi Zhang, Arun C. Surendran, John C. Platt, Mukund...