Sciweavers

543 search results - page 6 / 109
» Exploiting content redundancy for web information extraction
Sort
View
WWW
2005
ACM
14 years 8 months ago
Extracting semantic structure of web documents using content and visual information
This work aims to provide a page segmentation algorithm which uses both visual and content information to extract the semantic structure of a web page. The visual information is u...
Rupesh R. Mehta, Pabitra Mitra, Harish Karnick
CIKM
2009
Springer
13 years 8 months ago
OfCourse: web content discovery, classification and information extraction for online course materials
: OfCourse: Web Content Discovery, Classification and Information Extraction for Online Course Materials Yuhong Xiong, Ping Luo, Yong Zhao, Fen Lin, Shicong Feng, Baoyao Zhou, Liw...
Yuhong Xiong, Ping Luo, Yong Zhao, Fen Lin, Shicon...
APCCM
2009
13 years 8 months ago
Extracting and Modeling the Semantic Information Content of Web Documents to Support Semantic Document Retrieval
Existing HTML mark-up is used only to indicate the structure and lay-out of documents, but not the document semantics. As a result web documents are difficult to be semantically p...
Shahrul Azman Noah, Lailatulqadri Zakaria, Arifah ...
WWW
2009
ACM
14 years 8 months ago
Exploiting web search engines to search structured databases
Web search engines often federate many user queries to relevant structured databases. For example, a product related query might be federated to a product database containing thei...
Arnd Christian König, Dong Xin, Kaushik Chakr...
WWW
2003
ACM
14 years 8 months ago
Web-R: a Tool to Record & Replay Personal Web Navigation
This poster presents a useful tool to capture the content of browsing sessions. Web-R saves systematically all the components sufficient and necessary to visualize offline the pag...
Jean-Daniel Kant, Alain Lifchitz