Sciweavers

2137 search results - page 86 / 428
» Extraction of Structural Information from the Web
Sort
View
IWPC
2009
IEEE
15 years 11 months ago
Natural language parsing for fact extraction from source code
We present a novel approach to extract structural information from source code using state-of-the-art parser technologies for natural languages. The parser technology is robust in...
Jens Nilsson, Welf Löwe, Johan Hall, Joakim N...
AIIA
2007
Springer
15 years 10 months ago
Harvesting Relational and Structured Knowledge for Ontology Building in the WPro Architecture
We present two algorithms for supporting semi-automatic ontology building, integrated in WPro, a new architecture for ontology learning from Web documents. The first algorithm auto...
Daniele Bagni, Marco Cappella, Maria Teresa Pazien...
AAAI
2006
15 years 5 months ago
Phoebus: A System for Extracting and Integrating Data from Unstructured and Ungrammatical Sources
With the proliferation of online classifieds and auctions comes a new need to meaningfully search and organize the items for sale. However, since the seller's item descriptio...
Matthew Michelson, Craig A. Knoblock
ESWS
2007
Springer
15 years 10 months ago
A Unified Approach to Retrieving Web Documents and Semantic Web Data
The Semantic Web seems to be evolving into a property-linked web of RDF data, conceptually divorced from (but physically housed in) the hyperlinked web of HTML documents. We discus...
Trivikram Immaneni, Krishnaprasad Thirunarayan
CIKM
2008
Springer
15 years 6 months ago
Predicting web spam with HTTP session information
Web spam is a widely-recognized threat to the quality and security of the Web. Web spam pages pollute search engine indexes, burden Web crawlers and Web mining services, and expos...
Steve Webb, James Caverlee, Calton Pu