Sciweavers

2677 search results - page 94 / 536
» Extracting Structured Data from Web Pages
Sort
View
VLDB
1999
ACM
140views Database» more  VLDB 1999»
14 years 2 months ago
Distributed Hypertext Resource Discovery Through Examples
We describe the architecture of a hypertext resource discovery system using a relational database. Such a system can answer questions that combine page contents, metadata, and hyp...
Soumen Chakrabarti, Martin van den Berg, Byron Dom
WSDM
2010
ACM
265views Data Mining» more  WSDM 2010»
14 years 7 months ago
Data-oriented Content Query System: Searching for Data into Text on the Web
As the Web provides rich data embedded in the immense contents inside pages, we witness many ad-hoc efforts for exploiting fine granularity information across Web text, such as We...
Kevin Chen-Chuan Chang, Mianwei Zhou, Tao Cheng
ACL
2006
13 years 11 months ago
Extractive Summarization using Inter- and Intra- Event Relevance
Event-based summarization attempts to select and organize the sentences in a summary with respect to the events or the sub-events that the sentences describe. Each event has its o...
Wenjie Li, Mingli Wu, Qin Lu, Wei Xu, Chunfa Yuan
FLAIRS
2001
13 years 11 months ago
Extracting Partial Structures from HTML Documents
The new wrapper model for extractiong text data from HTML documents is introduced. The Kushmerick's wrapper class (Kusshmerick 2000) may be unsuccessful in the case that suff...
Hiroshi Sakamoto, Yoshitsugu Murakami, Hiroki Arim...
CIT
2005
Springer
13 years 9 months ago
Simple Classification into Large Topic Ontology of Web Documents
The paper presents an approach to classifying Web documents into large topic ontology. The main emphasis is on having a simple approach appropriate for handling a large ontology an...
Marko Grobelnik, Dunja Mladenic