Search Sciweavers | Sciweavers

2677 search results - page 94 / 536

» Extracting Structured Data from Web Pages

211

click to vote

VLDB
1999
ACM

140views Database» more VLDB 1999»

Distributed Hypertext Resource Discovery Through Examples

15 years 11 months ago

Download www.cse.iitb.ac.in

We describe the architecture of a hypertext resource discovery system using a relational database. Such a system can answer questions that combine page contents, metadata, and hyp...

Soumen Chakrabarti, Martin van den Berg, Byron Dom

claim paper

Read More »

194

click to vote

WSDM
2010
ACM

265views Data Mining» more WSDM 2010»

Data-oriented Content Query System: Searching for Data into Text on the Web

16 years 4 months ago

Download www.ews.uiuc.edu

As the Web provides rich data embedded in the immense contents inside pages, we witness many ad-hoc efforts for exploiting fine granularity information across Web text, such as We...

Kevin Chen-Chuan Chang, Mianwei Zhou, Tao Cheng

claim paper

Read More »

209

click to vote

ACL
2006

121views Computational Linguistics» more ACL 2006»

Extractive Summarization using Inter- and Intra- Event Relevance

15 years 8 months ago

Download acl.ldc.upenn.edu

Event-based summarization attempts to select and organize the sentences in a summary with respect to the events or the sub-events that the sentences describe. Each event has its o...

Wenjie Li, Mingli Wu, Qin Lu, Wei Xu, Chunfa Yuan

claim paper

Read More »

191

click to vote

FLAIRS
2001

131views Artificial Intelligence» more FLAIRS 2001»

Extracting Partial Structures from HTML Documents

15 years 8 months ago

Download qir.kyushu-u.ac.jp

The new wrapper model for extractiong text data from HTML documents is introduced. The Kushmerick's wrapper class (Kusshmerick 2000) may be unsuccessful in the case that suff...

Hiroshi Sakamoto, Yoshitsugu Murakami, Hiroki Arim...

claim paper

Read More »

218

click to vote

CIT
2005
Springer

226views Information Technology» more CIT 2005»

Simple Classification into Large Topic Ontology of Web Documents

15 years 7 months ago

Download eprints.pascal-network.org

The paper presents an approach to classifying Web documents into large topic ontology. The main emphasis is on having a simple approach appropriate for handling a large ontology an...

Marko Grobelnik, Dunja Mladenic

claim paper

Read More »

« Prev « First page 94 / 536 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers