Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

297

ICDE
2006
IEEE

156views Database» more ICDE 2006»

Extracting Objects from the Web

16 years 7 months ago

Extracting Objects from the Web

Download research.microsoft.com

Extracting and integrating object information from the Web is of great significance for Web data management. The existing Web information extraction techniques cannot provide satisfactory solution to the Web object extraction task since objects of the same type are distributed in diverse Web sources, whose structures are highly heterogeneous. In this paper, we propose a novel approach called Object-Level Information Extraction (OLIE) to extract Web objects. This approach extends a classic information extraction algorithm, Conditional Random Fields (CRF), by adding Web-specific information. The experimental results show OLIE can significantly improve the Web object extraction accuracy.

Zaiqing Nie, Fei Wu, Ji-Rong Wen, Wei-Ying Ma

Real-time Traffic

Database | ICDE 2006 | Object Extraction Accuracy | Web Information Extraction | Web Object Extraction |

claim paper

Related Content

» METEOR metadata and instance extraction from object referral lists on the web

» A Novel WebOriented Writing Environment Using Objects Facts Acquired from the Web

» Extracting data records from the web using tag path clustering

» Extraction and Approximation of Numerical Attributes from the Web

» Extracting ObjectOriented Database Schemas from XML DTDs Using Inheritance

» ObjectRunner Lightweight Targeted Extraction and Querying of Structured Web Data

» Topdown Extraction of SemiStructured Data

» Automatic Extraction of Textual Elements from News Web Pages

» What You Seek Is What You Get Extraction of Class Attributes from Query Logs

Post Info
More Details (n/a)

Added	01 Nov 2009
Updated	01 Nov 2009
Type	Conference
Year	2006
Where	ICDE
Authors	Zaiqing Nie, Fei Wu, Ji-Rong Wen, Wei-Ying Ma

Comments (0)