Search Sciweavers | Sciweavers

144 search results - page 6 / 29

» Methods for Domain-Independent Information Extraction from t...

click to vote

CIKM
2011
Springer

200views Information Technology» more CIKM 2011»

Semi-supervised multi-task learning of structured prediction models for web information extraction

12 years 7 months ago

Download www.keerthis.com

Extracting information from web pages is an important problem; it has several applications such as providing improved search results and construction of databases to serve user qu...

Paramveer S. Dhillon, Sundararajan Sellamanickam, ...

claim paper

Read More »

click to vote

SIGIR
2005
ACM

156views Information Technology» more SIGIR 2005»

Title extraction from bodies of HTML documents and its application to web page retrieval

14 years 1 months ago

Download research.microsoft.com

This paper is concerned with automatic extraction of titles from the bodies of HTML documents. Titles of HTML documents should be correctly defined in the title fields; however, i...

Yunhua Hu, Guomao Xin, Ruihua Song, Guoping Hu, Sh...

claim paper

Read More »

click to vote

IJSI
2008

115views more IJSI 2008»

Towards Knowledge Acquisition from Semi-Structured Content

13 years 7 months ago

Download www.ijsi.org

Abstract A rich family of generic Information Extraction (IE) techniques have been developed by researchers nowadays. This paper proposes WebKER, a system for automatically extract...

Xi Bai, Jigui Sun, Haiyan Che, Lian Shi

claim paper

Read More »

click to vote

WWW
2009
ACM

209views Internet Technology» more WWW 2009»

Incorporating site-level knowledge to extract structured data from web forums

14 years 8 months ago

Download www2009.eprints.org

Web forums have become an important data resource for many web applications, but extracting structured data from unstructured web forum pages is still a challenging task due to bo...

Jiang-Ming Yang, Rui Cai, Yida Wang, Jun Zhu, Lei ...

claim paper

Read More »

click to vote

CIKM
2010
Springer

225views Information Technology» more CIKM 2010»

Automatic metadata extraction from multilingual enterprise content

13 years 6 months ago

Download www.cngl.ie

Enterprises provide professionally authored content about their products/services in different languages for use in web sites and customer care. For customer care, personalization...

Melike Sah, Vincent Wade

claim paper

Read More »

« Prev « First page 6 / 29 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers