Search Sciweavers | Sciweavers

2337 search results - page 43 / 468

» Extracting Sequences from the Web

144

click to vote

CICLING
2009
Springer

140views Natural Language Processing» more CICLING 2009»

Business Specific Online Information Extraction from German Websites

16 years 4 months ago

Download www.cis.uni-muenchen.de

This paper presents a system that uses the domain name of a German business website to locate its information pages (e.g. company profile, contact page, imprint) and then identifi...

Yeong Su Lee, Michaela Geierhos

claim paper

Read More »

144

Voted

INLG
2010
Springer

128views Natural Language Processing» more INLG 2010»

Extracting Parallel Fragments from Comparable Corpora for Data-to-text Generation

15 years 2 months ago

Download www.aclweb.org

Building NLG systems, in particular statistical ones, requires parallel data (paired inputs and outputs) which do not generally occur naturally. In this paper, we investigate the ...

Anja Belz, Eric Kow

claim paper

Read More »

124

click to vote

POLICY
2007
Springer

125views Computer Networks» more POLICY 2007»

Adaptive Web Data Extraction Policies

15 years 10 months ago

Download cab.unime.it

Web data extraction is concerned, among other things, with routine data accessing and downloading from continuously-updated dynamic Web pages. There is a relevant trade-off between...

Giacomo Fiumara, Massimo Marchi, Alessandro Provet...

claim paper

Read More »

161

Voted

IJSI
2008

115views more IJSI 2008»

Towards Knowledge Acquisition from Semi-Structured Content

15 years 4 months ago

Download www.ijsi.org

Abstract A rich family of generic Information Extraction (IE) techniques have been developed by researchers nowadays. This paper proposes WebKER, a system for automatically extract...

Xi Bai, Jigui Sun, Haiyan Che, Lian Shi

claim paper

Read More »

131

click to vote

WISE
2002
Springer

178views Internet Technology» more WISE 2002»

Topic Extraction from News Archive Using TF*PDF Algorithm

15 years 9 months ago

Download www.miv.t.u-tokyo.ac.jp

Busy and no time to digest the news archive .... ? Ever since the Web wide-spreading, the amount of electronically available information online, especially news archive proliferat...

Khoo Khyou Bun, Mitsuru Ishizuka

claim paper

Read More »

« Prev « First page 43 / 468 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers