Search Sciweavers | Sciweavers

1947 search results - page 68 / 390

» On the Automatic Extraction of Data from the Hidden Web

177

click to vote

SIBGRAPI
2000
IEEE

186views Computer Graphics» more SIBGRAPI 2000»

An Off-Line Signature Verification System using Hidden Markov Model and Cross-Validation

15 years 10 months ago

Download www.livia.etsmtl.ca

This work has as main objective to present an off-line signature verification system. It is basically divided into three parts. The first one demonstrates a pre-processing process,...

Edson J. R. Justino, Abdenaim El Yacoubi, Fl&aacut...

claim paper

Read More »

183

click to vote

PVLDB
2008

141views more PVLDB 2008»

WebTables: exploring the power of tables on the web

15 years 5 months ago

Download turing.cs.washington.edu

The World-Wide Web consists of a huge number of unstructured documents, but it also contains structured data in the form of HTML tables. We extracted 14.1 billion HTML tables from...

Michael J. Cafarella, Alon Y. Halevy, Daisy Zhe Wa...

claim paper

Read More »

181

click to vote

GFKL
2007
Springer

152views Data Mining» more GFKL 2007»

Supporting Web-based Address Extraction with Unsupervised Tagging

16 years 3 days ago

Download wortschatz.uni-leipzig.de

Abstract. The manual acquisition and modeling of tourist information as e.g. addresses of points of interest is time and, therefore, cost intensive. Furthermore, the encoded inform...

Berenike Loos, Chris Biemann

claim paper

Read More »

168

click to vote

ICADL
2007
Springer

129views Education» more ICADL 2007»

Using Automatic Metadata Extraction to Build a Structured Syllabus Repository

16 years 3 days ago

Download manas.tungare.name

Syllabi are important documents created by instructors for students. Students use syllabi to ﬁnd information and to prepare for class. Instructors often need to ﬁnd similar syl...

Xiaoyan Yu, Manas Tungare, Weiguo Fan, Manuel A. P...

claim paper

Read More »

144

click to vote

ECIR
2006
Springer

143views Information Technology» more ECIR 2006»

Automatic Acquisition of Chinese-English Parallel Corpus from the Web

15 years 7 months ago

Download research.microsoft.com

Parallel corpora are a valuable resource for tasks such as cross-language information retrieval and data-driven natural language processing systems. Previously only small scale cor...

Ying Zhang, Ke Wu, Jianfeng Gao, Phil Vines

claim paper

Read More »

« Prev « First page 68 / 390 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers