Search Sciweavers | Sciweavers

43 search results - page 5 / 9

» Scalable Attribute-Value Extraction from Semi-structured Tex...

212

click to vote

DL
2000
Springer

162views Digital Library» more DL 2000»

Snowball: extracting relations from large plain-text collections

15 years 11 months ago

Download www.cs.columbia.edu

Text documents often contain valuable structured data that is hidden in regular English sentences. This data is best exploited if available as a relational table that we could use...

Eugene Agichtein, Luis Gravano

claim paper

Read More »

210

click to vote

EDBT
2009
ACM

123views Database» more EDBT 2009»

High-performance information extraction with AliBaba

16 years 1 months ago

Download www.informatik.hu-berlin.de

A wealth of information is available only in web pages, patents, publications etc. Extracting information from such sources is challenging, both due to the typically complex langu...

Peter Palaga, Long Nguyen, Ulf Leser, Jörg Ha...

claim paper

Read More »

211

click to vote

PAKDD
2009
ACM

116views Data Mining» more PAKDD 2009»

Scalable Web Mining with Newistic

16 years 1 months ago

Download www.horatiumocian.com

Abstract. Newistic is a web mining platform that collects and analyses documents crawled from the Internet. Although it currently processes news articles, it can be easily adapted ...

Ovidiu Dan, Horatiu Mocian

claim paper

Read More »

193

click to vote

DL
2000
Springer

164views Digital Library» more DL 2000»

Scalable browsing for large collections: a case study

15 years 11 months ago

Download www.cs.waikato.ac.nz

Phrase browsing techniques use phrases extracted automatically from a large information collection as a basis for browsing and accessing it. This paper describes a case study that...

Gordon W. Paynter, Ian H. Witten, Sally Jo Cunning...

claim paper

Read More »

203

click to vote

CLEF
2010
Springer

191views Information Technology» more CLEF 2010»

A Textual-Based Similarity Approach for Efficient and Scalable External Plagiarism Analysis - Lab Report for PAN at CLEF 2010

15 years 8 months ago

Download www.uni-weimar.de

In this paper we present an approach to detect external plagiarism based on textual similarity. This is an efficient and precise method that can be applied over large sets of docum...

Daniel Micol, Óscar Ferrández, Ferna...

claim paper

Read More »

« Prev « First page 5 / 9 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers