Search Sciweavers | Sciweavers

81 search results - page 7 / 17

» Unsupervised named-entity extraction from the Web: An experi...

click to vote

CIKM
2007
Springer

134views Information Technology» more CIKM 2007»

The role of documents vs. queries in extracting class attributes from text

14 years 3 months ago

Download www.cs.jhu.edu

Challenging the implicit reliance on document collections, this paper discusses the pros and cons of using query logs rather than document collections, as self-contained sources o...

Marius Pasca, Benjamin Van Durme, Nikesh Garera

claim paper

Read More »

click to vote

CIKM
2009
Springer

132views Information Technology» more CIKM 2009»

Helping editors choose better seed sets for entity set expansion

14 years 3 months ago

Download www.patrickpantel.com

Sets of named entities are used heavily at commercial search engines such as Google, Yahoo and Bing. Acquiring sets of entities typically consists of combining semi-supervised exp...

Vishnu Vyas, Patrick Pantel, Eric Crestan

claim paper

Read More »

click to vote

WWW
2009
ACM

152views Internet Technology» more WWW 2009»

Bootstrapped extraction of class attributes

14 years 3 months ago

Download www2009.eprints.org

As an alternative to previous studies on extracting class attributes from unstructured text, which consider either Web documents or query logs as the source of textual data, A boo...

Joseph Reisinger, Marius Pasca

claim paper

Read More »

click to vote

SIGMOD
2010
ACM

201views Database» more SIGMOD 2010»

I4E: interactive investigation of iterative information extraction

13 years 9 months ago

Download i.stanford.edu

Information extraction systems are increasingly being used to mine structured information from unstructured text documents. A commonly used unsupervised technique is to build iter...

Anish Das Sarma, Alpa Jain, Divesh Srivastava

claim paper

Read More »

click to vote

WWW
2010
ACM

188views Internet Technology» more WWW 2010»

Exploiting content redundancy for web information extraction

13 years 9 months ago

Download www.comp.nus.edu.sg

We propose a novel extraction approach that exploits content redundancy on the web to extract structured data from template-based web sites. We start by populating a seed database...

Pankaj Gulhane, Rajeev Rastogi, Srinivasan H. Seng...

claim paper

Read More »

« Prev « First page 7 / 17 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers