Search Sciweavers | Sciweavers

543 search results - page 80 / 109

» Exploiting content redundancy for web information extraction

130

click to vote

MM
2010
ACM

174views Multimedia» more MM 2010»

Image classification using the web graph

15 years 5 months ago

Download research.yahoo.com

Image classification is a well-studied and hard problem in computer vision. We extend a proven solution for classifying web spam to handle images. We exploit the link structure of...

Dhruv Kumar Mahajan, Malcolm Slaney

claim paper

Read More »

142

click to vote

WWW
2005
ACM

116views Internet Technology» more WWW 2005»

A search engine for natural language applications

16 years 5 months ago

Download turing.cs.washington.edu

Many modern natural language-processing applications utilize search engines to locate large numbers of Web documents or to compute statistics over the Web corpus. Yet Web search e...

Michael J. Cafarella, Oren Etzioni

claim paper

Read More »

123

click to vote

LREC
2008

142views Education» more LREC 2008»

RACAI's Linguistic Web Services

15 years 6 months ago

Download www.lrec-conf.org

Nowadays, there are hundreds of Natural Language Processing applications and resources for different languages that are developed and/or used, almost exclusively with a few but no...

Dan Tufis, Radu Ion, Alexandru Ceausu, Dan Stefane...

claim paper

Read More »

175

click to vote

COLING
2010

130views Computational Linguistics» more COLING 2010»

Enhancing Cross Document Coreference of Web Documents with Context Similarity and Very Large Scale Text Categorization

15 years 4 days ago

Download clgiles.ist.psu.edu

Cross Document Coreference (CDC) is the task of constructing the coreference chain for mentions of a person across a set of documents. This work offers a holistic view of using do...

Jian Huang 0002, Pucktada Treeratpituk, Sarah M. T...

claim paper

Read More »

155

click to vote

RIAO
2007

167views Information Technology» more RIAO 2007»

From Layout to Semantic: a Reranking Model for Mapping Web Documents to Mediated XML Representations

15 years 6 months ago

Download eprints.pascal-network.org

Many documents on the Web are formated in a weakly structured format. Because of their weak semantic and because of the heterogeneity of their formats, the information conveyed by...

Guillaume Wisniewski, Patrick Gallinari

claim paper

Read More »

« Prev « First page 80 / 109 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers