Search Sciweavers | Sciweavers

498 search results - page 32 / 100

» Robust web content extraction

189

click to vote

ICDAR
2003
IEEE

127views Document Analysis» more ICDAR 2003»

Identifying Story and Preview Images in News Web Pages

16 years 13 days ago

Download www.cse.salford.ac.uk

The World Wide Web provides an increasingly powerful and popular publication mechanism. Web documents often contain a large number of images serving various different purposes. Th...

Jianying Hu, Amit Bagga

claim paper

Read More »

230

click to vote

AIRWEB
2008
Springer

166views Internet Technology» more AIRWEB 2008»

A few bad votes too many?: towards robust ranking in social media

15 years 9 months ago

Download airweb.cse.lehigh.edu

Online social media draws heavily on active reader participation, such as voting or rating of news stories, articles, or responses to a question. This user feedback is invaluable ...

Jiang Bian, Yandong Liu, Eugene Agichtein, Hongyua...

claim paper

Read More »

207

click to vote

WSDM
2010
ACM

204views Data Mining» more WSDM 2010»

Learning URL patterns for webpage de-duplication

16 years 2 months ago

Download www.wsdm-conference.org

Presence of duplicate documents in the World Wide Web adversely aﬀects crawling, indexing and relevance, which are the core building blocks of web search. In this paper, we pres...

Hema Swetha Koppula, Krishna P. Leela, Amit Agarwa...

claim paper

Read More »

214

Voted

CLEF
2008
Springer

175views Information Technology» more CLEF 2008»

Overview of VideoCLEF 2008: Automatic Generation of Topic-Based Feeds for Dual Language Audio-Visual Content

15 years 9 months ago

Download ilps.science.uva.nl

The VideoCLEF track, introduced in 2008, aims to develop and evaluate tasks related to analysis of and access to multilingual multimedia content. In its first year, VideoCLEF pilo...

Martha Larson, Eamonn Newman, Gareth J. F. Jones

claim paper

Read More »

181

click to vote

COLING
1992

90views Computational Linguistics» more COLING 1992»

Knowledge Extraction From Texts By Sintesi

15 years 8 months ago

Download acl.ldc.upenn.edu

In this paper we present SINTESI, a system for the knowledge extraction from Italian inputs, currently under development in our re,search centre. It is used on short descriptive d...

Fabio Ciravegna, Paolo Campia, Alberto Colognese

claim paper

Read More »

« Prev « First page 32 / 100 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers