Search Sciweavers | Sciweavers

103 search results - page 15 / 21

» Visual Web Information Extraction with Lixto

174

click to vote

MM
2010
ACM

199views Multimedia» more MM 2010»

TOP-SURF: a visual words toolkit

15 years 7 months ago

Download www.liacs.nl

TOP-SURF is an image descriptor that combines interest points with visual words, resulting in a high performance yet compact descriptor that is designed with a wide range of conte...

Bart Thomee, Erwin M. Bakker, Michael S. Lew

claim paper

Read More »

208

click to vote

WEBDB
1999
Springer

196views Database» more WEBDB 1999»

Web Ecology: Recycling HTML Pages as XML Documents Using W4F

15 years 11 months ago

Download db.cis.upenn.edu

In this paper we present the World-Wide Web Wrapper Factory (W4F), a Java toolkit to generate wrappers for Web data sources. Some key features of W4F are an expressive language to...

Arnaud Sahuguet, Fabien Azavant

claim paper

Read More »

224

click to vote

CSE
2009
IEEE

192views Theoretical Computer Science» more CSE 2009»

Web Science 2.0: Identifying Trends through Semantic Social Network Analysis

16 years 1 months ago

Download www.ickn.org

—We introduce a novel set of social network analysis based algorithms for mining the Web, blogs, and online forums to identify trends and find the people launching these new tren...

Peter A. Gloor, Jonas Krauss, Stefan Nann, Kai Fis...

claim paper

Read More »

193

click to vote

ICASSP
2011
IEEE

179views Signal Processing» more ICASSP 2011»

Horror video scene recognition via Multiple-Instance learning

14 years 10 months ago

Download mirlab.org

Along with the ever-growing Web comes the proliferation of objectionable content, such as pornography, violence, horror information, etc. Horror videos, whose threat to childrens ...

Jianchao Wang, Bing Li, Weiming Hu, Ou Wu

claim paper

Read More »

191

click to vote

DOCENG
2009
ACM

166views Document Analysis» more DOCENG 2009»

Object-level document analysis of PDF files

16 years 1 months ago

Download www.dbai.tuwien.ac.at

The PDF format is commonly used for the exchange of documents on the Web and there is a growing need to understand and extract or repurpose data held in PDF documents. Many system...

Tamir Hassan

claim paper

Read More »

« Prev « First page 15 / 21 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers