Sciweavers

608 search results - page 81 / 122
» Extracting Partition Statistics from Semistructured Data
Sort
View
IDA
2007
Springer
13 years 9 months ago
WWW traffic measure and its properties
Abstract. We present a method to extract a time series (Number of Active Requests (NAR)) from web cache logs which serves as a transport level measurement of internet traffic. This...
Marcus R. Keogh-Brown, Barbara Bogacka
PR
2007
84views more  PR 2007»
13 years 9 months ago
Attention-based similarity
A similarity measure is described that does not require the prior specification of features or the need for training sets of representative data. Instead large numbers of feature...
Fred Stentiford
WWW
2007
ACM
14 years 10 months ago
Combining classifiers to identify online databases
We address the problem of identifying the domain of online databases. More precisely, given a set F of Web forms automatically gathered by a focused crawler and an online database...
Luciano Barbosa, Juliana Freire
PVLDB
2008
141views more  PVLDB 2008»
13 years 9 months ago
WebTables: exploring the power of tables on the web
The World-Wide Web consists of a huge number of unstructured documents, but it also contains structured data in the form of HTML tables. We extracted 14.1 billion HTML tables from...
Michael J. Cafarella, Alon Y. Halevy, Daisy Zhe Wa...
ICIP
2007
IEEE
14 years 4 months ago
Modeling Gabor Coefficients via Generalized Gaussian Distributions for Face Recognition
Gabor filters are biologically motivated convolution kernels that have been widely used in the field of computer vision and, specially, in face recognition during the last decad...
Daniel González-Jiménez, Fernando P&...