Sciweavers

219 search results - page 29 / 44
» Web page language identification based on URLs
Sort
View
DIS
2007
Springer
15 years 10 months ago
Unsupervised Spam Detection Based on String Alienness Measures
We propose an unsupervised method for detecting spam documents from Web page data, based on equivalence relations on strings. We propose 3 measures for quantifying the alienness (...
Kazuyuki Narisawa, Hideo Bannai, Kohei Hatano, Mas...
JOT
2008
136views more  JOT 2008»
15 years 3 months ago
The Stock Statistics Parser
This paper describes how use the HTMLEditorKit to perform web data mining on stock statistics for listed firms. Our focus is on making use of the web to get information about comp...
Douglas Lyon
143
Voted
VEE
2009
ACM
246views Virtualization» more  VEE 2009»
15 years 10 months ago
Tracing for web 3.0: trace compilation for the next generation web applications
Today’s web applications are pushing the limits of modern web browsers. The emergence of the browser as the platform of choice for rich client-side applications has shifted the ...
Mason Chang, Edwin W. Smith, Rick Reitmaier, Micha...
SEMWEB
2010
Springer
15 years 1 months ago
I18n of Semantic Web Applications
Recently, the use of semantic technologies has gained quite some traction. With increased use of these technologies, their maturation not only in terms of performance, robustness b...
Sören Auer, Matthias Weidl, Jens Lehmann, Amr...
SOCIALCOM
2010
15 years 1 months ago
Using Text Analysis to Understand the Structure and Dynamics of the World Wide Web as a Multi-Relational Graph
A representation of the World Wide Web as a directed graph, with vertices representing web pages and edges representing hypertext links, underpins the algorithms used by web search...
Harish Sethu, Alexander Yates