Sciweavers

25 search results - page 3 / 5
» Automatic Identification of Home Pages on the Web
Sort
View
INTR
2002
50views more  INTR 2002»
13 years 10 months ago
Methodologies for crawler based Web surveys
There have been many attempts to study the content of the web, either through human or automatic agents. Five different previously used web survey methodologies are described and ...
Mike Thelwall
CICLING
2009
Springer
14 years 2 months ago
Language Identification on the Web: Extending the Dictionary Method
Abstract. Automated language identification of written text is a wellestablished research domain that has received considerable attention in the past. By now, efficient and effecti...
Radim Rehurek, Milan Kolkus
CMS
2010
150views Communications» more  CMS 2010»
13 years 11 months ago
Throwing a MonkeyWrench into Web Attackers Plans
Abstract. Client-based attacks on internet users with malicious web pages represent a serious and rising threat. Internet Browsers with enabled active content technologies such as ...
Armin Büscher, Michael Meier, Ralf Benzmü...
LREC
2008
132views Education» more  LREC 2008»
14 years 9 days ago
Babylon Parallel Text Builder: Gathering Parallel Texts for Low-Density Languages
This paper describes BABYLON, a system that attempts to overcome the shortage of parallel texts in low-density languages by supplementing existing parallel texts with texts gather...
Michael Mohler, Rada Mihalcea
ACL
2006
14 years 8 days ago
A Collaborative Framework for Collecting Thai Unknown Words from the Web
We propose a collaborative framework for collecting Thai unknown words found on Web pages over the Internet. Our main goal is to design and construct a Webbased system which allow...
Choochart Haruechaiyasak, Chatchawal Sangkeettraka...