Sciweavers

48 search results - page 4 / 10
» Language Based Crawling: Crawling the Arabic Content of the ...
Sort
View
NIPS
2000
13 years 8 months ago
The Missing Link - A Probabilistic Model of Document Content and Hypertext Connectivity
We describe a joint probabilistic model for modeling the contents and inter-connectivity of document collections such as sets of web pages or research paper archives. The model is...
David A. Cohn, Thomas Hofmann
SIGMOD
2010
ACM
232views Database» more  SIGMOD 2010»
13 years 7 months ago
Optimizing content freshness of relations extracted from the web using keyword search
An increasing number of applications operate on data obtained from the Web. These applications typically maintain local copies of the web data to avoid network latency in data acc...
Mohan Yang, Haixun Wang, Lipyeow Lim, Min Wang
WWW
2005
ACM
14 years 7 months ago
The infocious web search engine: improving web searching through linguistic analysis
In this paper we present the Infocious Web search engine [23]. Our goal in creating Infocious is to improve the way people find information on the Web by resolving ambiguities pre...
Alexandros Ntoulas, Gerald Chao, Junghoo Cho
ISF
2011
13 years 1 months ago
A multi-region empirical study on the internet presence of global extremist organizations
Abstract Extremist organizations are heavily utilizing Internet technologies to increase their abilities to influence the world. Studying those global extremist organizations’ In...
Jialun Qin, Yilu Zhou, Hsinchun Chen
WWW
2003
ACM
14 years 6 days ago
AnswerBus News Engine
AnswerBus News Engine1 is a question answering system using the contents of CNN Web site2 as its knowledge base. Comparing to other question answering systems including its previo...
Zhiping Zheng