Sciweavers

300 search results - page 33 / 60
» Extracting Patterns and Relations from the World Wide Web
Sort
View
DSN
2006
IEEE
14 years 2 months ago
A Contribution Towards Solving the Web Workload Puzzle
World Wide Web, the biggest distributed system ever built, experiences tremendous growth and change in Web sites, users, and technology. A realistic and accurate characterization ...
Katerina Goseva-Popstojanova, Fengbin Li, Xuan Wan...
CIKM
2003
Springer
14 years 3 months ago
Multi-resolution disambiguation of term occurrences
We describe a system for extracting mentions of terms such as company and product names, in a large and noisy corpus of documents, such as the World Wide Web. Since natural langua...
Einat Amitay, Rani Nelken, Wayne Niblack, Ron Siva...
WWW
2009
ACM
14 years 5 months ago
Near real time information mining in multilingual news
This paper presents a near real-time multilingual news monitoring and analysis system that forms the backbone of our research work. The system integrates technologies to address t...
Martin Atkinson, Erik Van der Goot
WIDM
2004
ACM
14 years 3 months ago
Probabilistic models for focused web crawling
A Focused crawler must use information gleaned from previously crawled page sequences to estimate the relevance of a newly seen URL. Therefore, good performance depends on powerfu...
Hongyu Liu, Evangelos E. Milios, Jeannette Janssen
ITSSA
2006
129views more  ITSSA 2006»
13 years 10 months ago
A MultiAgent System for Classifying Bioinformatics Publications
Abstract. A growing amounts of information are currently being generated and stored in the World Wide Web (WWW), in particular, researchers in any field can find a lot of publicati...
Eloisa Vargiu, Andrea Addis