Sciweavers

304 search results - page 16 / 61
» Web Page Downloading and Classification
Sort
View
ADBIS
2007
Springer
110views Database» more  ADBIS 2007»
14 years 1 months ago
Design of Web Agents Inspired by Brain Research
The paper presents an approach to combine knowledge from memory and brain sciences with information retrieval research in the design of Web agents. An information retrieval agent f...
Maya Dimitrova, Hiroaki Wagatsuma, Yoko Yamaguchi
HICSS
2009
IEEE
150views Biometrics» more  HICSS 2009»
14 years 2 months ago
An N-Gram Based Approach to Automatically Identifying Web Page Genre
The research reported in this paper is the first phase of a larger project on the automatic classification of web pages by their genres, using ngram representations of the web pag...
Jane E. Mason, Michael A. Shepherd, Jack Duffy
WWW
2002
ACM
14 years 8 months ago
Parallel crawlers
In this paper we study how we can design an effective parallel crawler. As the size of the Web grows, it becomes imperative to parallelize a crawling process, in order to finish d...
Junghoo Cho, Hector Garcia-Molina
KDD
2009
ACM
172views Data Mining» more  KDD 2009»
14 years 8 months ago
Towards combining web classification and web information extraction: a case study
: ? Towards Combining Web Classification and Web Information Extraction: a Case Study Ping Luo, Fen Lin, Yuhong Xiong, Yong Zhao, Zhongzhi Shi HP Laboratories HPL-2009-86 Classific...
Ping Luo, Fen Lin, Yuhong Xiong, Yong Zhao, Zhongz...
CIKM
2005
Springer
14 years 1 months ago
Fast webpage classification using URL features
We demonstrate the usefulness of the uniform resource locator (URL) alone in performing web page classification. This approach is magnitudes faster than typical web page classific...
Min-Yen Kan, Hoang Oanh Nguyen Thi