Sciweavers

903 search results - page 16 / 181
» A Learning Algorithm for Web Page Scoring Systems
Sort
View
TREC
2004
13 years 8 months ago
Experiments with Web QA System and TREC 2004 Questions
We describe our first participation in TREC. We only competed in the Question Answering (QA) category and limited our runs to factoids. Our approach was to use our open domain QA ...
Dmitri Roussinov, Yin Ding, Jose Antonio Robles-Fl...
SIGMOD
2003
ACM
190views Database» more  SIGMOD 2003»
14 years 22 days ago
Extracting Structured Data from Web Pages
Many web sites contain large sets of pages generated using a common template or layout. For example, Amazon lays out the author, title, comments, etc. in the same way in all its b...
Arvind Arasu, Hector Garcia-Molina
ICPR
2010
IEEE
13 years 5 months ago
Enhancing Web Page Classification via Local Co-training
Abstract--In this paper we propose a new multi-view semisupervised learning algorithm called Local Co-Training (LCT). The proposed algorithm employs a set of local models with vect...
Youtian Du, Xiaohong Guan, Zhongmin Cai
WWW
2005
ACM
14 years 1 months ago
Finding the boundaries of information resources on the web
In recent years, many algorithms for the Web have been developed that work with information units distinct from individual web pages. These include segments of web pages or aggreg...
Pavel Dmitriev, Carl Lagoze, Boris Suchkov
KDD
2009
ACM
181views Data Mining» more  KDD 2009»
14 years 2 days ago
Intelligent file scoring system for malware detection from the gray list
Currently, the most significant line of defense against malware is anti-virus products which focus on authenticating valid software from a white list, blocking invalid software f...
Yanfang Ye, Tao Li, Qingshan Jiang, Zhixue Han, Li...