Sciweavers

81 search results - page 9 / 17
» Estimating web site readability using content extraction
Sort
View
WWW
2009
ACM
14 years 8 months ago
Incorporating site-level knowledge to extract structured data from web forums
Web forums have become an important data resource for many web applications, but extracting structured data from unstructured web forum pages is still a challenging task due to bo...
Jiang-Ming Yang, Rui Cai, Yida Wang, Jun Zhu, Lei ...
NAR
2002
132views more  NAR 2002»
13 years 7 months ago
euGenes: a eukaryote genome information system
euGenes is a genome information system and database that provides a common summary of eukaryote genes and genomes, at web site http://iubio.bio.indiana.edu/eugenes/. Seven popular...
Donald G. Gilbert
SIGMOD
2009
ACM
140views Database» more  SIGMOD 2009»
14 years 2 months ago
Robust web extraction: an approach based on a probabilistic tree-edit model
On script-generated web sites, many documents share common HTML tree structure, allowing wrappers to effectively extract information of interest. Of course, the scripts and thus ...
Nilesh N. Dalvi, Philip Bohannon, Fei Sha
WWW
2009
ACM
14 years 8 months ago
Learning to recognize reliable users and content in social media with coupled mutual reinforcement
Community Question Answering (CQA) has emerged as a popular forum for users to pose questions for other users to answer. Over the last few years, CQA portals such as Naver and Yah...
Jiang Bian, Yandong Liu, Ding Zhou, Eugene Agichte...
HCI
2009
13 years 5 months ago
User Reputation Evaluation Using Co-occurrence Feature and Collective Intelligence
It becomes more difficult to find valuable contents in the Web 2.0 environment since lots of inexperienced users provide many unorganized contents. In the previous researches, peop...
Jeong-Won Cha, Hyun-woo Lee, Yo-Sub Han, Laehyun K...