Sciweavers

8316 search results - page 51 / 1664
» Web Document Modeling
Sort
View
114
Voted
ICWE
2007
Springer
15 years 8 months ago
Fixing Weakly Annotated Web Data Using Relational Models
In this paper, we present a fast and scalable Bayesian model for improving weakly annotated data – which is typically generated by a (semi) automated information extraction (IE) ...
Fatih Gelgi, Srinivas Vadrevu, Hasan Davulcu
AIRS
2006
Springer
15 years 6 months ago
Learning to Separate Text Content and Style for Classification
Many text documents naturally have two kinds of labels. For example, we may label web pages from universities according to their categories, such as "student" or "fa...
Dell Zhang, Wee Sun Lee
MEDINFO
2007
15 years 3 months ago
Text Categorization Models for Identifying Unproven Cancer Treatments on the Web
The nature of the internet as a non-peer-reviewed (and more generally largely unregulated) publication medium has allowed wide-spread promotion of inaccurate and unproven medical ...
Yin Aphinyanaphongs, Constantin F. Aliferis
WWW
2008
ACM
16 years 3 months ago
Representing a web page as sets of named entities of multiple types: a model and some preliminary applications
As opposed to representing a document as a "bag of words" in most information retrieval applications, we propose a model of representing a web page as sets of named enti...
Nan Di, Conglei Yao, Mengcheng Duan, Jonathan J. H...
119
Voted
WSC
1997
15 years 3 months ago
Model-Driven Simulation of World-Wide-Web Cache Policies
The World Wide Web (WWW) has experienced a dramatic increase in popularity since 1993. Many reports indicate that its growth will continue at an exponential rate. This growth has ...
Ying Shi, Edward Watson, Ye-Sho Chen