Sciweavers

8316 search results - page 51 / 1664
» Web Document Modeling
Sort
View
ICWE
2007
Springer
14 years 4 months ago
Fixing Weakly Annotated Web Data Using Relational Models
In this paper, we present a fast and scalable Bayesian model for improving weakly annotated data – which is typically generated by a (semi) automated information extraction (IE) ...
Fatih Gelgi, Srinivas Vadrevu, Hasan Davulcu
AIRS
2006
Springer
14 years 2 months ago
Learning to Separate Text Content and Style for Classification
Many text documents naturally have two kinds of labels. For example, we may label web pages from universities according to their categories, such as "student" or "fa...
Dell Zhang, Wee Sun Lee
MEDINFO
2007
13 years 11 months ago
Text Categorization Models for Identifying Unproven Cancer Treatments on the Web
The nature of the internet as a non-peer-reviewed (and more generally largely unregulated) publication medium has allowed wide-spread promotion of inaccurate and unproven medical ...
Yin Aphinyanaphongs, Constantin F. Aliferis
WWW
2008
ACM
14 years 11 months ago
Representing a web page as sets of named entities of multiple types: a model and some preliminary applications
As opposed to representing a document as a "bag of words" in most information retrieval applications, we propose a model of representing a web page as sets of named enti...
Nan Di, Conglei Yao, Mengcheng Duan, Jonathan J. H...
WSC
1997
13 years 11 months ago
Model-Driven Simulation of World-Wide-Web Cache Policies
The World Wide Web (WWW) has experienced a dramatic increase in popularity since 1993. Many reports indicate that its growth will continue at an exponential rate. This growth has ...
Ying Shi, Edward Watson, Ye-Sho Chen