Sciweavers

2030 search results - page 382 / 406
» Estimation of Chaotic Probabilities
Sort
View
SIGMOD
2009
ACM
140views Database» more  SIGMOD 2009»
14 years 3 months ago
Robust web extraction: an approach based on a probabilistic tree-edit model
On script-generated web sites, many documents share common HTML tree structure, allowing wrappers to effectively extract information of interest. Of course, the scripts and thus ...
Nilesh N. Dalvi, Philip Bohannon, Fei Sha
WSDM
2009
ACM
187views Data Mining» more  WSDM 2009»
14 years 3 months ago
Speeding up algorithms on compressed web graphs
A variety of lossless compression schemes have been proposed to reduce the storage requirements of web graphs. One successful approach is virtual node compression [7], in which of...
Chinmay Karande, Kumar Chellapilla, Reid Andersen
DEXAW
2009
IEEE
113views Database» more  DEXAW 2009»
14 years 3 months ago
Identification of Surgery Indicators by Mining Hospital Data: A Preliminary Study
—The management of patient referrals is an interesting issue when it comes to predicting future patient demand to increase hospital productivity. In general, a patient is referre...
Marie Persson, Niklas Lavesson
ICDM
2009
IEEE
164views Data Mining» more  ICDM 2009»
14 years 3 months ago
iTopicModel: Information Network-Integrated Topic Modeling
—Document networks, i.e., networks associated with text information, are becoming increasingly popular due to the ubiquity of Web documents, blogs, and various kinds of online da...
Yizhou Sun, Jiawei Han, Jing Gao, Yintao Yu
AIRS
2009
Springer
14 years 3 months ago
A Latent Dirichlet Framework for Relevance Modeling
Relevance-based language models operate by estimating the probabilities of observing words in documents relevant (or pseudo relevant) to a topic. However, these models assume that ...
Viet Ha-Thuc, Padmini Srinivasan