Sciweavers

1082 search results - page 158 / 217
» Applying Pattern Mining to Web Information Extraction
Sort
View
KDD
2010
ACM
250views Data Mining» more  KDD 2010»
13 years 10 months ago
On community outliers and their efficient detection in information networks
Linked or networked data are ubiquitous in many applications. Examples include web data or hypertext documents connected via hyperlinks, social networks or user profiles connected...
Jing Gao, Feng Liang, Wei Fan, Chi Wang, Yizhou Su...
ICDM
2009
IEEE
164views Data Mining» more  ICDM 2009»
14 years 3 months ago
iTopicModel: Information Network-Integrated Topic Modeling
—Document networks, i.e., networks associated with text information, are becoming increasingly popular due to the ubiquity of Web documents, blogs, and various kinds of online da...
Yizhou Sun, Jiawei Han, Jing Gao, Yintao Yu
KDD
2008
ACM
183views Data Mining» more  KDD 2008»
14 years 9 months ago
De-duping URLs via rewrite rules
A large fraction of the URLs on the web contain duplicate (or near-duplicate) content. De-duping URLs is an extremely important problem for search engines, since all the principal...
Anirban Dasgupta, Ravi Kumar, Amit Sasturkar
ALMOB
2007
74views more  ALMOB 2007»
13 years 9 months ago
Evaluating deterministic motif significance measures in protein databases
Background: Assessing the outcome of motif mining algorithms is an essential task, as the number of reported motifs can be very large. Significance measures play a central role in...
Pedro Gabriel Ferreira, Paulo J. Azevedo
KDD
2003
ACM
133views Data Mining» more  KDD 2003»
14 years 9 months ago
Interactive Analysis of Gene Interactions Using Graphical gaussian model
DNA microarray provides a powerful basis for analysis of gene expression. Data mining methods such as clustering have been widely applied to microarray data to link genes that sho...
Xintao Wu, Yong Ye, Kalpathi R. Subramanian