Sciweavers

450 search results - page 82 / 90
» On Integrating Data Mining into Business Processes
Sort
View
BMCBI
2007
171views more  BMCBI 2007»
13 years 8 months ago
An open source infrastructure for managing knowledge and finding potential collaborators in a domain-specific subset of PubMed,
Background: Identifying relevant research in an ever-growing body of published literature is becoming increasingly difficult. Establishing domain-specific knowledge bases may be a...
Wei Yu, Ajay Yesupriya, Anja Wulf, Junfeng Qu, Mui...
PVLDB
2008
99views more  PVLDB 2008»
13 years 8 months ago
Industry-scale duplicate detection
Duplicate detection is the process of identifying multiple representations of a same real-world object in a data source. Duplicate detection is a problem of critical importance in...
Melanie Weis, Felix Naumann, Ulrich Jehle, Jens Lu...
JMLR
2010
125views more  JMLR 2010»
13 years 3 months ago
On utility of gene set signatures in gene expression-based cancer class prediction
Machine learning methods that can use additional knowledge in their inference process are central to the development of integrative bioinformatics. Inclusion of background knowled...
Minca Mramor, Marko Toplak, Gregor Leban, Tomaz Cu...
KDD
2010
ACM
259views Data Mining» more  KDD 2010»
14 years 13 days ago
A probabilistic model for personalized tag prediction
Social tagging systems have become increasingly popular for sharing and organizing web resources. Tag recommendation is a common feature of social tagging systems. Social tagging ...
Dawei Yin, Zhenzhen Xue, Liangjie Hong, Brian D. D...
SIGMOD
2007
ACM
160views Database» more  SIGMOD 2007»
14 years 8 months ago
Supporting ranking and clustering as generalized order-by and group-by
The Boolean semantics of SQL queries cannot adequately capture the "fuzzy" preferences and "soft" criteria required in non-traditional data retrieval applicati...
Chengkai Li, Min Wang, Lipyeow Lim, Haixun Wang, K...