Sciweavers

250 search results - page 19 / 50
» Clustering Documents Using a Wikipedia-Based Concept Represe...
Sort
View
BMCBI
2006
153views more  BMCBI 2006»
13 years 8 months ago
Automatic document classification of biological literature
Background: Document classification is a wide-spread problem with many applications, from organizing search engine snippets to spam filtering. We previously described Textpresso, ...
David Chen, Hans-Michael Müller, Paul W. Ster...
SIGIR
2008
ACM
13 years 8 months ago
Knowledge transformation from word space to document space
In most IR clustering problems, we directly cluster the documents, working in the document space, using cosine similarity between documents as the similarity measure. In many real...
Tao Li, Chris H. Q. Ding, Yi Zhang 0005, Bo Shao
SIGIR
2008
ACM
13 years 8 months ago
Enhancing text clustering by leveraging Wikipedia semantics
Most traditional text clustering methods are based on "bag of words" (BOW) representation based on frequency statistics in a set of documents. BOW, however, ignores the ...
Jian Hu, Lujun Fang, Yang Cao, Hua-Jun Zeng, Hua L...
AUSDM
2007
Springer
112views Data Mining» more  AUSDM 2007»
14 years 2 months ago
Measuring Data-Driven Ontology Changes using Text Mining
Most current ontology management systems concentrate on detecting usage-driven changes and representing changes formally in order to maintain the consistency. In this paper, we pr...
Majigsuren Enkhsaikhan, Wilson Wong, Wei Liu, Mark...
ICDE
2009
IEEE
114views Database» more  ICDE 2009»
14 years 10 months ago
XOntoRank: Ontology-Aware Search of Electronic Medical Records
As the use of Electronic Medical Records (EMRs) becomes more widespread, so does the need for effective information discovery within them. Recently proposed EMR standards are XML-b...
Fernando Farfán, Vagelis Hristidis, Anand R...