Code clones in software increase maintenance cost and lower software quality. We have devised a new algorithm to detect duplicated parts of source code in large software. Our algo...
In the Linked Open Data cloud one of the largest data sets, comprising of 2.5 billion triples, is derived from the Life Science domain. Yet this represents a small fraction of the ...
agroXML is a standardized language for data exchange in agriculture. It is based on the eXtensible Markup Language (XML) using XML Schema as its definition language. agroXML is us...
Mario Schmitz, Daniel Martini, Martin Kunisch, Han...
This paper discusses several data mining algorithms and techniques that we have developed at the University of Arizona Artificial Intelligence Lab. We have implemented these algori...
Andrea Houston, Hsinchun Chen, Susan Molloy Hubbar...
Background: Feature selection techniques are critical to the analysis of high dimensional datasets. This is especially true in gene selection from microarray data which are common...
Pengyi Yang, Bing Bing Zhou, Zili Zhang, Albert Y....