Sciweavers

960 search results - page 81 / 192
» There's no Data like More Data
Sort
View
ICML
2005
IEEE
14 years 10 months ago
Modeling word burstiness using the Dirichlet distribution
Multinomial distributions are often used to model text documents. However, they do not capture well the phenomenon that words in a document tend to appear in bursts: if a word app...
Rasmus Elsborg Madsen, David Kauchak, Charles Elka...
WWW
2006
ACM
14 years 9 months ago
Evaluating structural summaries as access methods for XML
Structural summaries are data structures that preserve all structural features of XML documents in a compact form. We investigate the applicability of the most popular summaries a...
Mirella Moura Moro, Zografoula Vagena, Vassilis J....
ICSE
2004
IEEE-ACM
14 years 9 months ago
Mining Version Histories to Guide Software Changes
We apply data mining to version histories in order to guide programmers along related changes: "Programmers who changed these functions also changed...." Given a set of e...
Andreas Zeller, Peter Weißgerber, Stephan Di...
ADC
2007
Springer
108views Database» more  ADC 2007»
14 years 3 months ago
Distributed Text Retrieval From Overlapping Collections
In standard text retrieval systems, the documents are gathered and indexed on a single server. In distributed information retrieval (DIR), the documents are held in multiple colle...
Milad Shokouhi, Justin Zobel, Yaniv Bernstein
SETN
2004
Springer
14 years 2 months ago
A Meta-classifier Approach for Medical Diagnosis
Abstract. Single classifiers, such as Neural Networks, Support Vector Machines, Decision Trees and other, can be used to perform classification of data for relatively simple proble...
George L. Tsirogiannis, Dimitrios S. Frossyniotis,...