Sciweavers

SISAP
2008
IEEE
98views Data Mining» more  SISAP 2008»
14 years 3 months ago
On Reinsertions in M-tree
In this paper we introduce a new M-tree building method, utilizing the classic idea of forced reinsertions. In case a leaf is about to split, some distant objects are removed from...
Jakub Lokoc, Tomás Skopal
IPPS
2008
IEEE
14 years 3 months ago
Multi-threaded data mining of EDGAR CIKs (Central Index Keys) from ticker symbols
This paper describes how use the Java Swing HTMLEditorKit to perform multi-threaded web data mining on the EDGAR system (Electronic DataGathering, Analysis, and Retrieval system)....
Dougal A. Lyon
ICEIS
2008
IEEE
14 years 3 months ago
Internal Fraud Risk Reduction - Results of a Data Mining Case Study
Corporate fraud these days represents a huge cost to our economy. Academic literature already concentrated on how data mining techniques can be of value in the fight against frau...
Mieke Jans, Nadine Lybaert, Koen Vanhoof
ICDM
2008
IEEE
143views Data Mining» more  ICDM 2008»
14 years 3 months ago
Exploiting Data Semantics to Discover, Extract, and Model Web Sources
We describe DEIMOS, a system that automatically discovers and models new sources of information. The system exploits four core technologies developed by our group that makes an en...
José Luis Ambite, Craig A. Knoblock, Kristi...
ICDM
2008
IEEE
118views Data Mining» more  ICDM 2008»
14 years 3 months ago
Extension of Partitional Clustering Methods for Handling Mixed Data
Clustering is an active research topic in data mining and different methods have been proposed in the literature. Most of these methods are based on the use of a distance measure ...
Yosr Naïja, Salem Chakhar, Kaouthar Blibech, ...
ICDM
2008
IEEE
117views Data Mining» more  ICDM 2008»
14 years 3 months ago
Semantic Full-Text Search with ESTER: Scalable, Easy, Fast
We present a demo of ESTER, a search engine that combines the ease of use, speed and scalability of full-text search with the powerful semantic capabilities of ontologies. ESTER s...
Holger Bast, Fabian M. Suchanek, Ingmar Weber
ICDM
2008
IEEE
190views Data Mining» more  ICDM 2008»
14 years 3 months ago
Simultaneous Co-segmentation and Predictive Modeling for Large, Temporal Marketing Data
Several marketing problems involve prediction of customer purchase behavior and forecasting future preferences. We consider predictive modeling of large scale, bi-modal or multimo...
Meghana Deodhar, Joydeep Ghosh
ICDM
2008
IEEE
156views Data Mining» more  ICDM 2008»
14 years 3 months ago
Mining Allocating Patterns in One-Sum Weighted Items
An Association Rule (AR) is a common knowledge model in data mining that describes an implicative cooccurring relationship between two disjoint sets of binary-valued transaction d...
Yanbo J. Wang, Xinwei Zheng, Frans Coenen, Cindy Y...
ICDM
2008
IEEE
127views Data Mining» more  ICDM 2008»
14 years 3 months ago
Word Sense Discovery for Web Information Retrieval
Word meaning disambiguation has always been an important problem in many computer science tasks, such as information retrieval and extraction. One of the problems, faced in automa...
Tomasz Nykiel, Henryk Rybinski