Sciweavers

SDM
2007
SIAM
195views Data Mining» more  SDM 2007»
13 years 10 months ago
On Anonymization of String Data
String data is especially important in the privacy preserving data mining domain because most DNA and biological data is coded as strings. In this paper, we will discuss a new met...
Charu C. Aggarwal, Philip S. Yu
SDM
2007
SIAM
118views Data Mining» more  SDM 2007»
13 years 10 months ago
On Privacy-Preservation of Text and Sparse Binary Data with Sketches
In recent years, privacy preserving data mining has become very important because of the proliferation of large amounts of data on the internet. Many data sets are inherently high...
Charu C. Aggarwal, Philip S. Yu
SDM
2007
SIAM
133views Data Mining» more  SDM 2007»
13 years 10 months ago
On Point Sampling Versus Space Sampling for Dimensionality Reduction
In recent years, random projection has been used as a valuable tool for performing dimensionality reduction of high dimensional data. Starting with the seminal work of Johnson and...
Charu C. Aggarwal
SDM
2007
SIAM
73views Data Mining» more  SDM 2007»
13 years 10 months ago
Sketching Landscapes of Page Farms
The Web is a very large social network. It is important and interesting to understand the “ecology” of the Web: the general relations of Web pages to their environment. The un...
Bin Zhou 0002, Jian Pei
SDM
2007
SIAM
98views Data Mining» more  SDM 2007»
13 years 10 months ago
An incremental data-stream sketch using sparse random projections
We propose the use of random projections with a sparse matrix to maintain a sketch of a collection of high-dimensional data-streams that are updated asynchronously. This sketch al...
Aditya Krishna Menon, Gia Vinh Anh Pham, Sanjay Ch...
SDM
2007
SIAM
126views Data Mining» more  SDM 2007»
13 years 10 months ago
Scalable Name Disambiguation using Multi-level Graph Partition
When non-unique values are used as the identifier of entities, due to their homonym, confusion can occur. In particular, when (part of) “names” of entities are used as their ...
Byung-Won On, Dongwon Lee
SEMWIKI
2008
168views Data Mining» more  SEMWIKI 2008»
13 years 10 months ago
AceWiki: Collaborative Ontology Management in Controlled Natural Language
AceWiki is a prototype that shows how a semantic wiki using controlled natural language -- Attempto Controlled English (ACE) in our case -- can make ontology management easy for ev...
Tobias Kuhn
SEMWIKI
2008
154views Data Mining» more  SEMWIKI 2008»
13 years 10 months ago
Flyspeck in a Semantic Wiki
Abstract. Semantic wikis have been successfully applied to many problems in knowledge management and collaborative authoring. They are particularly appropriate for scientific and m...
Christoph Lange 0002, Sean McLaughlin, Florian Rab...
SEMWIKI
2008
156views Data Mining» more  SEMWIKI 2008»
13 years 10 months ago
A Generic Corporate Ontology Lifecycle
Abstract. Weaving the Semantic Web the research community is working on publishing publicly available data sources as RDF data on the Web. To facilitate the adoption of Semantic We...
Markus Luczak-Rösch, Ralf Heese
SEMWIKI
2008
140views Data Mining» more  SEMWIKI 2008»
13 years 10 months ago
Next-Generation Wikis: What Users Expect; How RDF Helps
Even though wikis helped start the web 2.0 phenomenon, they currently run the risk of becoming outdated. In order to find out what aspects of wikis will survive and how wikis might...
Axel Rauschmayer