Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

189

KDD
2004
ACM

164views Data Mining» more KDD 2004»

Cluster-based concept invention for statistical relational learning

16 years 7 months ago

Cluster-based concept invention for statistical relational learning

Download www.cs.umd.edu

We use clustering to derive new relations which augment database schema used in automatic generation of predictive features in statistical relational learning. Clustering improves scalability through dimensionality reduction. More importantly, entities derived from clusters increase the expressivity of feature spaces by creating new first-class concepts which contribute to the creation of new features. For example, in CiteSeer, papers can be clustered based on words or citations giving "topics", and authors can be clustered based on documents they co-author giving "communities". Such cluster-derived concepts become part of more complex feature expressions. Out of the large number of generated features, those which improve predictive accuracy are kept in the model, as decided by statistical feature selection criteria. We present results demonstrating improved accuracy and scalability when predicting publication venues using CiteSeer data.

Alexandrin Popescul, Lyle H. Ungar

Real-time Traffic

Complex Feature Expressions | Data Mining | KDD 2004 | Predictive Accuracy | Statistical Feature Selection |

claim paper

Related Content

» Statistical predicate invention

» Change of Representation for Statistical Relational Learning

» Kernel methods and the exponential family

» Learning taxonomic relations from a set of text documents

» Computational Lexicons the Neat Examples and the Odd Exemplars

» AxiomBased Feedback Cycle for Relation Extraction in Ontology Learning from Text

» A Statistical Learning Approach to Spatial Context Exploitation for Semantic Image Analysi...

» Learning from Uncertain Data

» Inductive Concept Retrieval and Query Answering with Semantic Knowledge Bases Through Kern...

Post Info
More Details (n/a)

Added	30 Nov 2009
Updated	30 Nov 2009
Type	Conference
Year	2004
Where	KDD
Authors	Alexandrin Popescul, Lyle H. Ungar

Comments (0)