Learning taxonomic relations from a set of text documents

15 years 5 months ago

Download proceedings2010.imcsit.org

This paper presents a methodology for learning taxonomic relations from a set of documents that each explain one of the concepts. Three different feature extraction approaches with varying degree of language independence are compared in this study. The first feature extraction scheme is a languageindependent approach based on statistical keyphrase extraction, and the second one is based on a combination of rule-based stemming and fuzzy logic-based feature weighting and selection. The third approach is the traditional tf-idf weighting scheme with commonly used rule-based stemming. The concept hierarchy is obtained by combining Self-Organizing Map clustering with agglomerative hierarchical clustering. Experiments are conducted for both English and Finnish. The results show that concept hierarchies can be constructed automatically also by using statistical methods without heavy language-specific preprocessing.

Mari-Sanna Paukkeri, Alberto Pérez Garc&iac

Real-time Traffic

Feature Extraction | Feature Extraction Scheme | IMCSIT 2010 | Information Technology | Logic-based Feature Weighting |

claim paper

» Automated ontology construction for unstructured text documents

» AxiomBased Feedback Cycle for Relation Extraction in Ontology Learning from Text

» Combining Statistical Techniques and Lexicosyntactic Patterns for Semantic Relations Extra...

» Learning a Distance Metric from Relative Comparisons

» Learning GeneralizationSpecialization Relations between Concepts Application for Automati...

» Semisupervised Learning with WeaklyRelated Unlabeled Data Towards Better Text Categorizati...

» Mining protein function from text using termbased support vector machines

» Mining relational data from text From strictly supervised to weakly supervised learning

Post Info
More Details (n/a)

Added	13 Feb 2011
Updated	13 Feb 2011
Type	Journal
Year	2010
Where	IMCSIT
Authors	Mari-Sanna Paukkeri, Alberto Pérez García-Plaza, Sini Pessala, Timo Honkela

Comments (0)

Sciweavers

Learning taxonomic relations from a set of text documents

Feature Extraction | Feature Extraction Scheme | IMCSIT 2010 | Information Technology | Logic-based Feature Weighting |

Explore & Download

Productivity Tools

Sciweavers