Sciweavers

FCT
2001
Springer

Polynomial Time Algorithms for Finding Unordered Tree Patterns with Internal Variables

14 years 3 months ago
Polynomial Time Algorithms for Finding Unordered Tree Patterns with Internal Variables
Many documents such as Web documents or XML files have tree structures. A term tree is an unordered tree pattern consisting of internal variables and tree structures. In order to extract meaningful and hidden knowledge from such tree structured documents, we consider a minimal language (MINL) problem for term trees. The MINL problem for term trees is to find a term tree t such that the language generated by t is minimal among languages, generated by term trees, which contain all given tree structured data. Firstly, we show that the MINL problem for regular term trees is computable in polynomial time if the number of edge labels is infinite. Next, we show that the MINL problems with optimizing the size of an output term tree are NP-complete. Finally, in order to show that our polynomial time algorithm for the MINL problem can be applied to data mining from real-world Web documents, we show that regular term tree languages are polynomial time inductively inferable from positive data i...
Takayoshi Shoudai, Tomoyuki Uchida, Tetsuhiro Miya
Added 28 Jul 2010
Updated 28 Jul 2010
Type Conference
Year 2001
Where FCT
Authors Takayoshi Shoudai, Tomoyuki Uchida, Tetsuhiro Miyahara
Comments (0)