TY - JOUR
T1 - Approximation and parameterized algorithms for common subtrees and edit distance between unordered trees
AU - Akutsu, Tatsuya
AU - Fukagawa, Daiji
AU - Halldórsson, Magnús M.
AU - Takasu, Atsuhiro
AU - Tanaka, Keisuke
PY - 2013/1/28
Y1 - 2013/1/28
N2 - Given two rooted, labeled, unordered trees, the common subtree problem is to find a bijective matching between subsets of nodes of the trees of maximum cardinality which preserves labels and ancestry relationship. The tree edit distance problem is to determine the least cost sequence of insertions, deletions and substitutions that converts a tree into another given tree. Both problems are known to be hard to approximate within some constant factor in general. We tackle these problems from two perspectives: giving exact algorithms, either for special cases or in terms of some parameters; and approximation algorithms and hardness of approximation. We present a parameterized algorithm in terms of the number of branching nodes that solves both problems and yields polynomial algorithms for several special classes of trees. This is complemented with a tighter APX-hardness proof that holds when the trees are of height one and two, respectively. Furthermore, we present the first approximation algorithms for both problems. In particular, for the common subtree problem for t trees, we present an algorithm achieving a tlog 2(bOPT+1) ratio, where bOPT is the number of branching nodes in the optimal solution. We also present constant factor approximation algorithms for both problems in the case of bounded height trees.
AB - Given two rooted, labeled, unordered trees, the common subtree problem is to find a bijective matching between subsets of nodes of the trees of maximum cardinality which preserves labels and ancestry relationship. The tree edit distance problem is to determine the least cost sequence of insertions, deletions and substitutions that converts a tree into another given tree. Both problems are known to be hard to approximate within some constant factor in general. We tackle these problems from two perspectives: giving exact algorithms, either for special cases or in terms of some parameters; and approximation algorithms and hardness of approximation. We present a parameterized algorithm in terms of the number of branching nodes that solves both problems and yields polynomial algorithms for several special classes of trees. This is complemented with a tighter APX-hardness proof that holds when the trees are of height one and two, respectively. Furthermore, we present the first approximation algorithms for both problems. In particular, for the common subtree problem for t trees, we present an algorithm achieving a tlog 2(bOPT+1) ratio, where bOPT is the number of branching nodes in the optimal solution. We also present constant factor approximation algorithms for both problems in the case of bounded height trees.
KW - Approximation algorithms
KW - Dynamic programming
KW - Parameterized algorithms
KW - Tree edit distance
KW - Unordered trees
UR - https://www.scopus.com/pages/publications/84872007061
U2 - 10.1016/j.tcs.2012.11.017
DO - 10.1016/j.tcs.2012.11.017
M3 - Article
SN - 0304-3975
VL - 470
SP - 10
EP - 22
JO - Theoretical Computer Science
JF - Theoretical Computer Science
ER -