Abstract. A fundamental problem in computational biology is the determination of the correct species tree for a set of taxa given a set of possibly contradictory gene trees. In recent literature, the Duplication Loss model has received considerable attention. Here one measures the similarity dissimilarity between a set of gene trees by counting the number of paralogous gene duplications and subsequent gene losses which need to be postulated in order to explain in an evolutionarily meaningful way how the gene trees could have arisen with respect to the species tree. Here we count the number of multiple gene duplication events duplication events in the genome of the organism involving one or more genes without regard to gene losses. Multiple Gene Duplication asks to nd the species tree S which requires the fewest number of multiple gene duplication events to be postulated in order to explain a set of gene trees G1;G2;:::;Gk. We also examine the related problem which assumes the species t...
Michael R. Fellows, Michael T. Hallett, Ulrike Ste