Hierarchical genome assembly can be divided into three distinct stages: sequencing and assembling shotgun reads for each of a series of selected BAC clones; assembling the resulting fragments into BAC consensus sequences; and mapping and orienting the BAC consensus according to external positional information. We report a new approach for BAC consensus assembly that relies on iterative layouts of overlapping sequence, with no need for prior masking of repetitive sequence. The approach includes major steps of quality filtering and an iterative screening algorithm within and between clusters of overlapping BAC fragments. Each step includes numerous minor steps designed to detect false overlaps at minimal expense in true overlaps. In contrast to dynamic algorithms, our approach attempts to minimize false overlaps before attempting to form BAC consensus sequences. We show that false overlaps are reduced to a degree that final BAC consensus assembly is straightforward under a coordinate sy...
Daolong Wang, Mario Lauria, Bo Yuan, Fred A. Wrigh