Abstract— In previous work, we have shown how an evolutionary algorithm with a clustered population can be used to concurrently discover multiple regulatory motifs present within the promoter sequences of co-expressed genes. In this paper, we extend the algorithm by co-evolving a population of Boolean classification rules in parallel with the motif population. Results using synthetic data suggest that this approach allows poorly conserved motifs to be identified in promoter sequences a magnitude longer than using population clustering alone, whilst results using muscle-specific data suggest the algorithm is able to evolve meaningful sequence classifiers in parallel with motifs.
Michael A. Lones, Andy M. Tyrrell