The desynchronization approach combines a traditional synchronous specification style with a robust asynchronous implementation model. The main contribution of this paper is the description of two optimizations that decrease the overhead of desynchronization. First, we investigate the use of clustering to vary the granularity of desynchronization. Second, by applying temporal analysis on a formal execution model of the desynchronized design, we uncover significant amounts of timing slack. These methods are successfully applied to industrial RTL designs. Categories and Subject Descriptors J.6 [Computer-Aided Engineering]: Computer-aided design (CAD) General Terms Algorithms, Performance, Design, Experimentation Keywords Desynchronization, Separation Analysis, Clustering