Here we investigate an extension of a previously suggested "automatic amide I frequency selection procedure" where we introduce an additional criterion utilizing empirical knowledge on regions within the amide I band (1600-1700 cm-1 ) found to be particularly sensitive to protein secondary structure. We show that the genetic algorithm provides a solution with good protein secondary structure prediction accuracy. Based on an evaluation set of 13 protein infrared spectra from proteins not contained in the reference set, it is demonstrated that our method is capable of making good predictions for proteins it has never seen before during training. In the present study, where the genetic algorithm is guided towards a solution with a higher number of empirically determined, structure sensitive amide I frequencies selected, minor improvement in prediction accuracy for -helix and -sheet structure could be achieved compared to our previous study, where no such knowledge has been prov...
Joachim A. Hering, Peter R. Innocent, Parvez I. Ha