Prediction potential of candidate biomarker sets identified and validated on gene expression data from multiple datasets