A summarization approach for Affymetrix GeneChip data using a reference training set from a large, biologically diverse database