Radiologists disagree with each other over the characteristics and features of what constitutes a normal mammogram and the terminology to use in the associated radiology report. Recently, the focus has been on classifying abnormal or suspicious reports, but even this process needs further layers of clustering and gradation, so that individual lesions can be more effectively classified. Using a genetic algorithm, the approach described here successfully learns phrase patterns for two distinct classes of radiology reports (normal and abnormal). These patterns can then be used as a basis for automatically analyzing, categorizing, clustering, or retrieving relevant radiology reports for the user. Categories and Subject Descriptors I.2.6 [Learning]: analogies, concept learning, connectionism and neural nets, induction, knowledge acquisition, language acquisition, parameter learning. H.3.3 [Information Search and Retrieval]: clustering, information filtering, query formulation, relevance fe...
Robert M. Patton, Thomas E. Potok, Barbara G. Beck