Automatic medical coding of patient records via weighted ridge regression

15 years 9 months ago

Download www.cs.rpi.edu

In this paper, we apply weighted ridge regression to tackle the highly unbalanced data issue in automatic largescale ICD-9 coding of medical patient records. Since most of the ICD-9 codes are unevenly represented in the medical records, a weighted scheme is employed to balance positive and negative examples. The weights turn out to be associated with the instance priors from a probabilistic interpretation, and an efﬁcient EM algorithm is developed to automatically update both the weights and the regularization parameter. Experiments on a large-scale real patient database suggest that the weighted ridge regression outperforms the conventional ridge regression and linear support vector machines (SVM).

Jian-Wu Xu, Shipeng Yu, Jinbo Bi, Lucian Vlad Lita

Real-time Traffic

Automatic Largescale Icd-9 | Conventional Ridge Regression | ICMLA 2007 | Machine Learning | Weighted Ridge Regression |

claim paper

Post Info
More Details (n/a)

Added	29 Oct 2010
Updated	29 Oct 2010
Type	Conference
Year	2007
Where	ICMLA
Authors	Jian-Wu Xu, Shipeng Yu, Jinbo Bi, Lucian Vlad Lita, Radu Stefan Niculescu, R. Bharat Rao

Comments (0)

Sciweavers

Automatic medical coding of patient records via weighted ridge regression

Automatic Largescale Icd-9 | Conventional Ridge Regression | ICMLA 2007 | Machine Learning | Weighted Ridge Regression |

Explore & Download

Productivity Tools

Sciweavers