Sciweavers

DRR
2008

Hybrid approach combining contextual and statistical information for identifying MEDLINE citation terms

14 years 1 months ago
Hybrid approach combining contextual and statistical information for identifying MEDLINE citation terms
There is a strong demand for developing automated tools for extracting pertinent information from the biomedical literature that is a rich, complex, and dramatically growing resource, and is increasingly accessed via the web. This paper presents a hybrid method based on contextual and statistical information to automatically identify two MEDLINE citation terms: NIH grant numbers and databank accession numbers from HTML-formatted online biomedical documents. Their detection is challenging due to many variations and inconsistencies in their format (although recommended formats exist), and also because of their similarity to other technical or biological terms. Our proposed method first extracts potential candidates for these terms using a rule-based method. These are scored and the final candidates are submitted to a human operator for verification. The confidence score for each term is calculated using statistical information, and morphological and contextual information. Experiments c...
In-Cheol Kim, Daniel X. Le, George R. Thoma
Added 29 Oct 2010
Updated 29 Oct 2010
Type Conference
Year 2008
Where DRR
Authors In-Cheol Kim, Daniel X. Le, George R. Thoma
Comments (0)