Information available in the Internet is frequently supplied simply as plain ascii text, structured according to orthographic and semantic conventions. Traditional document classi...
In this work we try to bridge the gap often encountered by researchers who find themselves with few or no labeled examples from their desired target domain, yet still have access ...
Abstract. Current text classification systems typically use term stems for representing document content. Semantic Web technologies allow the usage of features on a higher semantic...
Transfer learning addresses the problem of how to utilize plenty of labeled data in a source domain to solve related but different problems in a target domain, even when the train...
Mapping concepts from medical terminologies, such as the UMLS, to medical documents is a prerequisite for many tasks of (automatically) processing documents. Due to the nature of ...
Katharina Kaiser, Theresia Gschwandtner, Patrick M...