. When publishing documents on the web, the user needs to describe and classify her documents for the benefit of later retrieval and use. This paper presents an approach to semantic document classification and retrieval based on Natural Language Processing and Conceptual Modeling. The Referent Model language is in combination with a lexical analysis tool used to define a controlled vocabulary for classifying and indexing documents. Documents are classified using simple sentences …. Classification is done by selecting sentences that contain the highest frequency words in the document that also occurs in the domain model. These are parsed using a DCG-like grammar, mapped into a Referent Model fragment and stored along with the document in RDF-XML syntax. The model fragment represents the connection between the document and the domain model and serves as a document index. The approach is being implemented for a document collection published by the Norwegian Center for Medical Informatic...