Extracting scientific articles from a large digital archive: BioStor and the Biodiversity Heritage Library