Gene and protein names follow few, if any, true naming conventions and are subject to great variation in different occurrences of the same name. This gives rise to two important p...
The Protein Data Bank (PDB) is the world-wide repository of macromolecular structure information. We present a series of databases that run parallel to the PDB. Each database hold...
Robbie P. Joosten, Tim A. H. te Beek, Elmar Kriege...
Background: Nonnegative matrix factorization (NMF) is a feature extraction method that has the property of intuitive part-based representation of the original features. This uniqu...
A core area of phonology is the study of phonotactics, or how sounds are linearly combined. Recent cross-linguistic analyses have shown that the phonology determines not only phon...
Background: With the exponential increase in genomic sequence data there is a need to develop automated approaches to deducing the biological functions of novel sequences with hig...