Identification and characterization of subfamily-specific signatures in a large protein superfamily by a hidden Markov model app