Biological sequences come in families
If we know some sequences belonging to a family, for a new query sequence we can directly try to get the distance score through pairwise searching.
you can search with either one sample from the family or
all the sequences in the family
Rather than this sort of method, a better one would be to look at the statistical features of all the sequences