Homology-based method for identification of protein repeats using statistical significance estimates