Clustering Of Large-Scale Protein Datasets