Technical aspects - tahiri-lab/KMeansPhyloTreesClustering GitHub Wiki

Technical aspects of the project

⚙️ Technical Aspects

This section presents the core algorithms used in the project, from a general perspective to their application in bioinformatics, specifically for phylogenetic trees.

Robinson-Foulds (RF) Distance

K-means Clustering

Cluster Validity Indices


🔗 Summary

  • RF distance → measures similarity between trees
  • K-means → groups similar trees
  • CH / BH → select the optimal number of clusters

Together, they form a complete clustering pipeline.