Given thousands of MD snapshots, how do we divide them into clusters and pick representative snapshots for each cluster?
From the clustering, we need to know:
- the centroids
- weight
Defining distance
Pros:
- Fast
- Intuitive
- Can visually identify number of clusters
Cons:
- Use partial information
- Less accurate, may miss-place elements