Hierarchical Clustering: Step 2
Next, we average the log-transformed expression levels for the clustered genes (in this case, B and D) and recalculate the similarity scores:
Gene A | Gene C | Gene [BD] | |
---|---|---|---|
Gene A | 1 | −0.633 | 0.564 |
Gene C | −0.633 | 1 | −0.305 |
Gene [BD] | 0.564 | −0.305 | 1 |
We pick the next highest score, A and [BD], to form the cluster [ABD]. Since we have only four genes, we are done, but this is an iterative process until we are left with a single pair. The end product is a dendrogram, a graphic representation of clusters:
