Explore Courses Blog Tutorials Interview Questions
0 votes
in Machine Learning by (19k points)

How can I plot a dendrogram right on top of a matrix of values, reordered appropriately to reflect the clustering, in Python?

I use scipy.cluster.dendrogram to make my dendrogram and perform hierarchical clustering on a matrix of data. How can I then plot the data as a matrix where the rows have been reordered to reflect a clustering induced by the cutting the dendrogram at a particular threshold, and have the dendrogram plotted alongside the matrix? I know how to plot the dendrogram in scipy, but not how to plot the intensity matrix of data with the right scale bar next to it.

Any help on this would be greatly appreciated.

1 Answer

0 votes
by (33.1k points)

A Dendrogram is a type of Hierarchical clustering that illustrates the arrangement of the clusters produced by the corresponding analyses.

Code for Dendrogram implementation using SciPy:

import scipy

import pylab

import scipy.cluster.hierarchy as sch

from scipy.spatial.distance import squareform

# Generate random features and distance matrix.

x = scipy.rand(40)

D = scipy.zeros([40,40])

for i in range(40):

    for j in range(40):

        D[i,j] = abs(x[i] - x[j])

condensedD = squareform(D)

# Compute and plot first dendrogram

fig = pylab.figure(figsize=(8,8))

ax1 = fig.add_axes([0.09,0.1,0.2,0.6])

Y = sch.linkage(condensedD, method='centroid')

Z1 = sch.dendrogram(Y, orientation='left')



# Compute and plot second dendrogram.

ax2 = fig.add_axes([0.3,0.71,0.6,0.2])

Y = sch.linkage(condensedD, method='single')

Z2 = sch.dendrogram(Y)



# Plot distance matrix.

axmatrix = fig.add_axes([0.3,0.1,0.6,0.6])

idx1 = Z1['leaves']

idx2 = Z2['leaves']

D = D[idx1,:]

D = D[:,idx2]

im = axmatrix.matshow(D, aspect='auto', origin='lower',



# Plot colorbar.

axcolor = fig.add_axes([0.91,0.1,0.02,0.6])

pylab.colorbar(im, cax=axcolor)




Hope this answer helps.

If you wish to Learn more about Python, Visit this Python Tutorial.

Browse Categories