To study and implement Hierarchical Clustering for data grouping
Agglomerative Clustering
Single Linkage
- Calculate the Euclidean distance for the highlighted cell using the provided calculator (take X₁ & X₂ from the table). Next, click on the corresponding cell in the matrix and enter the calculated distance.
- Repeat the step 1 until you've completed the matrix. (Mirrored values will be automatically filled)
- Identify the two clusters with the smallest maximum pairwise distance by clicking on the MIN button.
- Click the MERGE button to combine the two clusters from step 3 into a single cluster & update the matrix by recalculating pairwise distances, considering the single linkage criterion
- Repeat step 3-4 until only one cluster remains.
- Now, click on the NEXT button to plot the dendrogram.
- Finally, click on the PLOT button.
Complete Linkage
- Enter a value for each attribute in the ATTRIBUTES section and then click the SUBMIT button to incorporate the values into the table.
- Repeat the above step until you've completed this process for 5 rows.
- Click on the NEXT button to navigate to the COMPUTE page.
- Calculate the Euclidean distance for the highlighted cell using the provided calculator (take X & Y from the table). Next, click on the corresponding cell in the matrix and enter the calculated distance.
- Repeat the step 4 until you've completed the matrix. (Mirrored values will be automatically filled)
- Find the minimum element in distance matrix by clicking on the MIN button.
- Now, click on the MERGE button. This will form clusters of elements corresponding to the minimum value and update the matrix by recalculating pairwise distances, considering the complete linkage criterion.
- Repeat step 6-7 until only one cluster remains.
- Now, click on the NEXT button to plot the dendrogram.
- Finally, click on the PLOT button.
Divisive Clustering
Select the number of nodes for your graph and click on the SUBMIT button.
- If nodes equal 4, follow the below steps:
-
Create the complete graph by adding edges between every pair of nodes.
- Enter the values for the source node, destination node, and the weight/cost of the edge. Click on the SUBMIT button.
- Repeat the above step until you have added edges between every pair of nodes.
- Finally, click on the NEXT button to initiate the computation phase for the divisive algorithm.
- Now, jump to step 2.
- If nodes equal 5, follow the below steps:
- Click on the START button.
- Now, click on the highlighted cell of the table (EDGE-COST) and enter the required value in the pop-up box.
- Repeat the above step for every cell of the table.
- Now, jump to step 2.
- If nodes equal 4, follow the below steps:
Compute the Minimum Spanning Tree (MST) using Kruskal's algo. for the given adjacency matrix by clicking on the DRAW MST button.
Enter the edge name from the sorted list of edge For example, 1-2 and then repeat this step until the MST is created.
Progress to the next step by clicking on the NEXT button to identify "Singleton Clusters".
Form a new cluster by breaking the link associated with the highest cost. Click on the BREAK button to proceed.
Click on the NEXT button to navigate to the CLUSTERS page.