Agglomerative Clustering Communities
An important enterprise, after detecting a major overlap of work and subsequent redundant effort, has begun analyzing intra-organizational collaboration to impose a more coherent structure across its teams. You, as an avid network scientist, have decided to collect data from ten teams, labeled A through J, forming an undirected link between any pair of teams that share at least one substantive project dependency, documented code module, or operational handoff.
With the built graph, you could then find which teams have the most in common, modeling possible cooperations as communities within this graph. After testing the main agglomerative clustering algorithms, you have decided on the following partition, leading to the communities {A, B, C, D, E, G} and {F, I, J, H}, obtained by analysing the resulting dendogram.
The chosen similarity scores between teams u and v were simply 1 if there was a link between them, and 0 otherwise.
a) By choosing 0/1 similarity and the average linkage, stable communities could have been formed. This is evidenced by the strong communities produced when the distance is cut to 2.0.
b) The chosen cut is the only one from the set of all possible partitions induced by cuts in the dendrogram that produces strong communities.
c) The clustering algorithm that produced this dendrogram cannot have used a complete linkage, as I and J would have always been in different communities.
d) An agglomerative clustering with 0/1 similarity and single linkage could never produce this dendrogram.
e) None of the above
Original idea by: Daniel Gardin

Comments
Post a Comment