In the first line: "Dots are close to each other if they have a lot of common st...

HellsMaddy · on Dec 15, 2024

That explains why they are "close to each other" but not what determines which nodes are connected by an edge.

tux3 · on Dec 15, 2024

I think it's the other way around. The similarity metric determines which repos have edges (possibly weighted?)

And then some clustering algorithm makes sense of this giant graph by laying out sets of nodes that have a lot of edges to each other, close to each other

The closeness is just layout, the edges is the data structure that determines closeness.

anvaka · on Dec 16, 2024

This is correct!