Similarity measures between u and v are inverses of distance metrics.
Based on paths and random walks
. Matrix of scores, .
if the sum converges. Use \(A = U\EW U^{}\): EW decomposition, with ordered in descending order. \(\sum_{i=1}^{\infty} \gb^{i}A^{i} = U(\sum_{i=1}^{\infty} \gb^{i}\EW^{i})U^{}\) does not converge and multiplication by is not well defined. Condition for convergence: .
But, for the intuition of weighting longer paths less does not hold.
Similarly can use .
Truncated Katz usually used: : op instead of inverse finding.
; normed by stationary distribution: : to take care of skewing of hitting time due to large . - Commute time: ; stationary distr normed: .
Random walk can get lost in parts of graph away from u and v; so do random resets and return to u with probability a at each step.
A recursive definition: 2 nodes are similar to the extant that they are joined to similar nodes. .
Common neighbors: : same as taking inner product of rows in adj matrix M corresponding to u and v. (Jaccard): : pick a feature at random, see probability that it is a feature of both u and v. (Adamic/ Adar): : greater wt to rare features present in both.