Vishvas's notes

Intro

Graph G=(V,E)
E annotated with weights. If G unweighted, all weights are 1.

Vertex properties

Degree of v: $d e g (i) = \sum_{j} e_{i, j}$ . Neighbors of v: $Γ (v)$ .

Measure size of $A {V}$ : $| A |$ or $v o l (A) = \sum_{i \in A} d e g (i)$ .

Associated objects

Special vertex sets

Node cover of G=(V,E): subset of V which touches all e in E.

Proper vertex coloring; chromatic number.

Independent set of verteces: don’t share edge: a clique in $\bar{G}$ .

Walks

A sequence of edges $(e_{i})$ such that $e_{i}$ shares an end-point with $e_{i + 1}$ .

A walk may have a cycle. If it does not have a cycle, you have a path.

Also see random walks.

Alternating walks are defined and studied in combinatorial optimization problems over graphs; in these sequences even and odd edges are colored differently.

Cut

k-way cut: $c u t ((V_{i}))$ is a partitioning of V into k parts. Cutsets is a set of edges which, when removed from G, divides V into k partitions.

Weight of a cut

Weight of a cut is the sum of weights of edges in the cutset.

Minimum 2-way cut is a cut with the minimum weight. This is useful in partitioning nodes in a graph.

Subgraphs

A general subgraph: G’=(V’, E’). Subgraph induced by $A {V}$ . Connected components of a graph. Clique: a complete subgraph.

Spanning trees

Spannning tree. MST spanning a certain node set N: Aka Steiner tree. MST spanning with support over node set N: Aka Group Steiner tree.

Subtypes

Multigraphs: multiple edges allowed.

Tree

G sans cycle. Forest F: set of trees.

Biconnected graph

2 paths between any node pair.

Based on degree

d-regular G: $\forall v : | N (v) | = d$ . Complete graph $K_{n}$ .

Perfect graph G

Chromatic number = size of the largest clique in G.

A graph is perfect iff its complement is perfect.

Bipartite graph

E = cutset. Complete bipartite graph: All v in A has edge to all u in B. Complete bipartite graph $K_{i, j}$ . Can have A vs B adj matrix M. Thence get usual adj matrix for G: $[\begin{matrix} 0 & M \M^{T} & 0 \end{matrix}]$ .

Chordal graph G

Aka triangulated graph. Every cycle in G has a chord. G has a junction tree iff it is chordal.

Planar graphs

(Kuratowski) G planar iff $K_{5}$ and $K_{3, 3}$ are in G.

Directred graph, networks

Digraph or directed graph. Networks: digraph with edge weights.

Directed acyclic graphs (DAG)

Very useful in designing algorithms: can do recursion easily.

Topological numbering t of nodes

Number nodes so that if there is a path from u to v, then $p (u) < p (v)$ . Cyclic graphs don’t have a topological numbering.

Singly connected directed graph

Useful in probabilistic graphical models.

A tree underlies the graph: only 1 undirected path between any node pair.

Generalizations

Hypergraphs

Edges connect k-sets of verices, not just pairs.

Properties

Hop plot

For h, let g(h) be number of node pairs with path $\leq h$ . Hop plot plots this.

Diameter

Diameter of G. q effective diameter: q fraction of $V \times V$ have path length $\leq d$ .

Single source (s) shortest paths

Consider a weighted graph with weight $e (a, b)$ between two nodes. Suppose that you want to find the shortest path to every vertex $v \in V$ from $s$ .

One can use a bottom-up programming approach. (Dijkstra) Start with current node $c = s$ . Initially set $d (s) = 0$ and, $\forall v \neq s : d (v) = \infty$ .

Define $f (c)$ : For every $v \in Γ (c)$ , do the update: $d_{t + 1} (v) := min (d_{t} (c) + e (c, v), d_{t} (v))$ . Also, simultaneously update the ‘backpointer’ to point to $c$ (representing the optimal subsolution) if necessary.

Do $f (c) \forall v \neq s$ .

Time: $O (n^{2})$ in case of a complete graph.

Associated matreces

Edge incidence matrix J ( $m \times n$ ); for weighted G: edge $e_{i, j}$ terminii (i, j) are marked $\pm \sqrt{e_{i, j}}$ .

Vertex incidence/ adjacency matrix W: presence of edge $e_{i, j}$ indicated by weight at $W_{i, j}, W_{j, i}$ .

Connectivity matrix $A^{\infty}$ .

Degree matrix: D = diag(deg(i)).

Graph Laplacian of no-self-loop undirected graph

L = (D-W). $L = J J^{T}$ . As L is $| V | * | V |$ , it is an operator on the functions with domain V.

+ve semidefiniteness

L is symmetric, +ve semidefinite: $x^{T} L x = x^{T} J J^{T} x$ . So $λ \geq 0$ .

Smoothness of vectors from quadratic form

$x^{T} L x = x^{T} D x - x^{T} W x = \sum_{i, j} W_{i, j} (x_{i}^{2} - x_{j} x_{i}) = 2^{- 1} \sum_{i, j} W_{i, j} (x_{i} - x_{j})^{2} = \sum_{e_{i, j} \in E} e_{i, j} (x_{i} - x_{j})^{2}$ . This is a measure of the degree of oscillations/ smoothness among x, where edges occur.

Eigenvectors

$L 1 = 01, ∴ λ_{1} (L) = 0$ ; so L singular.

If G has c connected components: $λ_{1} = . . λ_{c} = 0$ : construct ev with 1 in the appropriate spots!

Smoothness of ev

Take ev x. Then $x^{T} L x = \sum_{e_{i, j} \in E} e_{i, j} (x_{i} - x_{j})^{2}$ measures smoothness of x where edges occur in the graph. But, ew are stationary points of $R (x) = x^{T} L x / (x^{T} x)$ . So, ev corresponding to lower ew tend to be smoother.

Smoothening functions

Consider the subspace spanned by the p bottom (smooth) ev. Any function on V, ie a $| V |$ -dim vector can be projected on to this subspace in order to smoothen it according to the graph structure. Labelling of nodes is an example of such a function.

Applications

This property is useful when using ev x for classification of nodes - one doesn’t want neighboring nodes to have disparate values in x. This is useful in both spectral clustering and label propogation in semisupervised learning.

Normalized graph Laplacian

Take $N = I - D^{- 1 / 2} W D^{- 1 / 2}$ : this is the normalized version, $D^{- 1 / 2} L D^{- 1 / 2}$ of L, using the normalized adjacency matrix $D^{- 1 / 2} W D^{- 1 / 2}$ .

$N ⪰ 0$ as $L ⪰ 0$ , ie $x^{T} D x - x^{T} W x \geq 0 \forall x$ : taking $D^{1 / 2} x = y$ , see that $\forall y : y^{T} N y \geq 0$ .

Normalized adjacency matrix has norm 1.

As $y^{T} y - y^{T} D^{- 1 / 2} W D^{- 1 / 2} y \geq 0 $, s e e t h a t $ 1 \geq {∥ D^{- 1 / 2} W D^{- 1 / 2} ∥}_{2} $; A l s o, u s i n g $ y = D^{1 / 2} 1$ , get \ ${∥ D^{- 1 / 2} W D^{- 1 / 2} ∥}_{2} = 1$ .

Another way to see this: $D^{- 1 / 2} W D^{- 1 / 2}$ is obtained by a similarity transformation to $D^{- 1} W$ , which has ew in the range [-1, 1] due to Gerschgorin thm, and which has $σ_{m} a x = | λ_{m} a x | = 1$ using the ev 1.

Quadratic form: Normalized smoothness measure

$y^{T} N y = y^{T} D^{- 1 / 2} L D^{- 1 / 2} y = \sum_{(i, j) \in E} W_{i j} (\frac{y_{i}}{\sqrt{d_{i i}}} - \frac{y_{j}}{\sqrt{d_{j j}}})^{2} $: f r o m t h e f o r m $ x^{T} L x $b e i n g a s m o o t h n e s s m e a s u r e . P u n i s h e s d e v i a t i o n b e t w e e n $ {y_{i}}$ corresponding to edges emanating from high degree vertices less.

Expanders

A sparse graph with high connectivity properties. Connectivity quantified as edge expansion or vertex expansion. Let $E^{'} (S)$ : edges with exactly one end point in S.

Edge expansion of G

Aka isoparametric number. \ $h (G) = min_{1 \leq | S | \leq n / 2} \frac{| E^{'} (S) |}{| S |}$ .

$α$ vertex expansion of G

$g_{α} (G) = min_{1 \leq | S | \leq n α} \frac{| Γ (S) |}{| S |}$ .

Other Examples

Most graphs are expanders. $K_{n}$ has good expansion properties, but it is not sparse.

Random graphs

Common Varieties: $G_{n, p}, G_{n, | E |}$

G(n,p)

A static model: every edge is present or absent independent of other edges. p controls edge density. As $n \to \infty d i a m (G) \to 2$ . Size of the largest cluster is $O (\log n)$ .