The EMD Under Transformations : The Optimal Tranformation Problem

The Optimal Transformation Problem

The optimal transformation problem in step (2)

(2)

g^(k+1)

arg

min_{g in G}

WORK(F^(k),x,g(y)).

can be written explicitly in terms of the ground distance function d as

(5) arg min_{g in G} sum_{i=1..m, j=1..,n} f_ij d(x_i,g(y_j)),

where the flow F=(f_ij) is fixed. The table below shows some cases in which (5) can be solved.

Transformation Set G	g in G	Ground Distance d
translation	t	L₂, L₁, (L₂)²
Euclidean	(R,t)	(L₂)²
similarity	(s,R,t)	(L₂)²
linear	A	(L₂)²
affine	(A,t)	(L₂)²

If we let

[c₁ ... c_N]	=	[f₁₁ f₁₂ ... f_1n \| f₂₁ f₂₂ ... f_2n \| ... \| f_m1 f_m2 ... f_mn],
[a₁ ... a_N]	=	[x₁ x₁ ... x₁ \| x₂ x₂ ... x₂ \| ... \| x_m x_m ... x_m], and
[b₁ ... b_N]	=	[y₁ y₂ ... y_n \| y₁ y₂ ... y_n \| ... \| y₁ y₂ ... y_n],

where N=mn, then (5) can be rewritten as a single index summation

(6) min_{g in G} sum_r=1..N c_r d(a_r,g(b_r)),

In this form, the optimal transformation problem asks for the transformation of one point set which minimizes a sum of weighted distances to corresponding points in another set.

We briefly consider only the easiest case listed in the above table : d=(L₂)² and G=T, the group of translations. In this case, (6) becomes

(7) min_{t in T} sum_r=1..N c_r ||a_r-(b_r+t)||².

It is well known (and easily proven using standard calculus) that the unique optimal translation in the least squares problem (7) is the translation that lines up the centroids of the weighted point sets {(c₁,a₁), ..., (c_N,a_N)} and {(c₁,b₁), ..., (c_N,b_N)} :

t^*

centroid(c,a) - centroid(c,b)

where

centroid(c,a)	=	sum_r=1..N c_r a_r / c^S,
centroid(c,b)	=	sum_r=1..N c_r b_r / c^S, and
c^S	=	sum_r=1..N c_r.

In terms of the original flow variables f_ij and distributions x and y, this solution becomes

(8) t^* = sum_{i=1..m, j=1..n} f_ij x_i / min(w^S,u^S) - sum_{i=1..m, j=1..n} f_ij y_j / min(w^S,u^S),

where we have used the fact that sum_{i=1..m, j=1..n} f_ij = min(w^S,u^S) for any feasible flow F between x and y. In the next section, we shall discuss an interesting property of this solution when the distributions x and y have the same total weight.

top Title, Table of Contents, The EMD
prev A Convergent Iteration
next Two Specific Cases

The ideas and results contained in this document are part of my thesis, which will be published as a Stanford computer science technical report in June 1999.

S. Cohen. Finding Color and Shape Patterns in Images. Thesis Technical Report STAN-CS-TR-99-?. To be published June 1999.

Email comments to scohen@cs.stanford.edu.

top	Title, Table of Contents, The EMD
prev	A Convergent Iteration
next	Two Specific Cases