PatchMatch

Flowers at right bottom corner are removed using PatсhMatсh

The core PatchMatch algorithm quickly finds correspondences between small square regions (or patches) of an image. The algorithm can be used in various applications such as object removal from images, reshuffling or moving contents of images, or retargeting or changing aspect ratios of images, optical flow estimation, or stereo correspondence.

Algorithm

The goal of the algorithm is to find the patch correspondence by defining a nearest-neighbor field (NNF) as a function ${\displaystyle f:\mathbb {R} ^{2}\to \mathbb {R} ^{2}}$ of offsets, which is over all possible matches of patch (location of patch centers) in image A, for some distance function of two patches ${\displaystyle D}$. So, for a given patch coordinate ${\displaystyle a}$ in image ${\displaystyle A}$ and its corresponding nearest neighbor ${\displaystyle b}$ in image ${\displaystyle B}$, ${\displaystyle f(a)}$ is simply ${\displaystyle b-a}$. However, if we search for every point in image ${\displaystyle B}$, the work will be too hard to complete. So the following algorithm is done in a randomized approach in order to accelerate the calculation speed. The algorithm has three main components. Initially, the nearest-neighbor field is filled with either random offsets or some prior information. Next, an iterative update process is applied to the NNF, in which good patch offsets are propagated to adjacent pixels, followed by random search in the neighborhood of the best offset found so far. Independent of these three components, the algorithm also use a coarse-to-fine approach by building an image pyramid to obtain the better result.

Initialization

When initializing with random offsets, we use independent uniform samples across the full range of image ${\displaystyle B}$. This algorithm avoids using an initial guess from the previous level of the pyramid because in this way the algorithm can avoid being trapped in local minima[clarification needed].

Iteration

After initialization, the algorithm attempted to perform iterative process of improving the ${\displaystyle NNF}$. The iterations examine the offsets in scan order (from left to right, top to bottom), and each undergoes propagation followed by random search.

Propagation

We attempt to improve ${\displaystyle f(x,y)}$ using the known offsets of ${\displaystyle f(x-1,y)}$ and ${\displaystyle f(x,y-1)}$, assuming that the patch offsets are likely to be the same. That is, the algorithm will take new value for ${\displaystyle f(x,y)}$ to be ${\displaystyle \arg \min \limits _{(x,y)}{D(f(x,y)),D(f(x-1,y)),D(f(x,y-1))}}$. So if ${\displaystyle f(x,y)}$ has a correct mapping and is in a coherent region ${\displaystyle R}$, then all of ${\displaystyle R}$ below and to the right of ${\displaystyle f(x,y)}$ will be filled with the correct mapping. Alternatively, on even iterations, the algorithm search for different direction, fill the new value to be ${\displaystyle \arg \min \limits _{(x,y)}\{D(f(x,y)),D(f(x+1,y)),D(f(x,y+1))\}}$.

Random search

Let ${\displaystyle v_{0}=f(x,y)}$, we attempt to improve${\displaystyle f(x,y)}$ by testing a sequence of candidate offsets at an exponentially decreasing distance from ${\displaystyle v_{0}}$

${\displaystyle u_{i}=v_{0}+w\alpha ^{i}R_{i}}$

where ${\displaystyle R_{i}}$ is a uniform random in ${\displaystyle [-1,1]\times [-1,1]}$, ${\displaystyle w}$ is a large window search radius which will be set to maximum picture size, and ${\displaystyle \alpha }$ is a fixed ratio often assigned as 1/2. This part of the algorithm allows the ${\displaystyle f(x,y)}$ to jump out of local minimum through random process.

Halting criteria

The often used halting criteria is set the iteration times to be about 4~5. Even with low iteration, the algorithm works well.

Conclusion

This is an efficient algorithm since it only takes a few second on a testing computer with Intel Core i5 CPU and Photoshop CS5.