Connected Components & Small Region Merging

High-Level Algorithm

Labeling / connected-component flood-fill: iterate pixels, perform a breadth-first (queue) flood-fill whenever an unlabeled pixel is found. Two pixels are considered connected if they are 4-neighbours (up, down, left, right) and their RGBA values are exactly equal.

Center pixel (being flood-filled)

Other pixels (unlabeled neighbor candidates)

Other pixels (unlabeled non-neighbor candidates)

While flood-filling, Region metadata is collected (size, minX, maxX, minY, maxY) and labeled according to an index (labels).
After all components are labeled and regions metadata computed, iterate pixels again. For a pixel whose region is considered small (fails isBigEnough(minArea,minWidth,minHeight)), check its four immediate neighbors. If any neighbor belongs to a big region, copy that neighbor's RGBA into the small pixel (effectively assigning the small pixel to the big region; over time, the small region is consumed by bigger neighboring regions).

note

This merges only small-region pixels which are adjacent to large regions. The order of iteration means small pixels near large regions are captured first — the implementation stops at first qualifying neighbour.

Mathematical Background

Connected components

The algorithm computes connected components on a planar grid using 4-connectivity. Formally, we can describe the image as a function:

\begin{align*} I &: \mathbb{Z}^2 \to \mathcal{C} \\ (x, y) &\mapsto I(x, y) \end{align*}

important

$I$ is the image function.
$\mathbb{Z}^2$ is the set of all integer pairs $(x, y)$ representing pixel coordinates.
$\mathcal{C}$ is the set of all possible RGBA values: $\mathcal{C} = \{ (R, G, B, A) \mid R,G,B,A \in [0,255] \}$
$I(x, y) \in \mathcal{C}$ is the color of the pixel at coordinates $(x, y)$ .

In simple terms, each pixel at position $(x, y)$ has a color given by $I(x, y)$ . :::

Two pixels, $p=(x,y)$ and $q=(x',y')$ , are 4-adjacent if $|x-x'| + |y-y'| = 1$ . A connected component is a maximal set of pixels, $S$ , such that any two pixels in $S$ are connected by a path of 4-adjacent pixels with identical colors.

A flood-fill (BFS / DFS) computes these components exactly.

Bounding box & geometric heuristics

For each component we compute an axis-aligned bounding box with integer coordinates:

[minX,maxX]\times[minY,maxY]

The bounding-box width and height are:

\begin{align*} W &= maxX - minX + 1 \\ H &= maxY - minY + 1 \end{align*}

tip

See the source code in the Region struct to understand how this is used.

The area (component size) is simply the number of pixels in the component, $|S|$ . The heuristics used to classify small vs big regions rely on thresholds on both area and bounding box dimensions. This avoids keeping long thin noise (e.g. a long 1-pixel-wide arm) even if its area is above minArea.

Why 4-connectivity, not 8?

4-connectivity treats diagonally touching pixels as disconnected. This is stricter and avoids connecting components that only meet at a corner. Depending on the data, 8-connectivity (connect diagonally too) may be preferred — see the Variants & Improvements page.

What does merging mean here?

The code does not merge region graphs with union operations. Instead, it performs a pixel-wise recoloring of small-region pixels to the color of an adjacent big region. That has the practical effect of attaching each small pixel to a neighboring large region. This is a cheap and local merge — it won’t always choose the most semantically correct neighbor if multiple are present.

High-Level Algorithm​

Mathematical Background​

Connected components​

Bounding box & geometric heuristics​

Why 4-connectivity, not 8?​

What does merging mean here?​