Why

img-aligner

Align 32-bit linear images with grid warping optimization
Latest Release · Documentation · Report Bug

img-aligner lets you warp an image (called the base image) to look like another, already similar image (the target image).

Why

My personal motivation for this project was to perspective-align two or more photos with different exposure durations taken from the same scene and almost the same camera angle. This can be used in HDR fusion where we mix photos from the same scene with different exposure levels to get a clean, noiseless linear image that we can further work on (e.g. by applying a view transform, commonly called a tone mapper).

We can do this by using the darker images to capture details in the highlights (sky, headlights, etc.) since the image is dark enough to prevent overexposure and clipping, and we can use the brighter photos to get clean and noise-free shadows. Again, darker images prevent overexposure or clipping in bright areas, and brighter images prevent noise in dark areas. By combining them, we can get the best of both worlds. I usually do this with a custom shader node setup I've made in Blender.

HDR fusion requires both the scene and the camera to be fully stationary because there's a delay between each shot (not to mention motion blur in images with higher exposure times). Using a tripod can help, but it's not always an option. Sometimes, I just need to pull out my phone and, using Open Camera, press capture in the "Exposure Bracket" mode which will automatically take 5 RAW (DNG) images with varying exposure levels. Obviously, this introduces tiny movements simply from my hands.

Despite being tiny, these movements are still large enough to ruin the images for HDR fusion by introducing artifacts, so what I needed was a program that aligns these "Exposure Bracket" images. Adobe Photoshop has a similar feature, but it doesn't seem to work for images in linear color spaces with 32-bit floating-point data, and I don't wanna rely on paid Adobe software (or any Adobe software for that matter).

If you're curious, I use darktable to convert DNG files from my phone to OpenEXR images in Linear BT.709 while also applying corrections and minimal processing.

How

The program uses grid warping to distort the base image to match the target image. An evolution algorithm is used for optimization to minimize the "cost". To calculate the cost, we render the per-pixel logarithmic difference (between the warped base image and the target image) into what we call the difference image which we then downscale to a really tiny resolution like 8x6 (called the cost resolution) and store it in the cost image. The downscaling is done in a smooth way and samples all necessary pixels to avoid aliasing.

Finally, we find the maximum and average value in the pixels of the cost image. The average (called the average difference) becomes our cost value, which is what we're trying to minimize. The maximum (called the maximum local difference), however, is just there to make sure we don't introduce local differences while decreasing the average difference. If the cost resolution is 1x1, this will have no effect, but if the cost resolution is too high, it can slow down the optimization.

Apart from warping, a basic linear transform can be applied to the grid by the user. When optimization starts, it will jitter this transform around to potentially lower the cost before warp optimization.

The Algorithm

Here's what the algorithm looks like in every iteration:

Do one of the following based on the number of iterations.
- Generate a random grid transform jittered around the initial transform set by the user.
- Warp the grid vertices using a gaussian distribution with a random center, radius, direction, and strength. The ranges of the random values are calculated based on parameters (warp strength, grid resolution, etc.).
Recalculate the cost (average difference) and the maximum local difference (max value in the cost image).
If (cost > previous iteration's cost) or (max local diff. > initial max) then undo the warping.
Break the loop if stop conditions are met.

Note that every iteration, we compare the cost after warping to the cost before warping (stored in the previous iteration). However, for the maximum local difference, we only compare it to the initial maximum local difference that we got in the beginning, and not the one from the previous iteration.

Performance

For increased performance and efficiency, grid warping and cost calculation are performed at a lower resolution (called the intermediate resolution) on the graphics processing unit (GPU) using the Vulkan API.

Color Spaces & Image Formats

Unlike typical images you might see on the internet which can only store RGB (red, green, blue) values in the [0, 1] range, linear images allow any real number (even negative) for the RGB values in their pixels. img-aligner performs its calculations in a linear color space using 32-bit floating point values.

img-aligner supports the OpenEXR image format for linear images and PNG and JPEG for nonlinear images. All linear images are assumed to be in Linear BT.709 I-D65 (AKA Linear Rec. 709) or something similar like Linear BT.2020 I-E. Nonlinear images will go through an sRGB to Linear BT.709 conversion upon loading and the opposite when exporting.

img-aligner always assumes your display device uses the sRGB standard. If you're using a P3 or BT.2020 device, linear images that were originally intended to work in BT.709 might look overly vibrant on your display. This only affects how you view images and not how they're processed or stored in memory.

If you're curious, I really tried adding the OpenColorIO and OpenImageIO libraries for proper color management and image IO (like in RealBloom), but they were painfully hard to configure and build with CMake, and I got errors after errors.

How to Use (Tutorial)

This step-by-step tutorial contains most of what you need to know to get started with img-aligner.

Command Line Interface

To batch-process multiple images, you can call img-aligner from a terminal or another program with the --cli argument to enable command line mode. If no other arguments are provided, a help text will be printed.

img-aligner --cli

Note

Most command line arguments still have an effect in GUI mode. For example, you can use --gpu -1 to manually choose a physical device (GPU) at the start of the program, or --silent to disable logging. The help message explains every option and flag.

Check out demo/exposure-bracket-batch-processing for a number of exposure-bracketed images, a Python script that uses img-aligner's CLI to align those images, and a Blender file for fusing the aligned images into a single 32-bit linear image and minor post processing.

How to Run

You can find the latest release in Releases.

How It's Made

This project is written in C++20 with Visual Studio Code and uses mainly the following libraries.

Library	Used for
CLI11	Command line interface
GLFW	Window management
Dear ImGui	Graphical user interface
NFD Extended	Native file dialogs
beva	Vulkan wrapper
OpenEXR	Reading and writing OpenEXR images
stb_image	Reading images
GLM	Math
nlohmann/json	JSON serialization
fmt	String formatting

How to Build

This project uses CMake as its build system (if it works it works).

Tools

Make sure you've installed Git (version control), CMake, the Ninja build system, a text editor or an IDE (like Visual Studio Code), and proper C++ compilers.

On Windows, you can use MSYS2 which comes with the MinGW compilers and a whole lot of other useful tools and libraries.

Warning

You need the g++ compiler to build this project. This is mainly because the OpenEXR library doesn't compile with clang. The CMakeLists.txt already sets CMAKE_CXX_COMPILER to g++ so you just need to have g++ installed.

Building

Note

A basic knowledge of Linux commands is required (cd, mkdir, rm, relative paths, etc.).

Open up a terminal and switch to your development directory. On Windows, you can use Git Bash which is automatically installed when you install Git for Windows.
Clone the repository and switch the working directory to it.

git clone https://github.com/bean-mhm/img-aligner.git
cd img-aligner

Create a build directory and cd to it.

# delete if it already exists
rm -rf ./build

mkdir build
cd build

Generate CMake configuration files with Ninja.

# debug mode
cmake -G "Ninja" -DCMAKE_BUILD_TYPE=Debug ..

# or release mode
cmake -G "Ninja" -DCMAKE_BUILD_TYPE=Release ..

Make sure you have a stable internet connection so that unavailable packages can be fetched online. You only need to regenerate this in certain cases, like when you add or remove source files, modify CMakeLists.txt, or switch between debug and release builds.

Build & Run.

cmake --build .
./bin/img-aligner

Name		Name	Last commit message	Last commit date
Latest commit History 262 Commits
demo		demo
docs		docs
fonts		fonts
images		images
licenses		licenses
resources		resources
shaders		shaders
src		src
thirdparty		thirdparty
.gitignore		.gitignore
.gitmodules		.gitmodules
CMakeLists.txt		CMakeLists.txt
LICENSE.txt		LICENSE.txt
README.md		README.md
app.rc		app.rc
imgui.ini		imgui.ini

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

img-aligner

Why

How

The Algorithm

Performance

Color Spaces & Image Formats

How to Use (Tutorial)

Command Line Interface

How to Run

How It's Made

How to Build

Tools

Building

About

Uh oh!

Releases 2

Contributors 2

Uh oh!

Languages

License

bean-mhm/img-aligner

Folders and files

Latest commit

History

Repository files navigation

img-aligner

Why

How

The Algorithm

Performance

Color Spaces & Image Formats

How to Use (Tutorial)

Command Line Interface

How to Run

How It's Made

How to Build

Tools

Building

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 2

Contributors 2

Uh oh!

Languages