Open in Google ColabRun this notebook interactively

How to save and load parameters of an augmentation pipeline 🔗

Reproducibility is very important in deep learning. Data scientists and machine learning engineers need a way to save all parameters of deep learning pipelines such as model, optimizer, input datasets, and augmentation parameters and to be able to recreate the same pipeline using that data. Albumentations has built-in functionality to serialize the augmentation parameters and save them. Then you can use those parameters to recreate an augmentation pipeline.

Import the required libraries 🔗

import albumentations as A
import cv2
import matplotlib.pyplot as plt
import numpy as np

/opt/homebrew/Caskroom/miniconda/base/envs/albumentations_examples/lib/python3.9/site-packages/tqdm/auto.py:21: TqdmWarning: IProgress not found. Please update jupyter and ipywidgets. See https://ipywidgets.readthedocs.io/en/stable/user_install.html
  from .autonotebook import tqdm as notebook_tqdm

Define the visualization function 🔗

def visualize(image):
    plt.figure(figsize=(6, 6))
    plt.axis("off")
    plt.imshow(image)

Load an image from the disk 🔗

image = cv2.imread("images/parrot.jpg", cv2.IMREAD_COLOR_RGB)

visualize(image)

No code provided

png

No code provided

Define an augmentation pipeline that we want to serialize 🔗

transform = A.Compose(
    [
        A.Perspective(),
        A.RandomCrop(768, 768),
        A.OneOf(
            [
                A.RGBShift(),
                A.HueSaturationValue(),
            ],
        ),
    ],
    strict=True,
    seed=137,
)

We can pass an instance of augmentation to the print function, and it will print the string representation of it.

print(transform)

Compose([
  Perspective(p=0.5, scale=(0.05, 0.1), keep_size=True, border_mode=0, fill=0.0, fill_mask=0.0, fit_output=False, interpolation=1, mask_interpolation=0),
  RandomCrop(p=1.0, height=768, width=768, pad_if_needed=False, border_mode=0, fill=0.0, fill_mask=0.0, pad_position='center'),
  OneOf([
    RGBShift(p=0.5, r_shift_limit=(-20, 20), g_shift_limit=(-20, 20), b_shift_limit=(-20, 20)),
    HueSaturationValue(p=0.5, hue_shift_limit=(-20, 20), sat_shift_limit=(-30, 30), val_shift_limit=(-20, 20)),
  ], p=0.5),
], p=1.0, bbox_params=None, keypoint_params=None, additional_targets={}, is_check_shapes=True)

transformed = transform(image=image)
visualize(transformed["image"])

No code provided

png

No code provided

Serializing an augmentation pipeline to a JSON or YAML file 🔗

To save the serialized representation of an augmentation pipeline to a JSON file, use the save function from Albumentations.

A.save(transform, "/tmp/transform.json")

To load a serialized representation from a JSON file, use the load function from Albumentations.

loaded_transform = A.load("/tmp/transform.json")
loaded_transform.set_random_seed(137)

print(loaded_transform)

Compose([
  Perspective(p=0.5, scale=(0.05, 0.1), keep_size=True, border_mode=0, fill=0.0, fill_mask=0.0, fit_output=False, interpolation=1, mask_interpolation=0),
  RandomCrop(p=1.0, height=768, width=768, pad_if_needed=False, border_mode=0, fill=0.0, fill_mask=0.0, pad_position='center'),
  OneOf([
    RGBShift(p=0.5, r_shift_limit=(-20, 20), g_shift_limit=(-20, 20), b_shift_limit=(-20, 20)),
    HueSaturationValue(p=0.5, hue_shift_limit=(-20, 20), sat_shift_limit=(-30, 30), val_shift_limit=(-20, 20)),
  ], p=0.5),
], p=1.0, bbox_params=None, keypoint_params=None, additional_targets={}, is_check_shapes=True)

transformed_from_loaded_transform = loaded_transform(image=image)
visualize(transformed_from_loaded_transform["image"])

No code provided

png

No code provided

assert np.array_equal(transformed["image"], transformed_from_loaded_transform["image"])

As you see, it produced the same result.

Using YAML insted of JSON 🔗

You can also use YAML instead of JSON for serializing and deserializing of augmentation pipelines. To do that add the data_format='yaml' argument to the save and load functions.

A.save(transform, "/tmp/transform.yml", data_format="yaml")
loaded_transform = A.load("/tmp/transform.yml", data_format="yaml")

print(loaded_transform)

Compose([
  Perspective(p=0.5, scale=(0.05, 0.1), keep_size=True, border_mode=0, fill=0.0, fill_mask=0.0, fit_output=False, interpolation=1, mask_interpolation=0),
  RandomCrop(p=1.0, height=768, width=768, pad_if_needed=False, border_mode=0, fill=0.0, fill_mask=0.0, pad_position='center'),
  OneOf([
    RGBShift(p=0.5, r_shift_limit=(-20, 20), g_shift_limit=(-20, 20), b_shift_limit=(-20, 20)),
    HueSaturationValue(p=0.5, hue_shift_limit=(-20, 20), sat_shift_limit=(-30, 30), val_shift_limit=(-20, 20)),
  ], p=0.5),
], p=1.0, bbox_params=None, keypoint_params=None, additional_targets={}, is_check_shapes=True)

Serializing an augmentation pipeline to a Python dictionary 🔗

If you need more control over a serialized pipeline, e.g., you want to save a serialized version to a database or send it to a server you can use the to_dict and from_dict functions. to_dict returns a Python dictionary that describes a pipeline. The dictionary will contain only primitive data types such as dictionaries, lists, strings, integers, and floats. To construct a pipeline from a dictionary, you need to call from_dict.

transform_dict = A.to_dict(transform)
loaded_transform = A.from_dict(transform_dict)

print(loaded_transform)

Compose([
  Perspective(p=0.5, scale=(0.05, 0.1), keep_size=True, border_mode=0, fill=0.0, fill_mask=0.0, fit_output=False, interpolation=1, mask_interpolation=0),
  RandomCrop(p=1.0, height=768, width=768, pad_if_needed=False, border_mode=0, fill=0.0, fill_mask=0.0, pad_position='center'),
  OneOf([
    RGBShift(p=0.5, r_shift_limit=(-20, 20), g_shift_limit=(-20, 20), b_shift_limit=(-20, 20)),
    HueSaturationValue(p=0.5, hue_shift_limit=(-20, 20), sat_shift_limit=(-30, 30), val_shift_limit=(-20, 20)),
  ], p=0.5),
], p=1.0, bbox_params=None, keypoint_params=None, additional_targets={}, is_check_shapes=True)

Serializing and deserializing Lambda transforms 🔗

Lambda transforms use custom transformation functions provided by a user. For those types of transforms, Albumentations saves only the name and the position in the augmentation pipeline. To deserialize an augmentation pipeline with Lambda transforms, you need to manually provide all Lambda transform instances using the lambda_transforms argument.

Let's define a function that we will use to transform an image.

def hflip_image(image, **kwargs):
    return cv2.flip(image, 1)

Next, we create a Lambda transform that will apply the hflip_image function to input images. Note that to make the transform serializable, you need to pass the name argument.

hflip = A.Lambda(name="hflip_image", image=hflip_image, p=0.5)
transform = A.Compose([hflip], strict=True, seed=137)
print(transform)

Compose([
  Lambda(name='hflip_image', image=<function hflip_image at 0x1713ba820>, mask=<function noop at 0x123525280>, keypoints=<function noop at 0x123525280>, bboxes=<function noop at 0x123525280>, p=0.5),
], p=1.0, bbox_params=None, keypoint_params=None, additional_targets={}, is_check_shapes=True)

To check that transform is working, we will apply to an image.

transformed = transform(image=image)
visualize(transformed["image"])

No code provided

png

No code provided

To serialize a pipeline with a Lambda transform, use the save function as before.

A.save(transform, "/tmp/lambda_transform.json")

To deserialize a pipeline that contains Lambda transforms, you need to pass names and instances of all Lambda transforms in a pipeline through the lambda_transforms argument.

loaded_transform = A.load("/tmp/lambda_transform.json", nonserializable={"hflip_image": hflip})
loaded_transform.set_random_seed(137)

print(loaded_transform)

Compose([
  Lambda(name='hflip_image', image=<function hflip_image at 0x1713ba820>, mask=<function noop at 0x123525280>, keypoints=<function noop at 0x123525280>, bboxes=<function noop at 0x123525280>, p=0.5),
], p=1.0, bbox_params=None, keypoint_params=None, additional_targets={}, is_check_shapes=True)

Verify that the deserialized pipeline produces the same output.

transformed_from_loaded_transform = loaded_transform(image=image)

assert np.array_equal(transformed["image"], transformed_from_loaded_transform["image"])

To serialize and deserialize Lambda transforms to and from dictionaries use to_dict and from_dict.

transform_dict = A.to_dict(transform)
print(transform_dict)

{'__version__': '2.0.3', 'transform': {'__class_fullname__': 'Compose', 'p': 1.0, 'transforms': [{'__class_fullname__': 'Lambda', '__name__': 'hflip_image'}], 'bbox_params': None, 'keypoint_params': None, 'additional_targets': {}, 'is_check_shapes': True}}

loaded_transform = A.from_dict(transform_dict, nonserializable={"hflip_image": hflip})