Stay updated

Pipelines: Composing Multiple Augmentations 🔗

While individual Transforms are useful, data augmentation often involves applying a sequence of different operations. Albumentations makes this easy using Pipelines.

A pipeline, defined using albumentations.Compose, chains multiple transforms together. When the pipeline is called, it applies the contained transforms sequentially to the input data.

Quick Reference 🔗

For experienced users, here are the key concepts covered in this guide:

Basic Pipeline Creation: A.Compose([transform1, transform2, ...])
Dynamic Modification: pipeline + transform, transform + pipeline, pipeline - transform
Composition Utilities: OneOf, SomeOf, Sequential, RandomOrder, etc.
Parameter Configuration: bbox_params, keypoint_params, additional_targets, seed, etc.

Defining a Simple Pipeline 🔗

You define a pipeline by passing a list of instantiated transforms to A.Compose.

import albumentations as A
import cv2
import numpy as np

# Assume 'image' is loaded as a NumPy array (e.g., 100x100x3)
image = np.random.randint(0, 256, (100, 100, 3), dtype=np.uint8) # Dummy image

# 1. Define the pipeline
pipeline = A.Compose([
    A.HorizontalFlip(p=0.5), # 50% chance to flip
    A.RandomBrightnessContrast(p=0.8), # 80% chance to adjust brightness/contrast
    A.GaussianBlur(p=0.3), # 30% chance to blur
])

# 2. Apply the pipeline
transformed_data = pipeline(image=image)
transformed_image = transformed_data['image']

print(f"Original shape: {image.shape}, Transformed shape: {transformed_image.shape}")
# Note: Shape usually remains the same unless a spatial transform like Resize is used.

Compose Parameters 🔗

While the transforms list is the only mandatory argument, A.Compose accepts several other parameters to configure its behavior, especially when dealing with multiple data targets or needing fine-grained control:

compose = A.Compose(
    transforms,
    bbox_params=None,
    keypoint_params=None,
    additional_targets=None,
    p=1.0,
    is_check_shapes=True,
    strict=False,
    mask_interpolation=None,
    seed=None,
    save_applied_params=False,
)

Core Parameters 🔗

transforms (list): (Required) The list of augmentation instances to apply sequentially.
p (float): The probability that the entire composition of transforms is applied. If a random check (< p) fails, the input data is returned unchanged. Note this differs from the individual p of transforms inside the list. Default: 1.0 (always apply the sequence).

Target-Specific Parameters 🔗

bbox_params (A.BboxParams | dict | None): Configuration for handling bounding boxes. If provided, the pipeline can accept a bboxes argument during the call. See the Object Detection Guide for details. Default: None.
keypoint_params (A.KeypointParams | dict | None): Configuration for handling keypoints. If provided, the pipeline can accept a keypoints argument during the call. See the Keypoint Augmentation Guide for details. Default: None.
additional_targets (dict[str, str] | None): Defines how to handle additional input arrays beyond the standard image, mask, etc. Maps a custom input name (e.g., 'image2') to a standard target type ('image' or 'mask'). See the Using Additional Targets Guide for details. Default: None.

Validation and Quality Control 🔗

is_check_shapes (bool): If True, performs shape consistency checks for all input arrays (image, mask, masks, volume, mask3d, etc.) before applying transforms.

Disable (False) only if you're certain about data consistency and need maximum speed, as it can catch potential errors early. Default: True.
strict (bool): If True, enables strict validation mode during the transform call:
1. Checks that all keyword arguments passed (e.g., image=..., mask=..., my_target=...) are recognized (either standard targets or defined in additional_targets).
2. Validates that transforms are not called with unsupported arguments (though this is less common with the standard structure).
3. Raises a ValueError if any validation fails.
If False, unknown arguments are ignored. Useful for debugging pipeline configuration. Default: False.

Advanced Configuration 🔗

mask_interpolation (int | None): If set to an OpenCV interpolation flag (e.g., cv2.INTER_LINEAR), this value overrides the interpolation method used for masks in all applicable geometric transforms within the pipeline. If None, each transform uses its default mask interpolation (usually cv2.INTER_NEAREST). Default: None.
seed (int | None): Controls the reproducibility of random augmentations within this specific Compose instance. When provided, this sets the random seed for all transforms in the pipeline, ensuring that the same sequence of random parameters is generated each time the pipeline is called with identical inputs. Default: None (no fixed seed).

When seed is set (int):
- Creates a fixed internal random state
- Two Compose instances with the same seed and transforms will produce identical sequences of augmentations
- Each call to the same Compose instance still produces random augmentations, but these sequences are reproducible
When seed is None (default):
- Generates a new internal random state on each Compose creation
- Different Compose instances will produce different sequences of augmentations
Important: Albumentations uses its own internal random state generators (self.py_random, self.random_generator), completely independent from global seeds (random.seed(), np.random.seed()). Setting global seeds has no effect on augmentations. See the Creating Custom Transforms Guide for how to use the seeded generators in custom transforms. Default: None.
save_applied_params (bool): If True, after processing each transform, Compose saves the actual parameters used by that transform in the result dictionary under the key applied_params. This includes both static parameters and any random values that were sampled. Useful for debugging or when you need to know exactly what augmentations were applied. Default: False.

Grayscale Image Preprocessing 🔗

Compose includes automatic preprocessing for grayscale images to ensure compatibility with all transforms:

Transform requirement: All individual transforms in Albumentations require grayscale images to have a channel dimension (H, W, 1)
Compose convenience: Automatically handles both (H, W) and (H, W, 1) formats
Channel dimension handling: If you pass a grayscale image without a channel dimension (e.g., shape (H, W)), Compose automatically adds one during preprocessing, making it (H, W, 1)
Postprocessing cleanup: If a channel dimension was added, it's automatically removed in the output, maintaining the original format
Transparent to users: This happens automatically - you can pass grayscale images in either format

This preprocessing ensures that all transforms work consistently regardless of whether you provide grayscale images with or without an explicit channel dimension. However, if you use transforms directly without Compose, you must ensure grayscale images have the channel dimension. See the Grayscale Image Handling section for more details.

Dynamic Pipeline Modification 🔗

Albumentations supports dynamic pipeline modification after initialization using mathematical operators. This allows you to add or remove transforms from existing pipelines without recreating them from scratch. All operations preserve the original pipeline's parameters (such as bbox_params, keypoint_params, additional_targets, etc.) and return new instances without modifying the original pipeline.

Adding Transforms to Pipelines 🔗

Use the + operator to add transforms to the end of a pipeline, or use a transform before + to add to the beginning:

import albumentations as A

# Create a base pipeline
base_pipeline = A.Compose([
    A.RandomCrop(height=256, width=256),
    A.HorizontalFlip(p=0.5),
], bbox_params=A.BboxParams(format='pascal_voc', label_fields=['labels']))

# Add a single transform to the end
extended_pipeline = base_pipeline + A.RandomBrightnessContrast(p=0.3)
# Result: [RandomCrop, HorizontalFlip, RandomBrightnessContrast]

# Add multiple transforms to the end
extended_pipeline = base_pipeline + [A.GaussianBlur(p=0.2), A.Rotate(limit=15)]
# Result: [RandomCrop, HorizontalFlip, GaussianBlur, Rotate]

# Add transforms to the beginning
extended_pipeline = A.Resize(height=512, width=512) + base_pipeline
# Result: [Resize, RandomCrop, HorizontalFlip]

# Add multiple transforms to the beginning
extended_pipeline = [A.Resize(height=512, width=512), A.Normalize()] + base_pipeline
# Result: [Resize, Normalize, RandomCrop, HorizontalFlip]

# All bbox_params and other settings are preserved in the new pipeline
print(f"bbox_params preserved: {extended_pipeline.processors.get('bboxes') is not None}")

Removing Transforms from Pipelines 🔗

Use the - operator to remove specific transform instances from a pipeline:

import albumentations as A

# Create transform instances
flip_transform = A.HorizontalFlip(p=0.5)
brightness_transform = A.RandomBrightnessContrast(p=0.3)
blur_transform = A.GaussianBlur(p=0.2)

# Create pipeline with these specific instances
pipeline = A.Compose([
    A.RandomCrop(height=256, width=256),
    flip_transform,
    brightness_transform,
    blur_transform,
])

# Remove transforms by class type
reduced_pipeline = pipeline - A.HorizontalFlip
# Result: [RandomCrop, RandomBrightnessContrast, GaussianBlur]

# Note: Removal works by class type, parameters don't matter
another_flip = A.HorizontalFlip(p=0.8)  # Different parameters
pipeline_with_another = A.Compose([A.RandomCrop(224, 224), another_flip])
reduced_again = pipeline_with_another - A.HorizontalFlip  # Works! Class type matches

# Chain removal operations
further_reduced = reduced_pipeline - A.RandomBrightnessContrast - A.GaussianBlur

⚠️ Important: Class-Based Removal

The - operator removes transforms by class type, not by instance object. This means:

✅ Pass the transform class (e.g., A.HorizontalFlip) to remove any instance of that class
✅ No need to store transform instances - just use the class name
⚠️ Only the first occurrence of that class type is removed if multiple exist

# ✅ CORRECT: Remove by class type
pipeline = A.Compose([A.Resize(224, 224), A.HorizontalFlip(p=0.5)])
modified = pipeline - A.HorizontalFlip  # Works! Removes the HorizontalFlip transform

# ✅ WORKS FROM COMPLEX PIPELINES: Remove from anywhere in the pipeline
complex_pipeline = A.Compose([
    A.RandomCrop(256, 256),
    A.HorizontalFlip(p=0.5),
    A.RandomBrightnessContrast(p=0.3),
    A.GaussianBlur(p=0.2)
])
without_flip = complex_pipeline - A.HorizontalFlip  # Removes HorizontalFlip from middle
# Result: [RandomCrop, RandomBrightnessContrast, GaussianBlur]

# ⚠️ Multiple instances: Only first occurrence removed
pipeline = A.Compose([
    A.HorizontalFlip(p=0.3),
    A.VerticalFlip(p=0.5),
    A.HorizontalFlip(p=0.8)
])
modified = pipeline - A.HorizontalFlip  # Removes first HorizontalFlip (p=0.3)
# Result: [VerticalFlip(p=0.5), HorizontalFlip(p=0.8)]

Important Notes about Pipeline Modification 🔗

Class-Based Removal: The - operator removes transforms based on class type using type(transform) is TargetClass. The specific parameters or instance identity don't matter.
Parameter Preservation: All pipeline modification operations preserve the original configuration including bbox_params, keypoint_params, additional_targets, seed, and other settings.
New Instance Creation: All operators (+, -) return new Compose instances without modifying the original pipeline. This ensures thread safety and prevents unintended side effects.
Type Restrictions: You can only add or remove BasicTransform instances (individual transforms), not BaseCompose instances (composition classes like OneOf, SomeOf, etc.).
Error Handling:
- Adding non-BasicTransform instances raises TypeError
- Removing a transform not found in the pipeline raises ValueError: Transform not found
- Removing non-BasicTransform instances raises TypeError
Random State Inheritance: New pipeline instances inherit the random state from the original pipeline, ensuring consistent behavior.

Practical Examples 🔗

Building Pipelines Incrementally:

import albumentations as A

# Start with basic geometric transforms
geometric_pipeline = A.Compose([
    A.RandomCrop(height=256, width=256),
    A.HorizontalFlip(p=0.5),
])

# Add color augmentations for training
training_pipeline = geometric_pipeline + [
    A.RandomBrightnessContrast(p=0.3),
    A.HueSaturationValue(p=0.3),
]

# Add noise and blur for robustness testing
robust_pipeline = training_pipeline + [
    A.GaussNoise(p=0.2),
    A.GaussianBlur(p=0.2),
]

Conditional Pipeline Construction:

import albumentations as A

# Base pipeline
pipeline = A.Compose([A.RandomCrop(height=256, width=256)])

# Add transforms based on conditions
if use_geometric_augmentations:
    pipeline = pipeline + [A.HorizontalFlip(p=0.5), A.Rotate(limit=15)]

if use_color_augmentations:
    pipeline = pipeline + A.RandomBrightnessContrast(p=0.3)

if add_noise:
    pipeline = pipeline + A.GaussNoise(p=0.1)

Transform Replacement:

import albumentations as A

# Original pipeline with specific transform instance
old_blur = A.GaussianBlur(blur_limit=3, p=0.2)
pipeline = A.Compose([
    A.RandomCrop(height=256, width=256),
    A.HorizontalFlip(p=0.5),
    old_blur,
])

# Replace the blur transform with a different configuration
new_blur = A.GaussianBlur(blur_limit=7, p=0.4)
updated_pipeline = pipeline - old_blur + new_blur

How Probabilities Work in Pipelines 🔗

When a pipeline created with A.Compose is called:

It iterates through the transforms in the list in the order they were provided.
For each transform in the list, its individual p value is checked.
If the random check for a specific transform passes (based on its p), that transform is applied to the current state of the data (which might have been modified by previous transforms in the pipeline).
If the check fails, that specific transform is skipped, and the pipeline moves to the next one.

This means that on any given call to the pipeline, a different subset of the defined transforms might actually be applied, controlled by their individual probabilities.

Important: A.Compose itself does not have a top-level p parameter to control whether the entire pipeline runs or not. It always executes and iterates through its contained transforms, letting their individual p values determine if they activate.

For grouping transforms under a single probability, use Sequential instead.

Advanced Composition Utilities 🔗

Beyond the basic sequential application in Compose, Albumentations offers several utilities to build more complex pipelines with conditional or altered execution flow.

`OneOf`: Apply Exactly One Transform 🔗

A.OneOf takes a list of transforms and applies exactly one of them, chosen randomly based on their individual p values (which are normalized within the OneOf block). The OneOf block itself also has a p parameter determining if any transform within it gets applied.

import albumentations as A

pipeline = A.Compose([
    # Apply either GaussianBlur or MotionBlur, but not both.
    # The choice between them depends on their p values (here, equal chance).
    # This entire OneOf block has a 90% chance of executing.
    A.OneOf([
        A.GaussianBlur(p=1.0), # p=1.0 within OneOf means it's considered
        A.MotionBlur(p=1.0),   # p=1.0 within OneOf means it's considered
    ], p=0.9), # 90% chance to apply *one* of the above
    A.HorizontalFlip(p=0.5)
])

# When pipeline(image=...) is called:
# - 90% chance: Either GaussianBlur OR MotionBlur is applied (50/50 chance each).
# - 10% chance: Neither GaussianBlur nor MotionBlur is applied.
# - Independently, 50% chance HorizontalFlip is applied.

`SomeOf`: Apply a Random Subset of Transforms 🔗

A.SomeOf takes a list of transforms and a number n. If the SomeOf block itself is activated (based on its main p value), it randomly selects n transforms from the list and attempts to apply them sequentially.

Key Features:

Number of transforms (n): This can be a fixed integer or a tuple (min_n, max_n). If it's a tuple, the number of transforms selected will be a random integer chosen uniformly from the range [min_n, max_n] (inclusive) on each call.
Selection Method (replace): By default (replace=False), transforms are selected without replacement. If replace=True, transforms are selected with replacement.
Uniform Selection: Transforms are selected uniformly at random from the list - each transform has an equal chance of being chosen, regardless of their individual p values.
Main Probability (p): Controls the probability that the SomeOf block executes at all. If this check fails, none of the transforms inside are considered.
Individual Probabilities: Crucially, after SomeOf selects the transforms, it attempts to apply each one. Each selected transform is only applied if its own individual p value passes. The p values inside the list are not ignored after selection.

import albumentations as A

pipeline = A.Compose([
    # Attempt to apply between 1 and 3 of the following transforms (inclusive).
    # Selection is without replacement (default).
    # This SomeOf block has a 90% chance of being entered.
    A.SomeOf([
        A.GaussianBlur(p=0.5), # If selected, 50% chance to apply
        A.RandomBrightnessContrast(p=1.0), # If selected, 100% chance to apply
        A.HueSaturationValue(p=0.8), # If selected, 80% chance to apply
        A.Sharpen(p=0.3), # If selected, 30% chance to apply
    ], n=(1, 3), p=0.9), # Select 1 to 3 transforms, 90% of the time
    A.HorizontalFlip(p=0.5)
])

# When pipeline(image=...) is called:
# - 90% chance:
#     1. SomeOf selects N transforms (N between 1 and 3) randomly from the list.
#     2. It then iterates through these N selected transforms.
#     3. For each selected transform, it applies it *only if* that transform's individual `p` value passes its random check.
# - 10% chance: None of the transforms in SomeOf are considered.
# - Independently, 50% chance HorizontalFlip is applied.

`OneOrOther`: Apply One of Two Transforms/Blocks 🔗

A.OneOrOther applies either its first transform (or composed block) or its second transform (or composed block), chosen randomly based on their p values. It provides a clearer structure for simple A/B choices compared to OneOf with two elements.

import albumentations as A

pipeline = A.Compose([
    # 50% chance apply first (Flip), 50% chance apply second (Rotate)
    A.OneOrOther(
        first=A.HorizontalFlip(p=1.0), # p=1.0 means always apply if chosen
        second=A.Rotate(limit=30, p=1.0), # p=1.0 means always apply if chosen
        p=0.5 # Probability of applying the `first` transform
    ),
    A.RandomBrightnessContrast(p=0.8)
])

# When pipeline(image=...) is called:
# - 50% chance HorizontalFlip is applied.
# - 50% chance Rotate is applied.
# - Independently, 80% chance RandomBrightnessContrast is applied.

`RandomOrder`: Apply Transforms in Random Sequence 🔗

A.RandomOrder takes a list of transforms and applies them sequentially, but shuffles the order of application randomly on each call. The RandomOrder block itself does not have a p parameter; it always executes and shuffles its children.

import albumentations as A

pipeline = A.Compose([
    # Apply Flip and BrightnessContrast, but in a random order each time.
    A.RandomOrder([
        A.HorizontalFlip(p=0.5),
        A.RandomBrightnessContrast(p=0.8),
    ]),
    A.GaussianBlur(p=0.3)
])

# When pipeline(image=...) is called:
# - Order 1: Flip (if p=0.5 passes) -> BrightnessContrast (if p=0.8 passes)
# - Order 2: BrightnessContrast (if p=0.8 passes) -> Flip (if p=0.5 passes)
# The order is chosen randomly.
# - Independently, 30% chance GaussianBlur is applied *after* the RandomOrder block.

`SelectiveChannelTransform`: Apply Transforms to Specific Channels 🔗

A.SelectiveChannelTransform applies its contained transforms only to specific channels of the input image. You specify the channels (by index) to affect.

A common use case is working with multi-channel data like RGBA images where you might want to apply certain augmentations only to the RGB channels (indices 0, 1, 2) and leave the Alpha channel (index 3) untouched.

import albumentations as A
import numpy as np

# Assume a 4-channel image (RGBA)
image_rgba = np.random.randint(0, 256, (100, 100, 4), dtype=np.uint8)

# Apply GaussianBlur only to the first three channels (RGB)
pipeline = A.Compose([
    A.SelectiveChannelTransform(
        A.GaussianBlur(p=1.0), # Transform to apply
        channels=[0, 1, 2], # Apply only to RGB channels
        p=1.0 # Probability of applying this selective transform
    ),
    A.HorizontalFlip(p=0.5) # Applied to all channels
])

transformed_data = pipeline(image=image_rgba)
transformed_image = transformed_data['image']

# When pipeline(image=...) is called:
# - GaussianBlur is applied *only* to the R, G, B channels.
# - Independently, 50% chance the *entire* RGBA image is flipped.

`Sequential`: Group Transforms with a Single Probability 🔗

A.Sequential applies a list of transforms sequentially, just like A.Compose. The key difference is that Sequential itself has a top-level p parameter. If the random check for Sequential's p fails, none of the transforms inside it are applied.

This contrasts with A.Compose, which always executes and iterates through its contained transforms, letting their individual p values determine if they activate. Compose also handles the setup for applying augmentations consistently across multiple targets (images, masks, bounding boxes), while Sequential is a simpler wrapper primarily for grouping transforms under a single probability.

Use Sequential when you want to treat a whole sequence of operations as a single augmentation block that either runs entirely or not at all, based on one probability value.

import albumentations as A

pipeline = A.Compose([
    # This whole sequence runs only 40% of the time.
    A.Sequential([
        A.HorizontalFlip(p=1.0), # Always apply *if* Sequential runs
        A.RandomBrightnessContrast(p=1.0) # Always apply *if* Sequential runs
    ], p=0.4), # 40% chance to run the Flip -> BrightnessContrast sequence
    A.GaussNoise(p=0.5)
])

# When pipeline(image=...) is called:
# - 40% chance: HorizontalFlip is applied, then RandomBrightnessContrast is applied.
# - 60% chance: Neither Flip nor BrightnessContrast is applied.
# - Independently, 50% chance GaussNoise is applied.

Nested Compositions 🔗

The real power of these composition utilities (OneOf, SomeOf, Sequential, etc.) comes from nesting them within each other or within a main A.Compose block. This allows you to build highly customized and complex augmentation pipelines.

Here's an example combining several concepts:

import albumentations as A
import cv2
import numpy as np

image = np.random.randint(0, 256, (100, 100, 3), dtype=np.uint8)

complex_pipeline = A.Compose([
    # --- Block 1: Basic Transform ---
    A.HorizontalFlip(p=0.5), # Applied 50% of the time

    # --- Block 2: Choose ONE of two sequences ---
    A.OneOf([
        # Sequence A: Blur then Sharpen (runs as a unit if chosen)
        A.Sequential([
            A.GaussianBlur(p=1.0), # Always applies if Sequence A is chosen
            A.Sharpen(p=1.0)       # Always applies if Sequence A is chosen
        ], p=1.0), # p=1.0 ensures it's considered by OneOf

        # Sequence B: Emboss (runs if chosen)
        A.Emboss(p=1.0) # Always applies if Sequence B is chosen

    ], p=0.8), # 80% chance to run EITHER Sequence A OR Sequence B (50/50 split)

    # --- Block 3: Apply SOME of these color transforms ---
    A.SomeOf([
        A.RandomBrightnessContrast(p=0.7), # If selected, 70% chance to apply
        A.HueSaturationValue(p=0.6),       # If selected, 60% chance to apply
        A.ColorJitter(p=0.5)               # If selected, 50% chance to apply
    ], n=(1, 2), p=0.9), # 90% chance to select 1 or 2, then apply based on their p

    # --- Block 4: Noise, maybe ---
    A.GaussNoise(p=0.2) # Applied 20% of the time, independently
])

# Applying the complex pipeline:
transformed_data = complex_pipeline(image=image)
transformed_image = transformed_data['image']

# --- Execution Flow Example ---
# When complex_pipeline(image=...) is called:
# 1. HorizontalFlip runs based on its p=0.5.
# 2. The OneOf block runs based on its p=0.8.
#    - If it runs, it randomly chooses either Sequential or Emboss (50/50).
#    - If Sequential is chosen, GaussianBlur and Sharpen are both applied.
#    - If Emboss is chosen, Emboss is applied.
# 3. The SomeOf block runs based on its p=0.9.
#    - If it runs, it selects 1 or 2 transforms randomly from its list.
#    - For each selected transform, it runs based on *that transform's* p value.
# 4. GaussNoise runs based on its p=0.2.

This demonstrates how you can chain and nest different composition logic blocks to control the probability and flow of your augmentations precisely.

Where to Go Next? 🔗

After learning about pipelines, you now understand how to:

Create basic and complex augmentation pipelines using A.Compose
Dynamically modify pipelines using + and - operators
Control probability and execution flow with composition utilities
Configure pipelines for different data types (images, masks, bboxes, keypoints)
Handle advanced scenarios like nested compositions

Next steps to explore:

Transforms: Explore the individual augmentation operations you can include in your pipelines.
Setting Probabilities: Get a deeper understanding of how the p parameter works for both individual transforms and composition blocks.
Working with Targets: Learn how pipelines consistently apply augmentations to images, masks, bounding boxes, and keypoints.
Basic Usage Examples: See complete code examples of pipelines applied to common computer vision tasks.
Advanced Guides: Discover techniques like serialization or creating custom transforms to integrate into your pipelines.
Visually Explore Transforms: Experiment with combining transforms in the interactive tool.