Stay updated

albumentations.augmentations.pixel.functional

Functional implementations of image augmentation operations. This module contains low-level functions for various image augmentation techniques including color transformations, blur effects, tone curve adjustments, noise additions, and other visual modifications. These functions form the foundation for the transform classes and provide the core functionality for manipulating image data during the augmentation process.

Members

functionshift_hsv
functionshift_hsv_images
functionsolarize
functionposterize
functionequalize
functionevaluate_bez
functionmove_tone_curve
functionlinear_transformation_rgb
functionclahe
functionimage_compression
functionadd_snow_bleach
functiongenerate_snow_textures
functionadd_snow_texture
functionadd_rain
functionget_fog_particle_radiuses
functionadd_fog
functionadd_sun_flare_overlay
functionadd_sun_flare_physics_based
functionadd_shadow
functionadd_gravel
functioninvert
functionchannel_shuffle
functionvolume_channel_shuffle
functionvolumes_channel_shuffle
functiongamma_transform
functioniso_noise
functioniso_noise_images
functionto_gray_weighted_average
functionto_gray_from_lab
functionto_gray_desaturation
functionto_gray_average
functionto_gray_max
functionto_gray_pca
functionto_gray
functiongrayscale_to_multichannel
functiondownscale
functionnoop
functionfancy_pca
functionadjust_brightness_torchvision
functionadjust_contrast_torchvision
functionadjust_saturation_torchvision
functionadjust_hue_torchvision
functionapply_brightness_contrast_torchvision
functionsuperpixels
functionunsharp_mask_images
functionunsharp_mask
functionpixel_dropout
functionspatter_rain
functionspatter_mud
functionchromatic_aberration
functionplanckian_jitter
functionadd_noise
functionslic
functionshot_noise
functionget_safe_brightness_contrast_params
functiongenerate_constant_noise_with_py_random
functiongenerate_spatial_noise
functiongenerate_per_pixel_noise
functionsample_noise
functionsample_uniform
functionsample_gaussian
functionsample_laplace
functionsample_beta
functiongenerate_shared_noise
functionsharpen_gaussian
functionapply_salt_and_pepper
functiongenerate_plasma_pattern
functionapply_plasma_brightness_contrast
functionapply_plasma_shadow
functioncreate_directional_gradient
functioncreate_illumination_gradient
functionapply_linear_illumination
functionapply_corner_illumination
functionapply_gaussian_illumination
functionauto_contrast
functioncreate_contrast_lut
functionget_histogram_bounds
functionget_drop_mask
functiongenerate_random_values
functionprepare_drop_values
functionget_mask_array
functionget_rain_params
functionget_mud_params
functionrgb_to_optical_density
functionnormalize_vectors
functionget_normalizer
classStainNormalizer
classSimpleNMF
functionorder_stains_combined
classVahadaneNormalizer
classMacenkoNormalizer
functionget_tissue_mask
functionapply_he_stain_augmentation
functionconvolve
functionseparable_convolve
functionapply_vignette
functionapply_film_grain
functionapply_halftone
functionapply_lens_flare
functiongenerate_water_displacement_maps
functionapply_atmospheric_fog

shift_hsvfunction

shift_hsv(
    img: ImageType,
    hue_shift: float,
    sat_shift: float,
    val_shift: float
)

Shift hue, saturation, and value in HSV space. hue_shift, sat_shift, val_shift control amount; grayscale gets value only. uint8 I/O. Args: img (ImageType): The image to shift. hue_shift (float): The amount to shift the hue. sat_shift (float): The amount to shift the saturation. val_shift (float): The amount to shift the value. Returns: ImageType: The shifted image.

Parameters

Name	Type	Default	Description
img	ImageType	-	-
hue_shift	float	-	-
sat_shift	float	-	-
val_shift	float	-	-

shift_hsv_imagesfunction

shift_hsv_images(
    images: np.ndarray,
    hue_shift: float,
    sat_shift: float,
    val_shift: float
)

Apply HSV shift to a batch of images (N, H, W, C) or (N, H, W). Uses pre-allocated output and per-frame shift_hsv; faster than stack loop for video batches. Uses a pre-allocated output array with per-frame shift_hsv calls. This is faster than the base-class np.stack loop because it avoids N intermediate allocations and the final stack copy. A reshape-to-tall-image approach (N,H,W,3) -> (N*H,W,3) was benchmarked but was slower for images >= 512x512 due to cv2.cvtColor and cv2.LUT cache thrashing on the large intermediate arrays.

Parameters

Name	Type	Default	Description
images	np.ndarray	-	-
hue_shift	float	-	-
sat_shift	float	-	-
val_shift	float	-	-

solarizefunction

solarize(
    img: ImageType,
    threshold: float
)

Invert pixel values above a normalized threshold (solarization). threshold in [0, 1]; works for uint8 and float32; pixels below threshold unchanged. Args: img (ImageType): The image to solarize. Can be uint8 or float32. threshold (float): Normalized threshold value in range [0, 1]. For uint8 images: pixels above threshold * 255 are inverted For float32 images: pixels above threshold are inverted Returns: ImageType: Solarized image. Note: The threshold is normalized to [0, 1] range for both uint8 and float32 images. For uint8 images, the threshold is internally scaled by 255.

Parameters

Name	Type	Default	Description
img	ImageType	-	-
threshold	float	-	-

posterizefunction

posterize(
    img: ImageType,
    bits: Literal[1, 2, 3, 4, 5, 6, 7] | list[Literal[1, 2, 3, 4, 5, 6, 7]]
)

Reduce bit depth by keeping only the highest N bits per channel. bits: 1-7 or list per channel; LUT-based. uint8 I/O, clipped. Args: img (ImageType): Input image. Can be single or multi-channel. bits (Literal[1, 2, 3, 4, 5, 6, 7] | list[Literal[1, 2, 3, 4, 5, 6, 7]]): Number of high bits to keep.. Can be either: - A single value to apply the same bit reduction to all channels - A list of values to apply different bit reduction per channel. Length of list must match number of channels in image. Returns: ImageType: Image with reduced bit depth. Has same shape and dtype as input. Note: - The transform keeps the N highest bits and sets all other bits to 0 - For example, if bits=3: - Original value: 11010110 (214) - Keep 3 bits: 11000000 (192) - The number of unique colors per channel will be 2^bits - Higher bits values = more colors = more subtle effect - Lower bits values = fewer colors = more dramatic posterization Examples: >>> import numpy as np >>> image = np.random.randint(0, 256, (100, 100, 3), dtype=np.uint8) >>> # Same posterization for all channels >>> result = posterize(image, bits=3) >>> # Different posterization per channel >>> result = posterize(image, bits=[3, 4, 5]) # RGB channels

Parameters

Name	Type	Default	Description
img	ImageType	-	-
bits	One of: 1 2 3 4 5 6 7] \| list[Literal[1, 2, 3, 4, 5, 6, 7]	-	-

equalizefunction

equalize(
    img: ImageType,
    mask: np.ndarray | None,
    mode: Literal['cv', 'pil'] = cv,
    by_channels: bool = True
)

Apply histogram equalization to the input image. Enhances contrast; supports grayscale and color, mode cv/pil, optional mask and by_channels. This function enhances the contrast of the input image by equalizing its histogram. It supports both grayscale and color images, and can operate on individual channels or on the luminance channel of the image. Args: img (ImageType): Input image. Can be grayscale (2D array) or RGB (3D array). mask (np.ndarray | None): Optional mask to apply the equalization selectively. If provided, must have the same shape as the input image. Default: None. mode (Literal['cv', 'pil']): The backend to use for equalization. Can be either "cv" for OpenCV or "pil" for Pillow-style equalization. Default: "cv". by_channels (bool): If True, applies equalization to each channel independently. If False, converts the image to YCrCb color space and equalizes only the luminance channel. Only applicable to color images. Default: True. Returns: ImageType: Equalized image. The output has the same dtype as the input. Raises: ValueError: If the input image or mask have invalid shapes or types. Note: - If the input image is not uint8, it will be temporarily converted to uint8 for processing and then converted back to its original dtype. - For color images, when by_channels=False, the image is converted to YCrCb color space, equalized on the Y channel, and then converted back to RGB. - The function preserves the original number of channels in the image. Examples: >>> import numpy as np >>> import albumentations as A >>> image = np.random.randint(0, 256, (100, 100, 3), dtype=np.uint8) >>> equalized = A.equalize(image, mode="cv", by_channels=True) >>> assert equalized.shape == image.shape >>> assert equalized.dtype == image.dtype

Parameters

Name	Type	Default	Description
img	ImageType	-	-
mask	One of: np.ndarray None	-	-
mode	One of: 'cv' 'pil'	cv	-
by_channels	bool	True	-

evaluate_bezfunction

evaluate_bez(
    low_y: float | np.ndarray,
    high_y: float | np.ndarray
)

Evaluate the Bezier curve at the given t values. Used for tone-curve control points; returns y coordinates for input t in [0, 1]. Args: t (np.ndarray): The t values to evaluate the Bezier curve at. low_y (float | np.ndarray): The low y values to evaluate the Bezier curve at. high_y (float | np.ndarray): The high y values to evaluate the Bezier curve at. Returns: np.ndarray: The Bezier curve values.

Parameters

Name	Type	Default	Description
low_y	One of: float np.ndarray	-	-
high_y	One of: float np.ndarray	-	-

move_tone_curvefunction

move_tone_curve(
    img: ImageType,
    low_y: float | np.ndarray,
    high_y: float | np.ndarray,
    num_channels: int
)

Rescale bright/dark via Bezier tone curve. low_y, high_y (per-channel or scalar), num_channels for per-channel curves. uint8 I/O. Args: img (ImageType): Any number of channels low_y (float | np.ndarray): per-channel or single y-position of a Bezier control point used to adjust the tone curve, must be in range [0, 1] high_y (float | np.ndarray): per-channel or single y-position of a Bezier control point used to adjust image tone curve, must be in range [0, 1] num_channels (int): The number of channels in the input image. Returns: ImageType: Image with adjusted tone curve

Parameters

Name	Type	Default	Description
img	ImageType	-	-
low_y	One of: float np.ndarray	-	-
high_y	One of: float np.ndarray	-	-
num_channels	int	-	-

linear_transformation_rgbfunction

linear_transformation_rgb(
    img: ImageType,
    transformation_matrix: np.ndarray
)

3x3 linear transformation to RGB. transformation_matrix (or batch) multiplies channel vector. Supports (H,W,3), (B,H,W,3), (B,D,H,W,3). This function applies a 3x3 linear transformation matrix (or batch of matrices) to the RGB channels of either a single image or a batch of images. Args: img (ImageType): A single RGB image of shape (H, W, 3), or a batch of images (B, H, W, 3), or a batch of volumes (B, D, H, W, 3). transformation_matrix (np.ndarray): A 3x3 matrix Returns: ImageType: Transformed image or batch of images, matching the input shape and dtype. Raises: ValueError: If input shapes do not conform to the supported configurations.

Parameters

Name	Type	Default	Description
img	ImageType	-	-
transformation_matrix	np.ndarray	-	-

clahefunction

clahe(
    img: ImageType,
    clip_limit: float,
    tile_grid_size: tuple[int, int]
)

CLAHE (Contrast Limited Adaptive Histogram Equalization) per tile. clip_limit, tile_grid_size. For color, applied in LAB to L channel. uint8 I/O. This function enhances the contrast of the input image using CLAHE. For color images, it converts the image to the LAB color space, applies CLAHE to the L channel, and then converts the image back to RGB. Args: img (ImageType): Input image. Can be grayscale (2D array) or RGB (3D array). clip_limit (float): Threshold for contrast limiting. Higher values give more contrast. tile_grid_size (tuple[int, int]): Size of grid for histogram equalization. Width and height of the grid. Returns: ImageType: Image with CLAHE applied. The output has the same dtype as the input. Note: - If the input image is float32, it's temporarily converted to uint8 for processing and then converted back to float32. - For color images, CLAHE is applied only to the luminance channel in the LAB color space. Raises: ValueError: If the input image is not 2D or 3D. Examples: >>> import numpy as np >>> img = np.random.randint(0, 256, (100, 100, 3), dtype=np.uint8) >>> result = clahe(img, clip_limit=2.0, tile_grid_size=(8, 8)) >>> assert result.shape == img.shape >>> assert result.dtype == img.dtype

Parameters

Name	Type	Default	Description
img	ImageType	-	-
clip_limit	float	-	-
tile_grid_size	tuple[int, int]	-	-

image_compressionfunction

image_compression(
    img: ImageType,
    quality: int,
    image_type: Literal['.jpg', '.webp']
)

Compress image with JPEG or WebP to simulate artifacts. quality, image_type (.jpg/.webp). Lower quality increases blockiness. uint8 I/O. Args: img (ImageType): Input image quality (int): Quality of compression in range [1, 100] image_type (Literal['.jpg', '.webp']): Type of compression to use Returns: ImageType: Compressed image

Parameters

Name	Type	Default	Description
img	ImageType	-	-
quality	int	-	-
image_type	One of: '.jpg' '.webp'	-	-

add_snow_bleachfunction

add_snow_bleach(
    img: ImageType,
    snow_point: float,
    brightness_coeff: float
)

Add a simple snow effect by bleaching out pixels. Brightness increase and optional mask; used as a building block for more complex snow augmentations. This function simulates a basic snow effect by increasing the brightness of pixels that are above a certain threshold (snow_point). It operates in the HLS color space to modify the lightness channel. Args: img (ImageType): Input image. Can be either RGB uint8 or float32. snow_point (float): A float in the range [0, 1], scaled and adjusted to determine the threshold for pixel modification. Higher values result in less snow effect. brightness_coeff (float): Coefficient applied to increase the brightness of pixels below the snow_point threshold. Larger values lead to more pronounced snow effects. Should be greater than 1.0 for a visible effect. Returns: ImageType: Image with simulated snow effect. The output has the same dtype as the input. Note: - This function converts the image to the HLS color space to modify the lightness channel. - The snow effect is created by selectively increasing the brightness of pixels. - This method tends to create a 'bleached' look, which may not be as realistic as more advanced snow simulation techniques. - The function automatically handles both uint8 and float32 input images. The snow effect is created through the following steps: 1. Convert the image from RGB to HLS color space. 2. Adjust the snow_point threshold. 3. Increase the lightness of pixels below the threshold. 4. Convert the image back to RGB. Mathematical Formulation: Let L be the lightness channel in HLS space. For each pixel (i, j): If L[i, j] < snow_point: L[i, j] = L[i, j] * brightness_coeff Examples: >>> import numpy as np >>> import albumentations as A >>> image = np.random.randint(0, 256, [100, 100, 3], dtype=np.uint8) >>> snowy_image = A.functional.add_snow_v1(image, snow_point=0.5, brightness_coeff=1.5) References: - HLS Color Space: https://en.wikipedia.org/wiki/HSL_and_HSV - Original implementation: https://github.com/UjjwalSaxena/Automold--Road-Augmentation-Library

Parameters

Name	Type	Default	Description
img	ImageType	-	-
snow_point	float	-	-
brightness_coeff	float	-	-

generate_snow_texturesfunction

generate_snow_textures(
    img_shape: tuple[int, int],
    random_generator: np.random.Generator
)

Generate snow texture and sparkle mask for add_snow_texture. Returns texture and mask arrays; uses random generator for reproducibility. Args: img_shape (tuple[int, int]): Image shape. random_generator (np.random.Generator): Random generator to use. Returns: tuple[np.ndarray, np.ndarray]: Tuple of (snow_texture, sparkle_mask) arrays.

Parameters

Name	Type	Default	Description
img_shape	tuple[int, int]	-	-
random_generator	np.random.Generator	-	-

add_snow_texturefunction

add_snow_texture(
    img: ImageType,
    snow_point: float,
    brightness_coeff: float,
    snow_texture: np.ndarray,
    sparkle_mask: np.ndarray
)

Add snow effect: texture overlay, sparkle, depth gradient, blue tint. snow_point, brightness_coeff; takes precomputed snow_texture and sparkle_mask. uint8 I/O. This function simulates snowfall by applying multiple visual effects to the image, including brightness adjustment, snow texture overlay, depth simulation, and color tinting. The result is a more natural-looking snow effect compared to simple pixel bleaching methods. Args: img (ImageType): Input image in RGB format. snow_point (float): Coefficient that controls the amount and intensity of snow. Should be in the range [0, 1], where 0 means no snow and 1 means maximum snow effect. brightness_coeff (float): Coefficient for brightness adjustment to simulate the reflective nature of snow. Should be in the range [0, 1], where higher values result in a brighter image. snow_texture (np.ndarray): Snow texture. sparkle_mask (np.ndarray): Sparkle mask. Returns: ImageType: Image with added snow effect. The output has the same dtype as the input. Note: - The function first converts the image to HSV color space for better control over brightness and color adjustments. - A snow texture is generated using Gaussian noise and then filtered for a more natural appearance. - A depth effect is simulated, with more snow at the top of the image and less at the bottom. - A slight blue tint is added to simulate the cool color of snow. - Random sparkle effects are added to simulate light reflecting off snow crystals. The snow effect is created through the following steps: 1. Brightness adjustment in HSV space 2. Generation of a snow texture using Gaussian noise 3. Application of a depth effect to the snow texture 4. Blending of the snow texture with the original image 5. Addition of a cool blue tint 6. Addition of sparkle effects Examples: >>> import numpy as np >>> import albumentations as A >>> image = np.random.randint(0, 256, [100, 100, 3], dtype=np.uint8) >>> snowy_image = A.functional.add_snow_v2(image, snow_coeff=0.5, brightness_coeff=0.2) Note: This function works with both uint8 and float32 image types, automatically handling the conversion between them. References: - Perlin Noise: https://en.wikipedia.org/wiki/Perlin_noise - HSV Color Space: https://en.wikipedia.org/wiki/HSL_and_HSV

Parameters

Name	Type	Default	Description
img	ImageType	-	-
snow_point	float	-	-
brightness_coeff	float	-	-
snow_texture	np.ndarray	-	-
sparkle_mask	np.ndarray	-	-

add_rainfunction

add_rain(
    img: ImageType,
    slant: float,
    drop_length: int,
    drop_width: int,
    drop_color: tuple[int, int, int],
    blur_value: int,
    brightness_coefficient: float,
    rain_drops: np.ndarray
)

Add rain streaks. slant, drop_length, drop_width, drop_color, blur_value, brightness_coefficient, rain_drops. Polylines; optional blur. uint8 I/O. This function adds rain to an image by drawing rain drops on the image. The rain drops are drawn using the OpenCV function cv2.polylines. Args: img (ImageType): The image to add rain to. slant (float): The slant of the rain drops. drop_length (int): The length of the rain drops. drop_width (int): The width of the rain drops. drop_color (tuple[int, int, int]): The color of the rain drops. blur_value (int): The blur value of the rain drops. brightness_coefficient (float): The brightness coefficient of the rain drops. rain_drops (np.ndarray): The rain drops to draw on the image. Returns: ImageType: The image with rain added.

Parameters

Name	Type	Default	Description
img	ImageType	-	-
slant	float	-	-
drop_length	int	-	-
drop_width	int	-	-
drop_color	tuple[int, int, int]	-	-
blur_value	int	-	-
brightness_coefficient	float	-	-
rain_drops	np.ndarray	-	-

get_fog_particle_radiusesfunction

get_fog_particle_radiuses(
    img_shape: tuple[int, int],
    num_particles: int,
    fog_intensity: float,
    random_generator: np.random.Generator
)

Generate per-particle radius list for add_fog. num_particles, fog_intensity, image size; random_generator samples. Returns list[int]. Args: img_shape (tuple[int, int]): Image shape. num_particles (int): Number of fog particles. fog_intensity (float): Intensity of the fog effect, between 0 and 1. random_generator (np.random.Generator): Random generator to use. Returns: list[int]: List of radiuses for each fog particle.

Parameters

Name	Type	Default	Description
img_shape	tuple[int, int]	-	-
num_particles	int	-	-
fog_intensity	float	-	-
random_generator	np.random.Generator	-	-

add_fogfunction

add_fog(
    img: ImageType,
    fog_intensity: float,
    alpha_coef: float,
    fog_particle_positions: list[tuple[int, int]],
    fog_particle_radiuses: list[int]
)

Add fog with circular particles and alpha blending. fog_intensity, alpha_coef, positions, radiuses (lists from get_fog_particle_radiuses). uint8 I/O, clipped. This function adds fog to an image by drawing fog particles on the image. The fog particles are drawn using the OpenCV function cv2.circle. Args: img (ImageType): The image to add fog to. fog_intensity (float): The intensity of the fog effect, between 0 and 1. alpha_coef (float): The coefficient for the alpha blending. fog_particle_positions (list[tuple[int, int]]): The positions of the fog particles. fog_particle_radiuses (list[int]): The radiuses of the fog particles. Returns: ImageType: The image with fog added.

Parameters

Name	Type	Default	Description
img	ImageType	-	-
fog_intensity	float	-	-
alpha_coef	float	-	-
fog_particle_positions	list[tuple[int, int]]	-	-
fog_particle_radiuses	list[int]	-	-

add_sun_flare_overlayfunction

add_sun_flare_overlay(
    img: ImageType,
    flare_center: tuple[float, float],
    src_radius: int,
    src_color: tuple[int, ...],
    circles: list[Any]
)

Add a sun flare effect using a simple overlay. Params: src_radius, num_flare_circles; used as helper for physics-based sun flare. This function creates a basic sun flare effect by overlaying multiple semi-transparent circles of varying sizes and intensities on the input image. The effect simulates a simple lens flare caused by bright light sources. Args: img (ImageType): The input image. flare_center (tuple[float, float]): (x, y) coordinates of the flare center in pixel coordinates. src_radius (int): The radius of the main sun circle in pixels. src_color (tuple[int, ...]): The color of the sun, represented as a tuple of RGB values. circles (list[Any]): A list of tuples, each representing a circle that contributes to the flare effect. Each tuple contains: - alpha (float): The transparency of the circle (0.0 to 1.0). - center (tuple[int, int]): (x, y) coordinates of the circle center. - radius (int): The radius of the circle. - color (tuple[int, int, int]): RGB color of the circle. Returns: ImageType: The output image with the sun flare effect added. Note: - This function uses a simple alpha blending technique to overlay flare elements. - The main sun is created as a gradient circle, fading from the center outwards. - Additional flare circles are added along an imaginary line from the sun's position. - This method is computationally efficient but may produce less realistic results compared to more advanced techniques. The flare effect is created through the following steps: 1. Create an overlay image and output image as copies of the input. 2. Add smaller flare circles to the overlay. 3. Blend the overlay with the output image using alpha compositing. 4. Add the main sun circle with a radial gradient. Examples: >>> import numpy as np >>> import albumentations as A >>> image = np.random.randint(0, 256, [100, 100, 3], dtype=np.uint8) >>> flare_center = (50, 50) >>> src_radius = 20 >>> src_color = (255, 255, 200) >>> circles = [ ... (0.1, (60, 60), 5, (255, 200, 200)), ... (0.2, (70, 70), 3, (200, 255, 200)) ... ] >>> flared_image = A.functional.add_sun_flare_overlay( ... image, flare_center, src_radius, src_color, circles ... ) References: - Alpha compositing: https://en.wikipedia.org/wiki/Alpha_compositing - Lens flare: https://en.wikipedia.org/wiki/Lens_flare

Parameters

Name	Type	Default	Description
img	ImageType	-	-
flare_center	tuple[float, float]	-	-
src_radius	int	-	-
src_color	tuple[int, ...]	-	-
circles	list[Any]	-	-

add_sun_flare_physics_basedfunction

add_sun_flare_physics_based(
    img: ImageType,
    flare_center: tuple[int, int],
    src_radius: int,
    src_color: tuple[int, int, int],
    circles: list[Any]
)

Physics-based sun flare: circle, spikes, ghosts, chromatic aberration, screen blend. flare_center, src_radius, src_color, circles. This function creates a complex sun flare effect by simulating various optical phenomena that occur in real camera lenses when capturing bright light sources. The result is a more realistic and physically plausible lens flare effect. Args: img (ImageType): Input image. flare_center (tuple[int, int]): (x, y) coordinates of the sun's center in pixels. src_radius (int): Radius of the main sun circle in pixels. src_color (tuple[int, int, int]): Color of the sun in RGB format. circles (list[Any]): List of tuples, each representing a flare circle with parameters: (alpha, center, size, color) - alpha (float): Transparency of the circle (0.0 to 1.0). - center (tuple[int, int]): (x, y) coordinates of the circle center. - size (float): Size factor for the circle radius. - color (tuple[int, int, int]): RGB color of the circle. Returns: ImageType: Image with added sun flare effect. Note: This function implements several techniques to create a more realistic flare: 1. Separate flare layer: Allows for complex manipulations of the flare effect. 2. Lens diffraction spikes: Simulates light diffraction in camera aperture. 3. Radial gradient mask: Creates natural fading of the flare from the center. 4. Gaussian blur: Softens the flare for a more natural glow effect. 5. Chromatic aberration: Simulates color fringing often seen in real lens flares. 6. Screen blending: Provides a more realistic blending of the flare with the image. The flare effect is created through the following steps: 1. Create a separate flare layer. 2. Add the main sun circle and diffraction spikes to the flare layer. 3. Add additional flare circles based on the input parameters. 4. Apply Gaussian blur to soften the flare. 5. Create and apply a radial gradient mask for natural fading. 6. Simulate chromatic aberration by applying different blurs to color channels. 7. Blend the flare with the original image using screen blending mode. Examples: >>> import numpy as np >>> import albumentations as A >>> image = np.random.randint(0, 256, [1000, 1000, 3], dtype=np.uint8) >>> flare_center = (500, 500) >>> src_radius = 50 >>> src_color = (255, 255, 200) >>> circles = [ ... (0.1, (550, 550), 10, (255, 200, 200)), ... (0.2, (600, 600), 5, (200, 255, 200)) ... ] >>> flared_image = A.functional.add_sun_flare_physics_based( ... image, flare_center, src_radius, src_color, circles ... ) References: - Lens flare: https://en.wikipedia.org/wiki/Lens_flare - Diffraction: https://en.wikipedia.org/wiki/Diffraction - Chromatic aberration: https://en.wikipedia.org/wiki/Chromatic_aberration - Screen blending: https://en.wikipedia.org/wiki/Blend_modes#Screen

Parameters

Name	Type	Default	Description
img	ImageType	-	-
flare_center	tuple[int, int]	-	-
src_radius	int	-	-
src_color	tuple[int, int, int]	-	-
circles	list[Any]	-	-

add_shadowfunction

add_shadow(
    img: ImageType,
    vertices_list: list[np.ndarray],
    intensities: np.ndarray
)

Darken polygonal regions to simulate shadows. vertices_list and intensities per polygon. Use for outdoor or synthetic shadow augmentation. uint8 I/O. Args: img (ImageType): Input image. Multichannel images are supported. vertices_list (list[np.ndarray]): List of vertices for shadow polygons. intensities (np.ndarray): Array of shadow intensities. Range is [0, 1]. Returns: ImageType: Image with shadows added. References: Automold--Road-Augmentation-Library: https://github.com/UjjwalSaxena/Automold--Road-Augmentation-Library

Parameters

Name	Type	Default	Description
img	ImageType	-	-
vertices_list	list[np.ndarray]	-	-
intensities	np.ndarray	-	-

add_gravelfunction

add_gravel(
    img: ImageType,
    gravels: list[Any]
)

Add gravel: write HLS saturation in rectangular regions. gravels: list of (min_y, max_y, min_x, max_x, sat). RGB only; uint8 I/O. This function adds gravel to an image by drawing gravel particles on the image. The gravel particles are drawn using the OpenCV function cv2.circle. Args: img (ImageType): The image to add gravel to. gravels (list[Any]): The gravel particles to draw on the image. Returns: ImageType: The image with gravel added.

Parameters

Name	Type	Default	Description
img	ImageType	-	-
gravels	list[Any]	-	-

invertfunction

invert(
    img: ImageType
)

Produce the negative image: each pixel becomes max_val - pixel. uint8/float32, any channels. Use for inversion augmentation or visualization. This function inverts the colors of an image by subtracting each pixel value from the maximum possible value. The result is a negative of the original image. Args: img (ImageType): The image to invert. Returns: ImageType: The inverted image.

Parameters

Name	Type	Default	Description
img	ImageType	-	-

channel_shufflefunction

channel_shuffle(
    img: ImageType,
    channels_shuffled: list[int]
)

Shuffle image channels via cv2.mixChannels. channels_shuffled gives new order; supports (H, W, C) or batch (N, H, W, C). This function shuffles the channels of an image by using the cv2.mixChannels function. The channels are shuffled according to the channels_shuffled array. Args: img (ImageType): The image to shuffle. channels_shuffled (list[int]): The array of channels to shuffle. Returns: ImageType: The shuffled image.

Parameters

Name	Type	Default	Description
img	ImageType	-	-
channels_shuffled	list[int]	-	-

volume_channel_shufflefunction

volume_channel_shuffle(
    volume: np.ndarray,
    channels_shuffled: Sequence[int]
)

Shuffle channels of volume (D, H, W, C) or (D, H, W). Same as channel_shuffle along last axis. channels_shuffled is new order. Used for 3D volume augmentation. Args: volume (np.ndarray): Input volume. channels_shuffled (Sequence[int]): New channel order. Returns: np.ndarray: Volume with channels shuffled.

Parameters

Name	Type	Default	Description
volume	np.ndarray	-	-
channels_shuffled	Sequence[int]	-	-

volumes_channel_shufflefunction

volumes_channel_shuffle(
    volumes: np.ndarray,
    channels_shuffled: Sequence[int]
)

Shuffle channels of a batch of volumes (B, D, H, W, C) or (B, D, H, W). Per-volume shuffle; used for 3D batch augmentation. Args: volumes (np.ndarray): Input batch of volumes. channels_shuffled (Sequence[int]): New channel order. Returns: np.ndarray: Batch of volumes with channels shuffled.

Parameters

Name	Type	Default	Description
volumes	np.ndarray	-	-
channels_shuffled	Sequence[int]	-	-

gamma_transformfunction

gamma_transform(
    img: ImageType,
    gamma: float
)

Apply gamma transformation: pixel^gamma to brighten or darken. gamma > 1 brightens; gamma < 1 darkens. Supports uint8 and float32. This function applies gamma transformation to an image by raising each pixel value to the power of gamma. The result is a non-linear transformation that can enhance or reduce the contrast of the image. Args: img (ImageType): The image to apply gamma transformation to. gamma (float): The gamma value to apply. Returns: ImageType: The gamma transformed image.

Parameters

Name	Type	Default	Description
img	ImageType	-	-
gamma	float	-	-

iso_noisefunction

iso_noise(
    image: np.ndarray,
    color_shift: float,
    intensity: float,
    random_generator: np.random.Generator
)

Apply Poisson noise to simulate camera sensor noise. color_shift, intensity; approximates shot noise. float32 I/O, clipped. Args: image (np.ndarray): Input image. Currently, only RGB images are supported. color_shift (float): The amount of color shift to apply. intensity (float): Multiplication factor for noise values. Values of ~0.5 produce a noticeable, yet acceptable level of noise. random_generator (np.random.Generator): If specified, this will be random generator used for noise generation. Returns: np.ndarray: The noised image. Image types: uint8, float32 Number of channels: 3

Parameters

Name	Type	Default	Description
image	np.ndarray	-	-
color_shift	float	-	-
intensity	float	-	-
random_generator	np.random.Generator	-	-

iso_noise_imagesfunction

iso_noise_images(
    images: np.ndarray,
    color_shift: float,
    intensity: float,
    random_generator: np.random.Generator
)

Apply ISO noise to an image batch with vectorized noise. One noise field broadcast to all images; matches apply() with same seed. Use for batch augmentation. Noise is generated once and broadcast across all images, matching the behavior of calling apply() per image with the same random seed. Args: images (np.ndarray): (N, H, W, 3) RGB images. color_shift (float): Amount of color hue shift. intensity (float): Noise intensity multiplier. random_generator (np.random.Generator): Numpy RNG seeded for reproducibility. Returns: np.ndarray: (N, H, W, 3) noised images. Image types: uint8, float32 Number of channels: 3

Parameters

Name	Type	Default	Description
images	np.ndarray	-	-
color_shift	float	-	-
intensity	float	-	-
random_generator	np.random.Generator	-	-

to_gray_weighted_averagefunction

to_gray_weighted_average(
    img: ImageType
)

Convert RGB to grayscale with weighted average (0.299*R+0.587*G+0.114*B). Single or batch. BT.601. Matches OpenCV perceptual luminance. This function uses OpenCV's cvtColor function with COLOR_RGB2GRAY conversion, which applies the following formula: Y = 0.299*R + 0.587*G + 0.114*B The function efficiently handles batches and volumes by reshaping them into a tall 2D image for processing, then restoring the original shape structure. Args: img (ImageType): Input RGB image(s) as a numpy array. Supported shapes: - Single image: (H, W, 3) - Batch of images: (N, H, W, 3) - Volume: (D, H, W, 3) - Batch of volumes: (N, D, H, W, 3) Returns: ImageType: Grayscale image as a 2D numpy array. Image types: uint8, float32 Number of channels: 3

Parameters

Name	Type	Default	Description
img	ImageType	-	-

to_gray_from_labfunction

to_gray_from_lab(
    img: ImageType
)

Convert RGB to grayscale using the LAB L channel; perceived brightness matches human vision better than a simple average. Single image or batch. uint8 I/O. This function converts RGB images to grayscale by first converting to LAB color space and then extracting the L (lightness) channel. It uses albucore's reshape utilities to efficiently handle batches/volumes by processing them as a single tall image. Implementation Details: The function uses albucore's reshape_for_channel and restore_from_channel functions: - reshape_for_channel: Flattens batches/volumes to 2D format for OpenCV processing - restore_from_channel: Restores the original shape after processing This enables processing all images in a single OpenCV call Args: img (ImageType): Input RGB image(s) as a numpy array. Must have 3 channels in the last dimension. Supported shapes: - Single image: (H, W, 3) - Batch of images: (N, H, W, 3) - Volume: (D, H, W, 3) - Batch of volumes: (N, D, H, W, 3) Supported dtypes: - np.uint8: Values in range [0, 255] - np.float32: Values in range [0, 1] Returns: ImageType: - Single image: (H, W) - Batch of images: (N, H, W) - Volume: (D, H, W) - Batch of volumes: (N, D, H, W) The output dtype matches the input dtype. For float inputs, the L channel is normalized to [0, 1] by dividing by 100. Raises: ValueError: If the last dimension is not 3 (RGB channels) Examples: >>> # Single image >>> img = np.random.randint(0, 256, (100, 100, 3), dtype=np.uint8) >>> gray = to_gray_from_lab(img) >>> assert gray.shape == (100, 100) >>> # Batch of images - efficiently processed without loops >>> batch = np.random.randint(0, 256, (10, 100, 100, 3), dtype=np.uint8) >>> gray_batch = to_gray_from_lab(batch) >>> assert gray_batch.shape == (10, 100, 100) >>> # Volume (e.g., video frames or 3D medical data) >>> volume = np.random.randint(0, 256, (16, 100, 100, 3), dtype=np.uint8) >>> gray_volume = to_gray_from_lab(volume) >>> assert gray_volume.shape == (16, 100, 100) >>> # Float32 input >>> img_float = img.astype(np.float32) / 255.0 >>> gray_float = to_gray_from_lab(img_float) >>> assert 0 <= gray_float.min() <= gray_float.max() <= 1.0 Note: The LAB color space provides perceptually uniform grayscale conversion, where the L (lightness) channel represents human perception of brightness better than simple RGB averaging or other methods.

Parameters

Name	Type	Default	Description
img	ImageType	-	-

to_gray_desaturationfunction

to_gray_desaturation(
    img: ImageType
)

Convert to grayscale with desaturation (max + min) / 2 per pixel. Any channels; single or batch. uint8 and float32. uint8 and float32. Args: img (ImageType): Input image as a numpy array. Returns: ImageType: Grayscale image as a 2D numpy array. Image types: uint8, float32 Number of channels: any

Parameters

Name	Type	Default	Description
img	ImageType	-	-

to_gray_averagefunction

to_gray_average(
    img: ImageType
)

Convert to grayscale using per-pixel mean across channels. Simple average; single image or batch. Any channel count. uint8 and float32. This function computes the arithmetic mean across all channels for each pixel, resulting in a grayscale representation of the image. Key aspects of this method: 1. It treats all channels equally, regardless of their perceptual importance. 2. Works with any number of channels, making it versatile for various image types. 3. Simple and fast to compute, but may not accurately represent perceived brightness. 4. For RGB images, the formula is: Gray = (R + G + B) / 3 Note: This method may produce different results compared to weighted methods (like RGB weighted average) which account for human perception of color brightness. It may also produce unexpected results for images with alpha channels or non-color data in additional channels. Args: img (ImageType): Input image as a numpy array. Can be any number of channels. Returns: ImageType: Grayscale image as a 2D numpy array. The output data type matches the input data type. Image types: uint8, float32 Number of channels: any

Parameters

Name	Type	Default	Description
img	ImageType	-	-

to_gray_maxfunction

to_gray_max(
    img: ImageType
)

Convert to grayscale using max across channels per pixel. Equivalent to HSV V for RGB. Any channel count. Single or batch. uint8 and float32. This function takes the maximum value across all channels for each pixel, resulting in a grayscale image that preserves the brightest parts of the original image. Key aspects of this method: 1. Works with any number of channels, making it versatile for various image types. 2. For 3-channel (e.g., RGB) images, this method is equivalent to extracting the V (Value) channel from the HSV color space. 3. Preserves the brightest parts of the image but may lose some color contrast information. 4. Simple and fast to compute. Note: - This method tends to produce brighter grayscale images compared to other conversion methods, as it always selects the highest intensity value from the channels. - For RGB images, it may not accurately represent perceived brightness as it doesn't account for human color perception. Args: img (ImageType): Input image as a numpy array. Can be any number of channels. Returns: ImageType: Grayscale image as a 2D numpy array. The output data type matches the input data type. Image types: uint8, float32 Number of channels: any

Parameters

Name	Type	Default	Description
img	ImageType	-	-

to_gray_pcafunction

to_gray_pca(
    img: ImageType
)

Reduce to one channel via PCA; captures max variance in color. Single or batch; uint8 or float32. Clipped. Use when simple averaging loses info. This function applies PCA to reduce a multi-channel image to a single channel, effectively creating a grayscale representation that captures the maximum variance in the color data. Args: img (ImageType): Input image as a numpy array. Can be: - Single multi-channel image: (H, W, C) - Batch of multi-channel images: (N, H, W, C) - Single multi-channel volume: (D, H, W, C) - Batch of multi-channel volumes: (N, D, H, W, C) Returns: ImageType: Grayscale image with the same spatial dimensions as input. If input is uint8, output is uint8 in range [0, 255]. If input is float32, output is float32 in range [0, 1]. Note: This method can potentially preserve more information from the original image compared to standard weighted average methods, as it accounts for the correlations between color channels. Image types: uint8, float32 Number of channels: any

Parameters

Name	Type	Default	Description
img	ImageType	-	-

to_grayfunction

to_gray(
    img: ImageType,
    num_output_channels: int,
    method: Literal['weighted_average', 'from_lab', 'desaturation', 'average', 'max', 'pca']
)

Convert image to grayscale using a specified method. Choices: weighted_average, from_lab, desaturation, average, max, pca. This function converts an image to grayscale using a specified method. The method can be one of the following: - "weighted_average": Use the weighted average method. - "from_lab": Use the L channel from the LAB color space. - "desaturation": Use the desaturation method. - "average": Use the average method. - "max": Use the maximum channel value method. - "pca": Use the Principal Component Analysis method. Args: img (ImageType): Input image as a numpy array. num_output_channels (int): The number of channels in the output image. method (Literal['weighted_average', 'from_lab', 'desaturation', 'average', 'max', 'pca']): The method to use for grayscale conversion. Returns: ImageType: Grayscale image as a 2D numpy array.

Parameters

Name	Type	Default	Description
img	ImageType	-	-
num_output_channels	int	-	-
method	One of: 'weighted_average' 'from_lab' 'desaturation' 'average' 'max' 'pca'	-	-

grayscale_to_multichannelfunction

grayscale_to_multichannel(
    grayscale_image: np.ndarray,
    num_output_channels: int = 3
)

Convert grayscale to multi-channel by repeating. num_output_channels (default 3). For blending gray with color in saturation/hue adjustments. This function takes a 2D grayscale image or a 3D image with a single channel and converts it to a multi-channel image by repeating the grayscale data across the specified number of channels. Args: grayscale_image (np.ndarray): Input grayscale image. Can be 2D (height, width) or 3D (height, width, 1). num_output_channels (int, optional): Number of channels in the output image. Defaults to 3. Returns: np.ndarray: Multi-channel image with shape (height, width, num_channels)

Parameters

Name	Type	Default	Description
grayscale_image	np.ndarray	-	-
num_output_channels	int	3	-

downscalefunction

downscale(
    img: ImageType,
    scale: float,
    down_interpolation: int,
    up_interpolation: int
)

Simulate resolution loss: downscale then upscale. down/up_interpolation control quality. Use for low-res or compression-style augmentation. uint8 I/O. This function downscales and upscales an image using the specified interpolation methods. The downscaling and upscaling are performed using albucore.resize. Args: img (ImageType): Input image as a numpy array. scale (float): The scale factor for the downscaling and upscaling. down_interpolation (int): The interpolation method for the downscaling. up_interpolation (int): The interpolation method for the upscaling. Returns: ImageType: The downscaled and upscaled image.

Parameters

Name	Type	Default	Description
img	ImageType	-	-
scale	float	-	-
down_interpolation	int	-	-
up_interpolation	int	-	-

noopfunction

noop(
    input_obj: Any,
    **params: Any
)

No-op: return input unchanged. Used to satisfy type checker and pipeline placeholders; accepts any input and optional kwargs. This function is a no-op and returns the input object unchanged. It is used to satisfy the type checker requirements for the `noop` function. Args: input_obj (Any): The input object to return unchanged. **params (Any): Additional keyword arguments. Returns: Any: The input object unchanged.

Parameters

Name	Type	Default	Description
input_obj	Any	-	-
**params	Any	-	-

fancy_pcafunction

fancy_pca(
    img: ImageType,
    alpha_vector: np.ndarray
)

Fancy PCA color augmentation: add noise along principal components. alpha_vector; any channel count. float32 I/O, clipped. Args: img (ImageType): Input image alpha_vector (np.ndarray): Vector of scale factors for each principal component. Should have the same length as the number of channels in the image. Returns: ImageType: Augmented image of the same shape, type, and range as the input. Image types: uint8, float32 Number of channels: Any Note: - This function generalizes the Fancy PCA augmentation to work with any number of channels. - It preserves the original range of the image ([0, 255] for uint8, [0, 1] for float32). - For single-channel images, the augmentation is applied as a simple scaling of pixel intensity variation. - For multi-channel images, PCA is performed on the entire image, treating each pixel as a point in N-dimensional space (where N is the number of channels). - The augmentation preserves the correlation between channels while adding controlled noise. - Computation time may increase significantly for images with a large number of channels. References: ImageNet classification with deep convolutional neural networks: Krizhevsky, A., Sutskever, I., & Hinton, G. E. (2012): In Advances in neural information processing systems (pp. 1097-1105).

Parameters

Name	Type	Default	Description
img	ImageType	-	-
alpha_vector	np.ndarray	-	-

adjust_brightness_torchvisionfunction

adjust_brightness_torchvision(
    img: ImageType,
    factor: np.ndarray
)

Adjust brightness by multiplying pixels by factor. Torchvision-compatible; uint8 and float32. factor scalar or per-channel array. This function adjusts the brightness of an image by multiplying each pixel value by a factor. The brightness is adjusted by multiplying the image by the factor. Args: img (ImageType): Input image as a numpy array. factor (np.ndarray): The factor to adjust the brightness by. Returns: ImageType: The adjusted image.

Parameters

Name	Type	Default	Description
img	ImageType	-	-
factor	np.ndarray	-	-

adjust_contrast_torchvisionfunction

adjust_contrast_torchvision(
    img: ImageType,
    factor: float
)

Adjust contrast by multiplying by factor (relative to grayscale mean). Torchvision-compatible; uint8 and float32. factor 0 yields flat gray. This function adjusts the contrast of an image by multiplying each pixel value by a factor. The contrast is adjusted by multiplying the image by the factor. Args: img (ImageType): Input image as a numpy array. factor (float): The factor to adjust the contrast by. Returns: ImageType: The adjusted image.

Parameters

Name	Type	Default	Description
img	ImageType	-	-
factor	float	-	-

adjust_saturation_torchvisionfunction

adjust_saturation_torchvision(
    img: ImageType,
    factor: float,
    gamma: float = 0
)

Adjust saturation by blending with grayscale. Factor in [0, inf]; uses to_gray (weighted_average for RGB). Torchvision-compatible. Uses to_gray for conversion: weighted_average for RGB (matches OpenCV), average for arbitrary channels. Works on batches (4D) and volumes (5D). Args: img (ImageType): Input image as a numpy array. factor (float): The factor to adjust the saturation by. gamma (float): Unused, kept for API compatibility. Returns: ImageType: The adjusted image.

Parameters

Name	Type	Default	Description
img	ImageType	-	-
factor	float	-	-
gamma	float	0	-

adjust_hue_torchvisionfunction

adjust_hue_torchvision(
    img: ImageType,
    factor: float
)

Adjust hue by shifting in HSV. factor in [-0.5, 0.5]. Torchvision-compatible; RGB only. LUT for uint8; in-place mod for float32. This function adjusts the hue of an image by adding a factor to the hue value. Args: img (ImageType): Input image. factor (float): The factor to adjust the hue by. Returns: ImageType: The adjusted image.

Parameters

Name	Type	Default	Description
img	ImageType	-	-
factor	float	-	-

apply_brightness_contrast_torchvisionfunction

apply_brightness_contrast_torchvision(
    img: ImageType,
    brightness_factor: float,
    contrast_factor: float,
    brightness_first: bool
)

Fused brightness and contrast: single LUT (uint8) or two passes (float32). clip(a*x+b); torchvision-compatible; order via brightness_first. Both operations are `clip(a*x + b)`. The image grayscale mean is computed once and propagated analytically through the pipeline: if brightness comes first, `mean_at_contrast = mean * brightness_factor` (clipped to valid range). This avoids re-reading the image after brightness is applied. For uint8 images the composition is pre-computed over all 256 input values into a single 256-entry LUT applied in one `cv2.LUT` call. For float32, two sequential clipped passes are used. Args: img (ImageType): Input image (uint8 or float32). brightness_factor (float): Brightness multiplicative factor. contrast_factor (float): Contrast multiplicative factor. brightness_first (bool): Whether brightness is applied before contrast. Returns: ImageType: Adjusted image with the same dtype as input.

Parameters

Name	Type	Default	Description
img	ImageType	-	-
brightness_factor	float	-	-
contrast_factor	float	-	-
brightness_first	bool	-	-

superpixelsfunction

superpixels(
    image: np.ndarray,
    n_segments: int,
    replace_samples: Sequence[bool],
    max_size: int | None,
    interpolation: int
)

Apply superpixels using SLIC: replace pixels with segment mean. n_segments, replace_samples, max_size, interpolation. uint8 I/O. This function applies superpixels to an image using the SLIC algorithm. The superpixels are applied by replacing the pixels in the image with the mean intensity of the superpixel. Args: image (np.ndarray): Input image as a numpy array. n_segments (int): The number of segments to use for the superpixels. replace_samples (Sequence[bool]): The samples to replace. max_size (int | None): The maximum size of the superpixels. interpolation (int): The interpolation method to use. Returns: np.ndarray: The superpixels applied to the image.

Parameters

Name	Type	Default	Description
image	np.ndarray	-	-
n_segments	int	-	-
replace_samples	Sequence[bool]	-	-
max_size	One of: int None	-	-
interpolation	int	-	-

unsharp_mask_imagesfunction

unsharp_mask_images(
    images: np.ndarray,
    ksize: int,
    sigma: float,
    alpha: float,
    threshold: int
)

Apply unsharp mask to batch (N, H, W, C). Sharpen via blur-subtract-add. ksize, sigma, alpha, threshold. Pre-allocated output for batch path. Processes a batch of images (N, H, W, C) by applying the unsharp mask to each image and writing directly into a pre-allocated output array. Args: images (np.ndarray): Batch of images with shape (N, H, W, C) or (N, H, W). ksize (int): The kernel size for Gaussian blur. sigma (float): The sigma value for Gaussian blur. alpha (float): The alpha value for the unsharp mask. threshold (int): The threshold value for the unsharp mask. Returns: np.ndarray: Batch of unsharp masked images with same shape and dtype as input. Note: we intentionally avoid @float32_io/@clipped decorators here. Those decorators convert the entire batch at once, which is ~2x slower for uint8 than per-image conversion due to poor cache locality on large 4D arrays.

Parameters

Name	Type	Default	Description
images	np.ndarray	-	-
ksize	int	-	-
sigma	float	-	-
alpha	float	-	-
threshold	int	-	-

unsharp_maskfunction

unsharp_mask(
    image: np.ndarray,
    ksize: int,
    sigma: float,
    alpha: float,
    threshold: int
)

Apply unsharp mask to a single image. Sharpen via blur-subtract-add; backward-compatible wrapper around unsharp_mask_images. Backward-compatible wrapper around unsharp_mask_images for a single image. Args: image (np.ndarray): Single image, shape (H, W, C) or (H, W). ksize (int): The kernel size for Gaussian blur. sigma (float): The sigma value for Gaussian blur. alpha (float): The alpha value for the unsharp mask. threshold (int): The threshold value for the unsharp mask. Returns: np.ndarray: Unsharp masked image with same shape and dtype as input.

Parameters

Name	Type	Default	Description
image	np.ndarray	-	-
ksize	int	-	-
sigma	float	-	-
alpha	float	-	-
threshold	int	-	-

pixel_dropoutfunction

pixel_dropout(
    image: np.ndarray,
    drop_mask: np.ndarray,
    drop_values: np.ndarray
)

Replace pixels where drop_mask is True with drop_values. Use get_drop_mask, prepare_drop_values. For coarse dropout or inpainting-style augmentation. Args: image (np.ndarray): Input image drop_mask (np.ndarray): Boolean mask indicating which pixels to drop drop_values (np.ndarray): Values to replace dropped pixels with Returns: np.ndarray: Image with dropped pixels

Parameters

Name	Type	Default	Description
image	np.ndarray	-	-
drop_mask	np.ndarray	-	-
drop_values	np.ndarray	-	-

spatter_rainfunction

spatter_rain(
    img: ImageType,
    rain: np.ndarray
)

Add rain layer using precomputed pattern from get_rain_params. Simulates wet surfaces. Used by Spatter. float32 I/O, clipped. This function applies spatter rain to an image by adding the rain to the image. Args: img (ImageType): Input image as a numpy array. rain (np.ndarray): Rain image as a numpy array. Returns: ImageType: The spatter rain applied to the image.

Parameters

Name	Type	Default	Description
img	ImageType	-	-
rain	np.ndarray	-	-

spatter_mudfunction

spatter_mud(
    img: ImageType,
    non_mud: np.ndarray,
    mud: np.ndarray
)

Spatter mud: blend non_mud and mud layers. non_mud, mud from get_mud_params. Simulates dirt on lens/surface. float32 I/O, clipped. This function applies spatter mud to an image by adding the mud to the image. Args: img (ImageType): Input image as a numpy array. non_mud (np.ndarray): Non-mud image as a numpy array. mud (np.ndarray): Mud image as a numpy array. Returns: ImageType: The spatter mud applied to the image.

Parameters

Name	Type	Default	Description
img	ImageType	-	-
non_mud	np.ndarray	-	-
mud	np.ndarray	-	-

chromatic_aberrationfunction

chromatic_aberration(
    img: ImageType,
    primary_distortion_red: float,
    secondary_distortion_red: float,
    primary_distortion_blue: float,
    secondary_distortion_blue: float,
    interpolation: int
)

Chromatic aberration: shift R/B channels. primary/secondary_distortion_red/blue, interpolation. Simulates lens R/B shift. uint8 I/O, clipped. This function applies chromatic aberration to an image by distorting the red and blue channels. Args: img (ImageType): Input image as a numpy array. primary_distortion_red (float): The primary distortion of the red channel. secondary_distortion_red (float): The secondary distortion of the red channel. primary_distortion_blue (float): The primary distortion of the blue channel. secondary_distortion_blue (float): The secondary distortion of the blue channel. interpolation (int): The interpolation method to use. Returns: ImageType: The chromatic aberration applied to the image.

Parameters

Name	Type	Default	Description
img	ImageType	-	-
primary_distortion_red	float	-	-
secondary_distortion_red	float	-	-
primary_distortion_blue	float	-	-
secondary_distortion_blue	float	-	-
interpolation	int	-	-

planckian_jitterfunction

planckian_jitter(
    img: ImageType,
    temperature: int,
    mode: Literal['blackbody', 'cied']
)

Apply Planckian jitter (color temp shift) to an image. Params: temperature, mode (blackbody/cied). Used for color augmentation; RGB only. This function applies Planckian jitter to an image by linearly interpolating between the two closest temperatures in the PLANCKIAN_COEFFS dictionary. Args: img (ImageType): Input image as a numpy array. temperature (int): The temperature to apply. mode (Literal['blackbody', 'cied']): The mode to use. Returns: ImageType: The Planckian jitter applied to the image.

Parameters

Name	Type	Default	Description
img	ImageType	-	-
temperature	int	-	-
mode	One of: 'blackbody' 'cied'	-	-

add_noisefunction

add_noise(
    img: ImageType,
    noise: np.ndarray
)

Add pre-generated noise to an image. Noise shape can match image or be broadcast; supports per-channel (vector) or full array. This function adds noise to an image by adding the noise to the image. Args: img (ImageType): Input image as a numpy array. noise (np.ndarray): Noise as a numpy array. Returns: ImageType: The noise added to the image.

Parameters

Name	Type	Default	Description
img	ImageType	-	-
noise	np.ndarray	-	-

slicfunction

slic(
    image: np.ndarray,
    n_segments: int,
    compactness: float = 10.0,
    max_iterations: int = 10
)

SLIC superpixel segmentation. n_segments, compactness, max_iterations. Returns label mask for oversegmentation; used by superpixels() or standalone. Args: image (np.ndarray): Input image (3D numpy array with shape (H, W, C)). n_segments (int): Approximate number of superpixels to generate. compactness (float): Balance between color proximity and space proximity. max_iterations (int): Maximum number of iterations for k-means. Returns: np.ndarray: Segmentation mask where each superpixel has a unique label.

Parameters

Name	Type	Default	Description
image	np.ndarray	-	-
n_segments	int	-	-
compactness	float	10.0	-
max_iterations	int	10	-

shot_noisefunction

shot_noise(
    img: ImageType,
    scale: float,
    random_generator: np.random.Generator
)

Add shot (Poisson) noise in linear light space; scale controls strength. Simulates sensor noise. Use for camera noise augmentation. float32 I/O, clipped. Args: img (ImageType): Input image scale (float): Scale factor for the noise random_generator (np.random.Generator): Random number generator Returns: ImageType: Image with shot noise

Parameters

Name	Type	Default	Description
img	ImageType	-	-
scale	float	-	-
random_generator	np.random.Generator	-	-

get_safe_brightness_contrast_paramsfunction

get_safe_brightness_contrast_params(
    alpha: float,
    beta: float,
    max_value: float
)

Get (alpha, beta) brightness/contrast params clipped to valid LUT range. alpha, beta, max_value. Returns (alpha, beta). For LUT-based brightness/contrast. Args: alpha (float): Contrast factor beta (float): Brightness factor max_value (float): Maximum pixel value Returns: tuple[float, float]: Safe alpha and beta values

Parameters

Name	Type	Default	Description
alpha	float	-	-
beta	float	-	-
max_value	float	-	-

generate_constant_noise_with_py_randomfunction

generate_constant_noise_with_py_random(
    noise_type: Literal['uniform', 'gaussian', 'laplace', 'beta'],
    shape: tuple[int, ...],
    params: dict[str, Any] | None,
    max_value: float,
    py_random: random.Random
)

Generate constant (per-channel) noise using Python's random. Faster for small arrays; used when one value per channel suffices. This function generates constant noise using Python's random generator, which is more efficient for generating a small number of values (one per channel). Args: noise_type (Literal['uniform', 'gaussian', 'laplace', 'beta']): The type of noise to generate. shape (tuple[int, ...]): The shape of the noise to generate. params (dict[str, Any] | None): The parameters of the noise to generate. max_value (float): The maximum value of the noise to generate. py_random (random.Random): Python's random generator to use. Returns: np.ndarray: The noise generated.

Parameters

Name	Type	Default	Description
noise_type	One of: 'uniform' 'gaussian' 'laplace' 'beta'	-	-
shape	tuple[int, ...]	-	-
params	One of: dict[str, Any] None	-	-
max_value	float	-	-
py_random	random.Random	-	-

generate_spatial_noisefunction

generate_spatial_noise(
    noise_type: Literal['uniform', 'gaussian', 'laplace', 'beta'],
    spatial_mode: Literal['per_pixel', 'shared'],
    shape: tuple[int, ...],
    params: dict[str, Any] | None,
    max_value: float,
    approximation: float,
    random_generator: np.random.Generator
)

Generate spatial noise (per_pixel or shared) with optional approximation. approximation < 1 reduces resolution for speed. This function generates spatial noise with optional approximation for speed. Args: noise_type (Literal['uniform', 'gaussian', 'laplace', 'beta']): The type of noise to generate. spatial_mode (Literal['per_pixel', 'shared']): The spatial mode to use. shape (tuple[int, ...]): The shape of the noise to generate. params (dict[str, Any] | None): The parameters of the noise to generate. max_value (float): The maximum value of the noise to generate. approximation (float): The approximation to use for the noise to generate. random_generator (np.random.Generator): The random number generator to use. Returns: np.ndarray: The noise generated.

Parameters

Name	Type	Default	Description
noise_type	One of: 'uniform' 'gaussian' 'laplace' 'beta'	-	-
spatial_mode	One of: 'per_pixel' 'shared'	-	-
shape	tuple[int, ...]	-	-
params	One of: dict[str, Any] None	-	-
max_value	float	-	-
approximation	float	-	-
random_generator	np.random.Generator	-	-

generate_per_pixel_noisefunction

generate_per_pixel_noise(
    noise_type: Literal['uniform', 'gaussian', 'laplace', 'beta'],
    shape: tuple[int, ...],
    params: dict[str, Any],
    max_value: float,
    random_generator: np.random.Generator
)

Generate per-pixel noise from chosen distribution (uniform, gaussian, laplace, beta). Shape and spatial_mode from params. matches image. This function generates per-pixel noise by sampling from the noise distribution. Args: noise_type (Literal['uniform', 'gaussian', 'laplace', 'beta']): The type of noise to generate. shape (tuple[int, ...]): The shape of the noise to generate. params (dict[str, Any]): The parameters of the noise to generate. max_value (float): The maximum value of the noise to generate. random_generator (np.random.Generator): The random number generator to use. Returns: np.ndarray: The per-pixel noise generated.

Parameters

Name	Type	Default	Description
noise_type	One of: 'uniform' 'gaussian' 'laplace' 'beta'	-	-
shape	tuple[int, ...]	-	-
params	dict[str, Any]	-	-
max_value	float	-	-
random_generator	np.random.Generator	-	-

sample_noisefunction

sample_noise(
    noise_type: Literal['uniform', 'gaussian', 'laplace', 'beta'],
    size: tuple[int, ...],
    params: dict[str, Any],
    max_value: float,
    random_generator: np.random.Generator
)

Sample from noise distribution (uniform, gaussian, laplace, beta). noise_type and params. Dispatches to sample_*; returns array of given size. This function samples from a specific noise distribution. Args: noise_type (Literal['uniform', 'gaussian', 'laplace', 'beta']): The type of noise to generate. size (tuple[int, ...]): The size of the noise to generate. params (dict[str, Any]): The parameters of the noise to generate. max_value (float): The maximum value of the noise to generate. random_generator (np.random.Generator): The random number generator to use. Returns: np.ndarray: The noise sampled.

Parameters

Name	Type	Default	Description
noise_type	One of: 'uniform' 'gaussian' 'laplace' 'beta'	-	-
size	tuple[int, ...]	-	-
params	dict[str, Any]	-	-
max_value	float	-	-
random_generator	np.random.Generator	-	-

sample_uniformfunction

sample_uniform(
    size: tuple[int, ...],
    params: dict[str, Any],
    random_generator: np.random.Generator
)

Sample from uniform distribution for spatial noise. params['ranges'][0] per channel. Returns array of given size. Used by noise augmentation. Args: size (tuple[int, ...]): Size of the output array params (dict[str, Any]): Distribution parameters random_generator (np.random.Generator): Random number generator Returns: np.ndarray: Sampled values

Parameters

Name	Type	Default	Description
size	tuple[int, ...]	-	-
params	dict[str, Any]	-	-
random_generator	np.random.Generator	-	-

sample_gaussianfunction

sample_gaussian(
    size: tuple[int, ...],
    params: dict[str, Any],
    random_generator: np.random.Generator
)

Sample from Gaussian distribution. Mean and std from params or uniform range. Uses cv2.randn; returns float32 array of given size. This function samples from a Gaussian distribution. Args: size (tuple[int, ...]): The size of the noise to generate. params (dict[str, Any]): The parameters of the noise to generate. random_generator (np.random.Generator): The random number generator to use. Returns: np.ndarray: The Gaussian noise sampled.

Parameters

Name	Type	Default	Description
size	tuple[int, ...]	-	-
params	dict[str, Any]	-	-
random_generator	np.random.Generator	-	-

sample_laplacefunction

sample_laplace(
    size: tuple[int, ...],
    params: dict[str, Any],
    random_generator: np.random.Generator
)

Sample from Laplace distribution. Location and scale from params. Returns array of given size. Heavier tails than Gaussian. This function samples from a Laplace distribution. Args: size (tuple[int, ...]): The size of the noise to generate. params (dict[str, Any]): The parameters of the noise to generate. random_generator (np.random.Generator): The random number generator to use. Returns: np.ndarray: The Laplace noise sampled.

Parameters

Name	Type	Default	Description
size	tuple[int, ...]	-	-
params	dict[str, Any]	-	-
random_generator	np.random.Generator	-	-

sample_betafunction

sample_beta(
    size: tuple[int, ...],
    params: dict[str, Any],
    random_generator: np.random.Generator
)

Sample from Beta distribution. Alpha, beta, scale from params. Returns values in [-scale, scale]. For structured noise augmentation. This function samples from a Beta distribution. Args: size (tuple[int, ...]): The size of the noise to generate. params (dict[str, Any]): The parameters of the noise to generate. random_generator (np.random.Generator): The random number generator to use. Returns: np.ndarray: The Beta noise sampled.

Parameters

Name	Type	Default	Description
size	tuple[int, ...]	-	-
params	dict[str, Any]	-	-
random_generator	np.random.Generator	-	-

generate_shared_noisefunction

generate_shared_noise(
    noise_type: Literal['uniform', 'gaussian', 'laplace', 'beta'],
    shape: tuple[int, ...],
    params: dict[str, Any],
    max_value: float,
    random_generator: np.random.Generator
)

Generate shared noise (one value per pixel, broadcast across channels). Used by MultiplicativeNoise and generate_spatial_noise when spatial_mode is shared. Args: noise_type (Literal['uniform', 'gaussian', 'laplace', 'beta']): Type of noise to generate shape (tuple[int, ...]): Shape of the output array params (dict[str, Any]): Distribution parameters max_value (float): Maximum value for the noise random_generator (np.random.Generator): Random number generator Returns: np.ndarray: Generated noise

Parameters

Name	Type	Default	Description
noise_type	One of: 'uniform' 'gaussian' 'laplace' 'beta'	-	-
shape	tuple[int, ...]	-	-
params	dict[str, Any]	-	-
max_value	float	-	-
random_generator	np.random.Generator	-	-

sharpen_gaussianfunction

sharpen_gaussian(
    img: ImageType,
    alpha: float,
    kernel_size: int,
    sigma: float
)

Sharpen via unsharp mask: subtract Gaussian blur, add back with alpha. kernel_size, sigma control blur. Use for crisp edges. Clipped. This function sharpens an image using a Gaussian blur. Args: img (ImageType): The image to sharpen. alpha (float): The alpha value to use for the sharpening. kernel_size (int): The kernel size to use for the Gaussian blur. sigma (float): The sigma value to use for the Gaussian blur. Returns: ImageType: The sharpened image.

Parameters

Name	Type	Default	Description
img	ImageType	-	-
alpha	float	-	-
kernel_size	int	-	-
sigma	float	-	-

apply_salt_and_pepperfunction

apply_salt_and_pepper(
    img: ImageType,
    salt_mask: np.ndarray,
    pepper_mask: np.ndarray
)

Apply salt and pepper noise using pre-computed masks. Replaces pixels with min or max value; amount controlled by masks. This function applies salt and pepper noise to an image using pre-computed masks. Salt pixels are set to maximum value, pepper pixels are set to 0. Args: img (ImageType): Input image of any dtype and dimensions: - 2D: (H, W) - grayscale - 3D: (H, W, C) - RGB/multi-channel - 4D: (D, H, W, C) - volume with depth salt_mask (np.ndarray): Boolean mask indicating salt pixels (H, W) pepper_mask (np.ndarray): Boolean mask indicating pepper pixels (H, W) Returns: ImageType: The image with salt and pepper noise applied.

Parameters

Name	Type	Default	Description
img	ImageType	-	-
salt_mask	np.ndarray	-	-
pepper_mask	np.ndarray	-	-

generate_plasma_patternfunction

generate_plasma_pattern(
    target_shape: tuple[int, int],
    roughness: float,
    random_generator: np.random.Generator
)

Generate a plasma pattern using diamond-square algorithm. Returns float32 (H, W) or (H, W, 1) for plasma-based brightness/contrast and shadow. This function generates a plasma pattern using the diamond-square algorithm. Args: target_shape (tuple[int, int]): The shape of the plasma pattern to generate. roughness (float): The roughness of the plasma pattern. random_generator (np.random.Generator): The random number generator to use. Returns: np.ndarray: The plasma pattern generated.

Parameters

Name	Type	Default	Description
target_shape	tuple[int, int]	-	-
roughness	float	-	-
random_generator	np.random.Generator	-	-

apply_plasma_brightness_contrastfunction

apply_plasma_brightness_contrast(
    img: ImageType,
    brightness_factor: float,
    contrast_factor: float,
    plasma_pattern: np.ndarray
)

Modulate brightness and contrast with a plasma pattern. brightness_factor, contrast_factor scale effect. Use for spatially varying lighting. This function applies plasma-based brightness and contrast adjustments to an image. Args: img (ImageType): The image to apply the brightness and contrast adjustments to. brightness_factor (float): The brightness factor to apply. contrast_factor (float): The contrast factor to apply. plasma_pattern (np.ndarray): The plasma pattern to use for the brightness and contrast adjustments. Returns: ImageType: The image with the brightness and contrast adjustments applied.

Parameters

Name	Type	Default	Description
img	ImageType	-	-
brightness_factor	float	-	-
contrast_factor	float	-	-
plasma_pattern	np.ndarray	-	-

apply_plasma_shadowfunction

apply_plasma_shadow(
    img: ImageType,
    intensity: float,
    plasma_pattern: np.ndarray
)

Darken image in regions defined by a plasma pattern. intensity controls strength. Use for soft shadows or vignette-like effects. Same dtype and channels. Args: img (ImageType): Input image intensity (float): Shadow intensity plasma_pattern (np.ndarray): Plasma pattern to use Returns: ImageType: Image with plasma shadow

Parameters

Name	Type	Default	Description
img	ImageType	-	-
intensity	float	-	-
plasma_pattern	np.ndarray	-	-

create_directional_gradientfunction

create_directional_gradient(
    height: int,
    width: int,
    angle: float
)

Create a directional gradient in [0, 1] range. Angle and optional smoothing; used for illumination and vignette helpers. Returns (H, W) float32. This function creates a directional gradient in the [0, 1] range. Args: height (int): The height of the image. width (int): The width of the image. angle (float): The angle of the gradient. Returns: np.ndarray: The directional gradient.

Parameters

Name	Type	Default	Description
height	int	-	-
width	int	-	-
angle	float	-	-

create_illumination_gradientfunction

create_illumination_gradient(
    height: int,
    width: int,
    mode: str,
    params: dict[str, Any]
)

Create illumination gradient map (H, W) or (H, W, 1). mode: linear, corner, gaussian. Float32; apply via multiply_by_array. Returns a float32 gradient that can be applied via multiply_by_array. The returned gradient does NOT have a channel dimension.

Parameters

Name	Type	Default	Description
height	int	-	-
width	int	-	-
mode	str	-	-
params	dict[str, Any]	-	-

apply_linear_illuminationfunction

apply_linear_illumination(
    img: ImageType,
    intensity: float,
    angle: float
)

Apply linear illumination gradient to the image. Multiplies by gradient; intensity and angle control direction and strength. float32 I/O. Args: img (ImageType): Input image intensity (float): Illumination intensity angle (float): Illumination angle in radians Returns: ImageType: Image with linear illumination

Parameters

Name	Type	Default	Description
img	ImageType	-	-
intensity	float	-	-
angle	float	-	-

apply_corner_illuminationfunction

apply_corner_illumination(
    img: ImageType,
    intensity: float,
    corner: Literal[0, 1, 2, 3]
)

Apply corner illumination (darkened corners) to the image. Gradient from center; intensity and corner (0-3) control strength and which corner. Clipped. Args: img (ImageType): Input image intensity (float): Illumination intensity corner (Literal[0, 1, 2, 3]): The corner to apply the illumination to. Returns: ImageType: Image with corner illumination applied.

Parameters

Name	Type	Default	Description
img	ImageType	-	-
intensity	float	-	-
corner	One of: 0 1 2 3	-	-

apply_gaussian_illuminationfunction

apply_gaussian_illumination(
    img: ImageType,
    intensity: float,
    center: tuple[float, float],
    sigma: float
)

Add a Gaussian-shaped bright or dark spot; intensity, center, and sigma define the falloff. Use for spotlight or vignette effects. Clipped. Same channel count. Args: img (ImageType): Input image intensity (float): Illumination intensity center (tuple[float, float]): The center of the illumination. sigma (float): The sigma of the illumination.

Parameters

Name	Type	Default	Description
img	ImageType	-	-
intensity	float	-	-
center	tuple[float, float]	-	-
sigma	float	-	-

auto_contrastfunction

auto_contrast(
    img: ImageType,
    cutoff: float,
    ignore: int | None,
    method: Literal['cdf', 'pil']
)

Apply automatic contrast enhancement. Stretches histogram to full range. cutoff, ignore, method (cdf or pil) limit clip. uint8 I/O. Args: img (ImageType): Input image cutoff (float): Cutoff percentage for histogram ignore (int | None): Value to ignore in histogram method (Literal['cdf', 'pil']): Method to use for contrast enhancement Returns: ImageType: Image with enhanced contrast

Parameters

Name	Type	Default	Description
img	ImageType	-	-
cutoff	float	-	-
ignore	One of: int None	-	-
method	One of: 'cdf' 'pil'	-	-

create_contrast_lutfunction

create_contrast_lut(
    hist: np.ndarray,
    min_intensity: int,
    max_intensity: int,
    max_value: int,
    method: Literal['cdf', 'pil']
)

Create lookup table for contrast adjustment. LUT maps [0, max_val] using clip low/high for contrast transforms and auto_contrast. This function creates a lookup table for contrast adjustment. Args: hist (np.ndarray): Histogram of the image. min_intensity (int): Minimum intensity of the histogram. max_intensity (int): Maximum intensity of the histogram. max_value (int): Maximum value of the lookup table. method (Literal['cdf', 'pil']): Method to use for contrast enhancement. Returns: np.ndarray: Lookup table for contrast adjustment.

Parameters

Name	Type	Default	Description
hist	np.ndarray	-	-
min_intensity	int	-	-
max_intensity	int	-	-
max_value	int	-	-
method	One of: 'cdf' 'pil'	-	-

get_histogram_boundsfunction

get_histogram_bounds(
    hist: np.ndarray,
    cutoff: float
)

Get the low and high bounds of the histogram. Percentile-based clipping; returns (low, high) for auto_contrast and create_contrast_lut. This function gets the low and high bounds of the histogram. Args: hist (np.ndarray): Histogram of the image. cutoff (float): Cutoff percentage for histogram. Returns: tuple[int, int]: Low and high bounds of the histogram.

Parameters

Name	Type	Default	Description
hist	np.ndarray	-	-
cutoff	float	-	-

get_drop_maskfunction

get_drop_mask(
    shape: tuple[int, ...],
    per_channel: bool,
    dropout_prob: float,
    random_generator: np.random.Generator
)

Generate dropout mask (boolean or per-pixel drop prob). shape, per_channel, dropout_prob, random_generator. Returns bool or float mask. This function generates a dropout mask. Args: shape (tuple[int, ...]): Shape of the output mask per_channel (bool): Whether to apply dropout per channel dropout_prob (float): Dropout probability random_generator (np.random.Generator): Random number generator Returns: np.ndarray: Dropout mask

Parameters

Name	Type	Default	Description
shape	tuple[int, ...]	-	-
per_channel	bool	-	-
dropout_prob	float	-	-
random_generator	np.random.Generator	-	-

generate_random_valuesfunction

generate_random_values(
    channels: int,
    dtype: np.dtype,
    random_generator: np.random.Generator
)

Generate random values for dropout fill. channels, dtype, random_generator. Returns 1D array per channel when value is None in prepare_drop_values. Args: channels (int): Number of channels dtype (np.dtype): Data type of the output array random_generator (np.random.Generator): Random number generator Returns: np.ndarray: Random values

Parameters

Name	Type	Default	Description
channels	int	-	-
dtype	np.dtype	-	-
random_generator	np.random.Generator	-	-

prepare_drop_valuesfunction

prepare_drop_values(
    array: np.ndarray,
    value: float | Sequence[float] | np.ndarray | None,
    random_generator: np.random.Generator
)

Prepare values to fill dropped pixels. array shape/dtype, value (or None for random), random_generator. Returns fill array matching shape. Args: array (np.ndarray): Input array to determine shape and dtype value (float | Sequence[float] | np.ndarray | None): User-specified drop values or None for random random_generator (np.random.Generator): Random number generator Returns: np.ndarray: Array of values matching input shape

Parameters

Name	Type	Default	Description
array	np.ndarray	-	-
value	One of: float Sequence[float] np.ndarray None	-	-
random_generator	np.random.Generator	-	-

get_mask_arrayfunction

get_mask_array(
    data: dict[str, Any]
)

Get mask array from input data if it exists. Returns data['mask'] or None; helper for transforms that accept optional mask.

Parameters

Name	Type	Default	Description
data	dict[str, Any]	-	-

get_rain_paramsfunction

get_rain_params(
    liquid_layer: np.ndarray,
    color: np.ndarray,
    intensity: float
)

Generate parameters for rain effect. liquid_layer, color, intensity. Returns dict with 'drops' for add_rain/spatter_rain. This function generates parameters for a rain effect. Args: liquid_layer (np.ndarray): Liquid layer of the image. color (np.ndarray): Color of the rain. intensity (float): Intensity of the rain. Returns: dict[str, Any]: Parameters for the rain effect.

Parameters

Name	Type	Default	Description
liquid_layer	np.ndarray	-	-
color	np.ndarray	-	-
intensity	float	-	-

get_mud_paramsfunction

get_mud_params(
    liquid_layer: np.ndarray,
    color: np.ndarray,
    cutout_threshold: float,
    sigma: float,
    intensity: float,
    random_generator: np.random.Generator
)

Generate parameters for mud effect. liquid_layer, color, cutout_threshold, sigma, intensity, random_generator. Returns dict for spatter_mud. This function generates parameters for a mud effect. Args: liquid_layer (np.ndarray): Liquid layer of the image. color (np.ndarray): Color of the mud. cutout_threshold (float): Cutout threshold for the mud. sigma (float): Sigma for the Gaussian blur. intensity (float): Intensity of the mud. random_generator (np.random.Generator): Random number generator. Returns: dict[str, Any]: Parameters for the mud effect.

Parameters

Name	Type	Default	Description
liquid_layer	np.ndarray	-	-
color	np.ndarray	-	-
cutout_threshold	float	-	-
sigma	float	-	-
intensity	float	-	-
random_generator	np.random.Generator	-	-

rgb_to_optical_densityfunction

rgb_to_optical_density(
    img: ImageType,
    eps: float = 1e-06
)

Convert RGB image to optical density (-log10). eps avoids log(0). Expects uint8 or float32 in [0,1]. Returns (N*H*W, 3) float64. For stain normalization. This function converts an RGB image to optical density. Args: img (ImageType): Input image. eps (float): Epsilon value. Returns: np.ndarray: Optical density image.

Parameters

Name	Type	Default	Description
img	ImageType	-	-
eps	float	1e-06	-

normalize_vectorsfunction

normalize_vectors(
    vectors: np.ndarray
)

Normalize vectors to unit length (L2). Axis and dtype preserved; 1D or 2D. For stain normalization (e.g. Macenko) stain vector normalization. This function normalizes vectors. Args: vectors (np.ndarray): Vectors to normalize. Returns: np.ndarray: Normalized vectors.

Parameters

Name	Type	Default	Description
vectors	np.ndarray	-	-

get_normalizerfunction

get_normalizer(
    method: Literal['vahadane', 'macenko']
)

Get stain normalizer based on method ('vahadane' or 'macenko'). Returns VahadaneNormalizer or MacenkoNormalizer instance for histology stain norm. This function gets a stain normalizer based on a method. Args: method (Literal['vahadane', 'macenko']): Method to use for stain normalization. Returns: StainNormalizer: Stain normalizer.

Parameters

Name	Type	Default	Description
method	One of: 'vahadane' 'macenko'	-	-

StainNormalizerclass

StainNormalizer()

Base class for stain normalizers. Subclass and implement fit/transform for histology stain normalization (e.g. Vahadane, Macenko).

SimpleNMFclass

SimpleNMF(
    n_iter: int = 100
)

Simple NMF for histology stain separation. Factorizes OD matrix into stain basis and concentrations. Iterative multiplicative updates, non-negativity. This class implements a simplified version of the Non-negative Matrix Factorization algorithm specifically designed for separating Hematoxylin and Eosin (H&E) stains in histopathology images. It is used as part of the Vahadane stain normalization method. The algorithm decomposes optical density values of H&E stained images into stain color appearances (the stain color vectors) and stain concentrations (the density of each stain at each pixel). The implementation uses an iterative multiplicative update approach that preserves non-negativity constraints, which are physically meaningful for stain separation as concentrations and absorption coefficients cannot be negative. This implementation is optimized for stability by: 1. Initializing with standard H&E reference colors from Ruifrok 2. Using normalized projection for initial concentrations 3. Applying careful normalization to avoid numerical issues Args: n_iter (int): Number of iterations for the NMF algorithm. Default: 100 References: - Vahadane, A., et al. (2016): Structure-preserving color normalization and sparse stain separation for histological images. IEEE Transactions on Medical Imaging, 35(8), 1962-1971. - Ruifrok, A. C., & Johnston, D. A. (2001): Quantification of histochemical staining by color deconvolution. Analytical and Quantitative Cytology and Histology, 23(4), 291-299.

Parameters

Name	Type	Default	Description
n_iter	int	100	-

order_stains_combinedfunction

order_stains_combined(
    stain_colors: np.ndarray
)

Order stains using a combination of methods (angular and spectral). Returns ordered stain matrix for consistent H/E ordering. This combines both angular information and spectral characteristics for more robust identification. Args: stain_colors (np.ndarray): Stain colors. Returns: tuple[int, int]: Hematoxylin and eosin indices.

Parameters

Name	Type	Default	Description
stain_colors	np.ndarray	-	-

VahadaneNormalizerclass

VahadaneNormalizer()

Vahadane stain normalizer for histopathology. NMF-based stain separation; fit on reference image, then transform. Used for H&E normalization. This class implements the "Structure-Preserving Color Normalization and Sparse Stain Separation for Histological Images" method proposed by Vahadane et al. The technique uses Non-negative Matrix Factorization (NMF) to separate Hematoxylin and Eosin (H&E) stains in histopathology images and then normalizes them to a target standard. The Vahadane method is particularly effective for histology image normalization because: 1. It maintains tissue structure during color normalization 2. It performs sparse stain separation, reducing color bleeding 3. It adaptively estimates stain vectors from each image 4. It preserves biologically relevant information This implementation uses SimpleNMF as its core matrix factorization algorithm to extract stain color vectors (appearance matrix) and concentration matrices from optical density-transformed images. It identifies the Hematoxylin and Eosin stains by their characteristic color profiles and spatial distribution. References: Vahadane, et al., 2016: Structure-preserving color normalization and sparse stain separation for histological images. IEEE transactions on medical imaging, 35(8), pp.1962-1971. Examples: >>> import numpy as np >>> import albumentations as A >>> from albumentations.augmentations.pixel import functional as F >>> import cv2 >>> >>> # Load source and target images (H&E stained histopathology) >>> source_img = cv2.imread('source_image.png') >>> source_img = cv2.cvtColor(source_img, cv2.COLOR_BGR2RGB) >>> target_img = cv2.imread('target_image.png') >>> target_img = cv2.cvtColor(target_img, cv2.COLOR_BGR2RGB) >>> >>> # Create and fit the normalizer to the target image >>> normalizer = F.VahadaneNormalizer() >>> normalizer.fit(target_img) >>> >>> # Normalize the source image to match the target's stain characteristics >>> normalized_img = normalizer.transform(source_img)

MacenkoNormalizerclass

MacenkoNormalizer(
    angular_percentile: float = 99
)

Macenko stain normalizer with optimized computations. SVD-based stain separation; fit on reference, then transform. Used for H&E normalization.

Parameters

Name	Type	Default	Description
angular_percentile	float	99	-

get_tissue_maskfunction

get_tissue_mask(
    img: ImageType,
    threshold: float = 0.85
)

Get tissue mask from image (exclude background). threshold for intensity-based masking of non-tissue. Returns 1D bool mask. Args: img (ImageType): Input image threshold (float): Threshold for tissue detection. Default: 0.85 Returns: np.ndarray: Binary mask where True indicates tissue regions

Parameters

Name	Type	Default	Description
img	ImageType	-	-
threshold	float	0.85	-

apply_he_stain_augmentationfunction

apply_he_stain_augmentation(
    img: ImageType,
    stain_matrix: np.ndarray,
    scale_factors: np.ndarray,
    shift_values: np.ndarray,
    augment_background: bool
)

Apply HE (hematoxylin-eosin) stain augmentation. Shifts stain concentrations; params control strength. Used for histology augmentation. Returns RGB image. This function applies HE stain augmentation to an image. Args: img (ImageType): Input image. stain_matrix (np.ndarray): Stain matrix. scale_factors (np.ndarray): Scale factors. shift_values (np.ndarray): Shift values. augment_background (bool): Whether to augment the background. Returns: ImageType: Augmented image.

Parameters

Name	Type	Default	Description
img	ImageType	-	-
stain_matrix	np.ndarray	-	-
scale_factors	np.ndarray	-	-
shift_values	np.ndarray	-	-
augment_background	bool	-	-

convolvefunction

convolve(
    img: ImageType,
    kernel: np.ndarray
)

Convolve image with 2D kernel via cv2.filter2D. Any channel count. Use for custom blur, sharpen, or edge kernels. Clipped. This function convolves an image with a kernel. Args: img (ImageType): Input image. kernel (np.ndarray): Kernel. Returns: ImageType: Convolved image.

Parameters

Name	Type	Default	Description
img	ImageType	-	-
kernel	np.ndarray	-	-

separable_convolvefunction

separable_convolve(
    img: ImageType,
    kernel: np.ndarray
)

Convolve with separable 1D kernel in two passes. Faster than full 2D for large kernels. Use for Gaussian-like blur or custom separable filters. Clipped. This function convolves an image with a separable kernel. Args: img (ImageType): Input image. kernel (np.ndarray): Kernel. Returns: ImageType: Convolved image.

Parameters

Name	Type	Default	Description
img	ImageType	-	-
kernel	np.ndarray	-	-

apply_vignettefunction

apply_vignette(
    img: ImageType,
    intensity: float,
    center_x: float,
    center_y: float
)

Apply vignetting by darkening corners with radial gradient. intensity, center_x, center_y control strength and center. Supports (H,W) or (H,W,C). Args: img (ImageType): Input image of shape (H, W) or (H, W, C). intensity (float): Strength of darkening at corners, in [0, 1]. center_x (float): Horizontal center of the vignette as fraction of width, in [0, 1]. center_y (float): Vertical center of the vignette as fraction of height, in [0, 1]. Returns: ImageType: Image with vignetting applied.

Parameters

Name	Type	Default	Description
img	ImageType	-	-
intensity	float	-	-
center_x	float	-	-
center_y	float	-	-

apply_film_grainfunction

apply_film_grain(
    img: ImageType,
    grain: np.ndarray,
    intensity: float
)

Apply film grain noise to an image (2D or 3D). Luminance-dependent and spatially correlated; params control grain size and intensity. Film grain is luminance-dependent and spatially correlated, unlike simple Gaussian noise. Args: img (ImageType): Input image, shape (H, W, C) or (H, W, 1). grain (np.ndarray): Pre-generated grain pattern, shape (H, W), float32. intensity (float): Grain strength multiplier. Returns: ImageType: Image with film grain applied.

Parameters

Name	Type	Default	Description
img	ImageType	-	-
grain	np.ndarray	-	-
intensity	float	-	-

apply_halftonefunction

apply_halftone(
    img: ImageType,
    dot_size: int,
    blend: float
)

Convert image to halftone dot pattern. Simulates printed halftone; dot size varies by intensity. Params: scale, dot_shape. Args: img (ImageType): Input image (H, W, C), uint8 or float32. dot_size (int): Size of each halftone grid cell in pixels. blend (float): Blend factor between halftone and original. 0 = pure halftone, 1 = original. Returns: ImageType: Image with halftone effect applied.

Parameters

Name	Type	Default	Description
img	ImageType	-	-
dot_size	int	-	-
blend	float	-	-

apply_lens_flarefunction

apply_lens_flare(
    img: ImageType,
    flare_center: tuple[int, int],
    ghosts: list[tuple[int, int, int, float]],
    starburst_angles: np.ndarray,
    starburst_intensity: float,
    bloom_radius: int
)

Apply realistic lens flare with ghosts and starburst. Params control position, intensity, and number of ghosts. RGB input. Args: img (ImageType): Input image (H, W, C), must be 3-channel. flare_center (tuple[int, int]): (x, y) position of the flare source. ghosts (list[tuple[int, int, int, float]]): List of (x, y, radius, alpha) for each ghost circle. starburst_angles (np.ndarray): Array of angles in radians for starburst rays. starburst_intensity (float): Brightness of starburst rays, 0-1. bloom_radius (int): Gaussian blur radius for bloom effect. Returns: ImageType: Image with lens flare applied.

Parameters

Name	Type	Default	Description
img	ImageType	-	-
flare_center	tuple[int, int]	-	-
ghosts	list[tuple[int, int, int, float]]	-	-
starburst_angles	np.ndarray	-	-
starburst_intensity	float	-	-
bloom_radius	int	-	-

generate_water_displacement_mapsfunction

generate_water_displacement_maps(
    image_shape: tuple[int, int],
    amplitude: float,
    wavelength: float,
    num_waves: int,
    random_generator: np.random.Generator
)

Generate displacement maps simulating water refraction. image_shape, amplitude, wavelength, num_waves, random_generator. Returns (map_x, map_y) for cv2.remap. Args: image_shape (tuple[int, int]): (height, width). amplitude (float): Maximum displacement in pixels. wavelength (float): Base wavelength of waves in pixels. num_waves (int): Number of overlaid sine waves. random_generator (np.random.Generator): NumPy random generator. Returns: tuple[np.ndarray, np.ndarray]: Tuple of (map_x, map_y) float32 arrays for cv2.remap.

Parameters

Name	Type	Default	Description
image_shape	tuple[int, int]	-	-
amplitude	float	-	-
wavelength	float	-	-
num_waves	int	-	-
random_generator	np.random.Generator	-	-

apply_atmospheric_fogfunction

apply_atmospheric_fog(
    img: ImageType,
    density: float,
    fog_color: tuple[float, ...],
    depth_map: np.ndarray
)

Apply depth-aware atmospheric fog using standard scattering. Formula: img * exp(-density*depth) + fog_color*(1 - exp(-density*depth)). Formula: result = img * exp(-density * depth) + fog_color * (1 - exp(-density * depth)) Args: img (ImageType): Input image (H, W, C). density (float): Fog density factor. fog_color (tuple[float, ...]): Color of the fog, values in [0, max_val]. depth_map (np.ndarray): (H, W) float32 array with values in [0, 1], where 1 is farthest. Returns: ImageType: Image with fog applied.

Parameters

Name	Type	Default	Description
img	ImageType	-	-
density	float	-	-
fog_color	tuple[float, ...]	-	-
depth_map	np.ndarray	-	-

shift_hsv
shift_hsv_images
solarize
posterize
equalize
evaluate_bez
move_tone_curve
linear_transformation_rgb
clahe
image_compression
add_snow_bleach
generate_snow_textures
add_snow_texture
add_rain
get_fog_particle_radiuses
add_fog
add_sun_flare_overlay
add_sun_flare_physics_based
add_shadow
add_gravel
invert
channel_shuffle
volume_channel_shuffle
volumes_channel_shuffle
gamma_transform
iso_noise
iso_noise_images
to_gray_weighted_average
to_gray_from_lab
to_gray_desaturation
to_gray_average
to_gray_max
to_gray_pca
to_gray
grayscale_to_multichannel
downscale
noop
fancy_pca
adjust_brightness_torchvision
adjust_contrast_torchvision
adjust_saturation_torchvision
adjust_hue_torchvision
apply_brightness_contrast_torchvision
superpixels
unsharp_mask_images
unsharp_mask
pixel_dropout
spatter_rain
spatter_mud
chromatic_aberration
planckian_jitter
add_noise
slic
shot_noise
get_safe_brightness_contrast_params
generate_constant_noise_with_py_random
generate_spatial_noise
generate_per_pixel_noise
sample_noise
sample_uniform
sample_gaussian
sample_laplace
sample_beta
generate_shared_noise
sharpen_gaussian
apply_salt_and_pepper
generate_plasma_pattern
apply_plasma_brightness_contrast
apply_plasma_shadow
create_directional_gradient
create_illumination_gradient
apply_linear_illumination
apply_corner_illumination
apply_gaussian_illumination
auto_contrast
create_contrast_lut
get_histogram_bounds
get_drop_mask
generate_random_values
prepare_drop_values
get_mask_array
get_rain_params
get_mud_params
rgb_to_optical_density
normalize_vectors
get_normalizer
StainNormalizer
SimpleNMF
order_stains_combined
VahadaneNormalizer
MacenkoNormalizer
get_tissue_mask
apply_he_stain_augmentation
convolve
separable_convolve
apply_vignette
apply_film_grain
apply_halftone
apply_lens_flare
generate_water_displacement_maps
apply_atmospheric_fog