albumentations.augmentations.geometric.transforms

View Source on GitHub

Geometric transformation classes for image augmentation. This module provides a collection of transforms that modify the geometric properties of images and associated data (masks, bounding boxes, keypoints). Includes implementations for flipping, transposing, affine transformations, distortions, padding, and more complex transformations like grid shuffling and thin plate splines.

Members

classAffine
classBaseDistortion
classD4
classElasticTransform
classGridDistortion
classGridElasticDeform
classHorizontalFlip
classOpticalDistortion
classPad
classPadIfNeeded
classPerspective
classPiecewiseAffine
classRandomGridShuffle
classShiftScaleRotate
classSquareSymmetry
classThinPlateSpline
classTranspose
classVerticalFlip

Name	Type	Default	Description
scale	One of: tuple[float, float] float dict[str, float \| tuple[float, float]]	(1.0, 1.0)	Scaling factor to use, where ``1.0`` denotes "no change" and ``0.5`` is zoomed out to ``50`` percent of the original size. * If a single number, then that value will be used for all images. * If a tuple ``(a, b)``, then a value will be uniformly sampled per image from the interval ``[a, b]``. That the same range will be used for both x- and y-axis. To keep the aspect ratio, set ``keep_ratio=True``, then the same value will be used for both x- and y-axis. * If a dictionary, then it is expected to have the keys ``x`` and/or ``y``. Each of these keys can have the same values as described above. Using a dictionary allows to set different values for the two axis and sampling will then happen independently per axis, resulting in samples that differ between the axes. Note that when the ``keep_ratio=True``, the x- and y-axis ranges should be the same.
translate_percent	One of: tuple[float, float] float dict[str, float \| tuple[float, float]] None	None	Translation as a fraction of the image height/width (x-translation, y-translation), where ``0`` denotes "no change" and ``0.5`` denotes "half of the axis size". * If ``None`` then equivalent to ``0.0`` unless `translate_px` has a value other than ``None``. * If a single number, then that value will be used for all images. * If a tuple ``(a, b)``, then a value will be uniformly sampled per image from the interval ``[a, b]``. That sampled fraction value will be used identically for both x- and y-axis. * If a dictionary, then it is expected to have the keys ``x`` and/or ``y``. Each of these keys can have the same values as described above. Using a dictionary allows to set different values for the two axis and sampling will then happen independently per axis, resulting in samples that differ between the axes.
translate_px	One of: tuple[int, int] int dict[str, int \| tuple[int, int]] None	None	Translation in pixels. * If ``None`` then equivalent to ``0`` unless `translate_percent` has a value other than ``None``. * If a single int, then that value will be used for all images. * If a tuple ``(a, b)``, then a value will be uniformly sampled per image from the discrete interval ``[a..b]``. That number will be used identically for both x- and y-axis. * If a dictionary, then it is expected to have the keys ``x`` and/or ``y``. Each of these keys can have the same values as described above. Using a dictionary allows to set different values for the two axis and sampling will then happen independently per axis, resulting in samples that differ between the axes.
rotate	One of: tuple[float, float] float	0.0	Rotation in degrees (NOT radians), i.e. expected value range is around ``[-360, 360]``. Rotation happens around the center of the image, not the top left corner as in some other frameworks. * If a number, then that value will be used for all images. * If a tuple ``(a, b)``, then a value will be uniformly sampled per image from the interval ``[a, b]`` and used as the rotation value.
shear	One of: tuple[float, float] float dict[str, float \| tuple[float, float]]	(0.0, 0.0)	Shear in degrees (NOT radians), i.e. expected value range is around ``[-360, 360]``, with reasonable values being in the range of ``[-45, 45]``. * If a number, then that value will be used for all images as the shear on the x-axis (no shear on the y-axis will be done). * If a tuple ``(a, b)``, then two value will be uniformly sampled per image from the interval ``[a, b]`` and be used as the x- and y-shear value. * If a dictionary, then it is expected to have the keys ``x`` and/or ``y``. Each of these keys can have the same values as described above. Using a dictionary allows to set different values for the two axis and sampling will then happen independently per axis, resulting in samples that differ between the axes.
interpolation	One of: cv2.INTER_NEAREST cv2.INTER_LINEAR cv2.INTER_CUBIC cv2.INTER_AREA cv2.INTER_LANCZOS4	1	OpenCV interpolation flag.
mask_interpolation	One of: cv2.INTER_NEAREST cv2.INTER_LINEAR cv2.INTER_CUBIC cv2.INTER_AREA cv2.INTER_LANCZOS4	0	OpenCV interpolation flag.
fit_output	bool	False	If True, the image plane size and position will be adjusted to tightly capture the whole image after affine transformation (`translate_percent` and `translate_px` are ignored). Otherwise (``False``), parts of the transformed image may end up outside the image plane. Fitting the output shape can be useful to avoid corners of the image being outside the image plane after applying rotations. Default: False
keep_ratio	bool	False	When True, the original aspect ratio will be kept when the random scale is applied. Default: False.
rotate_method	One of: 'largest_box' 'ellipse'	largest_box	rotation method used for the bounding boxes. Should be one of "largest_box" or "ellipse"[1]. Default: "largest_box"
balanced_scale	bool	False	When True, scaling factors are chosen to be either entirely below or above 1, ensuring balanced scaling. Default: False. This is important because without it, scaling tends to lean towards upscaling. For example, if we want the image to zoom in and out by 2x, we may pick an interval [0.5, 2]. Since the interval [0.5, 1] is three times smaller than [1, 2], values above 1 are picked three times more often if sampled directly from [0.5, 2]. With `balanced_scale`, the function ensures that half the time, the scaling factor is picked from below 1 (zooming out), and the other half from above 1 (zooming in). This makes the zooming in and out process more balanced.
border_mode	One of: cv2.BORDER_CONSTANT cv2.BORDER_REPLICATE cv2.BORDER_REFLECT cv2.BORDER_WRAP cv2.BORDER_REFLECT_101	0	OpenCV border flag.
fill	One of: tuple[float, ...] float	0	The constant value to use when filling in newly created pixels. (E.g. translating by 1px to the right will create a new 1px-wide column of pixels on the left of the image). The value is only used when `mode=constant`. The expected value range is ``[0, 255]`` for ``uint8`` images.
fill_mask	One of: tuple[float, ...] float	0	Same as fill but only for masks.
p	float	0.5	probability of applying the transform. Default: 0.5.

Name	Type	Default	Description
interpolation	One of: cv2.INTER_NEAREST cv2.INTER_LINEAR cv2.INTER_CUBIC cv2.INTER_AREA cv2.INTER_LANCZOS4	-	Interpolation method to be used for image transformation. Should be one of the OpenCV interpolation types (e.g., cv2.INTER_LINEAR, cv2.INTER_CUBIC).
mask_interpolation	One of: cv2.INTER_NEAREST cv2.INTER_LINEAR cv2.INTER_CUBIC cv2.INTER_AREA cv2.INTER_LANCZOS4	-	Flag that is used to specify the interpolation algorithm for mask. Should be one of: cv2.INTER_NEAREST, cv2.INTER_LINEAR, cv2.INTER_CUBIC, cv2.INTER_AREA, cv2.INTER_LANCZOS4.
keypoint_remapping_method	One of: 'direct' 'mask'	-	Method to use for keypoint remapping. - "mask": Uses mask-based remapping. Faster, especially for many keypoints, but may be less accurate for large distortions. Recommended for large images or many keypoints. - "direct": Uses inverse mapping. More accurate for large distortions but slower. Default: "mask"
p	float	-	Probability of applying the transform.
border_mode	One of: cv2.BORDER_CONSTANT cv2.BORDER_REPLICATE cv2.BORDER_REFLECT cv2.BORDER_WRAP cv2.BORDER_REFLECT_101	0	-
fill	One of: tuple[float, ...] float	0	-
fill_mask	One of: tuple[float, ...] float	0	-

Name	Type	Default	Description
alpha	float	1	Scaling factor for the random displacement fields. Higher values result in more pronounced distortions. Default: 1.0
sigma	float	50	Standard deviation of the Gaussian filter used to smooth the displacement fields. Higher values result in smoother, more global distortions. Default: 50.0
interpolation	One of: cv2.INTER_NEAREST cv2.INTER_LINEAR cv2.INTER_CUBIC cv2.INTER_AREA cv2.INTER_LANCZOS4	1	Interpolation method to be used for image transformation. Should be one of the OpenCV interpolation types. Default: cv2.INTER_LINEAR
approximate	bool	False	Whether to use an approximate version of the elastic transform. If True, uses a fixed kernel size for Gaussian smoothing, which can be faster but potentially less accurate for large sigma values. Default: False
same_dxdy	bool	False	Whether to use the same random displacement field for both x and y directions. Can speed up the transform at the cost of less diverse distortions. Default: False
mask_interpolation	One of: cv2.INTER_NEAREST cv2.INTER_LINEAR cv2.INTER_CUBIC cv2.INTER_AREA cv2.INTER_LANCZOS4	0	Flag that is used to specify the interpolation algorithm for mask. Should be one of: cv2.INTER_NEAREST, cv2.INTER_LINEAR, cv2.INTER_CUBIC, cv2.INTER_AREA, cv2.INTER_LANCZOS4. Default: cv2.INTER_NEAREST.
noise_distribution	One of: 'gaussian' 'uniform'	gaussian	Distribution used to generate the displacement fields. "gaussian" generates fields using normal distribution (more natural deformations). "uniform" generates fields using uniform distribution (more mechanical deformations). Default: "gaussian".
keypoint_remapping_method	One of: 'direct' 'mask'	mask	Method to use for keypoint remapping. - "mask": Uses mask-based remapping. Faster, especially for many keypoints, but may be less accurate for large distortions. Recommended for large images or many keypoints. - "direct": Uses inverse mapping. More accurate for large distortions but slower. Default: "mask"
border_mode	One of: cv2.BORDER_CONSTANT cv2.BORDER_REPLICATE cv2.BORDER_REFLECT cv2.BORDER_WRAP cv2.BORDER_REFLECT_101	0	-
fill	One of: tuple[float, ...] float	0	-
fill_mask	One of: tuple[float, ...] float	0	-
p	float	0.5	Probability of applying the transform. Default: 0.5

Name	Type	Default	Description
num_steps	int	5	Number of grid cells on each side of the image. Higher values create more granular distortions. Must be at least 1. Default: 5.
distort_limit	One of: tuple[float, float] float	(-0.3, 0.3)	Range of distortion. If a single float is provided, the range will be (-distort_limit, distort_limit). Higher values create stronger distortions. Should be in the range of -1 to 1. Default: (-0.3, 0.3).
interpolation	One of: cv2.INTER_NEAREST cv2.INTER_LINEAR cv2.INTER_CUBIC cv2.INTER_AREA cv2.INTER_LANCZOS4	1	OpenCV interpolation method used for image transformation. Options include cv2.INTER_LINEAR, cv2.INTER_CUBIC, etc. Default: cv2.INTER_LINEAR.
normalized	bool	True	If True, ensures that the distortion does not move pixels outside the image boundaries. This can result in less extreme distortions but guarantees that no information is lost. Default: True.
mask_interpolation	One of: cv2.INTER_NEAREST cv2.INTER_LINEAR cv2.INTER_CUBIC cv2.INTER_AREA cv2.INTER_LANCZOS4	0	Flag that is used to specify the interpolation algorithm for mask. Should be one of: cv2.INTER_NEAREST, cv2.INTER_LINEAR, cv2.INTER_CUBIC, cv2.INTER_AREA, cv2.INTER_LANCZOS4. Default: cv2.INTER_NEAREST.
keypoint_remapping_method	One of: 'direct' 'mask'	mask	Method to use for keypoint remapping. - "mask": Uses mask-based remapping. Faster, especially for many keypoints, but may be less accurate for large distortions. Recommended for large images or many keypoints. - "direct": Uses inverse mapping. More accurate for large distortions but slower. Default: "mask"
p	float	0.5	Probability of applying the transform. Default: 0.5.
border_mode	One of: cv2.BORDER_CONSTANT cv2.BORDER_REPLICATE cv2.BORDER_REFLECT cv2.BORDER_WRAP cv2.BORDER_REFLECT_101	0	-
fill	One of: tuple[float, ...] float	0	-
fill_mask	One of: tuple[float, ...] float	0	-

Name	Type	Default	Description
num_grid_xy	tuple[int, int]	-	Number of grid cells along the width and height. Specified as (grid_width, grid_height). Each value must be greater than 1.
magnitude	int	-	Maximum pixel-wise displacement for distortion. Must be greater than 0.
interpolation	One of: cv2.INTER_NEAREST cv2.INTER_LINEAR cv2.INTER_CUBIC cv2.INTER_AREA cv2.INTER_LANCZOS4	1	Interpolation method to be used for the image transformation. Default: cv2.INTER_LINEAR
mask_interpolation	One of: cv2.INTER_NEAREST cv2.INTER_LINEAR cv2.INTER_CUBIC cv2.INTER_AREA cv2.INTER_LANCZOS4	0	Interpolation method to be used for mask transformation. Default: cv2.INTER_NEAREST
p	float	1.0	Probability of applying the transform. Default: 1.0.

Name	Type	Default	Description
distort_limit	One of: tuple[float, float] float	(-0.05, 0.05)	Range of distortion coefficient. For camera model: recommended range (-0.05, 0.05) For fisheye model: recommended range (-0.3, 0.3) Default: (-0.05, 0.05)
interpolation	One of: cv2.INTER_NEAREST cv2.INTER_LINEAR cv2.INTER_CUBIC cv2.INTER_AREA cv2.INTER_LANCZOS4	1	Interpolation method used for image transformation. Should be one of: cv2.INTER_NEAREST, cv2.INTER_LINEAR, cv2.INTER_CUBIC, cv2.INTER_AREA, cv2.INTER_LANCZOS4. Default: cv2.INTER_LINEAR.
mask_interpolation	One of: cv2.INTER_NEAREST cv2.INTER_LINEAR cv2.INTER_CUBIC cv2.INTER_AREA cv2.INTER_LANCZOS4	0	Flag that is used to specify the interpolation algorithm for mask. Should be one of: cv2.INTER_NEAREST, cv2.INTER_LINEAR, cv2.INTER_CUBIC, cv2.INTER_AREA, cv2.INTER_LANCZOS4. Default: cv2.INTER_NEAREST.
mode	One of: 'camera' 'fisheye'	camera	Distortion model to use: - 'camera': Original camera matrix model - 'fisheye': Fisheye lens model Default: 'camera'
keypoint_remapping_method	One of: 'direct' 'mask'	mask	Method to use for keypoint remapping. - "mask": Uses mask-based remapping. Faster, especially for many keypoints, but may be less accurate for large distortions. Recommended for large images or many keypoints. - "direct": Uses inverse mapping. More accurate for large distortions but slower. Default: "mask"
p	float	0.5	Probability of applying the transform. Default: 0.5.
border_mode	One of: cv2.BORDER_CONSTANT cv2.BORDER_REPLICATE cv2.BORDER_REFLECT cv2.BORDER_WRAP cv2.BORDER_REFLECT_101	0	-
fill	One of: tuple[float, ...] float	0	-
fill_mask	One of: tuple[float, ...] float	0	-

Name	Type	Default	Description
padding	One of: int tuple[int, int] tuple[int, int, int, int]	0	Padding values. Can be: * int - pad all sides by this value * tuple[int, int] - (pad_x, pad_y) to pad left/right by pad_x and top/bottom by pad_y * tuple[int, int, int, int] - (left, top, right, bottom) specific padding per side
fill	One of: tuple[float, ...] float	0	Padding value if border_mode is cv2.BORDER_CONSTANT
fill_mask	One of: tuple[float, ...] float	0	Padding value for mask if border_mode is cv2.BORDER_CONSTANT
border_mode	One of: cv2.BORDER_CONSTANT cv2.BORDER_REPLICATE cv2.BORDER_REFLECT cv2.BORDER_WRAP cv2.BORDER_REFLECT_101	0	OpenCV border mode
p	float	1.0	probability of applying the transform. Default: 1.0.

Name	Type	Default	Description
min_height	One of: int None	1024	Minimum desired height of the image. Ensures image height is at least this value. If not specified, pad_height_divisor must be provided.
min_width	One of: int None	1024	Minimum desired width of the image. Ensures image width is at least this value. If not specified, pad_width_divisor must be provided.
pad_height_divisor	One of: int None	None	If set, pads the image height to make it divisible by this value. If not specified, min_height must be provided.
pad_width_divisor	One of: int None	None	If set, pads the image width to make it divisible by this value. If not specified, min_width must be provided.
position	One of: 'center' 'top_left' 'top_right' 'bottom_left' 'bottom_right' 'random'	center	Position where the image is to be placed after padding. Default is 'center'.
border_mode	One of: cv2.BORDER_CONSTANT cv2.BORDER_REPLICATE cv2.BORDER_REFLECT cv2.BORDER_WRAP cv2.BORDER_REFLECT_101	0	Specifies the border mode to use if padding is required. The default is `cv2.BORDER_CONSTANT`.
fill	One of: tuple[float, ...] float	0	Value to fill the border pixels if the border mode is `cv2.BORDER_CONSTANT`. Default is None.
fill_mask	One of: tuple[float, ...] float	0	Similar to `fill` but used for padding masks. Default is None.
p	float	1.0	Probability of applying the transform. Default is 1.0.

Name	Type	Default	Description
scale	One of: tuple[float, float] float	(0.05, 0.1)	Standard deviation of the normal distributions. These are used to sample the random distances of the subimage's corners from the full image's corners. If scale is a single float value, the range will be (0, scale). Default: (0.05, 0.1).
keep_size	bool	True	Whether to resize image back to its original size after applying the perspective transform. If set to False, the resulting images may end up having different shapes. Default: True.
fit_output	bool	False	If True, the image plane size and position will be adjusted to still capture the whole image after perspective transformation. This is followed by image resizing if keep_size is set to True. If False, parts of the transformed image may be outside of the image plane. This setting should not be set to True when using large scale values as it could lead to very large images. Default: False.
interpolation	One of: cv2.INTER_NEAREST cv2.INTER_LINEAR cv2.INTER_CUBIC cv2.INTER_AREA cv2.INTER_LANCZOS4	1	Interpolation method to be used for image transformation. Should be one of the OpenCV interpolation types. Default: cv2.INTER_LINEAR
mask_interpolation	One of: cv2.INTER_NEAREST cv2.INTER_LINEAR cv2.INTER_CUBIC cv2.INTER_AREA cv2.INTER_LANCZOS4	0	Flag that is used to specify the interpolation algorithm for mask. Should be one of: cv2.INTER_NEAREST, cv2.INTER_LINEAR, cv2.INTER_CUBIC, cv2.INTER_AREA, cv2.INTER_LANCZOS4. Default: cv2.INTER_NEAREST.
border_mode	One of: cv2.BORDER_CONSTANT cv2.BORDER_REPLICATE cv2.BORDER_REFLECT cv2.BORDER_WRAP cv2.BORDER_REFLECT_101	0	OpenCV border mode used for padding. Default: cv2.BORDER_CONSTANT.
fill	One of: tuple[float, ...] float	0	Padding value if border_mode is cv2.BORDER_CONSTANT. Default: 0.
fill_mask	One of: tuple[float, ...] float	0	Padding value for mask if border_mode is cv2.BORDER_CONSTANT. Default: 0.
p	float	0.5	Probability of applying the transform. Default: 0.5.

Name	Type	Default	Description
scale	One of: tuple[float, float] float	(0.03, 0.05)	Standard deviation of the normal distributions. These are used to sample the random distances of the subimage's corners from the full image's corners. If scale is a single float value, the range will be (0, scale). Recommended values are in the range (0.01, 0.05) for small distortions, and (0.05, 0.1) for larger distortions. Default: (0.03, 0.05).
nb_rows	One of: tuple[int, int] int	(4, 4)	Number of rows of points that the regular grid should have. Must be at least 2. For large images, you might want to pick a higher value than 4. If a single int, then that value will always be used as the number of rows. If a tuple (a, b), then a value from the discrete interval [a..b] will be uniformly sampled per image. Default: 4.
nb_cols	One of: tuple[int, int] int	(4, 4)	Number of columns of points that the regular grid should have. Must be at least 2. For large images, you might want to pick a higher value than 4. If a single int, then that value will always be used as the number of columns. If a tuple (a, b), then a value from the discrete interval [a..b] will be uniformly sampled per image. Default: 4.
interpolation	One of: cv2.INTER_NEAREST cv2.INTER_LINEAR cv2.INTER_CUBIC cv2.INTER_AREA cv2.INTER_LANCZOS4	1	Flag that is used to specify the interpolation algorithm. Should be one of: cv2.INTER_NEAREST, cv2.INTER_LINEAR, cv2.INTER_CUBIC, cv2.INTER_AREA, cv2.INTER_LANCZOS4. Default: cv2.INTER_LINEAR.
mask_interpolation	One of: cv2.INTER_NEAREST cv2.INTER_LINEAR cv2.INTER_CUBIC cv2.INTER_AREA cv2.INTER_LANCZOS4	0	Flag that is used to specify the interpolation algorithm for mask. Should be one of: cv2.INTER_NEAREST, cv2.INTER_LINEAR, cv2.INTER_CUBIC, cv2.INTER_AREA, cv2.INTER_LANCZOS4. Default: cv2.INTER_NEAREST.
absolute_scale	bool	False	If set to True, the value of the scale parameter will be treated as an absolute pixel value. If set to False, it will be treated as a fraction of the image height and width. Default: False.
keypoint_remapping_method	One of: 'direct' 'mask'	mask	Method to use for keypoint remapping. - "mask": Uses mask-based remapping. Faster, especially for many keypoints, but may be less accurate for large distortions. Recommended for large images or many keypoints. - "direct": Uses inverse mapping. More accurate for large distortions but slower. Default: "mask"
p	float	0.5	Probability of applying the transform. Default: 0.5.
border_mode	One of: cv2.BORDER_CONSTANT cv2.BORDER_REPLICATE cv2.BORDER_REFLECT cv2.BORDER_WRAP cv2.BORDER_REFLECT_101	0	-
fill	One of: tuple[float, ...] float	0	-
fill_mask	One of: tuple[float, ...] float	0	-

Name	Type	Default	Description
grid	tuple[int, int]	(3, 3)	Size of the grid for splitting the image into cells. Each cell is shuffled randomly. For example, (3, 3) will divide the image into a 3x3 grid, resulting in 9 cells to be shuffled. Default: (3, 3)
p	float	0.5	Probability that the transform will be applied. Should be in the range [0, 1]. Default: 0.5

Name	Type	Default	Description
shift_limit	One of: tuple[float, float] float	(-0.0625, 0.0625)	shift factor range for both height and width. If shift_limit is a single float value, the range will be (-shift_limit, shift_limit). Absolute values for lower and upper bounds should lie in range [-1, 1]. Default: (-0.0625, 0.0625).
scale_limit	One of: tuple[float, float] float	(-0.1, 0.1)	scaling factor range. If scale_limit is a single float value, the range will be (-scale_limit, scale_limit). Note that the scale_limit will be biased by 1. If scale_limit is a tuple, like (low, high), sampling will be done from the range (1 + low, 1 + high). Default: (-0.1, 0.1).
rotate_limit	One of: tuple[float, float] float	(-45, 45)	rotation range. If rotate_limit is a single int value, the range will be (-rotate_limit, rotate_limit). Default: (-45, 45).
interpolation	One of: cv2.INTER_NEAREST cv2.INTER_LINEAR cv2.INTER_CUBIC cv2.INTER_AREA cv2.INTER_LANCZOS4	1	flag that is used to specify the interpolation algorithm. Should be one of: cv2.INTER_NEAREST, cv2.INTER_LINEAR, cv2.INTER_CUBIC, cv2.INTER_AREA, cv2.INTER_LANCZOS4. Default: cv2.INTER_LINEAR.
border_mode	int	0	flag that is used to specify the pixel extrapolation method. Should be one of: cv2.BORDER_CONSTANT, cv2.BORDER_REPLICATE, cv2.BORDER_REFLECT, cv2.BORDER_WRAP, cv2.BORDER_REFLECT_101. Default: cv2.BORDER_CONSTANT
shift_limit_x	One of: tuple[float, float] float None	None	shift factor range for width. If it is set then this value instead of shift_limit will be used for shifting width. If shift_limit_x is a single float value, the range will be (-shift_limit_x, shift_limit_x). Absolute values for lower and upper bounds should lie in the range [-1, 1]. Default: None.
shift_limit_y	One of: tuple[float, float] float None	None	shift factor range for height. If it is set then this value instead of shift_limit will be used for shifting height. If shift_limit_y is a single float value, the range will be (-shift_limit_y, shift_limit_y). Absolute values for lower and upper bounds should lie in the range [-, 1]. Default: None.
rotate_method	One of: 'largest_box' 'ellipse'	largest_box	rotation method used for the bounding boxes. Should be one of "largest_box" or "ellipse". Default: "largest_box"
mask_interpolation	One of: cv2.INTER_NEAREST cv2.INTER_LINEAR cv2.INTER_CUBIC cv2.INTER_AREA cv2.INTER_LANCZOS4	0	Flag that is used to specify the interpolation algorithm for mask. Should be one of: cv2.INTER_NEAREST, cv2.INTER_LINEAR, cv2.INTER_CUBIC, cv2.INTER_AREA, cv2.INTER_LANCZOS4. Default: cv2.INTER_NEAREST.
fill	One of: tuple[float, ...] float	0	padding value if border_mode is cv2.BORDER_CONSTANT.
fill_mask	One of: tuple[float, ...] float	0	padding value if border_mode is cv2.BORDER_CONSTANT applied for masks.
p	float	0.5	probability of applying the transform. Default: 0.5.

Name	Type	Default	Description
scale_range	tuple[float, float]	(0.2, 0.4)	Range for random displacement of control points. Values should be in [0.0, 1.0]: - 0.0: No displacement (identity transform) - 0.1: Subtle warping - 0.2-0.4: Moderate deformation (recommended range) - 0.5+: Strong warping Default: (0.2, 0.4)
num_control_points	int	4	Number of control points per side. Creates a grid of num_control_points x num_control_points points. - 2: Minimal deformation (affine-like) - 3-4: Moderate flexibility (recommended) - 5+: More local deformation control Must be >= 2. Default: 4
interpolation	One of: cv2.INTER_NEAREST cv2.INTER_LINEAR cv2.INTER_CUBIC cv2.INTER_AREA cv2.INTER_LANCZOS4	1	OpenCV interpolation flag. Used for image sampling. See also: cv2.INTER_* Default: cv2.INTER_LINEAR
mask_interpolation	One of: cv2.INTER_NEAREST cv2.INTER_LINEAR cv2.INTER_CUBIC cv2.INTER_AREA cv2.INTER_LANCZOS4	0	OpenCV interpolation flag. Used for mask sampling. See also: cv2.INTER_* Default: cv2.INTER_NEAREST
keypoint_remapping_method	One of: 'direct' 'mask'	mask	Method to use for keypoint remapping. - "mask": Uses mask-based remapping. Faster, especially for many keypoints, but may be less accurate for large distortions. Recommended for large images or many keypoints. - "direct": Uses inverse mapping. More accurate for large distortions but slower. Default: "mask"
p	float	0.5	Probability of applying the transform. Default: 0.5
border_mode	One of: cv2.BORDER_CONSTANT cv2.BORDER_REPLICATE cv2.BORDER_REFLECT cv2.BORDER_WRAP cv2.BORDER_REFLECT_101	0	-
fill	One of: tuple[float, ...] float	0	-
fill_mask	One of: tuple[float, ...] float	0	-

Navigation

albumentations.augmentations.geometric.transforms

Members

Affineclass

Parameters

References

BaseDistortionclass

Parameters

Notes

D4class

Parameters

Example

Notes

ElasticTransformclass

Parameters

Example

Notes

GridDistortionclass

Parameters

Example

Notes

GridElasticDeformclass

Parameters

Example

Notes

HorizontalFlipclass

Parameters

OpticalDistortionclass

Parameters

Example

Notes

Padclass

Parameters

References

PadIfNeededclass

Parameters

Example

Notes

Perspectiveclass

Parameters

Example

Notes

PiecewiseAffineclass

Parameters

Example

Notes

RandomGridShuffleclass

Parameters

Example

Notes

ShiftScaleRotateclass

Parameters

SquareSymmetryclass

Parameters

Example

Notes

ThinPlateSplineclass

Parameters

Example

Notes

References

Transposeclass

Parameters

Example

Notes

VerticalFlipclass

Parameters

Example

Notes

Table of Contents