Your ad could be here - Reach CV/ML engineers

Interested in

advertising?

distortion flip functional pad resize rotate transforms

transform utils

transforms check_version

Interested in

advertising?

distortion flip functional pad resize rotate transforms

transform utils

transforms check_version

albumentations.augmentations.geometric.resize

View Source on GitHub

Transforms for resizing images and associated data. This module provides transform classes for resizing operations, including uniform resizing, scaling with aspect ratio preservation, and size-constrained transformations.

Members

classLongestMaxSize
classMaxSizeTransform
classRandomScale
classResize
classSmallestMaxSize

LongestMaxSizeclass

Try it on Explore Albumentations

LongestMaxSize(
    max_size: int | Sequence[int] | None = None,
    max_size_hw: tuple[int | None, int | None] | None = None,
    interpolation: Literal[cv2.INTER_NEAREST, cv2.INTER_NEAREST_EXACT, cv2.INTER_LINEAR, cv2.INTER_CUBIC, cv2.INTER_AREA, cv2.INTER_LANCZOS4, cv2.INTER_LINEAR_EXACT] = 1,
    mask_interpolation: Literal[cv2.INTER_NEAREST, cv2.INTER_NEAREST_EXACT, cv2.INTER_LINEAR, cv2.INTER_CUBIC, cv2.INTER_AREA, cv2.INTER_LANCZOS4, cv2.INTER_LINEAR_EXACT] = 0,
    p: float = 1
)

Rescale an image so that the longest side is equal to max_size or sides meet max_size_hw constraints, keeping the aspect ratio.

Parameters

Name	Type	Default	Description
max_size	One of: int Sequence[int] None	None	Maximum size of the longest side after the transformation. When using a list or tuple, the max size will be randomly selected from the values provided. Default: None.
max_size_hw	One of: tuple[int \| None, int \| None] None	None	Maximum (height, width) constraints. Supports: - (height, width): Both dimensions must fit within these bounds - (height, None): Only height is constrained, width scales proportionally - (None, width): Only width is constrained, height scales proportionally If specified, max_size must be None. Default: None.
interpolation	One of: cv2.INTER_NEAREST cv2.INTER_NEAREST_EXACT cv2.INTER_LINEAR cv2.INTER_CUBIC cv2.INTER_AREA cv2.INTER_LANCZOS4 cv2.INTER_LINEAR_EXACT	1	interpolation method. Default: cv2.INTER_LINEAR.
mask_interpolation	One of: cv2.INTER_NEAREST cv2.INTER_NEAREST_EXACT cv2.INTER_LINEAR cv2.INTER_CUBIC cv2.INTER_AREA cv2.INTER_LANCZOS4 cv2.INTER_LINEAR_EXACT	0	flag that is used to specify the interpolation algorithm for mask. Should be one of: cv2.INTER_NEAREST, cv2.INTER_LINEAR, cv2.INTER_CUBIC, cv2.INTER_AREA, cv2.INTER_LANCZOS4. Default: cv2.INTER_NEAREST.
p	float	1	probability of applying the transform. Default: 1.

Examples

>>> import albumentations as A
>>> import cv2
>>> # Using max_size
>>> transform1 = A.LongestMaxSize(max_size=1024)
>>> # Input image (1500, 800) -> Output (1024, 546)
>>>
>>> # Using max_size_hw with both dimensions
>>> transform2 = A.LongestMaxSize(max_size_hw=(800, 1024))
>>> # Input (1500, 800) -> Output (800, 427)
>>> # Input (800, 1500) -> Output (546, 1024)
>>>
>>> # Using max_size_hw with only height
>>> transform3 = A.LongestMaxSize(max_size_hw=(800, None))
>>> # Input (1500, 800) -> Output (800, 427)
>>>
>>> # Common use case with padding
>>> transform4 = A.Compose([
...     A.LongestMaxSize(max_size=1024),
...     A.PadIfNeeded(min_height=1024, min_width=1024),
... ])

Notes

- If the longest side of the image is already equal to max_size, the image will not be resized. - This transform will not crop the image. The resulting image may be smaller than specified in both dimensions. - For non-square images, both sides will be scaled proportionally to maintain the aspect ratio. - Bounding boxes and keypoints are scaled accordingly.

MaxSizeTransformclass

Try it on Explore Albumentations

MaxSizeTransform(
    max_size: int | Sequence[int] | None = None,
    max_size_hw: tuple[int | None, int | None] | None = None,
    interpolation: Literal[cv2.INTER_NEAREST, cv2.INTER_NEAREST_EXACT, cv2.INTER_LINEAR, cv2.INTER_CUBIC, cv2.INTER_AREA, cv2.INTER_LANCZOS4, cv2.INTER_LINEAR_EXACT] = 1,
    mask_interpolation: Literal[cv2.INTER_NEAREST, cv2.INTER_NEAREST_EXACT, cv2.INTER_LINEAR, cv2.INTER_CUBIC, cv2.INTER_AREA, cv2.INTER_LANCZOS4, cv2.INTER_LINEAR_EXACT] = 0,
    p: float = 1
)

Base class for transforms that resize based on maximum size constraints. This class provides common functionality for derived transforms like LongestMaxSize and SmallestMaxSize that resize images based on size constraints while preserving aspect ratio.

Parameters

Name	Type	Default	Description
max_size	One of: int Sequence[int] None	None	Maximum size constraint. The specific interpretation depends on the derived class. Default: None.
max_size_hw	One of: tuple[int \| None, int \| None] None	None	Maximum (height, width) constraints. Either max_size or max_size_hw must be specified, but not both. Default: None.
interpolation	One of: cv2.INTER_NEAREST cv2.INTER_NEAREST_EXACT cv2.INTER_LINEAR cv2.INTER_CUBIC cv2.INTER_AREA cv2.INTER_LANCZOS4 cv2.INTER_LINEAR_EXACT	1	Flag for the interpolation algorithm. Should be one of: cv2.INTER_NEAREST, cv2.INTER_LINEAR, cv2.INTER_CUBIC, cv2.INTER_AREA, cv2.INTER_LANCZOS4. Default: cv2.INTER_LINEAR.
mask_interpolation	One of: cv2.INTER_NEAREST cv2.INTER_NEAREST_EXACT cv2.INTER_LINEAR cv2.INTER_CUBIC cv2.INTER_AREA cv2.INTER_LANCZOS4 cv2.INTER_LINEAR_EXACT	0	Flag for the mask interpolation algorithm. Should be one of: cv2.INTER_NEAREST, cv2.INTER_LINEAR, cv2.INTER_CUBIC, cv2.INTER_AREA, cv2.INTER_LANCZOS4. Default: cv2.INTER_NEAREST.
p	float	1	Probability of applying the transform. Default: 1.

Examples

>>> import numpy as np
>>> import albumentations as A
>>> import cv2
>>>
>>> # Example of creating a custom transform that extends MaxSizeTransform
>>> class CustomMaxSize(A.MaxSizeTransform):
...     def get_params_dependent_on_data(self, params, data):
...         img_h, img_w = params["shape"][:2]
...         # Calculate scale factor - here we scale to make the image area constant
...         target_area = 300 * 300  # Target area of 300x300
...         current_area = img_h * img_w
...         scale = np.sqrt(target_area / current_area)
...         return {"scale": scale}
>>>
>>> # Prepare sample data
>>> image = np.zeros((100, 200, 3), dtype=np.uint8)
>>> # Add a rectangle to visualize the effect
>>> cv2.rectangle(image, (50, 20), (150, 80), (255, 0, 0), -1)
>>>
>>> # Create a mask
>>> mask = np.zeros((100, 200), dtype=np.uint8)
>>> mask[20:80, 50:150] = 1
>>>
>>> # Create bounding boxes and keypoints
>>> bboxes = np.array([[50, 20, 150, 80]])
>>> bbox_labels = [1]
>>> keypoints = np.array([[100, 50]])
>>> keypoint_labels = [0]
>>>
>>> # Apply the custom transform
>>> transform = A.Compose([
...     CustomMaxSize(
...         max_size=None,
...         max_size_hw=(None, None),  # Not used in our custom implementation
...         interpolation=cv2.INTER_LINEAR,
...         mask_interpolation=cv2.INTER_NEAREST,
...         p=1.0
...     )
... ], bbox_params=A.BboxParams(format='pascal_voc', label_fields=['bbox_labels']),
...    keypoint_params=A.KeypointParams(format='xy', label_fields=['keypoint_labels']))
>>>
>>> # Apply the transform
>>> result = transform(
...     image=image,
...     mask=mask,
...     bboxes=bboxes,
...     bbox_labels=bbox_labels,
...     keypoints=keypoints,
...     keypoint_labels=keypoint_labels
... )
>>>
>>> # Get results
>>> transformed_image = result['image']  # Shape will be approximately (122, 245, 3)
>>> transformed_mask = result['mask']    # Shape will be approximately (122, 245)
>>> transformed_bboxes = result['bboxes']  # Bounding boxes are scale invariant
>>> transformed_keypoints = result['keypoints']  # Keypoints scaled proportionally
>>> transformed_bbox_labels = result['bbox_labels']  # Labels remain unchanged
>>> transformed_keypoint_labels = result['keypoint_labels']  # Labels remain unchanged

Notes

- This is a base class that should be extended by concrete resize transforms. - The scaling calculation is implemented in derived classes. - Aspect ratio is preserved by applying the same scale factor to both dimensions.

RandomScaleclass

Try it on Explore Albumentations

RandomScale(
    scale_limit: tuple[float, float] | float = (-0.1, 0.1),
    interpolation: Literal[cv2.INTER_NEAREST, cv2.INTER_NEAREST_EXACT, cv2.INTER_LINEAR, cv2.INTER_CUBIC, cv2.INTER_AREA, cv2.INTER_LANCZOS4, cv2.INTER_LINEAR_EXACT] = 1,
    mask_interpolation: Literal[cv2.INTER_NEAREST, cv2.INTER_NEAREST_EXACT, cv2.INTER_LINEAR, cv2.INTER_CUBIC, cv2.INTER_AREA, cv2.INTER_LANCZOS4, cv2.INTER_LINEAR_EXACT] = 0,
    p: float = 0.5
)

Randomly resize the input. Output image size is different from the input image size.

Parameters

Name	Type	Default	Description
scale_limit	One of: tuple[float, float] float	(-0.1, 0.1)	scaling factor range. If scale_limit is a single float value, the range will be (-scale_limit, scale_limit). Note that the scale_limit will be biased by 1. If scale_limit is a tuple, like (low, high), sampling will be done from the range (1 + low, 1 + high). Default: (-0.1, 0.1).
interpolation	One of: cv2.INTER_NEAREST cv2.INTER_NEAREST_EXACT cv2.INTER_LINEAR cv2.INTER_CUBIC cv2.INTER_AREA cv2.INTER_LANCZOS4 cv2.INTER_LINEAR_EXACT	1	flag that is used to specify the interpolation algorithm. Should be one of: cv2.INTER_NEAREST, cv2.INTER_LINEAR, cv2.INTER_CUBIC, cv2.INTER_AREA, cv2.INTER_LANCZOS4. Default: cv2.INTER_LINEAR.
mask_interpolation	One of: cv2.INTER_NEAREST cv2.INTER_NEAREST_EXACT cv2.INTER_LINEAR cv2.INTER_CUBIC cv2.INTER_AREA cv2.INTER_LANCZOS4 cv2.INTER_LINEAR_EXACT	0	flag that is used to specify the interpolation algorithm for mask. Should be one of: cv2.INTER_NEAREST, cv2.INTER_LINEAR, cv2.INTER_CUBIC, cv2.INTER_AREA, cv2.INTER_LANCZOS4. Default: cv2.INTER_NEAREST.
p	float	0.5	probability of applying the transform. Default: 0.5.

Examples

>>> import numpy as np
>>> import albumentations as A
>>> import cv2
>>>
>>> # Create sample data for demonstration
>>> image = np.zeros((100, 100, 3), dtype=np.uint8)
>>> # Add some shapes to visualize scaling effects
>>> cv2.rectangle(image, (25, 25), (75, 75), (255, 0, 0), -1)  # Red square
>>> cv2.circle(image, (50, 50), 10, (0, 255, 0), -1)  # Green circle
>>>
>>> # Create a mask for segmentation
>>> mask = np.zeros((100, 100), dtype=np.uint8)
>>> mask[25:75, 25:75] = 1  # Mask covering the red square
>>>
>>> # Create bounding boxes and keypoints
>>> bboxes = np.array([[25, 25, 75, 75]])  # Box around the red square
>>> bbox_labels = [1]
>>> keypoints = np.array([[50, 50]])  # Center of circle
>>> keypoint_labels = [0]
>>>
>>> # Apply RandomScale transform with comprehensive parameters
>>> transform = A.Compose([
...     A.RandomScale(
...         scale_limit=(-0.3, 0.5),     # Scale between 0.7x and 1.5x
...         interpolation=cv2.INTER_LINEAR,
...         mask_interpolation=cv2.INTER_NEAREST,
...         p=1.0                         # Always apply
...     )
... ], bbox_params=A.BboxParams(format='pascal_voc', label_fields=['bbox_labels']),
...    keypoint_params=A.KeypointParams(format='xy', label_fields=['keypoint_labels']))
>>>
>>> # Apply the transform to all targets
>>> result = transform(
...     image=image,
...     mask=mask,
...     bboxes=bboxes,
...     bbox_labels=bbox_labels,
...     keypoints=keypoints,
...     keypoint_labels=keypoint_labels
... )
>>>
>>> # Get the transformed results
>>> scaled_image = result['image']        # Dimensions will be between 70-150 pixels
>>> scaled_mask = result['mask']          # Mask scaled proportionally to image
>>> scaled_bboxes = result['bboxes']      # Bounding boxes adjusted to new dimensions
>>> scaled_bbox_labels = result['bbox_labels']  # Labels remain unchanged
>>> scaled_keypoints = result['keypoints']      # Keypoints adjusted to new dimensions
>>> scaled_keypoint_labels = result['keypoint_labels']  # Labels remain unchanged
>>>
>>> # The image dimensions will vary based on the randomly sampled scale factor
>>> # With scale_limit=(-0.3, 0.5), dimensions could be anywhere from 70% to 150% of original

Notes

- The output image size is different from the input image size. - Scale factor is sampled independently per image side (width and height). - Bounding box coordinates are scaled accordingly. - Keypoint coordinates are scaled accordingly.

Resizeclass

Try it on Explore Albumentations

Resize(
    height: int,
    width: int,
    interpolation: Literal[cv2.INTER_NEAREST, cv2.INTER_NEAREST_EXACT, cv2.INTER_LINEAR, cv2.INTER_CUBIC, cv2.INTER_AREA, cv2.INTER_LANCZOS4, cv2.INTER_LINEAR_EXACT] = 1,
    mask_interpolation: Literal[cv2.INTER_NEAREST, cv2.INTER_NEAREST_EXACT, cv2.INTER_LINEAR, cv2.INTER_CUBIC, cv2.INTER_AREA, cv2.INTER_LANCZOS4, cv2.INTER_LINEAR_EXACT] = 0,
    p: float = 1
)

Resize the input to the given height and width.

Parameters

Name	Type	Default	Description
height	int	-	desired height of the output.
width	int	-	desired width of the output.
interpolation	One of: cv2.INTER_NEAREST cv2.INTER_NEAREST_EXACT cv2.INTER_LINEAR cv2.INTER_CUBIC cv2.INTER_AREA cv2.INTER_LANCZOS4 cv2.INTER_LINEAR_EXACT	1	flag that is used to specify the interpolation algorithm. Should be one of: cv2.INTER_NEAREST, cv2.INTER_LINEAR, cv2.INTER_CUBIC, cv2.INTER_AREA, cv2.INTER_LANCZOS4. Default: cv2.INTER_LINEAR.
mask_interpolation	One of: cv2.INTER_NEAREST cv2.INTER_NEAREST_EXACT cv2.INTER_LINEAR cv2.INTER_CUBIC cv2.INTER_AREA cv2.INTER_LANCZOS4 cv2.INTER_LINEAR_EXACT	0	flag that is used to specify the interpolation algorithm for mask. Should be one of: cv2.INTER_NEAREST, cv2.INTER_LINEAR, cv2.INTER_CUBIC, cv2.INTER_AREA, cv2.INTER_LANCZOS4. Default: cv2.INTER_NEAREST.
p	float	1	probability of applying the transform. Default: 1.

Examples

>>> import numpy as np
>>> import albumentations as A
>>> import cv2
>>>
>>> # Create sample data for demonstration
>>> image = np.zeros((100, 100, 3), dtype=np.uint8)
>>> # Add some shapes to visualize resize effects
>>> cv2.rectangle(image, (25, 25), (75, 75), (255, 0, 0), -1)  # Red square
>>> cv2.circle(image, (50, 50), 10, (0, 255, 0), -1)  # Green circle
>>>
>>> # Create a mask for segmentation
>>> mask = np.zeros((100, 100), dtype=np.uint8)
>>> mask[25:75, 25:75] = 1  # Mask covering the red square
>>>
>>> # Create bounding boxes and keypoints
>>> bboxes = np.array([[25, 25, 75, 75]])  # Box around the red square
>>> bbox_labels = [1]
>>> keypoints = np.array([[50, 50]])  # Center of circle
>>> keypoint_labels = [0]
>>>
>>> # Resize all data to 224x224 (common input size for many CNNs)
>>> transform = A.Compose([
...     A.Resize(
...         height=224,
...         width=224,
...         interpolation=cv2.INTER_LINEAR,
...         mask_interpolation=cv2.INTER_NEAREST,
...         p=1.0
...     )
... ], bbox_params=A.BboxParams(format='pascal_voc', label_fields=['bbox_labels']),
...    keypoint_params=A.KeypointParams(format='xy', label_fields=['keypoint_labels']))
>>>
>>> # Apply the transform to all targets
>>> result = transform(
...     image=image,
...     mask=mask,
...     bboxes=bboxes,
...     bbox_labels=bbox_labels,
...     keypoints=keypoints,
...     keypoint_labels=keypoint_labels
... )
>>>
>>> # Get the transformed results
>>> resized_image = result['image']        # Shape will be (224, 224, 3)
>>> resized_mask = result['mask']          # Shape will be (224, 224)
>>> resized_bboxes = result['bboxes']      # Bounding boxes scaled to new dimensions
>>> resized_bbox_labels = result['bbox_labels']  # Labels remain unchanged
>>> resized_keypoints = result['keypoints']      # Keypoints scaled to new dimensions
>>> resized_keypoint_labels = result['keypoint_labels']  # Labels remain unchanged
>>>
>>> # Note: When resizing from 100x100 to 224x224:
>>> # - The red square will be scaled from (25-75) to approximately (56-168)
>>> # - The keypoint at (50, 50) will move to approximately (112, 112)
>>> # - All spatial relationships are preserved but coordinates are scaled

SmallestMaxSizeclass

Try it on Explore Albumentations

SmallestMaxSize(
    max_size: int | Sequence[int] | None = None,
    max_size_hw: tuple[int | None, int | None] | None = None,
    interpolation: Literal[cv2.INTER_NEAREST, cv2.INTER_NEAREST_EXACT, cv2.INTER_LINEAR, cv2.INTER_CUBIC, cv2.INTER_AREA, cv2.INTER_LANCZOS4, cv2.INTER_LINEAR_EXACT] = 1,
    mask_interpolation: Literal[cv2.INTER_NEAREST, cv2.INTER_NEAREST_EXACT, cv2.INTER_LINEAR, cv2.INTER_CUBIC, cv2.INTER_AREA, cv2.INTER_LANCZOS4, cv2.INTER_LINEAR_EXACT] = 0,
    p: float = 1
)

Rescale an image so that minimum side is equal to max_size or sides meet max_size_hw constraints, keeping the aspect ratio.

Parameters

Name	Type	Default	Description
max_size	One of: int Sequence[int] None	None	Maximum size of smallest side of the image after the transformation. When using a list, max size will be randomly selected from the values in the list. Default: None.
max_size_hw	One of: tuple[int \| None, int \| None] None	None	Maximum (height, width) constraints. Supports: - (height, width): Both dimensions must be at least these values - (height, None): Only height is constrained, width scales proportionally - (None, width): Only width is constrained, height scales proportionally If specified, max_size must be None. Default: None.
interpolation	One of: cv2.INTER_NEAREST cv2.INTER_NEAREST_EXACT cv2.INTER_LINEAR cv2.INTER_CUBIC cv2.INTER_AREA cv2.INTER_LANCZOS4 cv2.INTER_LINEAR_EXACT	1	Flag that is used to specify the interpolation algorithm. Should be one of: cv2.INTER_NEAREST, cv2.INTER_LINEAR, cv2.INTER_CUBIC, cv2.INTER_AREA, cv2.INTER_LANCZOS4. Default: cv2.INTER_LINEAR.
mask_interpolation	One of: cv2.INTER_NEAREST cv2.INTER_NEAREST_EXACT cv2.INTER_LINEAR cv2.INTER_CUBIC cv2.INTER_AREA cv2.INTER_LANCZOS4 cv2.INTER_LINEAR_EXACT	0	flag that is used to specify the interpolation algorithm for mask. Should be one of: cv2.INTER_NEAREST, cv2.INTER_LINEAR, cv2.INTER_CUBIC, cv2.INTER_AREA, cv2.INTER_LANCZOS4. Default: cv2.INTER_NEAREST.
p	float	1	Probability of applying the transform. Default: 1.

Examples

>>> import numpy as np
>>> import albumentations as A
>>> # Using max_size
>>> transform1 = A.SmallestMaxSize(max_size=120)
>>> # Input image (100, 150) -> Output (120, 180)
>>>
>>> # Using max_size_hw with both dimensions
>>> transform2 = A.SmallestMaxSize(max_size_hw=(100, 200))
>>> # Input (80, 160) -> Output (100, 200)
>>> # Input (160, 80) -> Output (400, 200)
>>>
>>> # Using max_size_hw with only height
>>> transform3 = A.SmallestMaxSize(max_size_hw=(100, None))
>>> # Input (80, 160) -> Output (100, 200)

Notes

- If the smallest side of the image is already equal to max_size, the image will not be resized. - This transform will not crop the image. The resulting image may be larger than specified in both dimensions. - For non-square images, both sides will be scaled proportionally to maintain the aspect ratio. - Bounding boxes and keypoints are scaled accordingly.