Stay updated

News & Insights
utils

albumentations.augmentations.crops.functional


Functional implementations of image cropping operations. This module provides utility functions for performing various cropping operations on images, bounding boxes, and keypoints. It includes functions to calculate crop coordinates, crop images, and handle the corresponding transformations for bounding boxes and keypoints to maintain consistency between different data types during cropping operations.

get_crop_coordsfunction

get_crop_coords(
    image_shape: tuple[int, int],
    crop_shape: tuple[int, int],
    h_start: float,
    w_start: float
)

Get crop coordinates. This function gets the crop coordinates. Args: image_shape (tuple[int, int]): Original image shape. crop_shape (tuple[int, int]): Crop shape. h_start (float): Start height. w_start (float): Start width. Returns: tuple[int, int, int, int]: Crop coordinates.

Parameters

NameTypeDefaultDescription
image_shapetuple[int, int]--
crop_shapetuple[int, int]--
h_startfloat--
w_startfloat--

crop_bboxes_by_coordsfunction

crop_bboxes_by_coords(
    bboxes: np.ndarray,
    crop_coords: tuple[int, int, int, int],
    image_shape: tuple[int, int]
)

Crop bounding boxes based on given crop coordinates. This function adjusts bounding boxes to fit within a cropped image. Supports both HBB (axis-aligned) and OBB (oriented) bounding boxes. Simply shifts the bounding rectangle coordinates - clipping/filtering happens later in Compose via clamping_mode. Args: bboxes (np.ndarray): Array of normalized bounding boxes (Albumentations format) with shape (N, 4+) where each row is [x_min, y_min, x_max, y_max, ...] for HBB or [x_min, y_min, x_max, y_max, angle, ...] for OBB. crop_coords (tuple[int, int, int, int]): Crop coordinates (x_min, y_min, x_max, y_max) in absolute pixel values. image_shape (tuple[int, int]): Original image shape (height, width). Returns: np.ndarray: Array of cropped bounding boxes in normalized coordinates (Albumentations format).

Parameters

NameTypeDefaultDescription
bboxesnp.ndarray--
crop_coordstuple[int, int, int, int]--
image_shapetuple[int, int]--

crop_keypoints_by_coordsfunction

crop_keypoints_by_coords(
    keypoints: np.ndarray,
    crop_coords: tuple[int, int, int, int]
)

Crop keypoints using the provided coordinates of bottom-left and top-right corners in pixels. Args: keypoints (np.ndarray): An array of keypoints with shape (N, 4+) where each row is (x, y, angle, scale, ...). crop_coords (tuple): Crop box coords (x1, y1, x2, y2). Returns: np.ndarray: An array of cropped keypoints with the same shape as the input.

Parameters

NameTypeDefaultDescription
keypointsnp.ndarray--
crop_coordstuple[int, int, int, int]--

get_center_crop_coordsfunction

get_center_crop_coords(
    image_shape: tuple[int, int],
    crop_shape: tuple[int, int]
)

Get center crop coordinates. This function gets the center crop coordinates. Args: image_shape (tuple[int, int]): Original image shape. crop_shape (tuple[int, int]): Crop shape. Returns: tuple[int, int, int, int]: Center crop coordinates.

Parameters

NameTypeDefaultDescription
image_shapetuple[int, int]--
crop_shapetuple[int, int]--

cropfunction

crop(
    img: ImageType,
    x_min: int,
    y_min: int,
    x_max: int,
    y_max: int
)

Crop an image. This function crops an image. Args: img (np.ndarray): Input image. x_min (int): Minimum x coordinate. y_min (int): Minimum y coordinate. x_max (int): Maximum x coordinate. y_max (int): Maximum y coordinate. Returns: np.ndarray: Cropped image.

Parameters

NameTypeDefaultDescription
imgImageType--
x_minint--
y_minint--
x_maxint--
y_maxint--

crop_and_padfunction

crop_and_pad(
    img: ImageType,
    crop_params: tuple[int, int, int, int] | None,
    pad_params: tuple[int, int, int, int] | None,
    pad_value: tuple[float, ...] | float | None,
    image_shape: tuple[int, int],
    interpolation: int,
    pad_mode: int,
    keep_size: bool
)

Crop and pad an image. This function crops and pads an image. Args: img (np.ndarray): Input image. crop_params (tuple[int, int, int, int] | None): Crop parameters. pad_params (tuple[int, int, int, int] | None): Pad parameters. pad_value (tuple[float, ...] | float | None): Pad value. image_shape (tuple[int, int]): Original image shape. interpolation (int): Interpolation method. pad_mode (int): Pad mode. keep_size (bool): Whether to keep the original size. Returns: np.ndarray: Cropped and padded image.

Parameters

NameTypeDefaultDescription
imgImageType--
crop_params
One of:
  • tuple[int, int, int, int]
  • None
--
pad_params
One of:
  • tuple[int, int, int, int]
  • None
--
pad_value
One of:
  • tuple[float, ...]
  • float
  • None
--
image_shapetuple[int, int]--
interpolationint--
pad_modeint--
keep_sizebool--

crop_and_pad_bboxesfunction

crop_and_pad_bboxes(
    bboxes: np.ndarray,
    crop_params: tuple[int, int, int, int] | None,
    pad_params: tuple[int, int, int, int] | None,
    image_shape: tuple[int, int],
    result_shape: tuple[int, int]
)

Crop and pad bounding boxes. This function crops and pads bounding boxes. Supports both HBB and OBB. Simply shifts coordinates - clipping/filtering happens in Compose via clamping_mode. Args: bboxes (np.ndarray): Array of bounding boxes (HBB or OBB) with any number of columns. First 4 columns are [x_min, y_min, x_max, y_max], rest are preserved. crop_params (tuple[int, int, int, int] | None): Crop parameters. pad_params (tuple[int, int, int, int] | None): Pad parameters. image_shape (tuple[int, int]): Original image shape. result_shape (tuple[int, int]): Result image shape. Returns: np.ndarray: Array of cropped and padded bounding boxes.

Parameters

NameTypeDefaultDescription
bboxesnp.ndarray--
crop_params
One of:
  • tuple[int, int, int, int]
  • None
--
pad_params
One of:
  • tuple[int, int, int, int]
  • None
--
image_shapetuple[int, int]--
result_shapetuple[int, int]--

crop_and_pad_keypointsfunction

crop_and_pad_keypoints(
    keypoints: np.ndarray,
    crop_params: tuple[int, int, int, int] | None,
    pad_params: tuple[int, int, int, int] | None,
    image_shape: tuple[int, int] = (0, 0),
    result_shape: tuple[int, int] = (0, 0),
    keep_size: bool = False
)

Crop and pad multiple keypoints simultaneously. Args: keypoints (np.ndarray): Array of keypoints with shape (N, 4+) where each row is (x, y, angle, scale, ...). crop_params (Sequence[int], optional): Crop parameters [crop_x1, crop_y1, ...]. pad_params (Sequence[int], optional): Pad parameters [top, bottom, left, right]. image_shape (Tuple[int, int]): Original image shape (rows, cols). result_shape (Tuple[int, int]): Result image shape (rows, cols). keep_size (bool): Whether to keep the original size. Returns: np.ndarray: Array of transformed keypoints with the same shape as input.

Parameters

NameTypeDefaultDescription
keypointsnp.ndarray--
crop_params
One of:
  • tuple[int, int, int, int]
  • None
--
pad_params
One of:
  • tuple[int, int, int, int]
  • None
--
image_shapetuple[int, int](0, 0)-
result_shapetuple[int, int](0, 0)-
keep_sizeboolFalse-

volume_crop_yxfunction

volume_crop_yx(
    volume: ImageType,
    x_min: int,
    y_min: int,
    x_max: int,
    y_max: int
)

Crop a single volume along Y (height) and X (width) axes only. Args: volume (np.ndarray): Input volume with shape (D, H, W) or (D, H, W, C). x_min (int): Minimum width coordinate. y_min (int): Minimum height coordinate. x_max (int): Maximum width coordinate. y_max (int): Maximum height coordinate. Returns: np.ndarray: Cropped volume (D, H_new, W_new, [C]). Raises: ValueError: If crop coordinates are invalid.

Parameters

NameTypeDefaultDescription
volumeImageType--
x_minint--
y_minint--
x_maxint--
y_maxint--

volumes_crop_yxfunction

volumes_crop_yx(
    volumes: np.ndarray,
    x_min: int,
    y_min: int,
    x_max: int,
    y_max: int
)

Crop a batch of volumes along Y (height) and X (width) axes only. Args: volumes (np.ndarray): Input batch of volumes with shape (B, D, H, W) or (B, D, H, W, C). x_min (int): Minimum width coordinate. y_min (int): Minimum height coordinate. x_max (int): Maximum width coordinate. y_max (int): Maximum height coordinate. Returns: np.ndarray: Cropped batch of volumes (B, D, H_new, W_new, [C]). Raises: ValueError: If crop coordinates are invalid or volumes shape is incorrect.

Parameters

NameTypeDefaultDescription
volumesnp.ndarray--
x_minint--
y_minint--
x_maxint--
y_maxint--

pad_along_axesfunction

pad_along_axes(
    arr: np.ndarray,
    pad_top: int,
    pad_bottom: int,
    pad_left: int,
    pad_right: int,
    h_axis: int,
    w_axis: int,
    border_mode: int,
    pad_value: float | Sequence[float] = 0
)

Pad an array along specified height (H) and width (W) axes using np.pad. Args: arr (np.ndarray): Input array. pad_top (int): Padding added to the top (start of H axis). pad_bottom (int): Padding added to the bottom (end of H axis). pad_left (int): Padding added to the left (start of W axis). pad_right (int): Padding added to the right (end of W axis). h_axis (int): Index of the height axis (Y). w_axis (int): Index of the width axis (X). border_mode (int): OpenCV border mode. pad_value (float | Sequence[float]): Value for constant padding. Returns: np.ndarray: Padded array. Raises: ValueError: If border_mode is unsupported or axis indices are out of bounds.

Parameters

NameTypeDefaultDescription
arrnp.ndarray--
pad_topint--
pad_bottomint--
pad_leftint--
pad_rightint--
h_axisint--
w_axisint--
border_modeint--
pad_value
One of:
  • float
  • Sequence[float]
0-