Stay updated

Transform Library Comparison Guide 🔗

This guide helps you find equivalent transforms between Albumentations and other popular libraries (torchvision and Kornia).

Key Differences 🔗

Compared to TorchVision 🔗

Albumentations operates on numpy arrays (TorchVision uses PyTorch tensors)
More parameters for fine-tuning transformations
Built-in support for mask augmentation
Better handling of bounding boxes and keypoints

Compared to Kornia 🔗

CPU-based numpy operations (Kornia uses GPU tensors)
More comprehensive support for detection/segmentation
Generally better CPU performance
Simpler API for common tasks

Common Transform Mappings 🔗

Basic Geometric Transforms 🔗

TorchVision Transform	Albumentations Equivalent	Notes
Resize	Resize / LongestMaxSize	- TorchVision's `Resize` combines two Albumentations behaviors: 1. When given (h,w): equivalent to Albumentations `Resize` 2. When given single int + max_size: similar to `LongestMaxSize` - Albumentations allows separate interpolation method for masks - TorchVision has `antialias` parameter, Albumentations doesn't
ScaleJitter	OneOf + multiple Resize	- Can be approximated in Albumentations using OneOf container with multiple Resize transforms - Example: `transforms = A.OneOf([` `A.Resize(height=int(target_h * scale), width=int(target_w * scale))` `for scale in np.linspace(0.1, 2.0, num=20)` `])` - Not exactly the same as continuous random scaling, but provides similar functionality
RandomShortestSize	OneOf + SmallestMaxSize	- Can be approximated in Albumentations using: `transforms = A.OneOf([` `A.SmallestMaxSize(max_size=size, max_height=max_size, max_width=max_size)` `for size in [480, 512, 544, 576, 608]` `])` - Randomly selects size for shortest side while maintaining aspect ratio - Optional `max_size` parameter limits longest side - TorchVision has `antialias` parameter, Albumentations doesn't
RandomResize	OneOf + Resize	- TorchVision: randomly selects single size S between `min_size` and `max_size`, sets both `width` and `height` to `S` - No direct equivalent in Albumentations (RandomScale preserves aspect ratio) - Can be approximated using: `transforms = A.OneOf([` `A.Resize(size, size)` `for size in range(min_size, max_size + 1, step)` `])`
RandomCrop	RandomCrop	- Both perform random cropping with similar core functionality - Key differences: 1. TorchVision accepts single int for square crop, Albumentations requires both height and width 2. Padding options differ: - TorchVision: supports padding parameter for pre-padding - Albumentations: offers `pad_position` parameter ('center', 'top_left', etc.) 3. Fill value handling: - TorchVision: supports dict mapping for different types - Albumentations: separate `fill` and `fill_mask` parameters 4. Padding modes: - TorchVision: 'constant', 'edge', 'reflect', 'symmetric' - Albumentations: uses OpenCV border modes
RandomResizedCrop	RandomResizedCrop	- Nearly identical functionality and parameters - Key differences: 1. TorchVision accepts single int for square output, Albumentations requires `(height, width)` tuple 2. Default values are the same `(scale=(0.08, 1.0), ratio=(0.75, 1.3333))` 3. Albumentations adds: - Separate `mask_interpolation` parameter - Probability parameter `p`
RandomIoUCrop	RandomSizedBBoxSafeCrop	- Both ensure safe cropping with respect to bounding boxes - Key differences: 1. TorchVision: - Implements exact SSD paper approach - Uses IoU-based sampling strategy - Requires explicit sanitization of boxes after crop 2. Albumentations: - Simpler approach ensuring bbox safety - Directly specifies target size - Automatically handles bbox cleanup - For exact SSD-style cropping, might need custom implementation in Albumentations
CenterCrop	CenterCrop	- Both crop the center part of the input - Key differences: 1. Size specification: - TorchVision: accepts single int for square crop or (height, width) tuple - Albumentations: requires separate height and width parameters 2. Padding behavior: - TorchVision: always pads with 0 if image is smaller - Albumentations: optional padding with `pad_if_needed` 3. Albumentations adds: - Configurable padding mode and position - Separate fill values for image and mask - Probability parameter `p`
RandomHorizontalFlip	HorizontalFlip	- Identical functionality - Both have default probability p=0.5 - Only naming difference: TorchVision includes "Random" in name
RandomVerticalFlip	VerticalFlip	- Identical functionality - Both have default probability p=0.5 - Only naming difference: TorchVision includes "Random" in name
Pad	Pad	- Similar core padding functionality - Both support: - Single int for all sides - (pad_x, pad_y) for symmetric padding - (left, top, right, bottom) for per-side padding - Key differences: 1. Padding modes: - TorchVision: 'constant', 'edge', 'reflect', 'symmetric' - Albumentations: uses OpenCV border modes 2. Fill value handling: - TorchVision: supports dict mapping for different types - Albumentations: separate `fill` and `fill_mask` parameters 3. Albumentations adds: - Probability parameter `p`
RandomZoomOut	RandomScale + PadIfNeeded	- No direct equivalent in Albumentations - Can be approximated by combining: `A.Compose([` `A.RandomScale(scale_limit=(0.0, 3.0), p=0.5), # scale_limit=(0.0, 3.0) maps to side_range=(1.0, 4.0)` `A.PadIfNeeded(min_height=height, min_width=width, border_mode=cv2.BORDER_CONSTANT, value=fill)` `])` - Key differences: 1. TorchVision implements specific SSD paper approach 2. Albumentations requires composition of two transforms
RandomRotation	Rotate	- Similar core rotation functionality but with different parameters - Key differences: 1. Angle specification: - TorchVision: `degrees` parameter (-degrees, +degrees) or (min, max) - Albumentations: `limit` parameter (-limit, +limit) or (min, max) 2. Output size control: - TorchVision: `expand=True/False` - Albumentations: `crop_border=True/False` 3. Additional Albumentations features: - Separate mask interpolation - Bbox rotation methods ('largest_box' or 'ellipse') - More border modes - Probability parameter `p` 4. Center specification: - TorchVision: supports custom center point - Albumentations: always uses image center
RandomAffine	Affine	- Both support core affine operations (translation, rotation, scale, shear) - Key differences: 1. Parameter specification: - TorchVision: single parameters for each transform - Albumentations: more flexible with dict options for x/y axes 2. Scale handling: - Albumentations adds `keep_ratio` and `balanced_scale` - Albumentations supports independent x/y scaling 3. Translation: - TorchVision: fraction only - Albumentations: both percent and pixels 4. Additional Albumentations features: - `fit_output` to adjust image plane - Separate mask interpolation - More border modes - Bbox rotation methods - Probability parameter `p`
RandomPerspective	Perspective	- Both apply random perspective transformations - Key differences: 1. Distortion control: - TorchVision: single `distortion_scale` (0 to 1) - Albumentations: `scale` tuple for corner movement range 2. Output handling: - Albumentations adds `keep_size` and `fit_output` options - Can control whether to maintain original size 3. Additional Albumentations features: - Separate mask interpolation - More border modes - Better control over output size and fitting
ElasticTransform	ElasticTransform	- Similar core functionality: both apply elastic deformations to images - Key differences: 1. Parameters have opposite meanings: - TorchVision: `alpha` (displacement), `sigma` (smoothness) - Albumentations: `alpha` (smoothness), `sigma` (displacement) 2. Default values reflect this difference: - TorchVision: `alpha=50.0, sigma=5.0` - Albumentations: `alpha=1.0, sigma=50.0` - Note on implementation: - Albumentations follows Simard et al. 2003 paper more closely: - σ should be ~0.05 * image_size - α should be proportional to σ - Additional Albumentations features: - `approximate` mode - `same_dxdy` option - Choice of noise distribution - Separate mask interpolation
ColorJitter	ColorJitter	- Similar core functionality: both randomly adjust brightness, contrast, saturation, and hue - Key similarities: 1. Same parameter names and meanings 2. Same value ranges (e.g., hue should be in [-0.5, 0.5]) 3. Random order of transformations - Key differences: 1. Default values: - TorchVision: all None by default - Albumentations: defaults to (0.8, 1.2) for brightness/contrast/saturation 2. Implementation: - TorchVision: uses Pillow - Albumentations: uses OpenCV (may produce slightly different results) 3. Additional in Albumentations: - Explicit probability parameter `p` - Value saturation instead of uint8 overflow
RandomChannelPermutation	ChannelShuffle	- Both randomly permute image channels - Key similarities: 1. Same core functionality 2. Work on multi-channel images (typically RGB) - Key differences: 1. Naming convention only 2. Albumentations adds: - Probability parameter `p`
RandomPhotometricDistort	RandomOrder + ColorJitter + ChannelShuffle	- TorchVision's transform is from SSD paper, combines: 1. Color jittering (brightness, contrast, saturation, hue) 2. Random channel permutation - Can be replicated in Albumentations using: `A.RandomOrder([` `A.ColorJitter(brightness=(0.875, 1.125),` `contrast=(0.5, 1.5),` `saturation=(0.5, 1.5),` `hue=(-0.05, 0.05),` `p=0.5),` `A.ChannelShuffle(p=0.5)` `])`
Grayscale	ToGray	- Similar core functionality: convert RGB to grayscale - Key differences: 1. Output channels: - TorchVision: only 1 or 3 channels - Albumentations: supports any number of output channels 2. Conversion methods: - TorchVision: single method (weighted RGB) - Albumentations: multiple methods via `method` parameter: • weighted_average (default, same as TorchVision) • from_lab, desaturation, average, max, pca 3. Additional in Albumentations: - Probability parameter `p` - More flexible channel handling
RGB	ToRGB	- Similar core functionality: convert to RGB format - Key differences: 1. Input handling: - TorchVision: accepts 1 or 3 channel inputs - Albumentations: only accepts single-channel inputs 2. Output channels: - TorchVision: always 3 channels - Albumentations: configurable via `num_output_channels` 3. Behavior: - TorchVision: converts to RGB if not already RGB - Albumentations: strictly grayscale to RGB conversion 4. Additional in Albumentations: - Probability parameter `p`
RandomGrayscale	ToGray	- Similar core functionality: convert to grayscale with probability - Key differences: 1. Default probability: - TorchVision: p=0.1 - Albumentations: p=0.5 2. Output handling: - TorchVision: always preserves input channels - Albumentations: configurable output channels 3. Conversion methods: - TorchVision: single method - Albumentations: multiple methods with different channel support: • weighted_average, from_lab: 3-channel only • desaturation, average, max, pca: any number of channels 4. Channel requirements: - TorchVision: works with 1 or 3 channels - Albumentations: depends on method chosen
GaussianBlur	GaussianBlur	- Similar core functionality: apply Gaussian blur with random kernel size - Key similarities: 1. Both support random kernel sizes 2. Both support random sigma values - Key differences: 1. Parameter specification: - TorchVision: `kernel_size` (exact size), `sigma` (range) - Albumentations: `blur_limit` (size range), `sigma_limit` (range) 2. Kernel size constraints: - TorchVision: must specify exact size - Albumentations: can specify range (3, 7) or auto-compute 3. Additional in Albumentations: - Probability parameter `p` - Auto-computation of kernel size from sigma
GaussianNoise	GaussNoise	- Similar core functionality: add Gaussian noise to images - Key similarities: 1. Both support mean and standard deviation parameters - Key differences: 1. Parameter ranges: - TorchVision: fixed values for mean and sigma - Albumentations: ranges for both (`std_range`, `mean_range`) 2. Value handling: - TorchVision: expects float [0,1], has clip option - Albumentations: auto-scales based on dtype 3. Additional in Albumentations: - Per-channel noise option - Noise scale factor for performance - Probability parameter `p`
RandomInvert	InvertImg	- Similar core functionality: invert image colors - Key similarities: 1. Both invert pixel values 2. Both have default probability of 0.5 - Key differences: 1. Value handling: - TorchVision: works with [0,1] float tensors - Albumentations: auto-handles uint8 (255) and float32 (1.0)
RandomPosterize	Posterize	- Similar core functionality: reduce color bits - Key similarities: 1. Both posterize images with probability p=0.5 - Key differences: 1. Bits specification: - TorchVision: single fixed value [0-8] - Albumentations: flexible options with [1-7] (recommended): • Single value for all channels • Range (min_bits, max_bits) • Per-channel values [r,g,b] • Per-channel ranges [(r_min,r_max), ...] 2. Practical range: - TorchVision: includes 0 (black) and 8 (unchanged) - Albumentations: recommended [1-7] for actual posterization
RandomSolarize	Solarize	- Similar core functionality: invert pixels above threshold - Key similarities: 1. Both have default probability p=0.5 2. Both invert values above threshold - Key differences: 1. Threshold specification: - TorchVision: single fixed threshold value - Albumentations: range via `threshold_range` 2. Value handling: - TorchVision: works with raw threshold values - Albumentations: uses normalized [0,1] range: • uint8: multiplied by 255 • float32: multiplied by 1.0
RandomAdjustSharpness	Sharpen	- Similar core functionality: adjust image sharpness - Key similarities: 1. Both have default probability p=0.5 - Key differences: 1. Parameter specification: - TorchVision: single `sharpness_factor` • 0: blurred • 1: original • 2: doubled sharpness - Albumentations: more controls: • `alpha`: effect visibility [0,1] • `lightness`: contrast control 2. Method options: - TorchVision: single method - Albumentations: two methods: • 'kernel': Laplacian operator • 'gaussian': blur interpolation
RandomAutocontrast	AutoContrast	Same core functionality with identical parameters (p=0.5)
RandomEqualize	Equalize	- Similar core functionality: histogram equalization - Key similarities: 1. Both have default probability `p=0.5` - Key differences: 1. Additional Albumentations features: - Choice of algorithm (cv/pil methods) - Per-channel or luminance-based equalization - Optional masking support
Normalize	Normalize	- Similar core functionality: normalize image values - Key similarities: 1. Both support mean/std normalization - Key differences: 1. Normalization options: - TorchVision: only (input - mean) / std - Albumentations: multiple methods: • standard (same as TorchVision) • image (global stats) • image_per_channel • min_max • min_max_per_channel 2. Additional in Albumentations: - `max_pixel_value` parameter - Probability parameter `p`
RandomErasing	Erasing	- Similar core functionality: randomly erase image regions - Key similarities: 1. Both have default probability `p=0.5` 2. Same default scale=(0.02, 0.33) 3. Same default ratio=(0.3, 3.3) - Key differences: 1. Fill value options: - TorchVision: number/tuple or 'random' - Albumentations: additional options: • random_uniform • inpaint_telea • inpaint_ns 2. Additional in Albumentations: - Mask fill value option - Support for masks, bboxes, keypoints
JPEG	ImageCompression	- Similar core functionality: apply JPEG compression - Key similarities: 1. Both use quality range 1-100 2. Both support quality ranges - Key differences: 1. Compression types: - TorchVision: JPEG only - Albumentations: JPEG and WebP 2. Additional in Albumentations: - Probability parameter `p` - Default quality range (99, 100)

Kornia to Albumentations 🔗

Kornia	Albumentations	Notes
ColorJitter	ColorJitter	- Similar core functionality: randomly adjust brightness, contrast, saturation, and hue - Key similarities: 1. Both support same parameters (brightness, contrast, saturation, hue) 2. Both allow float or tuple ranges for parameters - Key differences: 1. Default values: - Albumentations: (0.8, 1.2) for brightness/contrast/saturation - Kornia: 0.0 for all parameters 2. Default probability: - Albumentations: p=0.5 - Kornia: p=1.0 3. Note: Kornia recommends using `ColorJiggle` instead as it follows color theory better
RandomAutoContrast	AutoContrast	- Similar core functionality: enhance image contrast automatically - Key similarities: 1. Both stretch intensity range to use full range 2. Both preserve relative intensities - Key differences: 1. Default probability: - Albumentations: p=0.5 - Kornia: p=1.0 2. Additional in Kornia: - `clip_output` parameter to control value clipping
RandomBoxBlur	Blur	- Similar core functionality: apply box/average blur to images - Key similarities: 1. Both have default probability p=0.5 2. Both apply box/average blur filter - Key differences: 1. Kernel size specification: - Albumentations: `blur_limit` parameter for range (e.g., (3, 7)) - Kornia: fixed `kernel_size` tuple (default (3, 3)) 2. Additional in Kornia: - `border_type` parameter ('reflect', 'replicate', 'circular') - `normalized` parameter for L1 norm control
RandomBrightness	RandomBrightnessContrast	- Different scope: - Kornia: brightness only - Albumentations: combines brightness and contrast - Key differences: 1. Parameter specification: - Kornia: `brightness` tuple (default: (1.0, 1.0)) - Albumentations: `brightness_limit` (default: (-0.2, 0.2)) 2. Default probability: - Kornia: p=1.0 - Albumentations: p=0.5 3. Additional in Albumentations: - `brightness_by_max` parameter for adjustment method - `ensure_safe_range` to prevent overflow/underflow - Combined contrast control 4. Additional in Kornia: - `clip_output` parameter
RandomChannelDropout	ChannelDropout	- Similar core functionality: randomly drop image channels - Key similarities: 1. Both have default probability p=0.5 2. Both allow specifying fill value for dropped channels - Key differences: 1. Channel drop specification: - Kornia: fixed `num_drop_channels` (default: 1) - Albumentations: flexible `channel_drop_range` tuple (default: (1, 1)) 2. Error handling: - Albumentations: explicit checks for single-channel images and invalid ranges - Kornia: simpler parameter validation
RandomChannelShuffle	ChannelShuffle	- Identical core functionality: randomly shuffle image channels
RandomClahe	CLAHE	- Similar core functionality: apply Contrast Limited Adaptive Histogram Equalization - Key similarities: 1. Both have default probability p=0.5 2. Both allow configuring grid size and clip limit - Key differences: 1. Parameter defaults: - Kornia: clip_limit=(40.0, 40.0), grid_size=(8, 8) - Albumentations: clip_limit=(1, 4), tile_grid_size=(8, 8) 2. Additional in Kornia: - `slow_and_differentiable` parameter for implementation choice
RandomContrast	RandomBrightnessContrast	- Different scope: - Kornia: contrast only - Albumentations: combines brightness and contrast - Key differences: 1. Parameter specification: - Kornia: `contrast` tuple (default: (1.0, 1.0)) - Albumentations: `contrast_limit` (default: (-0.2, 0.2)) 2. Default probability: - Kornia: p=1.0 - Albumentations: p=0.5 3. Additional in Albumentations: - `ensure_safe_range` to prevent overflow/underflow - Combined brightness control 4. Additional in Kornia: - `clip_output` parameter
RandomEqualize	Equalize	- Similar core functionality: apply histogram equalization - Key similarities: 1. Both have default probability p=0.5 - Key differences: 1. Additional in Albumentations: - `mode` parameter to choose between 'cv' and 'pil' methods - `by_channels` parameter for per-channel or luminance-based equalization - `mask` parameter to selectively apply equalization - `mask_params` for dynamic mask generation
RandomGamma	RandomGamma	- Similar core functionality: apply random gamma correction - Key differences: 1. Parameter specification: - Kornia: separate `gamma` (1.0, 1.0) and `gain` (1.0, 1.0) tuples - Albumentations: single `gamma_limit` (80, 120) as percentage range 2. Default probability: - Kornia: p=1.0 - Albumentations: p=0.5 3. Additional in Albumentations: - `eps` parameter to prevent numerical errors
RandomGaussianBlur	GaussianBlur	- Similar core functionality: apply Gaussian blur with random parameters - Key similarities: 1. Both have default probability p=0.5 2. Both support kernel size and sigma parameters - Key differences: 1. Parameter specification: - Kornia: requires explicit `kernel_size` and `sigma` range - Albumentations: `blur_limit` (default: (3, 7)) and `sigma_limit` (default: 0) 2. Additional in Kornia: - `border_type` parameter for padding mode - `separable` parameter for 1D convolution optimization
RandomGaussianIllumination	Illumination	- Similar core functionality: apply illumination effects - Key similarities: 1. Both have default probability p=0.5 2. Both support controlling effect intensity and position - Key differences: 1. Scope: - Kornia: Gaussian illumination patterns only - Albumentations: Multiple modes (linear, corner, gaussian) 2. Parameter ranges: - Kornia: gain=(0.01, 0.15), center=(0.1, 0.9), sigma=(0.2, 1.0) - Albumentations: intensity_range=(0.01, 0.2), center_range=(0.1, 0.9), sigma_range=(0.2, 1.0) 3. Additional in Albumentations: - `mode` parameter for different effect types - `effect_type` for brighten/darken control - `angle_range` for linear gradients 4. Additional in Kornia: - `sign` parameter for effect direction
RandomGaussianNoise	GaussNoise	- Similar core functionality: add Gaussian noise to images - Key similarities: 1. Both have default probability p=0.5 - Key differences: 1. Parameter specification: - Kornia: fixed `mean` (default: 0.0) and `std` (default: 1.0) - Albumentations: ranges via `std_range` (0.2, 0.44) and `mean_range` (0.0, 0.0) 2. Additional in Albumentations: - `per_channel` parameter for independent channel noise - `noise_scale_factor` for performance optimization - Automatic value scaling based on image dtype
RandomGrayscale	ToGray	- Similar core functionality: convert images to grayscale - Key differences: 1. Default probability: - Kornia: p=0.1 - Albumentations: p=0.5 2. Conversion options: - Kornia: customizable `rgb_weights` for channel mixing - Albumentations: multiple `method` options (weighted_average, from_lab, desaturation, average, max, pca) 3. Output control: - Kornia: always 3-channel output - Albumentations: configurable `num_output_channels`
RandomHue	ColorJitter (hue parameter)	- Similar core functionality: adjust image hue - Key differences: 1. Scope: - Kornia: hue-only transform - Albumentations: part of ColorJitter with brightness, contrast, and saturation 2. Default values: - Kornia: hue=(0.0, 0.0), p=1.0 - Albumentations: hue=(-0.5, 0.5), p=0.5
RandomInvert	InvertImg	- Similar core functionality: invert image values - Key differences: 1. Maximum value handling: - Kornia: configurable via `max_val` parameter (default: 1.0) - Albumentations: automatically determined by dtype (255 for uint8, 1.0 for float32)
RandomJPEG	ImageCompression	- Similar core functionality: apply image compression - Key differences: 1. Compression options: - Kornia: JPEG only - Albumentations: supports both JPEG and WebP 2. Quality specification: - Kornia: `jpeg_quality` (default: 50.0) - Albumentations: `quality_range` (default: (99, 100)) 3. Default probability: - Kornia: p=1.0 - Albumentations: p=0.5
RandomLinearCornerIllumination	Illumination (corner mode)	- Similar core functionality: apply corner illumination effects - Key differences: 1. Scope: - Kornia: corner illumination only - Albumentations: part of general Illumination transform with multiple modes 2. Parameter specification: - Kornia: `gain` (0.01, 0.2) and `sign` (-1.0, 1.0) - Albumentations: `intensity_range` (0.01, 0.2) and `effect_type` (brighten/darken/both) 3. Additional in Albumentations: - Multiple illumination modes (linear, corner, gaussian) - More control over effect parameters
RandomLinearIllumination	Illumination (linear mode)	- Similar core functionality: apply linear illumination effects - Key differences: 1. Scope: - Kornia: linear illumination only - Albumentations: part of general Illumination transform with multiple modes 2. Parameter specification: - Kornia: `gain` (0.01, 0.2) and `sign` (-1.0, 1.0) - Albumentations: `intensity_range` (0.01, 0.2), `effect_type` (brighten/darken/both), and `angle_range` (0, 360) 3. Additional in Albumentations: - Multiple illumination modes (linear, corner, gaussian) - Explicit angle control for gradient direction
RandomMedianBlur	MedianBlur	- Similar core functionality: apply median blur filter - Key similarities: 1. Both have default probability p=0.5 - Key differences: 1. Kernel size specification: - Kornia: fixed `kernel_size` tuple (default: (3, 3)) - Albumentations: range via `blur_limit` (default: (3, 7)) 2. Kernel constraints: - Albumentations: enforces odd kernel sizes
RandomMotionBlur	MotionBlur	- Similar core functionality: apply directional motion blur - Key similarities: 1. Both have default probability p=0.5 2. Both support angle and direction control - Key differences: 1. Kernel size specification: - Kornia: `kernel_size` as int or tuple - Albumentations: `blur_limit` (default: (3, 7)) 2. Angle control: - Kornia: `angle` parameter with symmetric range (-angle, angle) - Albumentations: `angle_range` (default: (0, 360)) 3. Additional in Albumentations: - `allow_shifted` parameter for kernel position control
RandomPlanckianJitter	PlanckianJitter	- Similar core functionality: apply physics-based color temperature variations - Key similarities: 1. Both have default probability p=0.5 2. Both support 'blackbody' and 'cied' modes - Key differences: 1. Temperature control: - Kornia: `select_from` parameter for discrete jitter selection - Albumentations: `temperature_limit` for continuous range 2. Additional in Albumentations: - `sampling_method` parameter ('uniform' or 'gaussian') - More detailed control over temperature ranges - Better documentation of physics-based effects
RandomPlasmaBrightness	PlasmaBrightnessContrast	- Similar core functionality: apply fractal-based brightness adjustments - Key similarities: 1. Both have default probability p=0.5 2. Both use Diamond-Square algorithm for pattern generation - Key differences: 1. Parameter specification: - Kornia: `roughness` (0.1, 0.7) and `intensity` (0.0, 1.0) - Albumentations: `brightness_range` (-0.3, 0.3), `contrast_range` (-0.3, 0.3), `roughness` (default: 3.0) 2. Additional in Albumentations: - Combined brightness and contrast adjustment - `plasma_size` parameter for pattern detail control - More detailed mathematical formulation and documentation
RandomPlasmaContrast	PlasmaBrightnessContrast	- Similar core functionality: apply fractal-based contrast adjustments - Key similarities: 1. Both have default probability p=0.5 2. Both use Diamond-Square algorithm for pattern generation - Key differences: 1. Parameter specification: - Kornia: `roughness` (0.1, 0.7) only - Albumentations: `contrast_range` (-0.3, 0.3), `roughness` (default: 3.0), `plasma_size` (default: 256) 2. Scope: - Kornia: contrast-only adjustment - Albumentations: combined brightness and contrast adjustment 3. Additional in Albumentations: - More detailed mathematical formulation - Pattern size control via `plasma_size`
RandomPlasmaShadow	PlasmaShadow	- Similar core functionality: apply fractal-based shadow effects - Key similarities: 1. Both have default probability p=0.5 2. Both use Diamond-Square algorithm for pattern generation - Key differences: 1. Parameter specification: - Kornia: `roughness` (0.1, 0.7), `shade_intensity` (-1.0, 0.0), `shade_quantity` (0.0, 1.0) - Albumentations: `shadow_intensity_range` (0.3, 0.7), `plasma_size` (default: 256), `roughness` (default: 3.0) 2. Additional in Albumentations: - Pattern size control via `plasma_size` - More intuitive intensity range (0 to 1) - More detailed mathematical formulation and documentation
RandomPosterize	Posterize	- Similar core functionality: reduce color bits in image - Key similarities: 1. Both have default probability p=0.5 2. Both operate on color bit reduction - Key differences: 1. Bit specification: - Kornia: `bits` parameter (default: 3) with range (0, 8], can be float or tuple - Albumentations: `num_bits` parameter (default: 4) with range [1, 7], supports multiple formats: * Single int for all channels * Tuple for random range * List for per-channel specification * List of tuples for per-channel ranges 2. Additional in Albumentations: - More flexible channel-wise control - More detailed documentation and mathematical background
RandomRain	RandomRain	- Similar core functionality: add rain effects to images - Key similarities: 1. Both have default probability p=0.5 - Key differences: 1. Rain parameter specification: - Kornia: `number_of_drops` (1000, 2000), `drop_height` (5, 20), `drop_width` (-5, 5) - Albumentations: `slant_range` (-10, 10), `drop_length` (20), `drop_width` (1) 2. Additional in Albumentations: - `drop_color` customization - `blur_value` for atmospheric effect - `brightness_coefficient` for lighting adjustment - `rain_type` presets (drizzle, heavy, torrential) 3. Approach: - Kornia: Direct drop placement - Albumentations: More realistic simulation with slant, blur, and brightness effects
RandomRGBShift	AdditiveNoise	- Similar core functionality: add noise/shifts to image channels - Key similarities: 1. Both have default probability p=0.5 2. Both can affect individual channels - Key differences: 1. Approach: - Kornia: Simple RGB channel shifts with individual limits - Albumentations: More sophisticated noise generation with multiple distributions 2. Parameter specification: - Kornia: `r_shift_limit`, `g_shift_limit`, `b_shift_limit` (all default: 0.5) - Albumentations: Flexible noise configuration with: * Multiple noise types (uniform, gaussian, laplace, beta) * Different spatial modes (constant, per_pixel, shared) * Customizable distribution parameters 3. Additional in Albumentations: - Performance optimization options - More detailed control over noise distribution - Spatial application modes
RandomSaltAndPepperNoise	SaltAndPepper	- Similar core functionality: apply salt and pepper noise to images - Key similarities: 1. Both have default probability p=0.5 2. Both use same default parameters: - `amount` (0.01, 0.06) - `salt_vs_pepper` (0.4, 0.6) - Key differences: 1. Parameter flexibility: - Kornia: Supports single float or tuple for parameters - Albumentations: Requires tuples for ranges 2. Documentation: - Albumentations provides: * Detailed mathematical formulation * Clear examples for different noise levels * Implementation notes and edge cases * References to academic sources
RandomSaturation	ColorJitter	- Different scope and functionality: - Key differences: 1. Scope: - Kornia: Saturation-only adjustment - Albumentations: Combined brightness, contrast, saturation, and hue adjustment 2. Default parameters: - Kornia: `saturation` (1.0, 1.0), p=1.0 - Albumentations: `saturation` (0.8, 1.2), p=0.5 3. Implementation: - Kornia: Aligns with PIL/TorchVision implementation - Albumentations: Uses OpenCV with noted differences in HSV conversion 4. Additional in Albumentations: - Brightness adjustment - Contrast adjustment - Hue adjustment - Random order of transformations
RandomSharpness	Sharpen	- Similar core functionality: sharpen images - Key similarities: 1. Both have default probability p=0.5 - Key differences: 1. Parameter specification: - Kornia: Single `sharpness` parameter (default: 0.5) - Albumentations: More detailed control with: * `alpha` (0.2, 0.5) for effect visibility * `lightness` (0.5, 1.0) for contrast * `method` choice ('kernel' or 'gaussian') * `kernel_size` and `sigma` for gaussian method 2. Implementation methods: - Kornia: Single approach - Albumentations: Two methods: * Kernel-based using Laplacian operator * Gaussian interpolation 3. Documentation: - Albumentations provides detailed mathematical formulation and references
RandomSnow	RandomSnow	- Similar core functionality: add snow effects to images - Key differences: 1. Parameter specification: - Kornia: `snow_coefficient` (0.5, 0.5), `brightness` (2, 2), p=1.0 - Albumentations: `snow_point_range` (0.1, 0.3), `brightness_coeff` (2.5), p=0.5 2. Implementation methods: - Kornia: Single approach - Albumentations: Two methods: * "bleach": Simple pixel value thresholding * "texture": Advanced snow texture simulation 3. Additional in Albumentations: - Detailed snow simulation with: * HSV color space manipulation * Gaussian noise for texture * Depth effect simulation * Sparkle effects 4. Documentation: - Albumentations provides detailed mathematical formulation and implementation notes
RandomSolarize	Solarize	- Similar core functionality: invert pixel values above threshold - Key similarities: 1. Both have default probability p=0.5 - Key differences: 1. Parameter specification: - Kornia: Two parameters: * `thresholds` (default: 0.1) for threshold range * `additions` (default: 0.1) for value adjustment - Albumentations: Single parameter: * `threshold_range` (default: (0.5, 0.5)) 2. Threshold handling: - Kornia: Generates from (0.5 - x, 0.5 + x) for float input - Albumentations: Direct range specification, scaled by image type max value 3. Documentation: - Albumentations provides: * Detailed examples for both uint8 and float32 images * Clear mathematical formulation * Image type-specific behavior explanation
CenterCrop	CenterCrop	- Similar core functionality: crop center of image - Key similarities: 1. Both have default probability p=1.0 - Key differences: 1. Size specification: - Kornia: Single `size` parameter (int or tuple) - Albumentations: Separate `height` and `width` parameters 2. Additional features: - Kornia: * `align_corners` for interpolation * `resample` mode selection * `cropping_mode` ('slice' or 'resample') - Albumentations: * `pad_if_needed` for handling small images * `border_mode` for padding method * `fill` and `fill_mask` for padding values * `pad_position` options 3. Target handling: - Kornia: Image tensors only - Albumentations: Supports images, masks, bboxes, and keypoints
PadTo	PadIfNeeded	- Can achieve same core functionality - Key similarities: 1. Both have default probability p=1.0 2. Can pad to exact size: - Kornia: `size=(height, width)` - Albumentations: `min_height=height, min_width=width` - Key differences: 1. Parameter naming: - Kornia: Single `size` tuple - Albumentations: Separate dimension parameters 2. Additional features: - Kornia: * Simple `pad_mode` selection * Single `pad_value` - Albumentations: * Flexible `position` options * Separate `fill` and `fill_mask` * Optional divisibility padding * Multiple target support 3. Target handling: - Kornia: Image tensors only - Albumentations: Images, masks, bboxes, keypoints
RandomAffine	Affine	- Similar core functionality: apply affine transformations - Key similarities: 1. Both have default probability p=0.5 2. Both support rotation, translation, scaling, and shear - Key differences: 1. Parameter specification: - Kornia: * `degrees` for rotation * `translate` as fraction * `scale` as tuple * `shear` in degrees - Albumentations: * More flexible parameter formats * Supports both percent and pixel translation * Dictionary format for independent axis control 2. Additional features in Albumentations: * `fit_output` for automatic size adjustment * `keep_ratio` for aspect ratio preservation * `rotate_method` options * `balanced_scale` for even scale distribution 3. Target handling: - Kornia: Image tensors only - Albumentations: Images, masks, keypoints, bboxes
RandomCrop	RandomCrop	- Similar core functionality: randomly crop image patches - Key similarities: 1. Both have default probability p=1.0 2. Both support padding if needed - Key differences: 1. Size specification: - Kornia: Single `size` tuple (height, width) - Albumentations: Separate `height` and `width` parameters 2. Padding options: - Kornia: * Flexible padding sizes (int, tuple[2], tuple[4]) * Multiple padding modes (constant, reflect, replicate) * Single fill value - Albumentations: * Simpler padding interface * Separate fill values for image and mask * Flexible pad positioning 3. Target handling: - Kornia: Image tensors only - Albumentations: Images, masks, bboxes, keypoints
RandomElasticTransform	ElasticTransform	- Similar core functionality: apply elastic deformations - Key similarities: 1. Both have default probability p=0.5 2. Both use Gaussian smoothing for displacement fields 3. Both support independent control of x/y deformations: - Kornia: via separate values in `sigma`/`alpha` tuples - Albumentations: via `same_dxdy` parameter - Key differences: 1. Parameter specification: - Kornia: * `kernel_size` tuple (63, 63) * `sigma` tuple (32.0, 32.0) * `alpha` tuple (1.0, 1.0) - Albumentations: * Single `sigma` (default: 50.0) * Single `alpha` (default: 1.0) 2. Additional features: - Kornia: * Control over padding mode - Albumentations: * `approximate` mode for faster processing * Choice of noise distribution * Separate mask interpolation 3. Target handling: - Kornia: Image tensors only - Albumentations: Images, masks, bboxes, keypoints
RandomErasing	Erasing	- Similar core functionality: randomly erase rectangular regions - Key similarities: 1. Both have default probability p=0.5 2. Same default parameters: * `scale` (0.02, 0.33) * `ratio` (0.3, 3.3) - Key differences: 1. Fill value options: - Kornia: Simple numeric `value` (default: 0.0) - Albumentations: Rich `fill` options: * Numeric values * "random" per pixel * "random_uniform" per region * "inpaint_telea" method * "inpaint_ns" method 2. Additional features in Albumentations: * Separate `mask_fill` value * Support for masks, bboxes, keypoints * Inpainting options for more natural-looking results 3. Target handling: - Kornia: Image tensors only - Albumentations: Images, masks, bboxes, keypoints
RandomFisheye	OpticalDistortion	- Similar core functionality: apply optical/fisheye distortion - Key similarities: 1. Both have default probability p=0.5 2. Both support fisheye distortion - Key differences: 1. Parameter specification: - Kornia: * Separate `center_x`, `center_y` for distortion center * `gamma` for distortion strength - Albumentations: * Single `distort_limit` parameter * `mode` selection ('camera' or 'fisheye') 2. Distortion models: - Kornia: Fisheye only - Albumentations: * Camera matrix model * Fisheye model 3. Additional features in Albumentations: * Separate interpolation methods for image and mask * Support for masks, bboxes, keypoints 4. Target handling: - Kornia: Image tensors only - Albumentations: Images, masks, bboxes, keypoints
RandomHorizontalFlip	HorizontalFlip	- Similar core functionality: flip image horizontally - Key similarities: 1. Both have default probability p=0.5 2. Simple operation with same visual result - Key differences: 1. Batch handling: - Kornia: * Additional `p_batch` parameter * `same_on_batch` option - Albumentations: No batch-specific parameters 2. Target handling: - Kornia: Image tensors only - Albumentations: Images, masks, bboxes, keypoints
RandomPerspective	Perspective	- Similar core functionality: apply perspective transformation - Key similarities: 1. Both have default probability p=0.5 2. Both transform image by moving corners 3. Both support different interpolation methods: - Kornia: via `resample` (BILINEAR, NEAREST) - Albumentations: via `interpolation` (INTER_LINEAR, INTER_NEAREST, etc.) - Key differences: 1. Distortion control: - Kornia: * `distortion_scale` (0 to 1, default: 0.5) * `sampling_method` ('basic' or 'area_preserving') - Albumentations: * `scale` tuple for corner movement range * `fit_output` option for image capture 2. Output handling: - Kornia: * `align_corners` parameter * `keepdim` for batch form - Albumentations: * `keep_size` for output dimensions * Border mode and fill options * Separate mask interpolation 3. Target handling: - Kornia: Image tensors only - Albumentations: Images, masks, keypoints, bboxes
RandomResizedCrop	RandomResizedCrop	- Similar core functionality: crop random patches and resize - Key similarities: 1. Both have default probability p=1.0 2. Same default parameters: * `scale` (0.08, 1.0) * `ratio` (~0.75, ~1.33) 3. Both support different interpolation methods: - Kornia: via `resample` - Albumentations: via `interpolation` - Key differences: 1. Implementation options: - Kornia: * `cropping_mode` ('slice' or 'resample') * `align_corners` parameter * `keepdim` for batch form - Albumentations: * Separate mask interpolation * Fallback to center crop after 10 attempts 2. Target handling: - Kornia: Image tensors only - Albumentations: Images, masks, bboxes, keypoints
RandomRotation90	RandomRotate90	- Similar core functionality: rotate image by 90 degrees - Key similarities: 1. Both have default probability p=0.5 2. Both rotate in 90-degree increments - Key differences: 1. Rotation control: - Kornia: * `times` parameter to specify range of rotations * `resample` and `align_corners` for interpolation - Albumentations: * Simpler implementation (0-3 rotations) 2. Target handling: - Kornia: Image tensors only - Albumentations: Images, masks, bboxes, keypoints
RandomRotation	Rotate	- Similar core functionality: rotate image by random angle - Key similarities: 1. Both have default probability p=0.5 2. Both support different interpolation methods - Key differences: 1. Angle specification: - Kornia: `degrees` parameter (if single value, range is (-degrees, +degrees)) - Albumentations: `limit` parameter (default: (-90, 90)) 2. Additional features: - Kornia: * `align_corners` for interpolation - Albumentations: * Border mode options * Fill values for padding * `rotate_method` for bboxes * `crop_border` option * Separate mask interpolation 3. Target handling: - Kornia: Image tensors only - Albumentations: Images, masks, bboxes, keypoints
RandomShear	Affine (shear parameter)	- Similar core functionality: apply shear transformation - Key similarities: 1. Both have default probability p=0.5 2. Both support different interpolation methods 3. Both support independent x/y shear control - Key differences: 1. Parameter specification: - Kornia: * Dedicated shear transform * `shear` parameter supports float, tuple(2), or tuple(4) * Simple padding modes (zeros, border, reflection) - Albumentations: * Part of general Affine transform * `shear` supports number, tuple, or dict format * More border modes and fill options 2. Additional features in Albumentations: * Separate mask interpolation * `fit_output` option * Combined with other affine transforms 3. Target handling: - Kornia: Image tensors only - Albumentations: Images, masks, keypoints, bboxes
RandomThinPlateSpline	ThinPlateSpline	- Similar core functionality: apply smooth, non-rigid deformations - Key similarities: 1. Both have default probability p=0.5 2. Both use thin plate spline algorithm 3. Both support interpolation options - Key differences: 1. Deformation control: - Kornia: * Single `scale` parameter (default: 0.2) * Fixed control point grid - Albumentations: * `scale_range` tuple for range of deformation * Configurable `num_control_points` 2. Implementation details: - Kornia: * `align_corners` parameter * Binary mode choice (bilinear/nearest) - Albumentations: * OpenCV interpolation flags * More granular control over grid 3. Target handling: - Kornia: Image tensors only - Albumentations: Images, masks, keypoints, bboxes
RandomVerticalFlip	VerticalFlip	- Similar core functionality: flip image vertically - Key similarities: 1. Both have default probability p=0.5 2. Simple operation with same visual result - Key differences: 1. Implementation: - Kornia: * Additional `p_batch` parameter - Albumentations: * Simpler implementation 2. Target handling: - Kornia: Image tensors only - Albumentations: Images, masks, bboxes, keypoints

Key Differences 🔗

Compared to TorchVision 🔗

Albumentations operates on numpy arrays instead of PyTorch tensors
Albumentations typically provides more parameters for fine-tuning transformations
Most Albumentations transforms support both image and mask augmentation
Better support for bounding box and keypoint augmentation

Compared to Kornia 🔗

Kornia operates directly on GPU tensors, while Albumentations works with numpy arrays
Albumentations provides more comprehensive support for object detection and segmentation tasks
Albumentations typically offers better performance for CPU-based augmentations

Performance Comparison 🔗

According to benchmarking results, Albumentations generally offers superior CPU performance compared to TorchVision and Kornia for most transforms. Here are some key highlights: Common Transforms Performance (images/second, higher is better)

Transform	Albumentations	TorchVision	Kornia	Notes
HorizontalFlip	8,618	914	390	Albumentations is ~9x faster than TorchVision, ~22x faster than Kornia
VerticalFlip	22,847	3,198	1,212	Albumentations is ~7x faster than TorchVision, ~19x faster than Kornia
RandomResizedCrop	2,828	511	287	Albumentations is ~5.5x faster than TorchVision, ~10x faster than Kornia
Normalize	1,196	519	626	Albumentations is ~2x faster than both
ColorJitter	628	46	55	Albumentations is ~13x faster than both

Key Performance Insights: 🔗

Basic Operations: Albumentations excels at basic transforms like flips and crops, often being 5-20x faster than alternatives
Complex Operations: For more complex transforms like elastic deformation, the performance gap narrows
Memory Efficiency: Working with numpy arrays (Albumentations) is generally more memory efficient than tensor operations (Kornia/TorchVision) on CPU

When to Choose Each Library: 🔗

Albumentations: Best choice for CPU-based preprocessing pipelines and when maximum performance is needed
Kornia: Consider when doing augmentation on GPU with existing PyTorch tensors
TorchVision: Good choice when deeply integrated into PyTorch ecosystem and GPU performance isn't critical

Note: Benchmarks performed on macOS-15.0.1-arm64 with Python 3.12.7. Your results may vary based on hardware and setup.

Code Examples 🔗

TorchVision to Albumentations 🔗

# TorchVision
transforms = T.Compose([
    T.RandomHorizontalFlip(p=0.5),
    T.RandomRotation(10),
    T.Normalize(mean=[0.485, 0.456, 0.406],
                std=[0.229, 0.224, 0.225])
])

# Albumentations equivalent
transforms = A.Compose([
    A.HorizontalFlip(p=0.5),
    A.Rotate(limit=10),
    A.Normalize(mean=[0.485, 0.456, 0.406],
                std=[0.229, 0.224, 0.225])
])

Kornia to Albumentations 🔗

# Kornia
transforms = K.AugmentationSequential(
    K.RandomHorizontalFlip(p=0.5),
    K.RandomRotation(degrees=10),
    K.Normalize(mean=[0.485, 0.456, 0.406],
                std=[0.229, 0.224, 0.225])
)

# Albumentations equivalent
transforms = A.Compose([
    A.HorizontalFlip(p=0.5),
    A.Rotate(limit=10),
    A.Normalize(mean=[0.485, 0.456, 0.406],
                std=[0.229, 0.224, 0.225])
])

Transform Library Comparison Guide 🔗

Key Differences 🔗

Compared to TorchVision 🔗

Compared to Kornia 🔗

Common Transform Mappings 🔗

Basic Geometric Transforms 🔗

Kornia to Albumentations 🔗

Key Differences 🔗

Compared to TorchVision 🔗

Compared to Kornia 🔗

Performance Comparison 🔗

Key Performance Insights: 🔗

When to Choose Each Library: 🔗

Code Examples 🔗

TorchVision to Albumentations 🔗

Kornia to Albumentations 🔗

Additional Resources 🔗