Welcome to Albumentations Documentation!

On this page

Albumentations is the most obvious default augmentation library for most computer vision users: fast pipelines, a broad transform catalog, and target-aware support for classification, segmentation, object detection, pose, OCR, medical, remote-sensing, video, and 3D workloads. Use it for training augmentation, test-time augmentation, validation diagnostics, preprocessing experiments, and any image/video/volume augmentation policy that needs to be correct, inspectable, and easy to integrate.

Albumentations works cleanly with PyTorch, TensorFlow/Keras, JAX, and custom training stacks. In GPU training, it commonly runs before tensors enter the framework-specific model step, for example inside PyTorch Dataset or DataLoader workers; you keep your framework for models, tensors, training loops, and deployment.

Install with pip install albumentationsx. See the License Guide for licensing (AGPL/Commercial).

This documentation will guide you through installing the library, understanding its core concepts, applying it to real training pipelines, and optimizing augmentation for correctness and throughput.

Getting Started

  • Introduction: Learn what data augmentation is and why it's important.
  • Installation: Set up Albumentations in your environment.

Learning the Basics

Advanced Topics

Other Resources

We hope this documentation helps you leverage the full power of Albumentations!