AlbumentationsX MCP integration

What MCP adds

AlbumentationsX is already a Python library with a direct API. MCP adds a structured interface that AI assistants can call safely and repeatedly.

With AlbumentationsX MCP, a host can:

discover AlbumentationsX transforms and supported targets;
recommend starter pipelines for classification, detection, segmentation, OCR, or balanced workflows;
validate pipelines before rendering;
validate local preview requests before reading image files;
render deterministic preview batches and contact sheets;
render annotation overlays for bounding boxes, masks, and keypoints;
compare baseline and candidate preview runs;
collect concrete feedback tags such as too_noisy:high, too_blurry, or too_distorted;
export accepted pipelines as Python, JSON, or YAML;
keep a short tuning-session record for review handoff.

This is especially useful for the common human-in-the-loop flow:

The assistant proposes a first augmentation pipeline.
You inspect a small preview sheet.
You say what is wrong in concrete terms.
The assistant adjusts the pipeline.
You compare the previous and adjusted preview runs.
You export the accepted pipeline for training or review.

Installation

Install the MCP server from PyPI with uvx:

uvx --from albumentationsx-mcp albumentationsx-mcp

For local preview work, always bound filesystem access explicitly:

uvx --from albumentationsx-mcp albumentationsx-mcp \
  --allowed-root /absolute/path/to/images-or-dataset \
  --artifact-root /absolute/path/to/albumentationsx-mcp-artifacts

--allowed-root limits which local images and masks the server can read. --artifact-root controls where generated previews, contact sheets, manifests, and reports are written.

The server also supports environment variables for host setups that prefer config-only launch commands:

ALBU_MCP_ALLOWED_ROOTS: os.pathsep-separated list of allowed local roots;
ALBU_MCP_ARTIFACT_ROOT: artifact output directory;
ALBU_MCP_MAX_PREVIEW_RUNS: optional preview-run retention limit.

Host configuration

The exact MCP config format depends on the host. The examples below use a PyPI install and local preview bounds.

Claude Desktop

{
  "mcpServers": {
    "albumentationsx": {
      "command": "uvx",
      "args": [
        "--from",
        "albumentationsx-mcp",
        "albumentationsx-mcp",
        "--allowed-root",
        "/absolute/path/to/images-or-dataset",
        "--artifact-root",
        "/absolute/path/to/albumentationsx-mcp-artifacts"
      ]
    }
  }
}

Restart Claude Desktop after editing its MCP config.

Cursor

Cursor uses a similar JSON server config:

{
  "mcpServers": {
    "albumentationsx": {
      "command": "uvx",
      "args": [
        "--from",
        "albumentationsx-mcp",
        "albumentationsx-mcp",
        "--allowed-root",
        "/absolute/path/to/images-or-dataset",
        "--artifact-root",
        "/absolute/path/to/albumentationsx-mcp-artifacts"
      ]
    }
  }
}

Refresh MCP discovery after changing the config.

Claude Code

claude mcp add-json albumentationsx '{
  "type": "stdio",
  "command": "uvx",
  "args": [
    "--from",
    "albumentationsx-mcp",
    "albumentationsx-mcp",
    "--allowed-root",
    "/absolute/path/to/images-or-dataset",
    "--artifact-root",
    "/absolute/path/to/albumentationsx-mcp-artifacts"
  ]
}'

Codex

[mcp_servers.albumentationsx]
command = "uvx"
args = [
  "--from",
  "albumentationsx-mcp",
  "albumentationsx-mcp",
  "--allowed-root",
  "/absolute/path/to/images-or-dataset",
  "--artifact-root",
  "/absolute/path/to/albumentationsx-mcp-artifacts",
]

First smoke check

After connecting the server, ask the host to run a smoke check before rendering previews:

Read albumentationsx://examples/client-smoke, then run run_host_smoke_check.
Continue only if preview_ready is true. If it is not ready, explain the remediation actions.

The smoke check verifies the local environment, safe roots, artifact output, recipe recommendation, pipeline validation, and a bounded preview request template.

Recommended preview workflow

Use this prompt for a first local preview:

Use AlbumentationsX MCP to create a small augmentation preview for this folder:
/absolute/path/to/images-or-dataset

Do not render anything until validate_preview_request returns valid=true.
Start with a low-intensity pipeline. Render one variant per image. Create a contact sheet and explain what changed.

A good host should follow this sequence:

Call plan_dataset_onboarding.
Inspect the returned preview_request_template.
Call validate_preview_request.
Call render_preview_batch only after validation succeeds.
Inspect the generated contact sheet and manifest.
Ask for concrete feedback before increasing intensity or batch size.

Feedback-driven tuning

After you inspect the first preview, give specific feedback:

The geometric variation is useful, but examples 3 and 8 are too noisy. Reduce noise and keep the crop/flip behavior.
Compare the adjusted preview with the baseline before exporting anything.

The host can then call:

adjust_pipeline to produce a candidate pipeline from feedback tags;
render_preview_batch to render the candidate;
compare_preview_runs to compare baseline and candidate manifests;
export_pipeline when you accept the result.

For multi-step reviews, ask the host to start a tuning session:

Start a tuning session. Record the baseline run, the candidate run, my feedback, and the final accepted pipeline.
Export the session as Markdown when we are done.

Detection and segmentation datasets

plan_dataset_onboarding detects common local dataset layouts and annotation formats. For sampled images, it can build annotation-aware preview templates.

Supported annotation inputs include:

COCO bounding boxes;
YOLO bounding boxes;
COCO segmentation polygons;
COCO compressed and uncompressed RLE masks;
YOLO segmentation polygons.

For detection workflows, the generated preview request can include pascal_voc bounding boxes and labels so the preview renderer can produce overlay contact sheets.

For segmentation workflows, use ["image", "mask"] targets. The onboarding template keeps mask-only payloads for segmentation-safe previews and avoids sending bounding boxes to pipelines that do not declare bbox_params.

Example prompt:

Plan a segmentation-safe preview for this dataset. Use image and mask targets. Detect COCO or YOLO-seg annotations if
available. Render a small overlay contact sheet and summarize mask coverage before and after augmentation.

The onboarding response includes annotation_summary, which reports matched sample counts, missing annotations, mask formats, and RLE counts. This helps the host explain whether it found the expected annotations before rendering.

Exporting a final pipeline

When the preview looks acceptable, ask for a reproducible export:

Export the accepted pipeline as Python and JSON. Include the seed, targets, bbox params or mask targets if used, and a
short note about the feedback that led to this version.

The exported Python can be copied into a training pipeline and reviewed like ordinary AlbumentationsX code.

Safety and privacy

AlbumentationsX MCP is designed for local, bounded preview work:

It does not train models.
It does not overwrite datasets.
It does not execute arbitrary user-provided Python.
It rejects preview paths outside --allowed-root.
It writes generated artifacts under --artifact-root.
It validates preview requests before reading image bytes.

Avoid uploading private datasets, production images, unredacted local paths, secrets, or credentials into public issues or chat transcripts. Use small synthetic samples or redacted contact sheets when reporting problems.

Troubleshooting

If the host cannot see the server, first run the command manually:

uvx --from albumentationsx-mcp albumentationsx-mcp --help

If preview setup fails, ask the host:

Read albumentationsx://diagnostics/guide and call diagnose_environment. Explain the failing checks and remediation
actions.

Common issues:

Missing uvx in the host environment.
Dataset path is outside --allowed-root.
Artifact directory is not writable.
Mask paths are outside the allowed root.
Annotation count does not match the number of input images.
A pipeline includes bounding boxes but does not declare compatible bbox_params.