Skip to content

AdaCLIP — Vision-Language Anomaly Detection

AdaCLIP couples a frozen CLIP backbone with a small adapter trained against hyperspectral data, producing anomaly scores conditioned on a natural-language prompt. It's the workhorse pipeline for the XMR_Demo_Industrial_FOD_Lentils use case: "is this a lentil, or something else?"

This tutorial walks through three AdaCLIP variants, each one a different recipe for getting CLIP to work on hyperspectral data.

Run the example:

What you'll learn

  • How external plugin nodes integrate with cuvis-ai pipelines.
  • Reducing 60+ band hyperspectral data to 3 channels CLIP can consume (PCA, Concrete, DRCNN).
  • Using restore-pipeline to run AdaCLIP on new cu3s data after training.

When to reach for AdaCLIP

  • You want a strong anomaly detector that benefits from CLIP's pre-training but operates on hyperspectral data.
  • You're comparing a frozen baseline against a trainable adapter and want both available in the same pipeline shape.
  • You need text-prompted anomaly detection (the prompt conditions what counts as anomalous).