Modules
Foreground/background separation — "remove the background from this image".
Image captioning — generates a natural-language description of an image.
Image classification — what's in this image?
Object detection — where are the objects in this image?
Face detection — where are the faces in this image?
Image segmentation — which pixels belong to which object?
Zero-shot image classification — classify an image against arbitrary labels you supply at call time, without retraining.
ImageVision provides simple, opinionated image vision operations
that sit alongside the image
library
On-disk cache for ONNX model weights downloaded from HuggingFace.
Mix Tasks
Pre-downloads the default models used by Image.Classification,
Image.Segmentation, Image.Detection, Image.Background,
Image.Captioning, and Image.ZeroShot so that first-call latency
is eliminated and the application can run fully offline.