English pipeline_image_classifier_vit_base_patch16_224_in21k_snacks ViTForImageClassification from matteopilotto

Description

Pretrained VIT model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.image_classifier_vit_base_patch16_224_in21k_snacks is a English model originally trained by matteopilotto.

Predicted Entities

salad, candy, muffin, banana, grape, popcorn, pretzel, pineapple, juice, orange, doughnut, carrot, waffle, cake, cookie, ice cream, watermelon, hot dog, apple, strawberry

Download Copy S3 URI

How to use

    pipeline = PretrainedPipeline('pipeline_image_classifier_vit_base_patch16_224_in21k_snacks', lang = 'en')
    annotations =  pipeline.transform(imageDF)
    

    val pipeline = new PretrainedPipeline("pipeline_image_classifier_vit_base_patch16_224_in21k_snacks", lang = "en")
    val annotations = pipeline.transform(imageDF)
    

Model Information

Model Name:	pipeline_image_classifier_vit_base_patch16_224_in21k_snacks
Type:	pipeline
Compatibility:	Spark NLP 4.2.1+
License:	Open Source
Edition:	Official
Language:	en
Size:	322.0 MB

Included Models

ImageAssembler
ViTForImageClassification

PREVIOUSEnglish pipeline_image_classifier_vit_base_patch16_224_in21k_finetuned_cifar10 ViTForImageClassification from tanlq

NEXTEnglish pipeline_image_classifier_vit_base_patch16_224_in21k_ucSat ViTForImageClassification from YKXBCi