Description
Pretrained VIT model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.image_classifier_vit_base_patch16_224_in21k_snacks is a English model originally trained by matteopilotto.
Predicted Entities
salad, candy, muffin, banana, grape, popcorn, pretzel, pineapple, juice, orange, doughnut, carrot, waffle, cake, cookie, ice cream, watermelon, hot dog, apple, strawberry
How to use
pipeline = PretrainedPipeline('pipeline_image_classifier_vit_base_patch16_224_in21k_snacks', lang = 'en')
annotations = pipeline.transform(imageDF)
val pipeline = new PretrainedPipeline("pipeline_image_classifier_vit_base_patch16_224_in21k_snacks", lang = "en")
val annotations = pipeline.transform(imageDF)
Model Information
| Model Name: | pipeline_image_classifier_vit_base_patch16_224_in21k_snacks |
| Type: | pipeline |
| Compatibility: | Spark NLP 4.2.1+ |
| License: | Open Source |
| Edition: | Official |
| Language: | en |
| Size: | 322.0 MB |
Included Models
- ImageAssembler
- ViTForImageClassification