Description
Pretrained ViTForImageClassification model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.deit_base_patch16_224 is a English model originally trained by facebook.
How to use
		
		
image_assembler = ImageAssembler()\
  .setInputCol("image")\
  .setOutputCol("image_assembler")
imageClassifier = ViTForImageClassification.pretrained(""deit_base_patch16_224","en")\
  .setInputCols("image_assembler")\
  .setOutputCol("class")
pipeline = Pipeline(stages=[
  image_assembler,
  imageClassifier,
])
pipelineModel = pipeline.fit(imageDF)
pipelineDF = pipelineModel.transform(imageDF)
val imageAssembler = new ImageAssembler()
    .setInputCol("image")
    .setOutputCol("image_assembler")
val imageClassifier =  ViTForImageClassification.pretrained("deit_base_patch16_224","en") 
    .setInputCols("image_assembler") 
    .setOutputCol("class") 
val pipeline = new Pipeline().setStages(Array(imageAssembler, imageClassifier))
val pipelineModel = pipeline.fit(imageDF)
val pipelineDF = pipelineModel.transform(imageDF)
Model Information
| Model Name: | deit_base_patch16_224 | 
| Compatibility: | Spark NLP 5.5.1+ | 
| License: | Open Source | 
| Edition: | Official | 
| Input Labels: | [image_assembler] | 
| Output Labels: | [label] | 
| Language: | en | 
| Size: | 324.1 MB | 
References
https://huggingface.co/facebook/deit-base-patch16-224