Arabic asr_swin_exp_w2v2t_ar_hubert_s290 TFHubertForCTC from jonatasgrosman

Description

Pretrained Hubert model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.asr_swin_exp_w2v2t_ar_hubert_s290 is a Arabic model originally trained by jonatasgrosman.

Download Copy S3 URI

How to use


audio_assembler = AudioAssembler() \
    .setInputCol("audio_content") \
    .setOutputCol("audio_assembler")

speech_to_text = HubertForCTC \
    .pretrained("asr_swin_exp_w2v2t_ar_hubert_s290", "ar")\
    .setInputCols("audio_assembler") \
    .setOutputCol("text")

pipeline = Pipeline(stages=[
  audio_assembler,
  speech_to_text,
])

pipelineModel = pipeline.fit(audioDf)

pipelineDF = pipelineModel.transform(audioDf)

val audioAssembler = new AudioAssembler()
    .setInputCol("audio_content") 
    .setOutputCol("audio_assembler")

val speechToText = HubertForCTC
    .pretrained("asr_swin_exp_w2v2t_ar_hubert_s290", "ar")
    .setInputCols("audio_assembler") 
    .setOutputCol("text") 

val pipeline = new Pipeline().setStages(Array(audioAssembler, speechToText))

val pipelineModel = pipeline.fit(audioDf)

val pipelineDF = pipelineModel.transform(audioDf)

Model Information

Model Name: asr_swin_exp_w2v2t_ar_hubert_s290
Compatibility: Spark NLP 4.4.0+
License: Open Source
Edition: Official
Input Labels: [image_assembler]
Output Labels: [class]
Language: ar
Size: 2.4 GB