Arabic asr_swin_exp_w2v2t_ar_hubert_s290 TFHubertForCTC from jonatasgrosman

Description

Pretrained Hubert model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.asr_swin_exp_w2v2t_ar_hubert_s290 is a Arabic model originally trained by jonatasgrosman.

Download Copy S3 URI

How to use

audio_assembler = AudioAssembler() \
    .setInputCol("audio_content") \
    .setOutputCol("audio_assembler")

speech_to_text = HubertForCTC \
    .pretrained("asr_swin_exp_w2v2t_ar_hubert_s290", "ar")\
    .setInputCols("audio_assembler") \
    .setOutputCol("text")

pipeline = Pipeline(stages=[
  audio_assembler,
  speech_to_text,
])

pipelineModel = pipeline.fit(audioDf)

pipelineDF = pipelineModel.transform(audioDf)

val audioAssembler = new AudioAssembler()
    .setInputCol("audio_content") 
    .setOutputCol("audio_assembler")

val speechToText = HubertForCTC
    .pretrained("asr_swin_exp_w2v2t_ar_hubert_s290", "ar")
    .setInputCols("audio_assembler") 
    .setOutputCol("text") 

val pipeline = new Pipeline().setStages(Array(audioAssembler, speechToText))

val pipelineModel = pipeline.fit(audioDf)

val pipelineDF = pipelineModel.transform(audioDf)

Model Information

Model Name:	asr_swin_exp_w2v2t_ar_hubert_s290
Compatibility:	Spark NLP 4.4.0+
License:	Open Source
Edition:	Official
Input Labels:	[image_assembler]
Output Labels:	[class]
Language:	ar
Size:	2.4 GB

PREVIOUSEnglish asr_swin_distilhubert_timit TFHubertForCTC from patrickvonplaten

NEXTArabic asr_swin_exp_w2v2t_ar_hubert_s693 TFHubertForCTC from jonatasgrosman