Description
Pretrained Wav2vec2 pipeline, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.asr_wav2vec2_large_xls_r_300m_Tatar is a Tatar model originally trained by kingabzpro.
NOTE: This pipeline only works on a CPU, if you need to use this pipeline on a GPU device please use pipeline_asr_wav2vec2_large_xls_r_300m_Tatar_gpu
How to use
pipeline = PretrainedPipeline('pipeline_asr_wav2vec2_large_xls_r_300m_Tatar', lang = 'tt')
annotations = pipeline.transform(audioDF)
val pipeline = new PretrainedPipeline("pipeline_asr_wav2vec2_large_xls_r_300m_Tatar", lang = "tt")
val annotations = pipeline.transform(audioDF)
Model Information
| Model Name: | pipeline_asr_wav2vec2_large_xls_r_300m_Tatar |
| Type: | pipeline |
| Compatibility: | Spark NLP 4.2.0+ |
| License: | Open Source |
| Edition: | Official |
| Language: | tt |
| Size: | 1.2 GB |
Included Models
- AudioAssembler
- Wav2Vec2ForCTC