Description
Pretrained Wav2vec2 pipeline, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.asr_wav2vec2_base_timit_demo_google_colab_by_hnhoangdz
is a English model originally trained by hnhoangdz.
NOTE: This pipeline only works on a CPU, if you need to use this pipeline on a GPU device please use pipeline_asr_wav2vec2_base_timit_demo_google_colab_by_hnhoangdz_gpu
How to use
pipeline = PretrainedPipeline('pipeline_asr_wav2vec2_base_timit_demo_google_colab_by_hnhoangdz', lang = 'en')
annotations = pipeline.transform(audioDF)
val pipeline = new PretrainedPipeline("pipeline_asr_wav2vec2_base_timit_demo_google_colab_by_hnhoangdz", lang = "en")
val annotations = pipeline.transform(audioDF)
Model Information
Model Name: | pipeline_asr_wav2vec2_base_timit_demo_google_colab_by_hnhoangdz |
Type: | pipeline |
Compatibility: | Spark NLP 4.4.0+ |
License: | Open Source |
Edition: | Official |
Language: | en |
Size: | 349.3 MB |
Included Models
- AudioAssembler
- Wav2Vec2ForCTC