NER Pipeline for 9 African Languages

Description

This pretrained pipeline is built on the top of distilbert_base_token_classifier_masakhaner model.

Predicted Entities

Download Copy S3 URI

How to use

masakhaner_pipeline = PretrainedPipeline("distilbert_base_token_classifier_masakhaner_pipeline", lang = "xx")

masakhaner_pipeline.annotate("Ilé-iṣẹ́ẹ Mohammed Sani Musa, Activate Technologies Limited, ni ó kó ẹ̀rọ Ìwé-pélébé Ìdìbò Alálòpẹ́ (PVCs) tí a lò fún ìbò ọdún-un 2019, ígbà tí ó jẹ́ òǹdíjedupò lábẹ́ ẹgbẹ́ olóṣèlúu tí ó ń tukọ̀ ètò ìṣèlú lọ́wọ́ All rogressives Congress (APC) fún Aṣojú Ìlà-Oòrùn Niger, ìyẹn gẹ́gẹ́ bí ilé iṣẹ́ aṣèwádìí, Premium Times ṣe tẹ̀ ẹ́ jáde.")
val masakhaner_pipeline = new PretrainedPipeline("distilbert_base_token_classifier_masakhaner_pipeline", lang = "xx")

masakhaner_pipeline.annotate("Ilé-iṣẹ́ẹ Mohammed Sani Musa, Activate Technologies Limited, ni ó kó ẹ̀rọ Ìwé-pélébé Ìdìbò Alálòpẹ́ (PVCs) tí a lò fún ìbò ọdún-un 2019, ígbà tí ó jẹ́ òǹdíjedupò lábẹ́ ẹgbẹ́ olóṣèlúu tí ó ń tukọ̀ ètò ìṣèlú lọ́wọ́ All rogressives Congress (APC) fún Aṣojú Ìlà-Oòrùn Niger, ìyẹn gẹ́gẹ́ bí ilé iṣẹ́ aṣèwádìí, Premium Times ṣe tẹ̀ ẹ́ jáde.")

Results

Results



+-----------------------------+---------+
|chunk                        |ner_label|
+-----------------------------+---------+
|Mohammed Sani Musa           |PER      |
|Activate Technologies Limited|ORG      |
|ọdún-un 2019                 |DATE     |
|All rogressives Congress     |ORG      |
|APC                          |ORG      |
|Aṣojú Ìlà-Oòrùn Niger        |LOC      |
|Premium Times                |ORG      |
+-----------------------------+---------+


{:.model-param}

Model Information

Model Name: distilbert_base_token_classifier_masakhaner_pipeline
Type: pipeline
Compatibility: Spark NLP 4.4.2+
License: Open Source
Edition: Official
Language: xx
Size: 505.8 MB

Included Models

  • DocumentAssembler
  • SentenceDetector
  • TokenizerModel
  • DistilBertForTokenClassification
  • NerConverter
  • Finisher