XLM-RoBERTa Base NER Pipeline

Description

This pretrained pipeline is built on the top of xlm_roberta_base_token_classifier_ontonotes model.

Predicted Entities

Download Copy S3 URI

How to use

pipeline = PretrainedPipeline("xlm_roberta_base_token_classifier_ontonotes_pipeline", lang = "en")

pipeline.annotate("My name is John and I have been working at John Snow Labs since November 2020.")
val pipeline = new PretrainedPipeline("xlm_roberta_base_token_classifier_ontonotes_pipeline", lang = "en")

pipeline.annotate("My name is John and I have been working at John Snow Labs since November 2020.")

Results

Results




+--------------+---------+
|chunk         |ner_label|
+--------------+---------+
|John          |PERSON   |
|John Snow Labs|ORG      |
|November 2020 |DATE     |
+--------------+---------+


{:.model-param}

Model Information

Model Name: xlm_roberta_base_token_classifier_ontonotes_pipeline
Type: pipeline
Compatibility: Spark NLP 4.4.2+
License: Open Source
Edition: Official
Language: en
Size: 858.4 MB

Included Models

  • DocumentAssembler
  • SentenceDetectorDLModel
  • TokenizerModel
  • XlmRoBertaForTokenClassification
  • NerConverter
  • Finisher