Typo Detector Pipeline for Icelandic

Description

This pretrained pipeline is built on the top of distilbert_token_classifier_typo_detector_is.

Predicted Entities

Download Copy S3 URI

How to use

typo_pipeline = PretrainedPipeline("distilbert_token_classifier_typo_detector_pipeline", lang = "is")

typo_pipeline.annotate("Það er miög auðvelt að draga marktækar álykanir af texta með Spark NLP.")
val typo_pipeline = new PretrainedPipeline("distilbert_token_classifier_typo_detector_pipeline", lang = "is")

typo_pipeline.annotate("Það er miög auðvelt að draga marktækar álykanir af texta með Spark NLP.")

Results

Results



+--------+---------+
|chunk   |ner_label|
+--------+---------+
|miög    |PO       |
|álykanir|PO       |
+--------+---------+


{:.model-param}

Model Information

Model Name: distilbert_token_classifier_typo_detector_pipeline
Type: pipeline
Compatibility: Spark NLP 4.4.2+
License: Open Source
Edition: Official
Language: is
Size: 505.8 MB

Included Models

  • DocumentAssembler
  • TokenizerModel
  • DistilBertForTokenClassification
  • NerConverter
  • Finisher