Match Pattern

Description

The match_pattern is a pretrained pipeline that we can use to process text with a simple pipeline that performs basic processing steps and matches pattrens . It performs most of the common text processing tasks on your dataframe

Predicted Entities

Download Copy S3 URI

How to use

from sparknlp.pretrained import PretrainedPipeline
pipeline = PretrainedPipeline("match_pattern", "en", "clinical/models")
	result = pipeline.annotate("""I love johnsnowlabs!  """)

Model Information

Model Name: match_pattern
Type: pipeline
Compatibility: Spark NLP 4.4.2+
License: Open Source
Edition: Official
Language: en
Size: 17.4 KB

Included Models

  • DocumentAssembler
  • SentenceDetector
  • TokenizerModel
  • RegexMatcherModel