Match Pattern

Description

The match_pattern is a pretrained pipeline that we can use to process text with a simple pipeline that performs basic processing steps and matches pattrens . It performs most of the common text processing tasks on your dataframe

Download Copy S3 URI

How to use

from sparknlp.pretrained import PretrainedPipeline
pipeline = PretrainedPipeline("match_pattern", "en", "clinical/models")
	result = pipeline.annotate("""I love johnsnowlabs!  """)

Model Information

Model Name: match_pattern
Type: pipeline
Compatibility: Spark NLP 4.0.0+
License: Open Source
Edition: Community
Language: en
Size: 29.0 KB

Included Models

  • DocumentAssembler
  • SentenceDetector
  • TokenizerModel
  • RegexMatcherModel