Match Pattern

Description

The match_pattern is a pretrained pipeline that we can use to process text with a simple pipeline that performs basic processing steps and matches pattrens . It performs most of the common text processing tasks on your dataframe

Download Copy S3 URI

How to use

from sparknlp.pretrained import PretrainedPipeline
pipeline = PretrainedPipeline("match_pattern", "en", "clinical/models")
	result = pipeline.annotate("""I love johnsnowlabs!  """)

Model Information

Model Name:	match_pattern
Type:	pipeline
Compatibility:	Spark NLP 4.0.0+
License:	Open Source
Edition:	Community
Language:	en
Size:	29.0 KB

Included Models

DocumentAssembler
SentenceDetector
TokenizerModel
RegexMatcherModel

PREVIOUSLongformer Large NER Pipeline

NEXTRoBERTa Large CoNLL-03 NER Pipeline