Description
The match_pattern is a pretrained pipeline that we can use to process text with a simple pipeline that performs basic processing steps and matches pattrens . It performs most of the common text processing tasks on your dataframe
How to use
from sparknlp.pretrained import PretrainedPipeline
pipeline = PretrainedPipeline("match_pattern", "en", "clinical/models")
	result = pipeline.annotate("""I love johnsnowlabs!  """)
Model Information
| Model Name: | match_pattern | 
| Type: | pipeline | 
| Compatibility: | Spark NLP 4.0.0+ | 
| License: | Open Source | 
| Edition: | Community | 
| Language: | en | 
| Size: | 29.0 KB | 
Included Models
- DocumentAssembler
- SentenceDetector
- TokenizerModel
- RegexMatcherModel