com.johnsnowlabs.nlp.annotators.sbd.pragmatic
PragmaticContentFormatter
Companion object PragmaticContentFormatter
class PragmaticContentFormatter extends AnyRef
rule-based formatter that adds regex rules to different marking steps Symbols protect from ambiguous bounds to be considered splitters
- Alphabetic
- By Inheritance
- PragmaticContentFormatter
- AnyRef
- Any
- Hide All
- Show All
- Public
- All
Instance Constructors
-
new
PragmaticContentFormatter(text: String)
- text
text to tag, which is modified in place with Symbols
Value Members
-
final
def
!=(arg0: Any): Boolean
- Definition Classes
- AnyRef → Any
-
final
def
##(): Int
- Definition Classes
- AnyRef → Any
-
final
def
==(arg0: Any): Boolean
- Definition Classes
- AnyRef → Any
-
final
def
asInstanceOf[T0]: T0
- Definition Classes
- Any
-
def
clone(): AnyRef
- Attributes
- protected[lang]
- Definition Classes
- AnyRef
- Annotations
- @throws( ... ) @native()
-
final
def
eq(arg0: AnyRef): Boolean
- Definition Classes
- AnyRef
-
def
equals(arg0: Any): Boolean
- Definition Classes
- AnyRef → Any
-
def
finalize(): Unit
- Attributes
- protected[lang]
- Definition Classes
- AnyRef
- Annotations
- @throws( classOf[java.lang.Throwable] )
-
def
finish: String
ToDo: NOT DOING replace_parens IN LISTS
-
def
formatAbbreviations(useDictAbbreviations: Boolean): PragmaticContentFormatter.this.type
Find abbreviations in non sentence breaks regex should match escape character replace with non separation symbol
- def formatBasicBreakers: PragmaticContentFormatter.this.type
-
def
formatBetweenPunctuations: PragmaticContentFormatter.this.type
Find punctuation rules NON-BREAKING characters regex should match entire wrapped sentence protect entire sentence
-
def
formatCustomBounds(factory: RuleFactory): PragmaticContentFormatter.this.type
Arbitrarely mark bounds with user provided characters
-
def
formatEllipsisRules: PragmaticContentFormatter.this.type
Find ellipsis BREAKING characters WITH REPLACEMENT regex should match the ellipsis replace with non breaking symbol
-
def
formatExclamationPoint: PragmaticContentFormatter.this.type
Specific cases for exclamation marks regex should match exclamation mark replace with symbol
-
def
formatGeoLocations: PragmaticContentFormatter.this.type
Find specific coordinates non-breaking characters regex should match non breaking symbol replace with non breaking symbol
-
def
formatLists(useLists: Boolean): PragmaticContentFormatter.this.type
Find simple lists regex should match entire enumeration prepend separation symbol
-
def
formatMultiplePeriods: PragmaticContentFormatter.this.type
Find sentence multiple non-breaking character regex should match group 2 as symbol replace with symbol
-
def
formatNumbers: PragmaticContentFormatter.this.type
Find numbers in non sentence breaks regex should match escape character replace with non separation symbol
-
def
formatPunctuations: PragmaticContentFormatter.this.type
Find sentence breaking symbols regex should match entire symbol append end breaking symbol
-
def
formatQuotationMarkInQuotation: PragmaticContentFormatter.this.type
Specific case for question mark in quotes regex should match question mark replace with symbol
-
final
def
getClass(): Class[_]
- Definition Classes
- AnyRef → Any
- Annotations
- @native()
-
def
hashCode(): Int
- Definition Classes
- AnyRef → Any
- Annotations
- @native()
-
final
def
isInstanceOf[T0]: Boolean
- Definition Classes
- Any
-
final
def
ne(arg0: AnyRef): Boolean
- Definition Classes
- AnyRef
-
final
def
notify(): Unit
- Definition Classes
- AnyRef
- Annotations
- @native()
-
final
def
notifyAll(): Unit
- Definition Classes
- AnyRef
- Annotations
- @native()
-
final
def
synchronized[T0](arg0: ⇒ T0): T0
- Definition Classes
- AnyRef
-
def
toString(): String
- Definition Classes
- AnyRef → Any
-
final
def
wait(): Unit
- Definition Classes
- AnyRef
- Annotations
- @throws( ... )
-
final
def
wait(arg0: Long, arg1: Int): Unit
- Definition Classes
- AnyRef
- Annotations
- @throws( ... )
-
final
def
wait(arg0: Long): Unit
- Definition Classes
- AnyRef
- Annotations
- @throws( ... ) @native()