`sparknlp.reader.layout_aligner_for_text`#

Module Contents#

Classes#

LayoutAlignerForText

Rebuilds final text by combining aligned document chunks and image captions.

class LayoutAlignerForText[source]#

Rebuilds final text by combining aligned document chunks and image captions.

This transformer is designed to consume aligned_doc + image_caption pairs and produce coherent text output with re-computed begin and end indexes.

Input Annotation types	Output Annotation type
`DOCUMENT, DOCUMENT`	`DOCUMENT`

name = 'LayoutAlignerForText'[source]#

inputAnnotatorTypes[source]#

outputAnnotatorType = 'document'[source]#

joinDelimiter[source]#

inlinePrefixThreshold[source]#

explodeElements[source]#

setParams()[source]#

setJoinDelimiter(value)[source]#

setInlinePrefixThreshold(value)[source]#

setExplodeElements(value)[source]#

sparknlp.reader.layout_aligner_for_text#

Module Contents#

Classes#

`sparknlp.reader.layout_aligner_for_text`#