sparknlp.reader.layout_aligner_for_text#

Module Contents#

Classes#

LayoutAlignerForText

Rebuilds final text by combining aligned document chunks and image captions.

class LayoutAlignerForText[source]#

Rebuilds final text by combining aligned document chunks and image captions.

This transformer is designed to consume aligned_doc + image_caption pairs and produce coherent text output with re-computed begin and end indexes.

Input Annotation types

Output Annotation type

DOCUMENT, DOCUMENT

DOCUMENT

name = 'LayoutAlignerForText'[source]#
inputAnnotatorTypes[source]#
outputAnnotatorType = 'document'[source]#
joinDelimiter[source]#
inlinePrefixThreshold[source]#
explodeElements[source]#
setParams()[source]#
setJoinDelimiter(value)[source]#
setInlinePrefixThreshold(value)[source]#
setExplodeElements(value)[source]#