sparknlp.reader.layout_aligner_for_vision#
Module Contents#
Classes#
Aligns document chunks with nearby images and emits paired outputs. |
- class LayoutAlignerForVision[source]#
Aligns document chunks with nearby images and emits paired outputs.
The output is written to three derived columns based on
outputCol:<outputCol>_doc,<outputCol>_image, and<outputCol>_prompt.Input Annotation types
Output Annotation type
DOCUMENT, IMAGEDOCUMENT