Class AbstractBankPDFPageFirst

java.lang.Object
org.apache.pdfbox.contentstream.PDFStreamEngine
org.apache.pdfbox.text.PDFTextStripper
org.apache.pdfbox.text.PDFTextStripperByArea
de.frankmuenster.mahoe.pdfextractor.AbstractBankPDFPage
de.frankmuenster.mahoe.pdfextractor.AbstractBankPDFPageFirst
Direct Known Subclasses:
SantanderPdfPageFirst, TargoBankPdfPageFirst

public abstract class AbstractBankPDFPageFirst extends AbstractBankPDFPage
Defines a first pages PDF account sheet. It extends AbstractBankPDFPage to add regions for general account information like account number, owner and balance.
Author:
Frank Münster
  • Field Summary

    Fields inherited from class org.apache.pdfbox.text.PDFTextStripper

    charactersByArticle, document, LINE_SEPARATOR, output
  • Constructor Summary

    Constructors
    Modifier
    Constructor
    Description
    protected
    Standard constructor call super class.
  • Method Summary

    Modifier and Type
    Method
    Description
    protected float
    computeFontHeight(org.apache.pdfbox.pdmodel.font.PDFont arg0)
     
    protected abstract Map<String,Rectangle>
    Defines the regions in a Map.
    protected void
    showGlyph(org.apache.pdfbox.util.Matrix arg0, org.apache.pdfbox.pdmodel.font.PDFont arg1, int arg2, org.apache.pdfbox.util.Vector arg3)
     

    Methods inherited from class de.frankmuenster.mahoe.pdfextractor.AbstractBankPDFPage

    defineRegions, getRectangleFrom, getXPos, setRegion, writeString

    Methods inherited from class org.apache.pdfbox.text.PDFTextStripperByArea

    addRegion, extractRegions, getRegions, getTextForRegion, processTextPosition, removeRegion, setShouldSeparateByBeads, writePage

    Methods inherited from class org.apache.pdfbox.text.PDFTextStripper

    endArticle, endDocument, endPage, getAddMoreFormatting, getArticleEnd, getArticleStart, getAverageCharTolerance, getCharactersByArticle, getCurrentPageNo, getDropThreshold, getEndBookmark, getEndPage, getIndentThreshold, getLineSeparator, getListItemPatterns, getOutput, getPageEnd, getPageStart, getParagraphEnd, getParagraphStart, getSeparateByBeads, getSortByPosition, getSpacingTolerance, getStartBookmark, getStartPage, getSuppressDuplicateOverlappingText, getText, getWordSeparator, matchPattern, processPage, processPages, setAddMoreFormatting, setArticleEnd, setArticleStart, setAverageCharTolerance, setDropThreshold, setEndBookmark, setEndPage, setIndentThreshold, setLineSeparator, setListItemPatterns, setPageEnd, setPageStart, setParagraphEnd, setParagraphStart, setSortByPosition, setSpacingTolerance, setStartBookmark, setStartPage, setSuppressDuplicateOverlappingText, setWordSeparator, startArticle, startArticle, startDocument, startPage, writeCharacters, writeLineSeparator, writePageEnd, writePageStart, writeParagraphEnd, writeParagraphSeparator, writeParagraphStart, writeString, writeText, writeWordSeparator

    Methods inherited from class org.apache.pdfbox.contentstream.PDFStreamEngine

    addOperator, applyTextAdjustment, beginMarkedContentSequence, beginText, decreaseLevel, endMarkedContentSequence, endText, getAppearance, getCurrentPage, getGraphicsStackSize, getGraphicsState, getInitialMatrix, getLevel, getResources, getTextLineMatrix, getTextMatrix, increaseLevel, operatorException, processAnnotation, processChildStream, processOperator, processOperator, processSoftMask, processTilingPattern, processTilingPattern, processTransparencyGroup, processType3Stream, restoreGraphicsStack, restoreGraphicsState, saveGraphicsStack, saveGraphicsState, setLineDashPattern, setTextLineMatrix, setTextMatrix, showAnnotation, showFontGlyph, showForm, showText, showTextString, showTextStrings, showTransparencyGroup, showType3Glyph, transformedPoint, transformWidth, unsupportedOperator

    Methods inherited from class java.lang.Object

    clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
  • Constructor Details

    • AbstractBankPDFPageFirst

      protected AbstractBankPDFPageFirst() throws IOException
      Standard constructor call super class.
      Throws:
      IOException
  • Method Details

    • defineFirstPageRegions

      protected abstract Map<String,Rectangle> defineFirstPageRegions()
      Defines the regions in a Map. The region rectangle values must able be converted to PDF measures. Following key needs to be defined in the Map:
      • AbstractBankPDFPage.KARTEN_KONTO
      • AbstractBankPDFPage.KARTEN_INHABER
      • AbstractBankPDFPage.RECHNUNGS_DATUM
      • AbstractBankPDFPage.SALDO
      • AbstractBankPDFPage.FAELLIG_BETRAG
      • AbstractBankPDFPage.FAELLIG_DATUM
      • AbstractBankPDFPage.FAELLIG_IBAN
      • AbstractBankPDFPage.FAELLIG_BIC
      Returns:
      a Map with the regions defined
    • showGlyph

      protected void showGlyph(org.apache.pdfbox.util.Matrix arg0, org.apache.pdfbox.pdmodel.font.PDFont arg1, int arg2, org.apache.pdfbox.util.Vector arg3) throws IOException
      Overrides:
      showGlyph in class org.apache.pdfbox.contentstream.PDFStreamEngine
      Throws:
      IOException
    • computeFontHeight

      protected float computeFontHeight(org.apache.pdfbox.pdmodel.font.PDFont arg0) throws IOException
      Throws:
      IOException