Package org.apache.pdfbox.pdfparser
The pdfparser package contains classes to parse PDF documents and objects within the document.
-
Interface Summary Interface Description PDFXRef -
Class Summary Class Description BaseParser This class is used to contain parsing logic that will be used by both the PDFParser and the COSStreamParser.ConformingPDFParser NonSequentialPDFParser PDFParser which first reads startxref and xref tables in order to know valid objects and parse only these objects.PDFObjectStreamParser This will parse a PDF 1.5 object stream and extract all of the objects from the stream.PDFParser This class will handle the parsing of the PDF document.PDFStreamParser This will parse a PDF byte stream and extract operands and such.PDFXRefStream PDFXrefStreamParser This will parse a PDF 1.5 (or better) Xref stream and extract the xref information from the stream.VisualSignatureParser XrefTrailerResolver This class will collect all XRef/trailer objects and creates correct xref/trailer information after all objects are read using startxref and 'Prev' information (unused XRef/trailer objects are discarded).