Package com.itextpdf.text.pdf.parser

Interface Summary
ContentOperator Interface implemented by a series of content operators
ExtRenderListener Simply extends the RenderListener interface to provide additional methods.
LocationTextExtractionStrategy.TextChunkFilter Specifies a filter for filtering LocationTextExtractionStrategy.TextChunk objects during text extraction
RenderListener A callback interface that receives notifications from the PdfContentStreamProcessor as various render operations are required.
Shape Represents segment from a PDF path.
TextExtractionStrategy Defines an interface for RenderListeners that can return text
XObjectDoHandler  
 

Class Summary
BezierCurve Represents a Bezier curve.
ContentByteUtils  
FilteredRenderListener A text render listener that filters text operations before passing them on to a delegate
FilteredTextRenderListener A text render listener that filters text operations before passing them on to a delegate
GlyphRenderListener  
GlyphTextRenderListener  
GraphicsState Keeps all the parameters of the graphics state.
ImageRenderInfo Represents image data from a PDF
InlineImageInfo Represents an inline image from a PDF
InlineImageUtils Utility methods to help with processing of inline images
Line Represents a line.
LineDashPattern Represents the line dash pattern.
LineSegment Represents a line segment in a particular coordinate system.
LocationTextExtractionStrategy Development preview - this class (and all of the parser classes) are still experiencing heavy development, and are subject to change both behavior and interface.
LocationTextExtractionStrategy.TextChunk Represents a chunk of text, it's orientation, and location relative to the orientation vector
MarkedContentInfo Represents a Marked Content block in a PDF
MarkedContentRenderFilter A RenderFilter that only allows text within a specified marked content sequence.
Matrix Keeps all the values of a 3 by 3 matrix and allows you to do some math with matrices.
MultiFilteredRenderListener  
Path Paths define shapes, trajectories, and regions of all sorts.
PathConstructionRenderInfo Contains information relating to construction the current path.
PathPaintingRenderInfo Contains information relating to painting current path.
PdfContentReaderTool Tool that parses the content of a PDF document.
PdfContentStreamProcessor Processor for a PDF content Stream.
PdfImageObject An object that contains an image dictionary and image bytes.
PdfReaderContentParser A utility class that makes it cleaner to process content from pages of a PdfReader through a specified RenderListener.
PdfTextExtractor Extracts text from a PDF file.
RegionTextRenderFilter A RenderFilter that only allows text within a specified rectangular region
RenderFilter Interface for defining filters for use with FilteredRenderListener
SimpleTextExtractionStrategy A simple text extraction renderer.
Subpath As subpath is a part of a path comprising a sequence of connected segments.
TaggedPdfReaderTool Converts a tagged PDF document into an XML file.
TextMarginFinder Allows you to find the rectangle that contains all the text in a page.
TextRenderInfo Provides information and calculations needed by render listeners to display/evaluate text render operations.
Vector Represents a vector (i.e. a point in space).
 

Enum Summary
PdfImageObject.ImageBytesType Different types of data that can be stored in the bytes of a PdfImageObject
 

Exception Summary
InlineImageUtils.InlineImageParseException Simple class in case users need to differentiate an exception from processing inline images vs other exceptions
 



Copyright © 2015. All Rights Reserved.