public class PdfTextExtractorByArea extends Object
| Constructor and Description |
|---|
PdfTextExtractorByArea() |
| Modifier and Type | Method and Description |
|---|---|
String |
extractAddedText(org.sejda.sambox.pdmodel.PDPage page,
Point2D position) |
String |
extractFooterText(org.sejda.sambox.pdmodel.PDPage page) |
String |
extractHeaderText(org.sejda.sambox.pdmodel.PDPage page) |
String |
extractTextFromArea(org.sejda.sambox.pdmodel.PDPage page,
Rectangle2D area)
Extracts the text found in a specific page bound to a specific rectangle area Eg: extract footer text from a certain page
|
List<String> |
extractTextFromAreas(org.sejda.sambox.pdmodel.PDPage page,
List<Rectangle> areas) |
public String extractFooterText(org.sejda.sambox.pdmodel.PDPage page) throws TaskIOException
page - TaskIOExceptionpublic String extractHeaderText(org.sejda.sambox.pdmodel.PDPage page) throws TaskIOException
TaskIOExceptionpublic String extractAddedText(org.sejda.sambox.pdmodel.PDPage page, Point2D position) throws TaskIOException
TaskIOExceptionpublic String extractTextFromArea(org.sejda.sambox.pdmodel.PDPage page, Rectangle2D area) throws TaskIOException
page - the page to extract the text fromarea - the rectangular area to extractTaskIOExceptionpublic List<String> extractTextFromAreas(org.sejda.sambox.pdmodel.PDPage page, List<Rectangle> areas) throws TaskIOException
TaskIOExceptionCopyright © 2019 sejda. All rights reserved.