| Package | Description |
|---|---|
| org.apache.tika.parser.pdf | |
| org.apache.tika.parser.pdf.image |
| Class and Description |
|---|
| AccessChecker
Checks whether or not a document allows extraction generally
or extraction for accessibility only.
|
| PDFParserConfig
Config for PDFParser.
|
| PDFParserConfig.IMAGE_STRATEGY |
| PDFParserConfig.OCR_RENDERING_STRATEGY |
| PDFParserConfig.OCR_STRATEGY |
| PDFParserConfig.OCRStrategyAuto
Encapsulate the numbers used to control OCR Strategy when set to auto
If the total characters on the page < this.totalCharsPerPage
or
total unmapped unicode characters on the page > this.unmappedUnicodeCharsPerPage
then we will perform OCR on the page
If unamppedUnicodeCharsPerPage is an integer > 0, then we compare absolute number of characters.
|
| Class and Description |
|---|
| PDFParserConfig
Config for PDFParser.
|
Copyright © 2007–2025 The Apache Software Foundation. All rights reserved.