- java.lang.Object
-
- io.annot8.common.components.AbstractComponentDescriptor<T,S>
-
- io.annot8.common.components.AbstractProcessorDescriptor<OCR.Processor,OCR.Settings>
-
- io.annot8.components.tesseract.processors.OCR
-
- All Implemented Interfaces:
io.annot8.api.components.Annot8ComponentDescriptor<OCR.Processor,OCR.Settings>,io.annot8.api.components.ProcessorDescriptor<OCR.Processor,OCR.Settings>
@ComponentName("Tesseract OCR") @ComponentDescription("Use Tesseract to extract text from images stored in FileContent, or directly from Image content") @SettingsClass(Settings.class) @ComponentTags({"image","text","ocr","tesseract"}) public class OCR extends io.annot8.common.components.AbstractProcessorDescriptor<OCR.Processor,OCR.Settings>Takes FileContent containing either an image or PDF file, or Image content directly, and produces a Text content with the text from the image as extracted by Tesseract
-
-
Nested Class Summary
Nested Classes Modifier and Type Class Description static classOCR.Processorstatic classOCR.SettingsSettings class forOCR
-
Constructor Summary
Constructors Constructor Description OCR()
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description io.annot8.api.capabilities.Capabilitiescapabilities()protected OCR.ProcessorcreateComponent(io.annot8.api.context.Context context, OCR.Settings settings)-
Methods inherited from class io.annot8.common.components.AbstractComponentDescriptor
create, getName, getSettings, setName, setSettings
-
-
-
-
Method Detail
-
createComponent
protected OCR.Processor createComponent(io.annot8.api.context.Context context, OCR.Settings settings)
- Specified by:
createComponentin classio.annot8.common.components.AbstractComponentDescriptor<OCR.Processor,OCR.Settings>
-
capabilities
public io.annot8.api.capabilities.Capabilities capabilities()
-
-