Class SXWPFWordExtractorDecorator
java.lang.Object
org.apache.tika.parser.microsoft.ooxml.AbstractOOXMLExtractor
org.apache.tika.parser.microsoft.ooxml.SXWPFWordExtractorDecorator
- All Implemented Interfaces:
OOXMLExtractor
This is an experimental, alternative extractor for docx files.
This streams the main document content rather than loading the
full document into memory.
This will be better for some use cases than the classic docx extractor; and, it will be worse for others.
- Since:
- 1.15
-
Constructor Summary
ConstructorsConstructorDescriptionSXWPFWordExtractorDecorator(Metadata metadata, ParseContext context, XWPFEventBasedWordExtractor extractor) -
Method Summary
Methods inherited from class org.apache.tika.parser.microsoft.ooxml.AbstractOOXMLExtractor
getDocument, getMetadataExtractor, getXHTML
-
Constructor Details
-
SXWPFWordExtractorDecorator
public SXWPFWordExtractorDecorator(Metadata metadata, ParseContext context, XWPFEventBasedWordExtractor extractor)
-