public abstract class AbstractOOXMLExtractor extends Object implements OOXMLExtractor
buildXHTML(XHTMLContentHandler) that
populates the XHTMLContentHandler object received as parameter.| Constructor and Description |
|---|
AbstractOOXMLExtractor(ParseContext context,
org.apache.poi.ooxml.extractor.POIXMLTextExtractor extractor) |
| Modifier and Type | Method and Description |
|---|---|
org.apache.poi.ooxml.POIXMLDocument |
getDocument()
Returns the opened document.
|
MetadataExtractor |
getMetadataExtractor()
POIXMLTextExtractor.getMetadataTextExtractor() not yet supported
for OOXML by POI. |
void |
getXHTML(ContentHandler handler,
Metadata metadata,
ParseContext context)
Parses the document into a sequence of XHTML SAX events sent to the
given content handler.
|
public AbstractOOXMLExtractor(ParseContext context, org.apache.poi.ooxml.extractor.POIXMLTextExtractor extractor)
public org.apache.poi.ooxml.POIXMLDocument getDocument()
OOXMLExtractorgetDocument in interface OOXMLExtractorOOXMLExtractor.getDocument()public MetadataExtractor getMetadataExtractor()
OOXMLExtractorPOIXMLTextExtractor.getMetadataTextExtractor() not yet supported
for OOXML by POI.getMetadataExtractor in interface OOXMLExtractorOOXMLExtractor.getMetadataExtractor()public void getXHTML(ContentHandler handler, Metadata metadata, ParseContext context) throws SAXException, XmlException, IOException, TikaException
OOXMLExtractorgetXHTML in interface OOXMLExtractorSAXExceptionXmlExceptionIOExceptionTikaExceptionOOXMLExtractor.getXHTML(ContentHandler, Metadata, ParseContext)Copyright © 2010 - 2020 Adobe. All Rights Reserved