Class XPSTextExtractor
- java.lang.Object
-
- org.apache.poi.extractor.POITextExtractor
-
- org.apache.poi.ooxml.extractor.POIXMLTextExtractor
-
- org.apache.tika.parser.microsoft.ooxml.xps.XPSTextExtractor
-
- All Implemented Interfaces:
java.io.Closeable,java.lang.AutoCloseable
public class XPSTextExtractor extends org.apache.poi.ooxml.extractor.POIXMLTextExtractorCurrently, mostly a pass-through class to hold pkg and properties and keep the general framework similar to our other POI-integrated extractors.
-
-
Constructor Summary
Constructors Constructor Description XPSTextExtractor(OPCPackage pkg)
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description org.apache.poi.ooxml.POIXMLProperties.CorePropertiesgetCoreProperties()org.apache.poi.ooxml.POIXMLProperties.CustomPropertiesgetCustomProperties()org.apache.poi.ooxml.POIXMLProperties.ExtendedPropertiesgetExtendedProperties()OPCPackagegetPackage()java.lang.StringgetText()Retrieves all the text from the document.-
Methods inherited from class org.apache.poi.ooxml.extractor.POIXMLTextExtractor
close, getDocument, getMetadataTextExtractor
-
Methods inherited from class org.apache.poi.extractor.POITextExtractor
setFilesystem
-
-
-
-
Constructor Detail
-
XPSTextExtractor
public XPSTextExtractor(OPCPackage pkg) throws OpenXML4JException, XmlException, java.io.IOException
- Throws:
OpenXML4JExceptionXmlExceptionjava.io.IOException
-
-
Method Detail
-
getPackage
public OPCPackage getPackage()
- Overrides:
getPackagein classorg.apache.poi.ooxml.extractor.POIXMLTextExtractor
-
getText
public java.lang.String getText()
Description copied from class:POITextExtractorRetrieves all the text from the document. How cells, paragraphs etc are separated in the text is implementation specific - see the javadocs for a specific project for details.- Specified by:
getTextin classPOITextExtractor- Returns:
- All the text from the document
-
getCoreProperties
public org.apache.poi.ooxml.POIXMLProperties.CoreProperties getCoreProperties()
- Overrides:
getCorePropertiesin classorg.apache.poi.ooxml.extractor.POIXMLTextExtractor
-
getExtendedProperties
public org.apache.poi.ooxml.POIXMLProperties.ExtendedProperties getExtendedProperties()
- Overrides:
getExtendedPropertiesin classorg.apache.poi.ooxml.extractor.POIXMLTextExtractor
-
getCustomProperties
public org.apache.poi.ooxml.POIXMLProperties.CustomProperties getCustomProperties()
- Overrides:
getCustomPropertiesin classorg.apache.poi.ooxml.extractor.POIXMLTextExtractor
-
-