Class XWPFEventBasedWordExtractor
- java.lang.Object
-
- org.apache.tika.parser.microsoft.ooxml.xwpf.XWPFEventBasedWordExtractor
-
- All Implemented Interfaces:
java.io.Closeable,java.lang.AutoCloseable,POITextExtractor,POIXMLTextExtractor
public class XWPFEventBasedWordExtractor extends java.lang.Object implements POIXMLTextExtractor
Experimental class that is based on POI's XSSFEventBasedExcelExtractor
-
-
Constructor Summary
Constructors Constructor Description XWPFEventBasedWordExtractor(OPCPackage container)
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description POIXMLProperties.CorePropertiesgetCoreProperties()Returns the core document propertiesPOIXMLProperties.CustomPropertiesgetCustomProperties()Returns the custom document propertiesPOIXMLDocumentgetDocument()Returns opened documentPOIXMLProperties.ExtendedPropertiesgetExtendedProperties()Returns the extended document propertiesjava.io.CloseablegetFilesystem()OPCPackagegetPackage()Returns the opened OPCPackage that contains the documentjava.lang.StringgetText()Retrieves all the text from the document.booleanisCloseFilesystem()voidsetCloseFilesystem(boolean b)-
Methods inherited from class java.lang.Object
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
-
Methods inherited from interface org.apache.poi.ooxml.extractor.POIXMLTextExtractor
checkMaxTextSize, close, getMetadataTextExtractor
-
-
-
-
Constructor Detail
-
XWPFEventBasedWordExtractor
public XWPFEventBasedWordExtractor(OPCPackage container) throws XmlException, OpenXML4JException, java.io.IOException
- Throws:
XmlExceptionOpenXML4JExceptionjava.io.IOException
-
-
Method Detail
-
getPackage
public OPCPackage getPackage()
Description copied from interface:POIXMLTextExtractorReturns the opened OPCPackage that contains the document- Specified by:
getPackagein interfacePOIXMLTextExtractor- Returns:
- the opened OPCPackage
-
getCoreProperties
public POIXMLProperties.CoreProperties getCoreProperties()
Description copied from interface:POIXMLTextExtractorReturns the core document properties- Specified by:
getCorePropertiesin interfacePOIXMLTextExtractor- Returns:
- the core document properties
-
getExtendedProperties
public POIXMLProperties.ExtendedProperties getExtendedProperties()
Description copied from interface:POIXMLTextExtractorReturns the extended document properties- Specified by:
getExtendedPropertiesin interfacePOIXMLTextExtractor- Returns:
- the extended document properties
-
getCustomProperties
public POIXMLProperties.CustomProperties getCustomProperties()
Description copied from interface:POIXMLTextExtractorReturns the custom document properties- Specified by:
getCustomPropertiesin interfacePOIXMLTextExtractor- Returns:
- the custom document properties
-
getDocument
public POIXMLDocument getDocument()
Description copied from interface:POIXMLTextExtractorReturns opened document- Specified by:
getDocumentin interfacePOITextExtractor- Specified by:
getDocumentin interfacePOIXMLTextExtractor- Returns:
- the opened document
-
getText
public java.lang.String getText()
Description copied from interface:POITextExtractorRetrieves all the text from the document. How cells, paragraphs etc are separated in the text is implementation specific - see the javadocs for a specific project for details.- Specified by:
getTextin interfacePOITextExtractor- Returns:
- All the text from the document
-
setCloseFilesystem
public void setCloseFilesystem(boolean b)
- Specified by:
setCloseFilesystemin interfacePOITextExtractor- Parameters:
b-true(default), if underlying resources/filesystem should be closed onPOITextExtractor.close()
-
isCloseFilesystem
public boolean isCloseFilesystem()
- Specified by:
isCloseFilesystemin interfacePOITextExtractor- Returns:
true, if resources/filesystem should be closed onPOITextExtractor.close()
-
getFilesystem
public java.io.Closeable getFilesystem()
- Specified by:
getFilesystemin interfacePOITextExtractor- Returns:
- The underlying resources/filesystem
-
-