Package org.apache.poi.hssf.extractor
Class EventBasedExcelExtractor
java.lang.Object
org.apache.poi.hssf.extractor.EventBasedExcelExtractor
- All Implemented Interfaces:
Closeable,AutoCloseable,POIOLE2TextExtractor,POITextExtractor,ExcelExtractor
public class EventBasedExcelExtractor
extends Object
implements POIOLE2TextExtractor, ExcelExtractor
A text extractor for Excel files, that is based
on the HSSF EventUserModel API.
It will typically use less memory than
ExcelExtractor, but may not provide
the same richness of formatting.
Returns the textual content of the file, suitable for
indexing by something like Lucene, but not really
intended for display to the user.
To turn an excel file into a CSV or similar, then see the XLS2CSVmra example
- See Also:
-
Constructor Summary
ConstructorsConstructorDescription -
Method Summary
Modifier and TypeMethodDescriptionvoidclose()Allows to free resources of the Extractor as soon as it is not needed any more.Would return the document information metadata for the document, if we supported itReturn the underlying POIDocumentgetRoot()Return the underlying DirectoryEntry of this document.Would return the summary information metadata for the document, if we supported itgetText()Retreives the text contents of the filebooleanvoidsetCloseFilesystem(boolean doCloseFilesystem) voidsetFormulasNotResults(boolean formulasNotResults) Should we return the formula itself, and not the result it produces? Default is falsevoidsetIncludeCellComments(boolean includeComments) Would control the inclusion of cell comments from the document, if we supported itvoidsetIncludeHeadersFooters(boolean includeHeadersFooters) Would control the inclusion of headers and footers from the document, if we supported itvoidsetIncludeSheetNames(boolean includeSheetNames) Should sheet names be included? Default is trueMethods inherited from class java.lang.Object
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, waitMethods inherited from interface org.apache.poi.extractor.POIOLE2TextExtractor
getMetadataTextExtractor
-
Constructor Details
-
EventBasedExcelExtractor
-
EventBasedExcelExtractor
-
-
Method Details
-
getDocSummaryInformation
Would return the document information metadata for the document, if we supported it- Specified by:
getDocSummaryInformationin interfacePOIOLE2TextExtractor- Returns:
- The Document Summary Information or null if it could not be read for this document.
-
getSummaryInformation
Would return the summary information metadata for the document, if we supported it- Specified by:
getSummaryInformationin interfacePOIOLE2TextExtractor- Returns:
- The Summary information for the document or null if it could not be read for this document.
-
setIncludeCellComments
public void setIncludeCellComments(boolean includeComments) Would control the inclusion of cell comments from the document, if we supported it- Specified by:
setIncludeCellCommentsin interfaceExcelExtractor- Parameters:
includeComments-trueif cell comments should be included
-
setIncludeSheetNames
public void setIncludeSheetNames(boolean includeSheetNames) Should sheet names be included? Default is true- Specified by:
setIncludeSheetNamesin interfaceExcelExtractor- Parameters:
includeSheetNames-trueif the sheet names should be included
-
setFormulasNotResults
public void setFormulasNotResults(boolean formulasNotResults) Should we return the formula itself, and not the result it produces? Default is false- Specified by:
setFormulasNotResultsin interfaceExcelExtractor- Parameters:
formulasNotResults-trueif the formula itself is returned
-
getText
Retreives the text contents of the file- Specified by:
getTextin interfaceExcelExtractor- Specified by:
getTextin interfacePOITextExtractor- Returns:
- All the text from the document
-
setCloseFilesystem
public void setCloseFilesystem(boolean doCloseFilesystem) - Specified by:
setCloseFilesystemin interfacePOITextExtractor- Parameters:
doCloseFilesystem-true(default), if underlying resources/filesystem should be closed onPOITextExtractor.close()
-
isCloseFilesystem
public boolean isCloseFilesystem()- Specified by:
isCloseFilesystemin interfacePOITextExtractor- Returns:
true, if resources/filesystem should be closed onPOITextExtractor.close()
-
getFilesystem
- Specified by:
getFilesystemin interfacePOITextExtractor- Returns:
- The underlying resources/filesystem
-
getDocument
Description copied from interface:POIOLE2TextExtractorReturn the underlying POIDocument- Specified by:
getDocumentin interfacePOIOLE2TextExtractor- Specified by:
getDocumentin interfacePOITextExtractor- Returns:
- the underlying POIDocument
-
getRoot
Description copied from interface:POIOLE2TextExtractorReturn the underlying DirectoryEntry of this document.- Specified by:
getRootin interfacePOIOLE2TextExtractor- Returns:
- the DirectoryEntry that is associated with the POIDocument of this extractor.
-
close
Description copied from interface:POITextExtractorAllows to free resources of the Extractor as soon as it is not needed any more. This may include closing open file handles and freeing memory. The Extractor cannot be used after close has been called.- Specified by:
closein interfaceAutoCloseable- Specified by:
closein interfaceCloseable- Specified by:
closein interfacePOITextExtractor- Throws:
IOException
-