Package org.apache.poi.hwpf.extractor
Class Word6Extractor
- java.lang.Object
-
- org.apache.poi.hwpf.extractor.Word6Extractor
-
- All Implemented Interfaces:
java.io.Closeable,java.lang.AutoCloseable,POIOLE2TextExtractor,POITextExtractor
public final class Word6Extractor extends java.lang.Object implements POIOLE2TextExtractor
Class to extract the text from old (Word 6 / Word 95) Word Documents. This should only be used on the older files, for most uses you should callWordExtractorwhich deals properly with HWPF.
-
-
Constructor Summary
Constructors Constructor Description Word6Extractor(java.io.InputStream is)Create a new Word ExtractorWord6Extractor(HWPFOldDocument doc)Create a new Word ExtractorWord6Extractor(DirectoryNode dir)Word6Extractor(DirectoryNode dir, POIFSFileSystem fs)Deprecated.UseWord6Extractor(DirectoryNode)insteadWord6Extractor(POIFSFileSystem fs)Create a new Word Extractor
-
Method Summary
All Methods Instance Methods Concrete Methods Deprecated Methods Modifier and Type Method Description HWPFOldDocumentgetDocument()Return the underlying POIDocumentHWPFOldDocumentgetFilesystem()java.lang.String[]getParagraphText()Deprecated.java.lang.StringgetText()Retrieves all the text from the document.booleanisCloseFilesystem()voidsetCloseFilesystem(boolean doCloseFilesystem)-
Methods inherited from class java.lang.Object
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
-
Methods inherited from interface org.apache.poi.extractor.POIOLE2TextExtractor
getDocSummaryInformation, getMetadataTextExtractor, getRoot, getSummaryInformation
-
Methods inherited from interface org.apache.poi.extractor.POITextExtractor
close
-
-
-
-
Constructor Detail
-
Word6Extractor
public Word6Extractor(java.io.InputStream is) throws java.io.IOExceptionCreate a new Word Extractor- Parameters:
is- InputStream containing the word file- Throws:
java.io.IOException
-
Word6Extractor
public Word6Extractor(POIFSFileSystem fs) throws java.io.IOException
Create a new Word Extractor- Parameters:
fs- POIFSFileSystem containing the word file- Throws:
java.io.IOException
-
Word6Extractor
@Deprecated public Word6Extractor(DirectoryNode dir, POIFSFileSystem fs) throws java.io.IOException
Deprecated.UseWord6Extractor(DirectoryNode)instead- Throws:
java.io.IOException
-
Word6Extractor
public Word6Extractor(DirectoryNode dir) throws java.io.IOException
- Throws:
java.io.IOException
-
Word6Extractor
public Word6Extractor(HWPFOldDocument doc)
Create a new Word Extractor- Parameters:
doc- The HWPFOldDocument to extract from
-
-
Method Detail
-
getParagraphText
@Deprecated public java.lang.String[] getParagraphText()
Deprecated.Get the text from the word file, as an array with one String per paragraph
-
getText
public java.lang.String getText()
Description copied from interface:POITextExtractorRetrieves all the text from the document. How cells, paragraphs etc are separated in the text is implementation specific - see the javadocs for a specific project for details.- Specified by:
getTextin interfacePOITextExtractor- Returns:
- All the text from the document
-
getDocument
public HWPFOldDocument getDocument()
Description copied from interface:POIOLE2TextExtractorReturn the underlying POIDocument- Specified by:
getDocumentin interfacePOIOLE2TextExtractor- Specified by:
getDocumentin interfacePOITextExtractor- Returns:
- the underlying POIDocument
-
setCloseFilesystem
public void setCloseFilesystem(boolean doCloseFilesystem)
- Specified by:
setCloseFilesystemin interfacePOITextExtractor- Parameters:
doCloseFilesystem-true(default), if underlying resources/filesystem should be closed onPOITextExtractor.close()
-
isCloseFilesystem
public boolean isCloseFilesystem()
- Specified by:
isCloseFilesystemin interfacePOITextExtractor- Returns:
true, if resources/filesystem should be closed onPOITextExtractor.close()
-
getFilesystem
public HWPFOldDocument getFilesystem()
- Specified by:
getFilesystemin interfacePOITextExtractor- Returns:
- The underlying resources/filesystem
-
-