Package org.apache.tika.sax
Class WriteOutContentHandler
java.lang.Object
org.xml.sax.helpers.DefaultHandler
org.apache.tika.sax.ContentHandlerDecorator
org.apache.tika.sax.WriteOutContentHandler
- All Implemented Interfaces:
ContentHandler,DTDHandler,EntityResolver,ErrorHandler
- Direct Known Subclasses:
LanguageHandler,ProfilingHandler,RichTextContentHandler
SAX event handler that writes content up to an optional write
limit out to a character stream or other decorated handler.
-
Constructor Summary
ConstructorsConstructorDescriptionCreates a content handler that writes character events to an internal string buffer.WriteOutContentHandler(int writeLimit) Creates a content handler that writes character events to an internal string buffer.WriteOutContentHandler(OutputStream stream) Creates a content handler that writes character events to the given output stream using the default encoding.WriteOutContentHandler(Writer writer) Creates a content handler that writes character events to the given writer.WriteOutContentHandler(Writer writer, int writeLimit) Creates a content handler that writes content up to the given write limit to the given character stream.WriteOutContentHandler(ContentHandler handler, int writeLimit) Creates a content handler that writes content up to the given write limit to the given content handler. -
Method Summary
Modifier and TypeMethodDescriptionvoidcharacters(char[] ch, int start, int length) Writes the given characters to the given character stream.voidignorableWhitespace(char[] ch, int start, int length) booleanChecks whether the given exception (or any of it's root causes) was thrown by this handler as a signal of reaching the write limit.Methods inherited from class org.apache.tika.sax.ContentHandlerDecorator
endDocument, endElement, endPrefixMapping, processingInstruction, setDocumentLocator, skippedEntity, startDocument, startElement, startPrefixMapping, toStringMethods inherited from class org.xml.sax.helpers.DefaultHandler
error, fatalError, notationDecl, resolveEntity, unparsedEntityDecl, warningMethods inherited from class java.lang.Object
equals, getClass, hashCode, notify, notifyAll, wait, wait, waitMethods inherited from interface org.xml.sax.ContentHandler
declaration
-
Constructor Details
-
WriteOutContentHandler
Creates a content handler that writes content up to the given write limit to the given content handler.- Parameters:
handler- content handler to be decoratedwriteLimit- write limit- Since:
- Apache Tika 0.10
-
WriteOutContentHandler
Creates a content handler that writes content up to the given write limit to the given character stream.- Parameters:
writer- character streamwriteLimit- write limit- Since:
- Apache Tika 0.10
-
WriteOutContentHandler
Creates a content handler that writes character events to the given writer.- Parameters:
writer- writer
-
WriteOutContentHandler
Creates a content handler that writes character events to the given output stream using the default encoding.- Parameters:
stream- output stream
-
WriteOutContentHandler
public WriteOutContentHandler(int writeLimit) Creates a content handler that writes character events to an internal string buffer. Use theContentHandlerDecorator.toString()method to access the collected character content.The internal string buffer is bounded at the given number of characters. If this write limit is reached, then a
SAXExceptionis thrown. TheisWriteLimitReached(Throwable)method can be used to detect this case.- Parameters:
writeLimit- maximum number of characters to include in the string, or -1 to disable the write limit- Since:
- Apache Tika 0.7
-
WriteOutContentHandler
public WriteOutContentHandler()Creates a content handler that writes character events to an internal string buffer. Use theContentHandlerDecorator.toString()method to access the collected character content.The internal string buffer is bounded at 100k characters. If this write limit is reached, then a
SAXExceptionis thrown. TheisWriteLimitReached(Throwable)method can be used to detect this case.
-
-
Method Details
-
characters
Writes the given characters to the given character stream.- Specified by:
charactersin interfaceContentHandler- Overrides:
charactersin classContentHandlerDecorator- Throws:
SAXException
-
ignorableWhitespace
- Specified by:
ignorableWhitespacein interfaceContentHandler- Overrides:
ignorableWhitespacein classContentHandlerDecorator- Throws:
SAXException
-
isWriteLimitReached
Checks whether the given exception (or any of it's root causes) was thrown by this handler as a signal of reaching the write limit.- Parameters:
t- throwable- Returns:
trueif the write limit was reached,falseotherwise- Since:
- Apache Tika 0.7
-