Index

A C D E G I M P R S 
All Classes and Interfaces|All Packages|Serialized Form

A

append(Node) - Method in class com.digitalpebble.stormcrawler.tika.DOMBuilder
Append a node to the current container.

C

cdata(char[], int, int) - Method in class com.digitalpebble.stormcrawler.tika.DOMBuilder
Receive notification of cdata.
characters(char[], int, int) - Method in class com.digitalpebble.stormcrawler.tika.DOMBuilder
Receive notification of character data.
charactersRaw(char[], int, int) - Method in class com.digitalpebble.stormcrawler.tika.DOMBuilder
If available, when the disable-output-escaping attribute is used, output raw text without escaping.
com.digitalpebble.stormcrawler.tika - package com.digitalpebble.stormcrawler.tika
 
comment(char[], int, int) - Method in class com.digitalpebble.stormcrawler.tika.DOMBuilder
Report an XML comment anywhere in the document.

D

declareOutputFields(OutputFieldsDeclarer) - Method in class com.digitalpebble.stormcrawler.tika.ParserBolt
 
declareOutputFields(OutputFieldsDeclarer) - Method in class com.digitalpebble.stormcrawler.tika.RedirectionBolt
 
DOMBuilder - Class in com.digitalpebble.stormcrawler.tika
This class takes SAX events (in addition to some extra events that SAX doesn't handle yet) and adds the result to a document or document fragment.
DOMBuilder(Document, Node) - Constructor for class com.digitalpebble.stormcrawler.tika.DOMBuilder
DOMBuilder instance constructor...

E

endCDATA() - Method in class com.digitalpebble.stormcrawler.tika.DOMBuilder
Report the end of a CDATA section.
endDocument() - Method in class com.digitalpebble.stormcrawler.tika.DOMBuilder
Receive notification of the end of a document.
endDTD() - Method in class com.digitalpebble.stormcrawler.tika.DOMBuilder
Report the end of DTD declarations.
endElement(String, String, String) - Method in class com.digitalpebble.stormcrawler.tika.DOMBuilder
Receive notification of the end of an element.
endEntity(String) - Method in class com.digitalpebble.stormcrawler.tika.DOMBuilder
Report the end of an entity.
endPrefixMapping(String) - Method in class com.digitalpebble.stormcrawler.tika.DOMBuilder
End the scope of a prefix-URI mapping.
entityReference(String) - Method in class com.digitalpebble.stormcrawler.tika.DOMBuilder
Receive notivication of a entityReference.
execute(Tuple) - Method in class com.digitalpebble.stormcrawler.tika.ParserBolt
 
execute(Tuple) - Method in class com.digitalpebble.stormcrawler.tika.RedirectionBolt
 

G

getDefaultNamespaceURI() - Method in class com.digitalpebble.stormcrawler.tika.DOMBuilder
 

I

ignorableWhitespace(char[], int, int) - Method in class com.digitalpebble.stormcrawler.tika.DOMBuilder
Receive notification of ignorable whitespace in element content.
isUpperCaseElementNames() - Method in class com.digitalpebble.stormcrawler.tika.DOMBuilder
 

M

m_inCData - Variable in class com.digitalpebble.stormcrawler.tika.DOMBuilder
Flag indicating that we are processing a CData section

P

ParserBolt - Class in com.digitalpebble.stormcrawler.tika
Uses Tika to parse the output of a fetch and extract text + metadata
ParserBolt() - Constructor for class com.digitalpebble.stormcrawler.tika.ParserBolt
 
prepare(Map<String, Object>, TopologyContext, OutputCollector) - Method in class com.digitalpebble.stormcrawler.tika.ParserBolt
 
prepare(Map, TopologyContext, OutputCollector) - Method in class com.digitalpebble.stormcrawler.tika.RedirectionBolt
 
processingInstruction(String, String) - Method in class com.digitalpebble.stormcrawler.tika.DOMBuilder
Receive notification of a processing instruction.

R

RedirectionBolt - Class in com.digitalpebble.stormcrawler.tika
Uses Tika only if a document has not been parsed with anything else.
RedirectionBolt() - Constructor for class com.digitalpebble.stormcrawler.tika.RedirectionBolt
 

S

setDefaultNamespaceURI(String) - Method in class com.digitalpebble.stormcrawler.tika.DOMBuilder
 
setDocumentLocator(Locator) - Method in class com.digitalpebble.stormcrawler.tika.DOMBuilder
Receive an object for locating the origin of SAX document events.
setIDAttribute(String, Element) - Method in class com.digitalpebble.stormcrawler.tika.DOMBuilder
Set an ID string to node association in the ID table.
setUpperCaseElementNames(boolean) - Method in class com.digitalpebble.stormcrawler.tika.DOMBuilder
 
skippedEntity(String) - Method in class com.digitalpebble.stormcrawler.tika.DOMBuilder
Receive notification of a skipped entity.
startCDATA() - Method in class com.digitalpebble.stormcrawler.tika.DOMBuilder
Report the start of a CDATA section.
startDocument() - Method in class com.digitalpebble.stormcrawler.tika.DOMBuilder
Receive notification of the beginning of a document.
startDTD(String, String, String) - Method in class com.digitalpebble.stormcrawler.tika.DOMBuilder
Report the start of DTD declarations, if any.
startElement(String, String, String, Attributes) - Method in class com.digitalpebble.stormcrawler.tika.DOMBuilder
Receive notification of the beginning of an element.
startEntity(String) - Method in class com.digitalpebble.stormcrawler.tika.DOMBuilder
Report the beginning of an entity.
startPrefixMapping(String, String) - Method in class com.digitalpebble.stormcrawler.tika.DOMBuilder
Begin the scope of a prefix-URI Namespace mapping.
A C D E G I M P R S 
All Classes and Interfaces|All Packages|Serialized Form