Index
All Classes and Interfaces|All Packages|Serialized Form
A
- append(Node) - Method in class com.digitalpebble.stormcrawler.tika.DOMBuilder
-
Append a node to the current container.
C
- cdata(char[], int, int) - Method in class com.digitalpebble.stormcrawler.tika.DOMBuilder
-
Receive notification of cdata.
- characters(char[], int, int) - Method in class com.digitalpebble.stormcrawler.tika.DOMBuilder
-
Receive notification of character data.
- charactersRaw(char[], int, int) - Method in class com.digitalpebble.stormcrawler.tika.DOMBuilder
-
If available, when the disable-output-escaping attribute is used, output raw text without escaping.
- com.digitalpebble.stormcrawler.tika - package com.digitalpebble.stormcrawler.tika
- comment(char[], int, int) - Method in class com.digitalpebble.stormcrawler.tika.DOMBuilder
-
Report an XML comment anywhere in the document.
D
- declareOutputFields(OutputFieldsDeclarer) - Method in class com.digitalpebble.stormcrawler.tika.ParserBolt
- declareOutputFields(OutputFieldsDeclarer) - Method in class com.digitalpebble.stormcrawler.tika.RedirectionBolt
- DOMBuilder - Class in com.digitalpebble.stormcrawler.tika
-
This class takes SAX events (in addition to some extra events that SAX doesn't handle yet) and adds the result to a document or document fragment.
- DOMBuilder(Document, Node) - Constructor for class com.digitalpebble.stormcrawler.tika.DOMBuilder
-
DOMBuilder instance constructor...
E
- endCDATA() - Method in class com.digitalpebble.stormcrawler.tika.DOMBuilder
-
Report the end of a CDATA section.
- endDocument() - Method in class com.digitalpebble.stormcrawler.tika.DOMBuilder
-
Receive notification of the end of a document.
- endDTD() - Method in class com.digitalpebble.stormcrawler.tika.DOMBuilder
-
Report the end of DTD declarations.
- endElement(String, String, String) - Method in class com.digitalpebble.stormcrawler.tika.DOMBuilder
-
Receive notification of the end of an element.
- endEntity(String) - Method in class com.digitalpebble.stormcrawler.tika.DOMBuilder
-
Report the end of an entity.
- endPrefixMapping(String) - Method in class com.digitalpebble.stormcrawler.tika.DOMBuilder
-
End the scope of a prefix-URI mapping.
- entityReference(String) - Method in class com.digitalpebble.stormcrawler.tika.DOMBuilder
-
Receive notivication of a entityReference.
- execute(Tuple) - Method in class com.digitalpebble.stormcrawler.tika.ParserBolt
- execute(Tuple) - Method in class com.digitalpebble.stormcrawler.tika.RedirectionBolt
G
- getDefaultNamespaceURI() - Method in class com.digitalpebble.stormcrawler.tika.DOMBuilder
I
- ignorableWhitespace(char[], int, int) - Method in class com.digitalpebble.stormcrawler.tika.DOMBuilder
-
Receive notification of ignorable whitespace in element content.
- isUpperCaseElementNames() - Method in class com.digitalpebble.stormcrawler.tika.DOMBuilder
M
- m_inCData - Variable in class com.digitalpebble.stormcrawler.tika.DOMBuilder
-
Flag indicating that we are processing a CData section
P
- ParserBolt - Class in com.digitalpebble.stormcrawler.tika
-
Uses Tika to parse the output of a fetch and extract text + metadata
- ParserBolt() - Constructor for class com.digitalpebble.stormcrawler.tika.ParserBolt
- prepare(Map<String, Object>, TopologyContext, OutputCollector) - Method in class com.digitalpebble.stormcrawler.tika.ParserBolt
- prepare(Map, TopologyContext, OutputCollector) - Method in class com.digitalpebble.stormcrawler.tika.RedirectionBolt
- processingInstruction(String, String) - Method in class com.digitalpebble.stormcrawler.tika.DOMBuilder
-
Receive notification of a processing instruction.
R
- RedirectionBolt - Class in com.digitalpebble.stormcrawler.tika
-
Uses Tika only if a document has not been parsed with anything else.
- RedirectionBolt() - Constructor for class com.digitalpebble.stormcrawler.tika.RedirectionBolt
S
- setDefaultNamespaceURI(String) - Method in class com.digitalpebble.stormcrawler.tika.DOMBuilder
- setDocumentLocator(Locator) - Method in class com.digitalpebble.stormcrawler.tika.DOMBuilder
-
Receive an object for locating the origin of SAX document events.
- setIDAttribute(String, Element) - Method in class com.digitalpebble.stormcrawler.tika.DOMBuilder
-
Set an ID string to node association in the ID table.
- setUpperCaseElementNames(boolean) - Method in class com.digitalpebble.stormcrawler.tika.DOMBuilder
- skippedEntity(String) - Method in class com.digitalpebble.stormcrawler.tika.DOMBuilder
-
Receive notification of a skipped entity.
- startCDATA() - Method in class com.digitalpebble.stormcrawler.tika.DOMBuilder
-
Report the start of a CDATA section.
- startDocument() - Method in class com.digitalpebble.stormcrawler.tika.DOMBuilder
-
Receive notification of the beginning of a document.
- startDTD(String, String, String) - Method in class com.digitalpebble.stormcrawler.tika.DOMBuilder
-
Report the start of DTD declarations, if any.
- startElement(String, String, String, Attributes) - Method in class com.digitalpebble.stormcrawler.tika.DOMBuilder
-
Receive notification of the beginning of an element.
- startEntity(String) - Method in class com.digitalpebble.stormcrawler.tika.DOMBuilder
-
Report the beginning of an entity.
- startPrefixMapping(String, String) - Method in class com.digitalpebble.stormcrawler.tika.DOMBuilder
-
Begin the scope of a prefix-URI Namespace mapping.
All Classes and Interfaces|All Packages|Serialized Form