Package com.digitalpebble.stormcrawler.tika
package com.digitalpebble.stormcrawler.tika
-
ClassesClassDescriptionThis class takes SAX events (in addition to some extra events that SAX doesn't handle yet) and adds the result to a document or document fragment.Uses Tika to parse the output of a fetch and extract text + metadataUses Tika only if a document has not been parsed with anything else.