Package com.digitalpebble.stormcrawler.tika


package com.digitalpebble.stormcrawler.tika
  • Classes
    Class
    Description
    This class takes SAX events (in addition to some extra events that SAX doesn't handle yet) and adds the result to a document or document fragment.
    Uses Tika to parse the output of a fetch and extract text + metadata
    Uses Tika only if a document has not been parsed with anything else.