Class ParserBolt

java.lang.Object
org.apache.storm.topology.base.BaseComponent
org.apache.storm.topology.base.BaseRichBolt
com.digitalpebble.stormcrawler.tika.ParserBolt
All Implemented Interfaces:
Serializable, org.apache.storm.task.IBolt, org.apache.storm.topology.IComponent, org.apache.storm.topology.IRichBolt

public class ParserBolt extends org.apache.storm.topology.base.BaseRichBolt
Uses Tika to parse the output of a fetch and extract text + metadata
See Also:
  • Constructor Summary

    Constructors
    Constructor
    Description
     
  • Method Summary

    Modifier and Type
    Method
    Description
    void
    declareOutputFields(org.apache.storm.topology.OutputFieldsDeclarer declarer)
     
    void
    execute(org.apache.storm.tuple.Tuple tuple)
     
    void
    prepare(@NotNull Map<String,Object> conf, @NotNull org.apache.storm.task.TopologyContext context, @NotNull org.apache.storm.task.OutputCollector collector)
     

    Methods inherited from class org.apache.storm.topology.base.BaseRichBolt

    cleanup

    Methods inherited from class org.apache.storm.topology.base.BaseComponent

    getComponentConfiguration

    Methods inherited from class java.lang.Object

    clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait

    Methods inherited from interface org.apache.storm.topology.IComponent

    getComponentConfiguration
  • Constructor Details

    • ParserBolt

      public ParserBolt()
  • Method Details

    • prepare

      public void prepare(@NotNull @NotNull Map<String,Object> conf, @NotNull @NotNull org.apache.storm.task.TopologyContext context, @NotNull @NotNull org.apache.storm.task.OutputCollector collector)
    • execute

      public void execute(org.apache.storm.tuple.Tuple tuple)
    • declareOutputFields

      public void declareOutputFields(org.apache.storm.topology.OutputFieldsDeclarer declarer)