Index
All Classes and Interfaces|All Packages
B
- BoilerpipeContentHandler - Class in org.apache.tika.sax.boilerpipe
-
Uses the boilerpipe library to automatically extract the main content from a web page.
- BoilerpipeContentHandler(Writer) - Constructor for class org.apache.tika.sax.boilerpipe.BoilerpipeContentHandler
-
Creates a content handler that writes XHTML body character events to the given writer.
- BoilerpipeContentHandler(ContentHandler) - Constructor for class org.apache.tika.sax.boilerpipe.BoilerpipeContentHandler
-
Creates a new boilerpipe-based content extractor, using the
DefaultExtractorextraction rules and "delegate" as the content handler. - BoilerpipeContentHandler(ContentHandler, BoilerpipeExtractor) - Constructor for class org.apache.tika.sax.boilerpipe.BoilerpipeContentHandler
-
Creates a new boilerpipe-based content extractor, using the given extraction rules.
C
- characters(char[], int, int) - Method in class org.apache.tika.sax.boilerpipe.BoilerpipeContentHandler
E
- endDocument() - Method in class org.apache.tika.sax.boilerpipe.BoilerpipeContentHandler
- endElement(String, String, String) - Method in class org.apache.tika.sax.boilerpipe.BoilerpipeContentHandler
G
- getTextDocument() - Method in class org.apache.tika.sax.boilerpipe.BoilerpipeContentHandler
-
Retrieves the built TextDocument
I
- isIncludeMarkup() - Method in class org.apache.tika.sax.boilerpipe.BoilerpipeContentHandler
O
- org.apache.tika.sax.boilerpipe - package org.apache.tika.sax.boilerpipe
S
- setIncludeMarkup(boolean) - Method in class org.apache.tika.sax.boilerpipe.BoilerpipeContentHandler
- startDocument() - Method in class org.apache.tika.sax.boilerpipe.BoilerpipeContentHandler
- startElement(String, String, String, Attributes) - Method in class org.apache.tika.sax.boilerpipe.BoilerpipeContentHandler
- startPrefixMapping(String, String) - Method in class org.apache.tika.sax.boilerpipe.BoilerpipeContentHandler
All Classes and Interfaces|All Packages