public interface BoilerpipeExtractor extends BoilerpipeFilter
| Modifier and Type | Method and Description |
|---|---|
String |
getText(InputSource is)
Extracts text from the HTML code available from the given
InputSource. |
String |
getText(Reader r)
Extracts text from the HTML code available from the given
Reader. |
String |
getText(String html)
Extracts text from the HTML code given as a String.
|
String |
getText(TextDocument doc)
Extracts text from the given
TextDocument object. |
processString getText(String html) throws BoilerpipeProcessingException
html - The HTML code as a String.BoilerpipeProcessingExceptionString getText(InputSource is) throws BoilerpipeProcessingException
InputSource.is - The InputSource containing the HTMLBoilerpipeProcessingExceptionString getText(Reader r) throws BoilerpipeProcessingException
Reader.r - The Reader containing the HTMLBoilerpipeProcessingExceptionString getText(TextDocument doc) throws BoilerpipeProcessingException
TextDocument object.doc - The TextDocument.BoilerpipeProcessingExceptionCopyright © 2013-2014. All Rights Reserved.