Class HTMLPageParser
java.lang.Object
com.opensymphony.module.sitemesh.parser.HTMLPageParser
- All Implemented Interfaces:
PageParser
- Direct Known Subclasses:
DivExtractingPageParser
Builds an HTMLPage object from an HTML document. This behaves similarly to the FastPageParser, however it's a complete rewrite that is simpler to add custom features to such as extraction and transformation of elements.
To customize the rules used, this class can be extended and have the userDefinedRules() methods overridden.
- Author:
- Joe Walnes
- See Also:
-
Constructor Summary
Constructors -
Method Summary
Modifier and TypeMethodDescriptionprotected voidaddUserDefinedRules(State html, PageBuilder page) Adds the user defined rules.parse(char[] buffer) Parse the given buffer into a Page object.parse(SitemeshBuffer buffer) Parse the given buffer into a page object.
-
Constructor Details
-
HTMLPageParser
public HTMLPageParser()
-
-
Method Details
-
parse
Description copied from interface:PageParserParse the given buffer into a Page object.- Specified by:
parsein interfacePageParser- Parameters:
buffer- The buffer for the page.- Returns:
- The parsed page
- Throws:
IOException- if an error occurs
-
parse
Description copied from interface:PageParserParse the given buffer into a page object.DefaultSitemeshBufferis the appropriate implementation of this interface to pass in.- Specified by:
parsein interfacePageParser- Parameters:
buffer- The buffer for the page.- Returns:
- The parsed page
- Throws:
IOException- if an error occurs
-
addUserDefinedRules
Adds the user defined rules.- Parameters:
html- the htmlpage- the page
-