java.lang.Object
com.lowagie.text.pdf.parser.MarkedUpTextAssembler
- All Implemented Interfaces:
TextAssembler
We'll get called on a variety of marked section content (perhaps including the results of nested sections), and will
assemble it into an order as we can.
- Author:
- dgd
-
Method Summary
Modifier and TypeMethodDescriptionendParsingContext(String containingElementName) protected PdfReaderGetter.assembler can calculate an identifier for each word on a page, for use in markup.voidSlot fully-assembled chunk into our result at the current location.voidprocess(ParsedText unassembled, String contextName) Remember an unassembled chunk until we hit the end of this element, or we hit an assembled chunk, and need to pull things together.voidvoidrenderText(FinalText finalText) voidrenderText(ParsedTextImpl partialWord) Captures text using a simplified algorithm for inserting hard returns and spacesvoidreset()voidsetPage(int page)
-
Method Details
-
process
Remember an unassembled chunk until we hit the end of this element, or we hit an assembled chunk, and need to pull things together.- Specified by:
processin interfaceTextAssembler- Parameters:
unassembled- chunk of text rendering instruction to contribute to final textcontextName- Name of the element context we are in. Null value if it's an Artifact.
-
process
Slot fully-assembled chunk into our result at the current location. If there are unassembled chunks waiting, assemble them first.- Specified by:
processin interfaceTextAssembler- Parameters:
completed- This is a chunk from a nested elementcontextName- Name of the element context we are in. Null value if it's an Artifact.
-
process
- Specified by:
processin interfaceTextAssembler- Parameters:
completed- process a complete chunk -- just add this subsection into the proper place.contextName- Name of the element context we are in. Null value if it's an Artifact.- See Also:
-
endParsingContext
- Specified by:
endParsingContextin interfaceTextAssembler- Parameters:
containingElementName- This is an element name to surround the extracted text- Returns:
- the final text for the set of fragments and fully parsed items we were passed during processing.
- See Also:
-
reset
public void reset()- Specified by:
resetin interfaceTextAssembler- See Also:
-
renderText
- Specified by:
renderTextin interfaceTextAssembler- Parameters:
finalText- process a complete chunk -- just add this subsection into the proper place.
-
renderText
Captures text using a simplified algorithm for inserting hard returns and spaces- Specified by:
renderTextin interfaceTextAssembler- Parameters:
partialWord- process one of a number of raw pdf text chunks, with placement, font, etc.- See Also:
-
getReader
Getter.- Returns:
- reader
-
setPage
public void setPage(int page) - Specified by:
setPagein interfaceTextAssembler- Parameters:
page- number of the page we are assembling- See Also:
-
getWordId
assembler can calculate an identifier for each word on a page, for use in markup.- Specified by:
getWordIdin interfaceTextAssembler- Returns:
- the new unique id.
- See Also:
-