DocTextFinder Class Documentation
classDocTextFinderNamespace:com::datalogics::PDFL
Detailed Description
A class used to extract words from a document using a regular expression.
Constructor & Destructor Documentation
DocTextFinder
DocTextFinder(Documentdoc, WordFinderConfigwbConfig)Parameters
doc: Documentdocument to search
wbConfig: WordFinderConfigWordFinder configuration to use
Create a DocTextFinder
DocTextFinder
DocTextFinder(Documentdoc, WordFinderConfigwbConfig, DocTextFinderConfigdtfConfig)Parameters
doc: Documentdocument to search
wbConfig: WordFinderConfigWordFinder configuration to use
dtfConfig: DocTextFinderConfigDocTextFinder configuration to use
Create a DocTextFinder with a DocTextFinderConfig
Member Function Documentation
DisposeChildren
voidDisposeChildren()Returns:
void[static initializer]
static void[static initializer]()delete
synchronized voiddelete(Booleandisposing)Parameters
disposing: Boolean
Returns:
synchronized voiddelete
synchronized voiddelete()Returns:
synchronized voidfinalize
voidfinalize()Returns:
voidgetMatchList
java.util.List< DocTextFinderMatch >getMatchList(intbeginPageNumber, intendPageNumber, Stringregexstr)Parameters
beginPageNumber: intthe beginning page number included in the search.
endPageNumber: intthe ending page number included in the search.
regexstr: Stringthe regular expression to use for the search.
Returns:
a structure containing matching phrases along with its quads.Finds all matching strings for a given regular expression for the specified page range. It returns a vector containing words along with the page and matching phrase that was found.