DocTextFinder Class Documentation
classDocTextFinder : SystemIDisposableNamespace:Datalogics::PDFL
- Inherits from:
SystemIDisposable
Detailed Description
A class used to extract words from a document using a regular expression.
Referenced by
Constructor & Destructor Documentation
DocTextFinder
DocTextFinder(DocTextFinderrhs, InternalConstructsignifier)Parameters
rhs: DocTextFindersignifier: InternalConstruct
DocTextFinder
DocTextFinder(Documentdoc, WordFinderConfigwbConfig)Parameters
doc: Documentdocument to search
wbConfig: WordFinderConfigWordFinder configuration to use
Create a DocTextFinder
DocTextFinder
DocTextFinder(Documentdoc, WordFinderConfigwbConfig, DocTextFinderConfigdtfConfig)Parameters
doc: Documentdocument to search
wbConfig: WordFinderConfigWordFinder configuration to use
dtfConfig: DocTextFinderConfigDocTextFinder configuration to use
Create a DocTextFinder with a DocTextFinderConfig
~DocTextFinder
~DocTextFinder()Member Function Documentation
ConstructorRetrieve
static DocTextFinderConstructorRetrieve(System.IntPtrignored)Parameters
ignored: System.IntPtr
Returns:
DocTextFinderDispose
voidDispose()Returns:
voidDisposeChildren
voidDisposeChildren()Returns:
voidGetMatchList
System.Collections.Generic.IList< DocTextFinderMatch >GetMatchList(intbeginPageNumber, intendPageNumber, stringregexstr)Parameters
beginPageNumber: intthe beginning page number included in the search.
endPageNumber: intthe ending page number included in the search.
regexstr: stringthe regular expression to use for the search.
Returns:
a structure containing matching phrases along with its quads.Finds all matching strings for a given regular expression for the specified page range. It returns a vector containing words along with the page and matching phrase that was found.