DocTextFinder Class Documentation
classDocTextFinderNamespace:datalogics_interface
Detailed Description
A class used to extract words from a document using a regular expression.
Uses types
Constructor & Destructor Documentation
DocTextFinder
DocTextFinder(DocTextFinder &&)Parameters
DocTextFinder
DocTextFinder(Document &doc)Parameters
doc: Document &The document to search.
Create a DocTextFinder for the given document.
DocTextFinder
DocTextFinder(Document &doc, WordFinderConfig &config)Parameters
doc: Document &The document to search.
config: WordFinderConfig &Word finder configuration controlling text extraction behavior.
Create a DocTextFinder with a word-finder configuration.
~DocTextFinder
~DocTextFinder()Member Function Documentation
get_match_list
std::vector< DocTextFinderMatch >get_match_list(intbegin_page, intend_page, const std::string ®ex)Parameters
begin_page: intthe beginning page number included in the search.
end_page: intthe ending page number included in the search.
regex: const std::string &the regular expression to use for the search.
Returns:
a structure containing matching phrases along with its quads.Finds all matching strings for a given regular expression for the specified page range. It returns a vector containing words along with the page and matching phrase that was found.
operator=
DocTextFinder &operator=(DocTextFinder &&)