Enum text

WordAttributeFlags

Namespace: Datalogics::PDFL

Description

Flags returned by Word.GetAttributes.

Values

HasNonalphanum

The word contains a character outside the range of A-Z, a-Z, 0-9.

HasLetter

The word contains a character between A-Z or a-z.

HasUppercase

The word contains a character between A-Z.

HasDigit

One or more characters in the word are digits.

HasPunctuation

One or more characters in the word are punctuation marks. Other flag bits can be checked to test whether the punctuation was at the beginning of the word, the end of the word, or elsewhere in the word.

HasHyphen

There is a hyphen in the word.

HasSoftHyphen

There is a soft hyphen in the word.

HasLigature

The word contains a ligature.

HasLeadingPunctuation

The first character in the word is a punctuation mark.

HasTrailingPunctuation

The last character in the word is a punctuation mark.

HasUnmappedCharacters

One or more characters in the word cannot be represented in the output font encoding.

AdjacentToSpace

The character following the end of the word is a space (either an explicit space character encoded in a string, or one that appears implicitly because the drawing point was moved).

Rotated

The writing direction of the word is not in a multiple of 90 degrees, or the bounding box of the text is skewed. This flag indicates that the quads of the word should be used to specify the highlight area correctly.

VerticalFlow

The writing direction of the word is either 90 or 180 degrees. This flag ignores the page rotation parameter of the page dictionary. Therefore, if the page is rotated 90 degrees, this flag will be set on each word that appears horizonally on the screen.

WordBreak
LastWordOnLine

The word is at the end of the current text line (for example, the word is followed by a line break). In tagged PDF documents, this flag is set according to the tags in the document. To determine visual line breaks in tagged PDF, use IsLastWordInRegion instead.

FrontTab

Insert a tab character before this word

EncodingWarning

Unreliable encoding conversion happened in this word. Check the encoding flags of each character for detail.

ReverseDirection

The writing direction is right-left or top-down

WordIsUnicode

The text is in Unicode format.

ExtCharOffsets

The word has extended character offset information.