EncodingInfo Class Documentation

classEncodingInfo

Namespace:datalogics_interface

Detailed Description

Encoding is the process that transforms a set of Unicode characters into a series of bytes.

With PDF files, Unicode values are used to represent individual character within a font set. Unicode assigns a symbolic value to represent every single character in a wide variety of fonts, scripts, and languages.

For example, the Unicode value "U+0041" refers to the capital letter "A" in a Latin script.

The EncodingInfo Class is used to provide basic information about an encoding. For example, EncodingInfo could be used to provide just the code name and description for a Unicode standard font or language encoding, like this:

IBM860 (Portuguese, DOS)

ibm861 (Icelandic, DOS)

DOS-862 (Hebrew, DOS)

IBM863 (French Canadian, DOS)

IBM864 (Arabic, 864)

iso-8859-9 (Turkish, ISO)

iso-8859-13 (Estonian, ISO)

(c)2007 Datalogics Inc. Datalogics is a registered trademark of Datalogics Incorporated. Other company and brand, product and service names are for identification purposes only and may be trademarks or registered trademarks of their respective holders. Data is subject to change without notice.

Referenced by

Constructor & Destructor Documentation

~EncodingInfo

~EncodingInfo()

Member Function Documentation

get_char_names

std::vector< std::string >get_char_names()

Returns:

std::vector< std::string >

container for differences array/character names

get_encoding_name

std::stringget_encoding_name()

Returns:

std::string

Return the encoding name/cmap name for the font eg: