PHP get_html_translation_table() Function

The get_html_translation_table() function returns the translation table used by the htmlentities() and htmlspecialchars() functions.

array get_html_translation_table ([ int $table = HTML_SPECIALCHARS [, int $flags = ENT_COMPAT | ENT_HTML401 [, string $encoding = "UTF-8" ]]] )

Example -

Example #1 Translation Table Example

The above example will output something similar to : -

array(1510) {
  ["
"]=>
  string(9) "
"
  ["!"]=>
  string(6) "!"
  ["""]=>
  string(6) """
  ["#"]=>
  string(5) "#"
  ["$"]=>
  string(8) "$"
  ["%"]=>
  string(8) "%"
  ["&"]=>
  string(5) "&"
  ["'"]=>
  string(6) "'"
  // ...
}

ParameterDescription
functionOptional. Specifies which translation table to return.
Possible values:
  • HTML_SPECIALCHARS - Default. Translates some characters that need URL-encoding to be shown properly on a HTML page
  • HTML_ENTITIES - Translates all characters that need URL-encoding to be shown properly on a HTML page
  • flagsOptional. Specifies which quotes the table will contain and which document type the table is for.
    The available quote styles are:
  • ENT_COMPAT - Default. Table contains entities for double quotes, not single quotes
  • ENT_QUOTES - Table contains entities for double and single quotes
  • ENT_NOQUOTES - Table will not contain entities for double and single quotes
  • Additional flags for specifying which doctype the table is for:
  • ENT_HTML401 - Default. Table for HTML 4.01
  • ENT_HTML5 - Table for HTML 5
  • ENT_XML1 - Table for XML 1
  • ENT_XHTML - Table for XHTML
  • character-setOptional. A string that specifies which character-set to use.
    Allowed values are:
  • UTF-8 - Default. ASCII compatible multi-byte 8-bit Unicode
  • ISO-8859-1 - Western European
  • ISO-8859-15 - Western European (adds the Euro sign + French and Finnish letters missing in ISO-8859-1)
  • cp866 - DOS-specific Cyrillic charset
  • cp1251 - Windows-specific Cyrillic charset
  • cp1252 - Windows specific charset for Western European
  • KOI8-R - Russian
  • BIG5 - Traditional Chinese, mainly used in Taiwan
  • GB2312 - Simplified Chinese, national standard character set
  • BIG5-HKSCS - Big5 with Hong Kong extensions
  • Shift_JIS - Japanese
  • EUC-JP - Japanese
  • MacRoman - Character-set that was used by Mac OS
  • Returns the translation table as an array, with the original characters as keys and entities as values.