20.3. html.entities — Definitions of HTML general entities

Source code: Lib/html/entities.py


This module defines four dictionaries, html5, name2codepoint, codepoint2name, and entitydefs.

html.entities.html5

A dictionary that maps HTML5 named character references [1] to the equivalent Unicode character(s), e.g. html5['gt;'] == '>'. Note that the trailing semicolon is included in the name (e.g. 'gt;'), however some of the names are accepted by the standard even without the semicolon: in this case the name is present with and without the ';'. See also html.unescape().

New in version 3.3.

html.entities.entitydefs

A dictionary mapping XHTML 1.0 entity definitions to their replacement text in ISO Latin-1.

html.entities.name2codepoint

A dictionary that maps HTML entity names to the Unicode code points.

html.entities.codepoint2name

A dictionary that maps Unicode code points to HTML entity names.

Footnotes

[1]See http://www.w3.org/TR/html5/syntax.html#named-character-references

Previous topic

20.2. html.parser — Simple HTML and XHTML parser

Next topic

20.4. XML Processing Modules

This Page