A numeric character reference expanded to the C1 controls range
Introduction
HTML document should not contain «control characters»,
C1 control range contains symbols that are related to bits from 0x80 to
0x9f (from 128 to 159) in Latin-1, or from U+0080 to U+009F in Unicode.
Unicode considers them as «control characters», which are directly forbidden
by HTML5 parcer.
Special HTML symbols, from C1 controls range
Sym bol
| Forbid den code
| Valid subst.
| Alter native
|
€ | € | € |
|
|  | |
|
‚ | ‚ | ‚ |
|
ƒ | ƒ | ƒ |
|
„ | „ | „ |
|
… | … | … |
|
† | † | † |
|
‡ | ‡ | ‡ |
|
ˆ | ˆ | ˆ |
|
‰ | ‰ | ‰ | ‰
|
Š | Š | Š | Š
|
‹ | ‹ | ‹ | ‹
|
Œ | Œ | Œ | Œ
|
|  | |
|
Ž | Ž | Ž |
|
|  | |
|
|  | |
|
‘ | ‘ | ‘ | ‘
|
’ | ’ | ’ |
|
“ | “ | “ |
|
” | ” | ” |
|
• | • | • | •
|
– | – | – | –
|
— | — | — | —
|
˜ | ˜ | ˜ | ˜
|
™ | ™ | ™ | ™
|
š | š | š | š
|
› | › | › | ›
|
œ | œ | œ | œ
|
|  | |
|
ž | ž | ž |
|
Ÿ | Ÿ | Ÿ | Ÿ
|