Excerpts from the Unicode Standard, Version 14.0 - ionathanch/ionathanch.github.io GitHub Wiki
Code Points for Pictures for Control Codes. By definition, control codes themselves are manifested only by their action. However, it is sometimes necessary to show the position of a control code within a data stream. Conventional illustrations for the ASCII C0 control codes have been developedโbut the characters U+2400..U+241F and U+2424 are intended for use as unspecified graphics for the corresponding control codes.
โโโโโโ
โโโโโโโโโโ
โโโโโโโโโโโโโโโโ
โกโค
Pictures for ASCII Space. By definition, the space is a blank graphic. Conventions have also been established for the visible representation of the space. Three specific characters are provided that may be used to visually represent the ASCII space character, U+2420 symbol for space, U+2422 blank symbol, and U+2423 open box.
โ โขโฃ
- U+2425 SYMBOL FOR DELETE FORM TWO:
โฅ
- U+2426 SYMBOL FOR SUBSTITUTE FORM TWO:
โฆ
Keytop Labels. Where possible, keytop labels have been unified with other symbols of like appearanceโfor example, U+21E7 upwards white arrow to indicate the Shift key. While symbols such as U+2318 place of interest sign and U+2388 helm symbol are generic symbols that have been adapted to use on keytops, other symbols specifically follow ISO/IEC 9995-7.
This block contains a large number of symbols from ISO/IEC 9995-7:1994, Information technologyโKeyboard layouts for text and office systemsโPart 7: Symbols used to represent functions.
โโ
โคโฅโฆโงโจโซ
โโ
โโโโโโ
โโโโโโโ
โโโโโ
Floor and Ceiling. The floor and ceiling symbols encoded at U+2308..U+230B are tall, narrow mathematical delimiters.
โโ
โโ
Crops and Quine Corners. Crops and quine corners are most properly used in two-dimensional layout but may be referred to in plain text.
โโ โโ
โโ โโ
Angle Brackets. U+2329 left-pointing angle bracket and U+232A right-pointing angle bracket have long been canonically equivalent to the CJK punctuation characters U+3008 left angle bracket and U+3009 right angle bracket , respectively. This canonical equivalence implies that the use of the latter (CJK) code points is preferred and that U+2329 and U+232A are also โwideโ characters. (See Unicode Standard Annex #11, โEast Asian Width,โ for the definition of the East Asian wide property.) For this reason, the use of U+2329 and U+232A is deprecated for mathematics and for technical publication, where the wide property of the characters has the potential to interfere with the proper formatting of mathematical formulae. The angle brackets specifically provided for mathematics, U+27E8 mathematical left angle bracket and U+27E9 mathematical right angle bracket, should be used instead.
โฉโช
APL Functional Symbols. APL (A Programming Language) makes extensive use of functional symbols constructed by composition with other, more primitive functional symbols. It used backspace and overstrike mechanisms in early computer implementations. In principle, functional composition is productive in APL; in practice, a relatively small number of composed functional symbols have become standard operators in APL. This relatively small set is encoded in its entirety in this block.
โถโทโธโนโบโปโผโฝโพโฟ
โโโโโโ
โโโโโโโโโโ
โโโโโโโโโโโโโโโโ
โ โกโขโฃโคโฅโฆโงโจโฉโชโซโฌโญโฎโฏ
โฐโฑโฒโณโดโตโถโทโธโนโบโ
Symbol Pieces. The characters in the range U+239B..U+23B3, plus U+23B7, constitute a set of bracket and other symbol fragments for use in mathematical typesetting. These pieces originated in older font standards but have been used in past mathematical processing as characters in their own right to make up extra-tall glyphs for enclosing multiline mathematical formulae.
โงโซ
โชโช
โจโฌ
โชโช
โฉโญ
โโโกโคโ
โโโขโฅโฎ
โโ โฃโฆโก
โฒ
โณ
โฏโ
Horizontal Brackets. In mathematical equations, delimiters are often used horizontally, where they expand to the width of the expression they encompass. The six bracket characters in the range U+23DC..U+23E1 can be used for this purpose. In the context of mathematical layout, U+23B4 top square bracket and U+23B5 bottom square bracket are also used that way.
โโโ โด
โโโกโตโถ
Decimal Exponent Symbol. U+23E8 decimal exponent symbol is for compatibility with the Russian standard GOST 10859-64, as well as the paper tape and punch card standard, Alcor (DIN 66006). It represents a fixed token introducing the exponent of a real number in scientific notation, comparable to the more common usage of โeโ in similar notations: 1.621e5. It was used in the early computer language ALGOL-60, and appeared in some Soviet-manufactured computers, such as the BESM-6 and its emulators. In the Unicode Standard it is treated simply as an atomic symbol; it is not considered to be equivalent to a generic subscripted form of the numeral โ10โ and is not given a decomposition.
โจ
Dental Symbols. The set of symbols from U+23BE to U+23CC form a set of symbols from JIS X 0213 for use in dental notation.
โพโฟโโโโโโ
โโโโโโโ
Metrical Symbols. The symbols in the range U+23D1..U+23D9 are a set of spacing symbols used in the metrical analysis of poetry and lyrics.
โโโโโโโโโ
Electrotechnical Symbols. The Miscellaneous Technical block also contains a smattering of electrotechnical symbols. These characters are not intended to constitute a complete encoding of all symbols used in electrical diagrams, but rather are compatibility characters encoded primarily for mapping to other standards. The symbols in the range U+238D..U+2394 are from the character set with the International Registry number 181. U+23DA earth ground and U+23DB fuse are from HKSCS-2001.
โโโโโโโโโโ
User Interface Symbols. The characters U+231A, U+231B, and U+23E9 through U+23FA are often found in user interfaces for media players, clocks, alarms, and timers, as well as in text discussing those user interfaces. The black medium triangles (U+23F4..U+23F7) are the preferred shapes for User Interface purposes, rather than the similar geometric shapes located in the Geometric Shapes block: U+25A0..U+25FF.
โโโฉโชโซโฌโญโฎโฏโฐโฑโฒโณโดโตโถโทโธโนโบ
Miscellaneous technical.
โโโโโโ
โโ
โโโโโโโโโ
โขโฃ
โโขโคโฅโฆโงโฟ
Chemistry symbols.
โฌโฃ
Drafting symbols.
โญโฎโฏโฐโฑโฒโณโดโต
Graphics for control codes.
โปโฝโพโฟ
Terminal characters.
โท
โธ
โน
โบโปโผโฝ
Power symbols.
โปโผโฝโพ
This block includes those symbolic characters of the OCR-A character set that do not correspond to ASCII characters, as well as magnetic ink character recognition (MICR) symbols used in check processing.
โโโโโโ
โโโโโ
Most of the symbols in this block are semi-graphics: block-style symbols which can be combined to simulate an all-points-addressable graphic display. Many platforms used these semi-graphic characters to support a graphics mode: small blocks that would be plotted at various coordinates, resulting in the appropriate full-sized block character consisting of the necessary โonโ and โoffโ blocks. Other symbols in the Symbols for Legacy Computing block include box drawing and shading characters, and miscellaneous arrows and stick figures. In the teletext specification, symbols in this group can be displayed either with cells joined together or with a narrow space between cells. The Symbols for Legacy Computing block also includes clones of the ASCII digits 0 through 9 (U+1FBF0..U+1FBF9), styled as upright seven-segment digits that were often used in Atari 16-bit applications for game scores.
Block mosaic terminal graphic characters.
๐ฌ๐ฌ๐ฌ๐ฌ๐ฌ๐ฌ
๐ฌ๐ฌ๐ฌ๐ฌ๐ฌ๐ฌ๐ฌ๐ฌ๐ฌ๐ฌ
๐ฌ๐ฌ๐ฌ๐ฌ๐ฌ๐ฌ๐ฌ๐ฌ๐ฌ๐ฌ๐ฌ๐ฌ๐ฌ๐ฌ๐ฌ๐ฌ
๐ฌ ๐ฌก๐ฌข๐ฌฃ๐ฌค๐ฌฅ๐ฌฆ๐ฌง๐ฌจ๐ฌฉ๐ฌช๐ฌซ๐ฌฌ๐ฌญ๐ฌฎ๐ฌฏ
๐ฌฐ๐ฌฑ๐ฌฒ๐ฌณ๐ฌด๐ฌต๐ฌถ๐ฌท๐ฌธ๐ฌน๐ฌบ๐ฌป
Smooth mosaic terminal graphic characters.
๐ฌผ๐ฌฝ๐ฌพ๐ฌฟ
๐ญ๐ญ๐ญ๐ญ๐ญ๐ญ
๐ญ๐ญ๐ญ๐ญ๐ญ๐ญ๐ญ๐ญ๐ญ๐ญ
๐ญ๐ญ๐ญ๐ญ๐ญ๐ญ๐ญ๐ญ๐ญ๐ญ๐ญ๐ญ๐ญ๐ญ๐ญ๐ญ
๐ญ ๐ญก๐ญข๐ญฃ๐ญค๐ญฅ๐ญฆ๐ญง๐ญจ๐ญฉ๐ญช๐ญซ๐ญฌ๐ญญ๐ญฎ๐ญฏ
๐ฎ๐ฎ
Block elements.
๐ญฐ๐ญฑ๐ญฒ๐ญณ๐ญด๐ญต๐ญถ๐ญท๐ญธ๐ญน๐ญบ๐ญป๐ญผ๐ญฝ๐ญพ๐ญฟ๐ฎ๐ฎ๐ฎ๐ฎ๐ฎ๐ฎ
๐ฎ๐ฎ๐ฎ๐ฎ๐ฎ๐ฎ
Shade characters.
๐ฎ๐ฎ๐ฎ๐ฎ๐ฎ๐ฎ๐ฎ๐ฎ
๐ฎ๐ฎ๐ฎ๐ฎ
Fill characters.
๐ฎ๐ฎ๐ฎ๐ฎ๐ฎ
Character cell diagonals.
๐ฎ ๐ฎก๐ฎข๐ฎฃ๐ฎค๐ฎฅ๐ฎฆ๐ฎง๐ฎจ๐ฎฉ๐ฎช๐ฎซ๐ฎฌ๐ฎญ๐ฎฎ๐ฎฏ
Terminal graphic characters.
๐ฎฐ๐ฎฑ๐ฎฒ๐ฎณ๐ฎน๐ฎบ๐ฎป๐ฎผ๐ฎฝ๐ฎพ๐ฎฟ๐ฏ๐ฏ๐ฏ๐ฏ๐ฏ๐ฏ
๐ฏ๐ฏ๐ฏ๐ฏ๐ฏ
Arrows.
๐ฎด๐ฎต๐ฎถ๐ฎท๐ฎธ
Segmented digits.
๐ฏฐ๐ฏฑ๐ฏฒ๐ฏณ๐ฏด๐ฏต๐ฏถ๐ฏท๐ฏธ๐ฏน
Box Drawing. The Box Drawing block (U+2500..U+257F) contains a collection of graphic compatibility characters that originate in legacy standards in use prior to 1990 and that are intended for drawing boxes of various shapes and line widths for user interface components in character-cell-based graphic systems.
The โlight,โ โheavy,โ and โdoubleโ attributes for some of these characters reflect the fact that the original sets often had a two-way distinction, between a light versus heavy line or a single versus double line, and included sufficient pieces to enable construction of graphic boxes with distinct styles that abutted each other in display.
โโโโโโ
โโโโโโโโโโ
โโโโโโโโโโโโโโโโ
โ โกโขโฃโคโฅโฆโงโจโฉโชโซโฌโญโฎโฏ
โฐโฑโฒโณโดโตโถโทโธโนโบโปโผโฝโพโฟ
โโโโโโ
โโโโโโโโโโ
โโโโโโโโโโโโโโโโ
โ โกโขโฃโคโฅโฆโงโจโฉโชโซโฌโญโฎโฏ
โฐโฑโฒโณโดโตโถโทโธโนโบโปโผโฝโพโฟ
Block Elements. The Block Elements block (U+2580..U+259F) contains another collection of graphic compatibility characters. Unlike the box drawing characters, the legacy block elements are designed to fill some defined fraction of each display cell or to fill each display cell with some defined degree of shading. These elements were used to create crude graphic displays in terminals or in terminal modes on displays where bit-mapped graphics were unavailable.
โโโโโโ
โโโโโโโโโโ
โโโโโโโโโโโโโโโโ