App C Segoe Character Set - practicalseries/GitHub-Wiki-Design-and-Implementation GitHub Wiki
The following is a full character set for the Segoe UI font. All characters are renderable on GitHub (with the exception of certain control characters, these function as control characters but have no renderable appearance, these are labelled N/A).
The character set is very large (approximately 26,700 characters in total) and has been split across several pages (GitHub limits a Wiki page to a maximum of 484,000 characters).
The character set is split as follows:
C.2. Characters U+00000 to U+00FFF
C.3. Characters U+01000 to U+01FFF
C.4. Characters U+02000 to U+02FFF
C.5. Characters U+03000 to U+09FFF
C.6. Characters U+0A000 to U+0AFFF
C.7. Characters U+0B000 to U+0FFFF
C.8. Characters U+10000 to U+10FFF
C.9. Characters U+11000 to U+11FFF
C.10. Characters U+12000 to U+12FFF
C.11. Characters U+13000 to U+15FFF
C.12. Characters U+16000 to U+1CFFF
C.13. Characters U+1D000 to U+1EFFF
C.14. Characters U+1F000 to U+3FFFF
There is a spread sheet version of all this data here:
You may be wondering why I’ve bothered to include these tables of Unicode character sets.
The answer is: “I already had them”. As part of the process of procuring the PAL Software Library, it was necessary to identify the characters that were in one of the distributed fonts needed to program the various devices. This font was called Siemens TIA Portal and I put together the whole spreadsheet of available characters for it.
It turns out that the Siemens TIA Portal font and Segoe UI have very similar character sets and, since the spreadsheet already existed, it was very easy to just change the font to Segoe UI and then check to see which characters no longer worked and edit them from the spreadsheet.
The second reason for doing this is that it is actually quite difficult to find a full list of Unicode characters all in one place with their descriptions.
The closest I could get was a CSV file on the Unicode site that has all the descriptions, you can get it here:
http://unicode.org/Public/UCD/latest/ucd/UnicodeData.txt
It's a bit brutal, just plain CSV without column headings.
So here it all is, if you are looking for the full list, I suggest you use the downloadable spreadsheet, it just has all the information in one place.
Any character can be inserted into a Markdown file by incorporating its Unicode value (either decimal or hexadecimal) in an escape code.
For example, the “A” character can be inserted using the hexadecimal Unicode value:
A
Within any Markdown file. Hexadecimal codes should be preceded by &#x
and followed by a semicolon (;
).
The decimal equivalent is:
A
Decimal codes should be preceded by &#
and followed by a semicolon (;
).
Although the tables show hexadecimal numbers with leading zeros, the leading zeros can be included or omitted in the escape code (it makes no difference).