07 Special characters and escaping characters - practicalseries/GitHub-Wiki-Design-and-Implementation GitHub Wiki

PAL Logo showing Wiki Documentation heading

7SpacerSpecial characters and escaping characters

GitHub uses the Segoe UI font (pronounced seg-o-ee) as standard across all its Wiki pages (at least it does on Windows based browsers). Segoe is a large font with a very great number of characters all of which can be accessed by using escape codes.

โ€œEscapingโ€ a character is a term that applies to special characters that would otherwise be used to format the text in some way (asterisks for example), it allows the character to be displayed as a character rather than being interpreted as a formatting instruction.

This โ€œescapingโ€ process takes different forms for different languages (Markdown, HTML &c.), but always results in the true character being rendered.

This escaping process allows the full range of the Segoe UI font characters to be accessed (not just the ones that can be entered from a keyboard). Thing like this:

โ€ƒโ€ƒโ€ƒโ–› โ–œ โ™‘ โ™ž โ™ฌ

There is a spreadsheet with the full Segoe UI character set here:

Itโ€™s a big list, there are 40,000 characters in it (not all of which render on GitHub, but 26,684 of them do).

These are also listed in Appendix C of this Wiki.

7.1SpacerEscape characters and character codes

Markdown has a mechanism for displaying characters that would otherwise be use to format text (asterisks for example), this mechanism is called โ€œescaping the characterโ€. It is also possible to use a Unicode value to display a specific character (in either decimal or hexadecimal format), plus all the HTML symbol codes (the ones that begin with an ampersand) are also supported.

7.1.1SpacerMarkdown escape sequences

With Markdown, to display a literal character (i.e. to make the character appear in the text rather than format the text), precede it with a backslash character \.

For example if the following Markdown text were used:

Markdown and GitHub output
${\large \color{#0050C0}\text{M\ A\ R\ K\ D\ O\ W\ N}}$ ๐Ÿ”ฝ

* Without a backslash this is rendered as a list.

${\large \color{#B00000}\text{G\ I\ T\ H\ U\ B}\space\ \space\text{O\ U\ T\ P\ U\ T}}$ ๐Ÿ”ฝ
  • Without a backslash this is rendered as a list.

Section 8 explains about lists. The point here is that if we wish to display the asterisk as an asterisk, we need to escape it (by adding a backslash):

Markdown and GitHub output
${\large \color{#0050C0}\text{M\ A\ R\ K\ D\ O\ W\ N}}$ ๐Ÿ”ฝ

\* With a backslash it renders as an asterisk.

${\large \color{#B00000}\text{G\ I\ T\ H\ U\ B}\space\ \space\text{O\ U\ T\ P\ U\ T}}$ ๐Ÿ”ฝ

* With a backslash it renders as an asterisk.

The following characters can all be โ€œescapedโ€ by placing a backslash before the character:

Character Name Escape symbol

\

Backslash

\\

`

Backtick

\`

*

Asterisk

\*

_

Underscore

\_

{โ€ƒ}

Braces

\{โ€ƒ\}

[โ€ƒ]

Brackets

\[โ€ƒ\]

<โ€ƒ>

Angle brackets

\<โ€ƒ\>

(โ€ƒ)

Parentheses

\(โ€ƒ\)

#

Hash sign

\#

+

Plus sign

\+

-

Minus sign (hyphen)

\-

.

Full stop

\.

!

Exclamation mark

\!

|

Pipe

\|

Table 7.1 โ€” Markdown escapable characters

โฌ†๏ธ Top



7.1.2SpacerHTML escape sequences

Like Markdown, HTML has reserved characters, mostly the less than < and greater than signs >. These can also be escaped in HTML by using escape sequences.

There are several of these reserved characters in HTML:

Less than

<

Replacement code:

&lt;

Greater than

>

Replacement code:

&gt;

Ampersand

&

Replacement code:

&amp;

Double quotation mark

"

Replacement code:

&quot;

Single quotation mark

'

Replacement code:

&apos;

Table 7.2 โ€” HTML reserved characters and escape sequences

HTML provides a series of escape sequences (sometimes called symbol codes) that start with an ampersand & followed by a meaningful group of characters (well, meaningful in a way, some require a degree of interpretation) and ending with a semicolon ;. For example, the escape sequence for a less than symbol < is &lt;.

Escape sequences always start with the ampersand character & and end with a semicolon ;.

Basically, HTML escape sequences are a group of characters that are translated by the browser into a specific symbol.

Whenever the browser comes across the sequence of characters &lt;, it will display a less than sign <.

There are many other HTML escape sequences for characters not accessible via the key board &mu; for example displays the Greek Mu character ยต. Appendix A contains a full list of all HTML escape sequences.

All of these HTML escape sequences work in Markdown, just put them in the text and GitHub will display them correctly.

โฌ†๏ธ Top



7.1.3SpacerHTML decimal and hexadecimal escape codes

The escape sequences of the previous section are one way of displaying reserved and non-keyboard characters. These escape sequences are intended to be intuitive mnemonics for the symbols they represent (that said, I usually have to look them up). The problem is that not every character has one.

It is possible to use the Unicode value of the character as an โ€œescape codeโ€ (as opposed to an escape sequence).

Every character that can be displayed has a Unicode value (a number), specifically a value given using the Unicode transformation format-8 (UTF-8๐Ÿ’ 1). For example, the letter โ€œAโ€ has a Unicode value of 65, โ€œBโ€ 66 &c., there is a full list on Wikipedia: https://en.wikipedia.org/wiki/List_of_Unicode_characters.

Appendix C contains a spreadsheet with the full character set.

In HTML and GitHub Flavoured Markdown, any character can be entered by using its Unicode value as an escape code. HTML escape codes are preceded by the ampersand and hash characters &# and finished with a semicolon ;.

Continuing the previous example Unicode value for the letter โ€œAโ€ is 65 (decimal). To enter the letter โ€œAโ€ in HTML using an escape code, use the following:

โ€ƒโ€ƒโ€ƒ&#65;

GitHub Markdown accepts the use of both HTML escape sequence and escape codes.

The following tables gives a list of common escape sequence and escape codes Appendix A has a complete list of all HTML escape sequences and codes (they mostly all work in GitHub Markdown, there are some exceptions though, these are listed in section 7.2.1).

Mathematical HTML Code athematical HTML Code
ร— Multiplication sign &times; &#0215; โˆฉ Intersection &cap; &#8745;
รท Division sign &divide; &#0247; โˆซ Integral &int; &#8747;
โˆ’ Minus sign &minus; &#8722; โ‰ˆ Almost equal to &asymp; &#8776;
ยฑ Plus/minus sign &plusmn; &#0177; โ‰  Not equal to &ne; &#8800;
โ„ Fraction slash &frasl; &#8260; โ‰ก Identical to &equiv; &#8801;
โˆ N-array product &prod; &#8719; < Less than &lt; &#0060;
โˆ‘ N-array summation &sum; &#8721; > Greater than &gt; &#0062;
โˆš Square root &radic; &#8730; โ‰ค Less than or equal to &le; &#8804;
โˆž Infinity &infin; &#8734; โ‰ฅ Greater than or equal to &ge; &#8805;



HTML reserved
HTML Code HTML reserved HTML Code
< Less than &lt; &#0060; " Quotation mark &quot; &#0034;
> Greater than &gt; &#0062; ' Single quote &apos; &#0039;
& Ampersand &amp; &#0038;



Miscellaneous
HTML Code Miscellaneous HTML Code
โ† Leftwards arrow &larr; &#8592; ยฆ Broken vertical bar &brvbar; &#0166;
โ†‘ Upwards arrow &uarr; &#8593; ยฐ Degree sign &deg; &#0176;
โ†’ Rightwards arrow &rarr; &#8594; ยท Middle dot &middot; &#0183;
โ†“ Downwards arrow &darr; &#8595; โ€ข Bullet &bull; &#8226;
โ†” Left right arrow &harr; &#8596;
โ€ƒโ€ƒโ€ƒ


Spacing
HTML Code โ€ƒโ€ƒโ€ƒ Spacing HTML Code
โ–ˆโ€ƒโ–ˆ Em space &emsp; &#8195; โ–ˆ โ–ˆ Space &#0032;
โ–ˆโ€‡โ–ˆ Number space &numsp; &#8199; โ–ˆโ€…โ–ˆ Em/4 space &emsp14; &#8197;
โ–ˆโ€‚โ–ˆ En space &ensp; &#8194; โ–ˆโ€ˆโ–ˆ Punctuation space &puncsp; &#8200;
โ–ˆโ€„โ–ˆ Em/3 space &emsp13; &#8196; โ–ˆโ€‰โ–ˆ Thin space &thinsp; &#8201;
โ–ˆ โ–ˆ Non-breaking space &nbsp; &#0160; โ–ˆโ€Šโ–ˆ Hair space &hairsp; &#8202;



Currency
HTML Code Currency HTML Code
$ Dollar &dollar; &#0036; ยข Cent sign &cent; &#0162;
ยฃ Pound sign &pound; &#0163; ยฅ Yen &yen; &#0165;
โ‚ฌ Euro sign &euro; &#0128; ยค Curren &curren; &#0164;



Numbers
HTML Code Numbers HTML Code
ยน Superscript one &sup1; &#0185; ยฝ Fraction one half &frac12; &#0189;
ยฒ Superscript two &sup2; &#0178; ยผ Fraction one quarter &frac14 &#0188;
ยณ Superscript three &sup3; &#0179; ยพ Fraction three quarters &frac34 &#0190;



Punctuation
HTML Code Punctuation HTML Code
ยก Inverted exclamation mark &iexcl; &#0161; โ€ฆ Horizontal ellipsis &hellip; &#8230;
ยฟ Inverted question mark &iquest; &#0191; โ€พ Overline &oline; &#8254;
โ€œ Left double quote &ldquo; &#8220; ยง Section sign &sect; &#0167;
โ€ Right double quote &rdquo; &#8221; ยถ Paragraph sign &para; &#0182;
โ€ž Double low-9 quote &bdquo; &#8222; ยฉ Copyright sign &copy; &#0169;
โ€˜ Left single quote &lsquo; &#8216; ยฎ Registered trademark sign &reg; &#0174;
โ€™ Right single quote & rsquo; &#8217; โ„ข Trademark sign &trade; &#8482;
โ€š Single low-9 quote &sbquo; &#8218; ยฌ Not sign &not; &#0172;
โ—Š Lozenge &loz; &#9674; ยต Micro sign &micro; &#0181;
ยซ Left double angle quote &laquo; &#0171; โ€ฐ Per mille sign &permil; &#8240;
ยป Right double angle quote &raquo; &#0187; โ€ฒ Prime (straight quote) &prime; &#8242;
โ€น Single left angle quote &lsaquo; &#8249; โ€ณ Double prime (straight quote) &Prime; &#8243;
โ€บ Single right angle quote &rsaquo; &#8250; โ€  Dagger &dagger; &#8224;
โ€“ En dash &ndash; &#8211; โ€ก Double dagger &Dagger; &#8225;
โ€” Em dash &mdash; &#8212;



Greek small letters
HTML Code Greek capital letters HTML Code
ฮฑ Alpha &alpha &#0945; ฮ‘ Alpha &Alpha &#0913;
ฮฒ Beta &beta; &#0946; ฮ’ Beta &Beta; &#0914;
ฮณ Gamma &gamma; &#0947; ฮ“ Gamma &Gamma; &#0915;
ฮด Delta &delta; &#0948; ฮ” Delta &Delta; &#0916;
ฮต Epsilon &epsilon; &#0949; ฮ• Epsilon &Epsilon; &#0917;
ฮถ Zeta &zeta; &#0950; ฮ– Zeta &Zeta; &#0918;
ฮท Eta &eta; &#0951; ฮ— Eta &Eta; &#0919;
ฮธ Theta &theta; &#0952; ฮ˜ Theta &Theta; &#0920;
ฮน Iota &iota; &#0953; ฮ™ Iota &Iota; &#0921;
ฮบ Kappa &kappa; &#0954; ฮš Kappa &Kappa; &#0922;
ฮป Lambda &lambda; &#0955; ฮ› Lambda &Lambda; &#0923;
ฮผ Mu &mu; &#0956; ฮœ Mu &Mu; &#0924;
ฮฝ Nu &nu; &#0957; ฮ Nu &Nu; &#0925;
ฮพ Xi &xi; &#0958; ฮž Xi &Xi; &#0926;
ฮฟ Omicron &omicron; &#0959; ฮŸ Omicron &Omicron; &#0927;
ฯ€ Pi &pi; &#0960; ฮ  Pi &Pi; &#0928;
ฯ Rho &rho; &#0961; ฮก Rho &Rho; &#0929;
ฯ‚ Sigma 1 &sigmaf; &#0962; ฮฃ Sigma &Sigma; &#0931;
ฯƒ Sigma 2 &sigma; &#0963;
ฯ„ Tau &tau; &#0964; ฮค Tau &Tau; &#0932;
ฯ… Upsilon &upsilon; &#0965; ฮฅ Upsilon &Upsilon; &#0933;
ฯ† Phi &phi; &#0966; ฮฆ Phi &Phi; &#0934;
ฯ‡ Chi &chi; &#0967; ฮง Chi &Chi; &#0935;
ฯˆ Psi &psi; &#0968; ฮจ Psi &Psi; &#0936;
ฯ‰ Omega &omega; &#0969; ฮฉ Omega &Omega; &#0937;
Table 7.3 โ€” HTML common escape sequences and codes



Hexadecimal escape codesโ€ƒโ€ƒโ€ƒโ€ƒโ€ƒ

The escape codes listed above use decimal numbers for the Unicode characters &#65; where 65 is the decimal value of the Unicode number for โ€œAโ€.

The escape codes can also be given in hexadecimal format and this will work within GitHub Markdown and Wiki pages. The hexadecimal equivalent of 65 is 41. To use the hexadecimal number in an escape code, precede it with &#x and follow it with a semicolon ;. Thus:

https://docs.github.com/en/enterprise-cloud@latest

&#65; and &#x41; both display the โ€œAโ€ character.

Tip

Being able to use hexadecimal notation is useful; simply because Unicode char-acters are generally given in hexadecimal format. U+0041 is Unicode for โ€œAโ€.

โฌ†๏ธ Top



7.2SpacerSpecial space characters

Markdown (and GitHub Markdown) ignore multiple spaces. In the following example, the two words โ€œTESTโ€ are separated by five spaces:

Markdown, HTML equivalence and GitHub output
${\large \color{#0050C0}\text{M\ A\ R\ K\ D\ O\ W\ N}}$ ๐Ÿ”ฝ

TEST     TEST

${\large \color{#B00000}\text{G\ I\ T\ H\ U\ B}\space\ \space\text{O\ U\ T\ P\ U\ T}}$ ๐Ÿ”ฝ

TEST TEST

Table 7.4 โ€” Markdown ignores multiple consecutive spaces

Markdown simply ignores the multiple, consecutive spaces.

Markdown does not, however, ignore the non-breaking space character &nbsp;, this has exactly the same spacing as a normal space character, but will always be rendered by Markdown.

This is the same example with five non-breaking spaces between the two words โ€œTESTโ€

Markdown, HTML equivalence and GitHub output
${\large \color{#0050C0}\text{M\ A\ R\ K\ D\ O\ W\ N}}$ ๐Ÿ”ฝ

TEST&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;TEST

${\large \color{#B00000}\text{G\ I\ T\ H\ U\ B}\space\ \space\text{O\ U\ T\ P\ U\ T}}$ ๐Ÿ”ฝ

TEST     TEST

Table 7.4 โ€” Markdown does not ignore multiple consecutive non-breaking spaces

Markdown supports several such space characters; this is a full list โ€” the spaces are bounded by full blocks to give an idea of the width of each type of space, the second column shows four of each type of space to emphasize the different relative sizes:

Single Space Four Spaces Name Escape sequence

โ–ˆโ€ƒโ–ˆ

โ–ˆโ€ƒโ€ƒโ€ƒโ€ƒโ–ˆ

Em space

&emsp;

โ–ˆโ€‡โ–ˆ

โ–ˆโ€‡โ€‡โ€‡โ€‡โ–ˆ

Number space

&numsp;

โ–ˆโ€‚โ–ˆ

โ–ˆโ€‚โ€‚โ€‚โ€‚โ–ˆ

En space

&ensp;

โ–ˆโ€„โ–ˆ

โ–ˆโ€„โ€„โ€„โ€„โ–ˆ

Em/3 space

&emsp13;

โ–ˆ โ–ˆ

โ–ˆ    โ–ˆ

Non-breaking space

&nbsp;

โ–ˆโ€…โ–ˆ

โ–ˆโ€…โ€…โ€…โ€…โ–ˆ

Em/4 space

&emsp14;

โ–ˆโ€ˆโ–ˆ

โ–ˆโ€ˆโ€ˆโ€ˆโ€ˆโ–ˆ

Punctuation space

&puncsp;

โ–ˆโ€‰โ–ˆ

โ–ˆโ€‰โ€‰โ€‰โ€‰โ–ˆ

Thin space

&thinsp;

โ–ˆโ€Šโ–ˆ

โ–ˆโ€Šโ€Šโ€Šโ€Šโ–ˆ

Hair space

&hairsp;

Table 7.6 โ€” Different spaces and relative widths

These different size spaces are used extensively in the PracticalSeries Wiki page headings and tables of contents to ensure that the gaps between the heading numbers and the heading text are consistent.

The size of the gap between the heading number on the left and the heading text on the right, depends on how many numbers there are (90.10.20 has six numbers, 1.2.4 only has three. Both are valid section numbers, but the first will have a smaller space between the last full stop and the heading text).

The width of each type of space depends where the space is used. The space in a heading (all headings are different) is generally larger than the same space used in body text. Similarly, if the text is in a sidebar or footer, the spacings are again different for headings and body text.

The following tables give the width in pixels of each different type of space character for all headings and body text in both the main page area and sidebars/footers (there is no difference between space sizes in sidebars and in footers, they are the same).

Space witdths in pixels for the main page

H1

H2

H3

H4

H5

H6

Body text

Em space

32.00

24.00

20.00

16.00

14.00

13.60

16.00

Number space

17.77

13.33

11.10

8.87

7.77

7.53

8.63

En space

16.00

12.00

10.00

8.00

7.00

6.80

8.00

Em/3 space

10.63

8.00

6.63

5.33

4.67

4.53

5.33

Normal space

8.80

6.60

5.47

4.40

3.87

3.73

4.37

Non-breaking space

8.80

6.60

5.47

4.40

3.87

3.73

4.37

Em/4 space

8.00

6.00

5.00

4.00

3.50

3.40

4.00

Punctuation space

7.70

5.80

4.80

3.87

3.37

3.27

3.47

Thin space

6.40

4.83

4.00

3.20

2.80

2.70

3.20

Hair space

4.00

3.00

2.50

2.00

1.77

1.70

2.00

Two blocks โ–ˆโ–ˆ

47.00

35.00

30.00

24.00

21.00

21.00

24.00

Table 7.7 โ€” Space widths in the main page (in pixels)

Space witdths in pixels for sidebars and footers

H1

H2

H3

H4

H5

H6

Body text

Em space

24.00

18.00

15.00

12.00

10.50

10.20

12.00

Number space

13.33

10.00

8.30

6.67

5.83

5.63

6.47

En space

12.00

9.00

7.50

6.00

5.27

5.10

6.00

Em/3 space

8.00

6.00

4.97

4.00

3.50

3.40

4.00

Normal space

6.60

4.93

4.10

3.30

2.90

2.80

3.30

Non-breaking space

6.60

4.93

4.10

3.30

2.90

2.80

3.30

Em/4 space

6.00

4.50

3.73

3.00

2.63

2.53

3.00

Punctuation space

5.80

4.33

3.60

2.90

2.53

2.43

2.60

Thin space

4.83

3.60

3.00

2.40

2.10

2.03

2.40

Hair space

3.00

2.23

1.87

1.47

1.33

1.27

1.50

Two blocks โ–ˆโ–ˆ

35.00

27.00

23.00

18.00

16.00

16.00

18.00

Table 7.8 โ€” Space widths in sidebars and footers (in pixels)

Note

All widths in the above tables are measured using the Edge browser with page magnification set to 100% on a monitor set to its native resolution (2560 ร— 1440 px).

โฌ†๏ธ Top



7.2.1SpacerEscape sequence restrictions in GitHub HTML

For some reason, some HTML escape sequences do not work in GitHub Wiki Markdown.

This is true only when the escape sequences are between HTML tags, i.e. in a table <table>โ€ฆ</table> or between <p>โ€ฆ</p> tags for example.

Mainly this affects some of the special space characters:

Name Non-functional
Esc sequence
Replacement
Esc code (dec)
Replacement
Esc code (hex)

Number space

&numsp;

&#8199;

&#x2007;

Em/3 space

&emsp13;

&#8196;

&#x2004;

Em/4 space

&emsp14;

&#8197;

&#x2005;

Punctuation space

&puncsp;

&#8200;

&#x2008;

Hair space

&hairsp;

&#8202;

&#x200A;

Table 7.9 โ€” Escape sequences that do not work in GitHub Markdown HTML

Important

This is only a partial list of the most common escape sequences, a full list is available in Appendix A.2

The alternate decimal and hexadecimal escape codes work everywhere.

Note

The above escape sequences work perfectly well with just Markdown, it is only when they are inside an HTML tag that problems occur.

To complicate things, it is only Wiki Markdown that is affected, all the escape sequences work perfectly well in repository Markdown, see section 5.6.

โฌ†๏ธ Top



7.3SpacerEmojis and emoticons

Emojis and emoticons are pictograms that can be embedded in text to convey some form of emotion, smiley face symbols, that sort of thing. They are popular with teenagers and the intellectually challenged.

GitHub supports a full set of Unicode emojis and these can be pasted directly into a Wiki or Markdown page, they can be entered using short name abbreviations or they can be entered as either decimal escape codes &#โ€ฆ; or hexadecimal escape codes &#xโ€ฆ;.

There is a standard version of the short names that can be used for emojis, these are managed by the Unicode CLDR (Common Locale Data Repository), available here: https://cldr.unicode.org/.

The Unicode CLDR provides a full list of all emoji characters, their Unicode character (or string of characters) and the formal short form name, the list is available here: https://unicode.org/emoji/charts/full-emoji-list.html.

GitHub allows short names to be used, these are surrounded by a colon : before and after, thus, the crossed fingers emoji is displayed in Markdown with the short name:

โ€ƒโ€ƒโ€ƒ:crossed_fingers: it looks like this: ๐Ÿคž

The problem with this approach is that GitHub, in its wisdom, decided not to use the standardised (Unicode CDLR) short names, it uses its own versions with slightly different names.

I thought at first this was so that GitHub could use shorter names than the standard CLDR, for example where the CLDR has the name grinning face (๐Ÿ˜€)and GitHub just has :grinning:.

This argument falls down with the CLDR: smiling face with hearts and the GitHub :smiling_face_with_three_hearts: (๐Ÿฅฐ). So Iโ€™ve no idea why GitHub have differed.

Appendix B contains a full list of all the emojis. For completeness, it shows both the GitHub short name and the standardised CDLR short name, the decimal escape code and the hexadecimal escape code.



A note by the Author about emojisโ€ƒโ€ƒโ€ƒโ€ƒโ€ƒ

Well, Iโ€™m an engineer and engineers in particular are not at home to emojis (weโ€™re not generally tainted with emotion).

Engineers have only one rule for using emojis:

${\LARGE \color{#C00000}\text{DON'T\ EVER!}}$
Smiley

โฌ†๏ธ Top



7.4SpacerComments

It is possible to insert comments in Markdown text.

Comments are visible in Markdown, but are not displayed when the page is rendered (on a web browser).

Comments in Markdown are identical to those in HTML.

Any text between <!-- and --> is a comment and will not be displayed:

Markdown, HTML equivalence and GitHub output
${\large \color{#0050C0}\text{M\ A\ R\ K\ D\ O\ W\ N}}$ ๐Ÿ”ฝ

Comments <!-- Like this --> are not displayed

${\large \color{#00C050}\text{H\ T\ M\ L}}$ ๐Ÿ”ฝ

<p>Comments <!-- Like this --> are not displayed</p>

${\large \color{#B00000}\text{G\ I\ T\ H\ U\ B}\space\ \space\text{O\ U\ T\ P\ U\ T}}$ ๐Ÿ”ฝ

Comments are not displayed

Table 7.10 โ€” Body text examples

I think Iโ€™m one of the only people who bother putting comments in their Markdown.


Footnotes:โ€ƒโ€ƒโ€ƒโ€ƒโ€ƒ


Note

๐Ÿ’ 1โ€ƒUTF-8 is a Unicode character set that is backwards compatible with the old 7-bit ASCII char-acters that those of us of a certain age will remember. The 8 means it uses 8-bit blocks (bytes to most people, but octets in the Unicode standard) to represent characters, it can have up to 4 bytes and can represent all Unicode characters (there is a lot of them, โ€˜bout a million).โ†ฉ

UTF-8 is the standard character set for web pages and E-mail.




โš ๏ธ **GitHub.com Fallback** โš ๏ธ