Header Ads

HTML Charset

HTML Charset

What is Character Encoding?

Character encoding in HTML provides a way for us to convert bites into letters. In order for us to display the HTML document correctly, we have to choose the appropriate character encoding of the program. Character coding in HTML is used to cater to many different purposes.
In HTML, character encoding is also known as character set or character map in two forms.

Uses on a computer usually uses character set or character encoding ASCII - (American International Code for Information Interchange)

ASCII (American International Code for Information Interchange) encoding only upper- and
 Those lowercase letters are Latin alphabet, they support them, numbers 0-9, and some extra characters like! $ + - () @ <>., It has a total of 128 characters.

There are many languages, which either do either in the Latin alphabet or in completely different letters. ASCII does not address such characters; Therefore, if you want, if you use any non-ASCII characters, it is necessary that you fully understand character encoding.
In order to display the HTML document correctly and to deal with various national characters, the International Standards Organization has created a series of character sets.

The ISO (American International Code for Information Interchange) -885 9-1 has been the default character for HTML4. This character set has also supported the character code, which is a total of 256 character code in the number.

By default, the character used for HTML5 is encoding UTF-8

HTML Charset

HTML Chartset Attribution

HTML is used to display HTML page correctly in a web browser
Specifies this within the <Meta> tag( In <head> element ):

<meta http-equiv="Content-Type" content="text/html;charset=ISO-8859-1">

<meta charset="UTF-8">

Differences between character group
The above table will be shown about the following Character sets:
Numb ASCII ANSI 8859 UTF-8 Description
33!!!!exclamation mark
34""""quotation mark
35####number sign
36$$$$dollar sign
37%%%%percent sign
40((((left parenthesis
41))))right parenthesis
43++++plus sign
46....full stop
480000digit zero
491111digit one
502222digit two
513333digit three
524444digit four
535555digit five
546666digit six
557777digit seven
568888digit eight
579999digit nine
60<<<<less-than sign
61====equals sign
62>>>>greater-than sign
63????question mark
64@@@@commercial at
65AAAALatin capital letter A
66BBBBLatin capital letter B
67CCCCLatin capital letter C
68DDDDLatin capital letter D
69EEEELatin capital letter E
70FFFFLatin capital letter F
71GGGGLatin capital letter G
72HHHHLatin capital letter H
73IIIILatin capital letter I
74JJJJLatin capital letter J
75KKKKLatin capital letter K
76LLLLLatin capital letter L
77MMMMLatin capital letter M
78NNNNLatin capital letter N
79OOOOLatin capital letter O
80PPPPLatin capital letter P
81QQQQLatin capital letter Q
82RRRRLatin capital letter R
83SSSSLatin capital letter S
84TTTTLatin capital letter T
85UUUULatin capital letter U
86VVVVLatin capital letter V
87WWWWLatin capital letter W
88XXXXLatin capital letter X
89YYYYLatin capital letter Y
90ZZZZLatin capital letter Z
91[[[[left square bracket
92\\\\reverse solidus
93]]]]right square bracket
94^^^^circumflex accent
95____low line
96````grave accent
97aaaaLatin small letter a
98bbbbLatin small letter b
99ccccLatin small letter c
100ddddLatin small letter d
101eeeeLatin small letter e
102ffffLatin small letter f
103ggggLatin small letter g
104hhhhLatin small letter h
105iiiiLatin small letter i
106jjjjLatin small letter j
107kkkkLatin small letter k
108llllLatin small letter l
109mmmmLatin small letter m
110nnnnLatin small letter n
111ooooLatin small letter o
112ppppLatin small letter p
113qqqqLatin small letter q
114rrrrLatin small letter r
115ssssLatin small letter s
116ttttLatin small letter t
117uuuuLatin small letter u
118vvvvLatin small letter v
119wwwwLatin small letter w
120xxxxLatin small letter x
121yyyyLatin small letter y
122zzzzLatin small letter z
123{{{{left curly bracket
124||||vertical line
125}}}}right curly bracket
128euro sign
129NOT USED
130single low-9 quotation mark
131ƒLatin small letter f with hook
132double low-9 quotation mark
133horizontal ellipsis
135double dagger
136ˆmodifier letter circumflex accent
137per mille sign
138ŠLatin capital letter S with caron
139single left-pointing angle quotation mark
140ŒLatin capital ligature OE
141NOT USED
142ŽLatin capital letter Z with caron
143NOT USED
144NOT USED
145left single quotation mark
146right single quotation mark
147left double quotation mark
148right double quotation mark
150en dash
151em dash
152˜small tilde
153trade mark sign
154šLatin small letter s with caron
155single right-pointing angle quotation mark
156œLatin small ligature oe
157NOT USED
158žLatin small letter z with caron
159ŸLatin capital letter Y with diaeresis
160no-break space
161¡¡¡inverted exclamation mark
162¢¢¢cent sign
163£££pound sign
164¤¤¤currency sign
165¥¥¥yen sign
166¦¦¦broken bar
167§§§section sign
169©©©copyright sign
170ªªªfeminine ordinal indicator
171«««left-pointing double angle quotation mark
172¬¬¬not sign
173­­­soft hyphen
174®®®registered sign
176°°°degree sign
177±±±plus-minus sign
178²²²superscript two
179³³³superscript three
180´´´acute accent
181µµµmicro sign
182pilcrow sign
183···middle dot
185¹¹¹superscript one
186ºººmasculine ordinal indicator
187»»»right-pointing double angle quotation mark
188¼¼¼vulgar fraction one quarter
189½½½vulgar fraction one half
190¾¾¾vulgar fraction three quarters
191¿¿¿inverted question mark
192ÀÀÀLatin capital letter A with grave
193ÁÁÁLatin capital letter A with acute
194ÂÂÂLatin capital letter A with circumflex
195ÃÃÃLatin capital letter A with tilde
196ÄÄÄLatin capital letter A with diaeresis
197ÅÅÅLatin capital letter A with ring above
198ÆÆÆLatin capital letter AE
199ÇÇÇLatin capital letter C with cedilla
200ÈÈÈLatin capital letter E with grave
201ÉÉÉLatin capital letter E with acute
202ÊÊÊLatin capital letter E with circumflex
203ËËËLatin capital letter E with diaeresis
204ÌÌÌLatin capital letter I with grave
205ÍÍÍLatin capital letter I with acute
206ÎÎÎLatin capital letter I with circumflex
207ÏÏÏLatin capital letter I with diaeresis
208ÐÐÐLatin capital letter Eth
209ÑÑÑLatin capital letter N with tilde
210ÒÒÒLatin capital letter O with grave
211ÓÓÓLatin capital letter O with acute
212ÔÔÔLatin capital letter O with circumflex
213ÕÕÕLatin capital letter O with tilde
214ÖÖÖLatin capital letter O with diaeresis
215×××multiplication sign
216ØØØLatin capital letter O with stroke
217ÙÙÙLatin capital letter U with grave
218ÚÚÚLatin capital letter U with acute
219ÛÛÛLatin capital letter U with circumflex
220ÜÜÜLatin capital letter U with diaeresis
221ÝÝÝLatin capital letter Y with acute
222ÞÞÞLatin capital letter Thorn
223ßßßLatin small letter sharp s
224àààLatin small letter a with grave
225áááLatin small letter a with acute
226âââLatin small letter a with circumflex
227ãããLatin small letter a with tilde
228äääLatin small letter a with diaeresis
229åååLatin small letter a with ring above
230æææLatin small letter ae
231çççLatin small letter c with cedilla
232èèèLatin small letter e with grave
233éééLatin small letter e with acute
234êêêLatin small letter e with circumflex
235ëëëLatin small letter e with diaeresis
236ìììLatin small letter i with grave
237íííLatin small letter i with acute
238îîîLatin small letter i with circumflex
239ïïïLatin small letter i with diaeresis
240ðððLatin small letter eth
241ñññLatin small letter n with tilde
242òòòLatin small letter o with grave
243óóóLatin small letter o with acute
244ôôôLatin small letter o with circumflex
245õõõLatin small letter o with tilde
246öööLatin small letter o with diaeresis
247÷÷÷division sign
248øøøLatin small letter o with stroke
249ùùùLatin small letter u with grave
250úúúLatin small letter u with acute
251ûûûLatin small letter with circumflex
252üüüLatin small letter u with diaeresis
253ýýýLatin small letter y with acute
254þþþLatin small letter thorn
255ÿÿÿLatin small letter y with diaeresis

Here is the list of using the character

Sr.No Character Set & Description
1 ISO-8859-1
Latin alphabet part 1
Covering North America,Western Europe, Latin America, theCaribbean, Canada, Africa
2 ISO-8859-2
Latin alphabet part 2
Covering Eastern Europe
3 ISO-8859-3
Latin alphabet part 3
Covering SE Europe, Esperanto, miscellaneous others
4 ISO-8859-4
Latin alphabet part 4
Covering Scandinavia/Baltics (and others not in ISO-8859-1)
5 ISO-8859-5
Latin/Cyrillic alphabet part 5
6 ISO-8859-6
Latin/Arabic alphabet part 6
7 ISO-8859-7
Latin/Greek alphabet part 7
8 ISO-8859-8
Latin/Hebrew alphabet part 8
9 ISO-8859-9
Latin 5 alphabet part 9
Same as ISO-8859-1 except Turkish characters replace Icelandic ones
10 ISO-8859-10
Latin 6 Latin 6 Lappish, Nordic, and Eskimo
11 ISO-8859-15
The same as ISO-8859-1 but with more characters added
12 ISO-2022-JP
Latin/Japanese alphabet part 1
13 ISO-2022-JP-2
Latin/Japanese alphabet part 2
14 ISO-2022-KR
Latin/Korean alphabet part 1

If you want to create an HTML document using characters of multiple character sets, you will be able to do this by using a single Unicode character encoding.
Unicode encoding specifies which can deal with a string in special ways so that
 He could make enough space for the huge character set. Those used to do this, they are known as UTF8, UTF-16, and UTF-32.

Sr.No Character Set & Description
1 UTF-8
A Unicode translation format that comes in 8-bit units, it comes in bytes. By making a character UTF 8 variable width in UTF 8, it can be up to 1 to 4 bytes long.
2 UTF-16
A Unicode translation format which comes in 16-bit units, which comes in this shorts. This UTF can be up to 1 or 2 shorts, making 16 variable widths.
3 UTF-32
A Unicode Translation Format that comes in 32-bit units that is, it comes in longs. It is a fixed-width format and is always 1 "long" in length.

No comments