Unicode, DOT NET Programming

What is Unicode?

In order to understand the concept of Unicode we require  to move a little back and understand the ANSI code. The ASCII (ask key) stands for American Standard Code for Information Interchange. In the ASCII format every character is represented by one byte (i.e. 8 bits). So in brief we can have 256 characters (2^8). Before UNICODE came into picture the programmers used code page to represent characters in different languages. The Code page is a different interpretation of ASCII set. The Code pages keep 128 characters for English and the rest 128 characters are tailored for a specific language.

The pictorial representation of the same is shown below:

2468_Code page.png

Figure:- Code page in action

There are following demerits of the CODE page approach:-

(i)Some languages such as Chinese have more than 5000 characters that is difficult to represent only 128 character set.

(ii)Only 2 languages can be supported at one time. As said you can use only 128 for English and the rest 128 for the other language.

(iii)The end client must have the code page.

(iv)The Code Representation modify according to the Operating system and Language used.

(v)That means a character can be represented in different numbers depending on the operating system.

For all the above problems the UNICODE was introduced. The UNICODE represents characters with 2 bytes. Therefore if its two bytes that means 18 bits. You can now have 2^16 characters that is 65536 characters. That's a huge number you can include any language in the world. In further if you use surrogates you can have additional 1 million characters...Hmm that can involve type of language including historian characters.

The ASCII representation varied according to the operating system and language. But in the UNICODE it assigns a unique letter for every character irrespective of the Language or the operating system which makes programmers life much easier while developing international compatible applications.

Posted Date: 9/25/2012 3:05:50 AM | Location : United States







Related Discussions:- Unicode, Assignment Help, Ask Question on Unicode, Get Answer, Expert's Help, Unicode Discussions

Write discussion on Unicode
Your posts are moderated
Related Questions
Differentiate between Boxing and Unboxing. When a value type is changed to an object type, the process is known as boxing; while, when an object type is changed to a value type

Define Network? A network is a set of devices linked by physical media links. A network is recursively is a connection of two or more nodes by a physical link or two or more

How do we access view state value of the page in the next page? The View state is page specific; it contains information about controls embedded at the particular page. ASP.NET

What is a candidate key? A table may have more than one combination of columns that could uniquely identify the rows in a table; every combination is a candidate key. During th

what is Entity-relationship diagram for e-resource planner software

What is an identifier? Identifiers are northing but names given to various entities uniquely identified in a program. The name of identifiers must vary in spelling or casing. F

What is UDDI? The Full form of the UDDI is Universal Description, Discovery and Integration. The directory that can be used to publish and discover public Web Services. If you

What is Unicode? In order to understand the concept of Unicode we require  to move a little back and understand the ANSI code. The ASCII (ask key) stands for American Standard

The Events are always public as they are meant to serve each one registering to it. But you can access modifiers in the events.You may have events with protected keyword which will

Ask queyuyuystion #Minimum 100 words accepted#