Unicode, DOT NET Programming

What is Unicode?

In order to understand the concept of Unicode we require  to move a little back and understand the ANSI code. The ASCII (ask key) stands for American Standard Code for Information Interchange. In the ASCII format every character is represented by one byte (i.e. 8 bits). So in brief we can have 256 characters (2^8). Before UNICODE came into picture the programmers used code page to represent characters in different languages. The Code page is a different interpretation of ASCII set. The Code pages keep 128 characters for English and the rest 128 characters are tailored for a specific language.

The pictorial representation of the same is shown below:

2468_Code page.png

Figure:- Code page in action

There are following demerits of the CODE page approach:-

(i)Some languages such as Chinese have more than 5000 characters that is difficult to represent only 128 character set.

(ii)Only 2 languages can be supported at one time. As said you can use only 128 for English and the rest 128 for the other language.

(iii)The end client must have the code page.

(iv)The Code Representation modify according to the Operating system and Language used.

(v)That means a character can be represented in different numbers depending on the operating system.

For all the above problems the UNICODE was introduced. The UNICODE represents characters with 2 bytes. Therefore if its two bytes that means 18 bits. You can now have 2^16 characters that is 65536 characters. That's a huge number you can include any language in the world. In further if you use surrogates you can have additional 1 million characters...Hmm that can involve type of language including historian characters.

The ASCII representation varied according to the operating system and language. But in the UNICODE it assigns a unique letter for every character irrespective of the Language or the operating system which makes programmers life much easier while developing international compatible applications.

Posted Date: 9/25/2012 3:05:50 AM | Location : United States







Related Discussions:- Unicode, Assignment Help, Ask Question on Unicode, Get Answer, Expert's Help, Unicode Discussions

Write discussion on Unicode
Your posts are moderated
Related Questions
Computer programming Project Description: Solving some issues about traffic jam Skills required: C# Programming, Java, C++ Programming, PHP, Software Architecture

What are satellite assemblies? We use resource files to store data according to the localized languages. But when you actually go for the deployment you will not like to also i

How we get the current culture of the environment in windows and ASP.NET? The "CultureInfo.CurrentCulture" represents the current culture of the environment. For illustrate if

Reading strings from the keyboard   using System; class Prog3_1 {       public static void Main()   {    Console.Write ("Enter Your First Name : "); // Displaying to writ

What is SIX sigma? The Sigma means the deviation in Greek language. The Deviation means how much variations exist in a set of data. For illustrate  let's say in a software main


Project Description: We are a small development house in Singapore, and have a project which is 90 percent complete; thud our previous developer could not finish due to other co

Prevent .NET DLL to be decompiled By design phase .NET embeds rich Metadata inside the executable code using the MSIL. Anyone can easily  decompile DLL back using tools such as

how we can display a alphabets by identifies an image and if its different than show someother alpbabets.

What is the difference between the Take and Skip clauses? The Take clause returns a specified number of elements. For instance, you can use the Take clause to return two values