Unicode, DOT NET Programming

What is Unicode?

In order to understand the concept of Unicode we require  to move a little back and understand the ANSI code. The ASCII (ask key) stands for American Standard Code for Information Interchange. In the ASCII format every character is represented by one byte (i.e. 8 bits). So in brief we can have 256 characters (2^8). Before UNICODE came into picture the programmers used code page to represent characters in different languages. The Code page is a different interpretation of ASCII set. The Code pages keep 128 characters for English and the rest 128 characters are tailored for a specific language.

The pictorial representation of the same is shown below:

2468_Code page.png

Figure:- Code page in action

There are following demerits of the CODE page approach:-

(i)Some languages such as Chinese have more than 5000 characters that is difficult to represent only 128 character set.

(ii)Only 2 languages can be supported at one time. As said you can use only 128 for English and the rest 128 for the other language.

(iii)The end client must have the code page.

(iv)The Code Representation modify according to the Operating system and Language used.

(v)That means a character can be represented in different numbers depending on the operating system.

For all the above problems the UNICODE was introduced. The UNICODE represents characters with 2 bytes. Therefore if its two bytes that means 18 bits. You can now have 2^16 characters that is 65536 characters. That's a huge number you can include any language in the world. In further if you use surrogates you can have additional 1 million characters...Hmm that can involve type of language including historian characters.

The ASCII representation varied according to the operating system and language. But in the UNICODE it assigns a unique letter for every character irrespective of the Language or the operating system which makes programmers life much easier while developing international compatible applications.

Posted Date: 9/25/2012 3:05:50 AM | Location : United States







Related Discussions:- Unicode, Assignment Help, Ask Question on Unicode, Get Answer, Expert's Help, Unicode Discussions

Write discussion on Unicode
Your posts are moderated
Related Questions
Simple FB Integration in MS-SQL and ASP.NET I am seeking urgent project to be done, please I am interested in someone who will present demo for retrieving personal informatio

What are the naming conventions required for XML elements tags? The following are the naming conventions that need to be followed for XML elements tags: Element names should

Assembly: 1) Assembly is a unit of deployment such as EXE or a DLL. 2) An assembly consists one or more files (dlls, exe's, html files etc.),& represents a group of resources,

Assignment for inheritance based on c#

Briefly explain how server form post-back works ?  Post Back: The process in which a Web page sends data back to the similar page on the server. View State : View State i

Project Description: I am preparing a speech recognition program and need a quick code to get it done I am seeking for someone to create a Gmail API that can get subject, and co

Develop PC Cleaner Software Project Description: We are seeking experienced developer who can develop Registry Cleaner / PC Cleaner software for us. Software should be like c

read a file which has emp.no,name,age as per fields/you want to calculate no.of employess whose age more than50

Describe the XmlWriter class. The XmlWriter class is used to write XML to a stream, a file, or a Textwriter object. This class works in a forward-only, non-cached manner. You c

What's the use of resource manager class? The ResourceManager class helps us to read the resource files and get the values by using key. At First you require to create the obje