W3C, Unicode move to head off character clash

Published: 18 June 2003 y., Wednesday
The different approaches to adding relevance and functionality to documents -- character encoding in Unicode and markup in XML (Extensible Markup Language) -- were beginning to overlap in some areas, and the two organizations have stated their keenness to iron out any areas of conflict. Unicode defines a 65,536-character set which holds all the letters used in alphabets and syllabaries worldwide, radical characters used in logographic (pictorial) languages such as Chinese and diacritical markers used in many scripts to mark vowels or voice tones. But it also includes many characters which define the direction which text runs in, such as from right to left as in Arabic scripts or from top to bottom as in Japanese, paragraph separation codes and ways to deal with odd items such as fractions and superscripts. It is mainly in these areas where Unicode and XML have begun to grate against one another. The two organizations have decided that markup, as used in XML, is generally more robust and functional than Unicode's character encoding for matters not strictly related to producing exotic characters.
Šaltinis: IDG News Service
Copying, publishing, announcing any information from the News.lt portal without written permission of News.lt editorial office is prohibited.

Facebook Comments

New comment


Captcha

Associated articles

NASA to merge media archives

Space officials want proposals for a NASA archiving system that would create a one-stop multimedia source for the public more »

Google Focuses Local Ad Targeting

Search giant Google will offer its advertisers the chance to more tightly target the geographical areas where their ads will be seen more »

'Linspiration' Hits Lindows

Lindows executives have rolled out a new moniker for its desktop Linux software and the name is...Linspire more »

Spam reaches new high in March

More than one million junk emails sent on one day alone more »

Internet nonprofit meets with U.N.

U.S. company controls domain names; security, governing discussed more »

ITT fashion spring “CeBIT 2004”

18th world’s largest information technologies’ and telecommunications’ exhibition “CeBIT 2004”, which takes place in Hanover (Germany) annually, has already ended. more »

Foreign fraud hits U.S. e-commerce firms hard

Top offending countries: Yugoslavia, Nigeria, Romania more »

'Buffalo Spammer' convicted

A man accused of using EarthLink Inc. e-mail accounts to release a flood of unsolicited commercial ("spam") e-mail on the Internet has been convicted on charges of identity theft and falsifying business records more »

Google Gets E-Mail

Search player Google is getting into the e-mail game more »

New eMail Tales in Microsoft's Minn. Case

Microsoft officials sought to dissuade Intel from investing in handwriting software startup GO Corporation in 1990, according to the latest round of e-mail evidence more »