W3C, Unicode move to head off character clash

Published: 18 June 2003 y., Wednesday
The different approaches to adding relevance and functionality to documents -- character encoding in Unicode and markup in XML (Extensible Markup Language) -- were beginning to overlap in some areas, and the two organizations have stated their keenness to iron out any areas of conflict. Unicode defines a 65,536-character set which holds all the letters used in alphabets and syllabaries worldwide, radical characters used in logographic (pictorial) languages such as Chinese and diacritical markers used in many scripts to mark vowels or voice tones. But it also includes many characters which define the direction which text runs in, such as from right to left as in Arabic scripts or from top to bottom as in Japanese, paragraph separation codes and ways to deal with odd items such as fractions and superscripts. It is mainly in these areas where Unicode and XML have begun to grate against one another. The two organizations have decided that markup, as used in XML, is generally more robust and functional than Unicode's character encoding for matters not strictly related to producing exotic characters.
Šaltinis: IDG News Service
Copying, publishing, announcing any information from the News.lt portal without written permission of News.lt editorial office is prohibited.

Facebook Comments

New comment


Captcha

Associated articles

Italian police shut down hacker rings

Tipped off by American officials, Italian police shut down two rings of hackers who attacked Web sites belonging to the U.S. Army and NASA more »

Yokohama to let residents decide participation in network

Yokohama Mayor Hiroshi Nakada decided Friday to allow residents of the city to choose whether their personal data can be registered in a national resident registry network to be launched Monday by the central government more »

Light speed

An Israeli startup takes on Moore's law--and Texas Instruments more »

Cheap PCs With Lindows Are Well Intentioned but Flawed

Wal-Mart, the most mass-market retailer imaginable, is committing an outrageous form of computing heresy: On its Web site, it's selling Windows-compatible personal computers without Windows more »

Users divided on the meaning of spam

Businesses in the US and UK agree that spam is a problem, but according to MessageLabs many users cannot reach a consensus on its definition more »

search.lt news

search.lt presents newest links more »

The investigation

FORMER FSB OFFICER TESTIFIES ABOUT 1999 APARTMENT-BUILDING BOMBINGS... more »

Gates: Slow going for .Net

Microsoft on Wednesday acknowledged that its .Net plan has been slow to catch on and laid out an agenda to move the software strategy ahead more »

Virus Dials 911

Police Show Up Only to Find Infected WebTVs. more »

AOL blasted for anti-semitic postings

Filters fail to block 'pro-terrorist' messages more »