Common Locale Data Repository

From Wikipedia, the free encyclopedia
Jump to: navigation, search
Common Locale Data Repository
Developed by Unicode Consortium
Initial release 19 December 2003; 10 years ago (2003-12-19)[1]
Latest release
CLDR 25
(19 March 2014; 7 months ago (2014-03-19)[1])
Container for XML[2]
Website cldr.unicode.org

The Common Locale Data Repository Project, often abbreviated as CLDR, is a project of the Unicode Consortium to provide locale data in the XML format for use in computer applications. CLDR contains locale specific information that an operating system will typically provide to applications. CLDR is written in LDML (Locale Data Markup Language). The information is currently used in International Components for Unicode, Apple's Mac OS X, OpenOffice.org, and IBM's AIX, among other applications and operating systems.

Among the types of data that CLDR includes are the following:

  • Translations for language names.
  • Translations for territory and country names.
  • Translations for currency names, including singular/plural modifications.
  • Translations for weekday, month, era, period of day, in full and abbreviated forms.
  • Translations for timezones and example cities (or similar) for timezones.
  • Translations for calendar fields.
  • Patterns for formatting/parsing dates or times of day.
  • Examplar sets of characters used for writing the language.
  • Patterns for formatting/parsing numbers.
  • Rules for language adapted collation.
  • Rules for formatting numbers in traditional numeral systems (like Roman numerals, Armenian numerals, ...).
  • Rules for spelling out numbers as words.
  • Rules for transliteration between scripts. A lot of it is based on BGN/PCGN romanization.

It overlaps somewhat with ISO 15897 (POSIX locales). POSIX locale information can be derived from CLDR by using some of CLDR's conversion tools.

CLDR is maintained by the CLDR technical committee, which includes organizations from IBM, Apple, Sun Microsystems and some government based organizations. The committee is currently chaired by John Emmons (IBM), with Mark Davis (Google) as vice-chair.[3]

References[edit]

  1. ^ a b CLDR Releases/Downloads
  2. ^ Updating DTDs, CLDR makes special use of XML because of the way it is structured. In particular, the XML is designed so that you can read in a CLDR XML file and interpret it as an unordered list of <path,value> pairs, called a CLDRFile internally. These path/value pairs can be added to or deleted, and then the CLDRFile can be written back out to disk, resulting in a valid XML file. That is a very powerful mechanism, and also allows for the CLDR inheritance model.
  3. ^ http://cldr.unicode.org/index/process#TOC-Officers

External links[edit]

CLDR Tools