Latin characters in Unicode

Latin characters in Unicode

Unicode as of version 5.1 defines the following ranges for encoding the Latin alphabet and derived characters:
*Basic Latin 0000–007F: identical to ASCII (0000–001F, 007F are control characters, 0020–003F are punctuation and Arabic numerals)
*Latin-1 Supplement 0080–00FF: identical to ISO/IEC 8859-1 (0080–009F are control characters, 00A0–00BF are currency symbols, punctuation and numerals)
*Latin Extended-A 0100–017F
*Latin Extended-B 0180–024F
*IPA Extensions 0250–02AF
*Phonetic Extensions 1D00–1D7F
*Phonetic Extensions Supplement 1D80–1DBF
*Latin Extended Additional 1E00–1EFF
*Letterlike Symbols 2100–214F
*Enclosed Alphanumerics 2460–24FF
*Latin Extended-C 2C60–2C7F
*Latin Extended-D A720–A7FF
*Alphabetic Presentation Forms (Latin ligatures) FB00–FB4F
*Halfwidth and Fullwidth Forms (fullwidth Latin letters) FF00–FFEF
*Mathematical Alphanumeric Symbols 1D400–1D7FF

The "extended" ranges contain mainly precomposed diacritics that may be equivalently encoded with combining diacritics, as well as some ligatures, used in the orthography of various African languages (including click symbols in Latin Extended-B) and the Vietnamese language (Latin Extended Additional).Latin Extended-C contains additions for Uighur and the Claudian letters. Latin Extended-D is made up of characters that are mostly of interest to medievalists.

ee also

*Mapping of Unicode characters
*Letterlike Symbols
*List of Latin letters

External links

* [http://www.unicode.org/charts/PDF/U0000.pdf Basic Latin]
* [http://www.unicode.org/charts/PDF/U0080.pdf Latin-1]
* [http://www.unicode.org/charts/PDF/U0100.pdf Latin Extended A]
* [http://www.unicode.org/charts/PDF/U0180.pdf Latin Extended B]
* [http://www.unicode.org/charts/PDF/U0250.pdf IPA Extensions]
* [http://www.unicode.org/charts/PDF/U1D00.pdf Phonetic Extensions]
* [http://www.unicode.org/charts/PDF/U1D80.pdf Phonetic Extensions Supplement]
* [http://www.unicode.org/charts/PDF/U1E00.pdf Latin Extended Additional]
* [http://www.unicode.org/charts/PDF/U2100.pdf Letterlike Symbols]
* [http://www.unicode.org/charts/PDF/U2460.pdf Enclosed Alphanumerics]
* [http://www.unicode.org/charts/PDF/U2C60.pdf Latin Extended C]
* [http://www.unicode.org/charts/PDF/UA720.pdf Latin Extended D]
* [http://www.unicode.org/charts/PDF/UFB00.pdf Latin Ligatures (Alphabetic Presentation Forms)]
* [http://www.unicode.org/charts/PDF/UFF00.pdf Fullwidth Latin Letters (Halfwith and Fullwidth forms)]
* [http://www.unicode.org/charts/PDF/U1D400.pdf Mathematical Alphanumeric Symbols]


Wikimedia Foundation. 2010.

Игры ⚽ Нужна курсовая?

Look at other dictionaries:

  • List of precomposed Latin characters in Unicode — This is the list of all precomposed characters in Unicode. Unicode typefaces (e.g. Fixedsys Excelsior) may be needed for these to display correctly. Latin Alphabets with diacritics Ligatures Row cell order. See also *Alphabets derived from the… …   Wikipedia

  • Latin Extended-A unicode block — The Latin Extended A unicode block is a unicode block of the Unicode standard.It encodes Latin letters from the Latin ISO character sets other than Latin 1 (which is already encoded in the Latin 1 Supplement unicode block) and also legacy… …   Wikipedia

  • Duplicate characters in Unicode — Unicode has a certain amount of duplication of characters. These are pairs of single Unicode code points that are canonically equivalent. The reason for this are compatibility issues with legacy systems. Unless two characters are canonically… …   Wikipedia

  • Unicode Phonetic Symbols — Unicode supports several phonetic alphabets and notations through the existing writing systems and the addition of several phonetic extension blocks. *IPA Extensions (0250–02AF); Spacing Modifier Letters (02B0–02FF); Phonetic Extensions… …   Wikipedia

  • Cyrillic characters in Unicode — Cyrillic script Slavic letters А Б В Г Ґ Д …   Wikipedia

  • Unicode symbols — v · Character Types Scripts Unihan ideographs, etc. Phonetic characters Punctuation and separators Diacritics and other marks Symbols Numerals Compatibility characters …   Wikipedia

  • Latin alphabet — Infobox Writing system name=Latin alphabet type=Alphabet languages=Latin and Romance languages; most languages of Europe; Romanizations exist for practically all known languages. time= 700 B.C. to the present. fam1=Egyptian hieroglyphs fam2=Proto …   Wikipedia

  • Latin — Infobox Language name=Latin nativename= la. Lingua Latina pronunciation=/laˈtiːna/ states=Vatican City speakers= Native: none Second Language Fluent: estimated at 5,000Fact|date=April 2007 Second Language Literate: estimated 25,000Fact|date=April …   Wikipedia

  • Unicode character property — Unicode assigns character properties to each code point.[1] These properties can be used to handle characters (code points) in processes, like in line breaking, script direction right to left or applying controls. Slightly inconsequently, some… …   Wikipedia

  • Unicode — For the 1889 Universal Telegraphic Phrase book, see Commercial code (communications). The Unicode official logo since October 2009 …   Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”