| advertise add site services publishers database health videos | ![]() | about toolbar stats live show health store more stuff JOIN/LOGIN |
Articles on the English Wikipedia may contain words or texts written in different languages and scripts. To be able to correctly view and edit these articles requires that you have the appropriate fonts installed and to have correctly configured your operating system and browser. This guide will help you to do so.
[edit] Overview[edit] UnicodeArticles on Wikipedia are encoded using Unicode (specifically UTF-8)[1], an industry standard designed to allow text and symbols from all of the writing systems of the world to be consistently represented and manipulated by computers. Because UTF-8 is backwards compatible with ASCII, and most modern browsers have at least basic Unicode support, most users will experience little difficulty reading and editing Wikipedia. For older browsers, MediaWiki, the Wikipedia software, serves the wikitext in a safe mode upon editing. Characters that cannot be represented in ASCII are temporarily converted to hexadecimal character references, looking like ሴ. Existing hexadecimal character references get an additional leading zero so they are not converted to actual characters when the page is saved, and look like ሴ. Likewise, to create a hexadecimal character reference in safe mode, not the character itself, a leading zero should be added. One can check whether safe mode is used by editing this section. If M looks like M rather than M, safe mode is used. [edit] FontMost computers with Microsoft Windows or Microsoft Office will already have several fonts with support for Latin, Greek, Cyrillic, Hebrew, Arabic, Chinese, Japanese, Korean and the International Phonetic Alphabet installed. Several historic and accented characters (used in the transliteration of foreign scripts) are missing, though.
[edit] Other available unicode fonts
[edit] Browsers
[edit] Scripts[edit] East AsianMain article: Help:Multilingual support (East Asian)
[edit] EthiopicMain article: Help:Multilingual support (Ethiopic) The Ethiopic syllabary is used in central east Africa for Amharic, Bilen, Oromo, Tigré, Tigrinya, and other languages. It evolved from the script for classical Ge'ez, which is now strictly a liturgical language.
[edit] IndicMain article: Help:Multilingual support (Indic) The following table compares how a correctly enabled computer would render the following scripts with how your computer renders them:
[edit] BurmaMain article: Help:Multilingual support (Burmese) [edit] Fonts
[edit] CopticThis is the Language used in Egypt before Arabic. It is currently used solely as a liturgical language.
[edit] Cherokee
[edit] Special cases[edit] Esperanto
Mediawiki installations configured for Esperanto use UTF-8 for storage and display. However when editing the text is converted to a form that is designed to be easier to edit with a standard keyboard. The characters for which this applies are: Ĉ, Ĝ, Ĥ, Ĵ, Ŝ, Ŭ, ĉ, ĝ, ĥ, ĵ, ŝ, ŭ. you may enter these directly in the edit box if you have the facilities to do so. However when you edit the page again you will see them encoded as Sx. This form is referred to as "x-sistemo" or "x-kodo". In order to preserve round trip capability when one or more x's follow these characters or their non-accented forms (A, G, H, J, S, U, c, g, h, j, s, u), the number of x's in the edit box is double the number in the actual stored article text. For example, the interlanguage link [[en:Luxury car]] to en:Luxury car has to be entered in the edit box as [[en:Luxxury car]] on eo:. This has caused problems with interwiki update bots in the past. [edit] RomanianThe Romanian alphabet contains an S-comma (Ș ș) and T-comma (Ț ț). These characters were added to Unicode 3.0 at the request of the Romanian standardization institute. Font support for these characters is poor, so the Romanian Wikipedia represents these letters with an S-cedilla (Ş ş) and T-cedilla (Ţ ţ) instead.[4] [edit] See also
[edit] Notes
[edit] External links |
| ↑ top of page ↑ | about thumbshots |