English in computing
| Content languages used for websites |
|---|
| Source: W3Techs, 24 March 2013[1] |
The English language is sometimes described as the lingua franca of computing. In comparison to other sciences, where Latin and Greek are the principal sources of vocabulary, Computer Science borrows more extensively from English. Due to the technical limitations of early computers, and the lack of international standards on the Internet, computer users were limited to using English and the Latin alphabet. However, this historical limitation is less present today. Most software products are localized in numerous languages and the use of the Unicode character encoding has resolved problems with non-Latin alphabets. Some limitations have only been changed recently, such as with domain names, which previously allowed only ASCII characters.
Contents |
Influence on other languages [edit]
The computing terminology of many languages borrows from English. Some language communities resist actively to that trend, and in other cases English is used extensively and more directly. This section gives some examples for the use of English terminology in other languages, and also mentions any notable differences.
Bulgarian [edit]
Both English and Russian have influence over Bulgarian computing vocabulary. However, in many cases the borrowed terminology is translated, and not transcribed phonetically. Combined with the use of Cyrillic this can make it difficult to recognize loanwords. For example the Bulgarian term for motherboard is 'дънна платка' (IPA /danna platka/ or literally "bottom board" ).
- компютър /kompiutar/ - computer
- твърд диск /tvard disk/ - hard disk
- дискета /disketa/ - floppy disk; like the French disquette
- уеб сайт /web sait/ - web site; but also "интернет страница" /internet page/
Faroese [edit]
The Faroese language has a sparse scientific vocabulary based on the language itself. Many Faroese scientific words are borrowed and/or modified versions of especially Nordic and English equivalents. The vocabulary is constantly evolving and thus new words often die out, and only a few survive and become widely used. Examples of successful words include e.g. "telda" (computer), "kurla" (at sign) and "ambætari" (server).[2]
French [edit]
In French there are some generally accepted English loan-words, but there is also a distinct effort to avoid them. In France, the Académie française is responsible for the standardisation of the language and often coins new technological terms. Some of them are accepted in practice, in other cases the English loanwords remain predominant. In Quebec, the Office québécois de la langue française has a similar function.
- email/mail (in Europe); courriel (mainly in Quebec, but increasingly used in French speaking Europe); informally mél; more formally "courrier électronique"
- pourriel - Spam
- hameçonnage, phishing - Phishing
- télécharger - to download
- site web - web site
- lien - website hyper-link
- base de données - Database
- caméra web - Webcam
- amorcer, démarrer, booter - to boot
- redémarrer, rebooter - to reboot
- arrêter, éteindre - to shutdown
- amorçable, bootable - Bootable
- overclocking, surfréquençage, surcadençage - Overclocking
- watercooling: refroidissement à l'eau
- tuning PC: case modding
German [edit]
In German, English words are very often used as well:
- noun: Computer, Website, Software, E-Mail, Blog
- verb: downloaden, booten, crashen
Icelandic [edit]
The Icelandic language has its own vocabulary of scientific terms, still English borrowings exist. English or Icelandicised words are mostly used in casual conversations, whereas the Icelandic words might be longer or not widespread.
Polish [edit]
Polish language words derived from English:
Russian [edit]
Spanish [edit]
The English influence on the software industry and the internet in Latin America has borrowed significantly from the Castilian lexicon.
- Frequently untranslated, and their Spanish equivalent
- email: correo electrónico
- mouse (only in Latin America): ratón (mainly in Spain)
- messenger: mensajero
- webcam: cámara web
- website: página web, sitio web
- blog: bitácora, 'blog'
- Not translated
- web
- flog
- Undecided
Many computing terms in Spanish share a common root with their English counterpart. In these cases, both terms are understood, but the Spanish is preferred for formal use:
- link vs enlace or vínculo
- net vs red
Character encoding [edit]
The early computer software and hardware had very little support for alphabets other than the Latin. As a result of this it was difficult or impossible to represent languages based on other scripts. The ASCII character encoding, created in the 1960s, only supported 256 different characters. With the use of additional software it was possible to provide support for some languages, for instance those based on the Cyrillic alphabet. However, complex-script languages like Chinese or Japanese need more characters than the 256 limit imposed by ASCII. Some computers created in the former USSR had native support for the Cyrillic alphabet.
The wide adoption of Unicode, and UTF-8 on the web, resolved most of these historical limitations. ASCII remains the de facto standard for command interpreters, programming languages and text-based communication protocols.
- Mojibake - Common mistakes
Programming language [edit]
The syntax of most programming languages uses English keywords, and therefore it could be argued some knowledge of English is required in order to use them. However, it is important to recognize all programming languages are in the class of formal languages. They are very different from any natural language, including English.
Some examples of non-English programming languages:
- Although it uses English keywords, Ruby allows the use of Japanese characters in variable names, and other elements of the code.
- Arabic: ARLOGO, قلب
- Bengali: BangaBhasha
- Chinese: Chinese BASIC
- Dutch: Superlogo
- French: LSE, WinDev, Pascal (although the English version is more widespread)
- Hebrew: Hebrew Programming Language
- Icelandic: Fjölnir
- Indian Languages: Hindawi Programming System
- Russian: Glagol
- Spanish: Lexico
Communication protocols [edit]
Many application protocols, especially those depending on widespread standardisation to be effective, use text strings for requests and parameters, rather than the binary values commonly used in lower layer protocols. The request strings are generally based on English words, although in some cases the strings are contractions or acronyms of English expressions, which renders them somewhat cryptic to anyone not familiar with the protocol, whatever their proficiency in English. Nevertheless, the use of word-like strings is a convenient mnemonic device that allows a person skilled in the art (and with sufficient knowledge of English) to execute the protocol manually from a keyboard, usually for the purpose of finding a problem with the service.
Examples:
- FTP: USER, PASS (password), PASV (passive), PORT, RETR (retrieve), STOR (store), QUIT
- SMTP: HELO (hello), MAIL, RCPT (recipient), DATA, QUIT
- HTTP: GET, PUT, POST, HEAD (headers), DELETE, TRACE, OPTIONS
It is notable that response codes, that is, the strings sent back by the recipient of a request, are typically numeric: for instance, in HTTP (and some borrowed by other protocols)
- 200 OK request succeeded
- 301 Moved Permanently to redirect the request to a new address
- 404 Not Found the requested page does not exist
This is because response codes also need to convey unambiguous information, but can have various nuances that the requester may optionally use to vary its subsequent actions. To convey all such "sub-codes" with alphabetic words would be unwieldy, and negate the advantage of using pseudo-English words. Since responses are usually generated by software they do not need to be mnemonic. Numeric codes are also more easily analysed and categorised when they are processed by software, instead of a human testing the protocol by manual input.
Localization [edit]
BIOS [edit]
Many personal computers have a BIOS chip, displaying text in English during boot time.
Keyboard shortcut [edit]
Keyboard shortcuts are usually defined in terms of English keywords such as CTRL+F for find.
English on the World Wide Web [edit]
English is the largest language on the World Wide Web, with 27% of internet users. Please refer to the article for Internet linguistic patterns for more details.
English speakers [edit]
Web user percentages usually focus on raw comparisons of the first language of those who access the web. Just as important is a consideration of second- and foreign-language users; i.e., the first language of a user does not necessarily reflect which language he or she regularly employs when using the web.
Native speakers [edit]
English-language users appear to be a plurality of web users, consistently cited as around one-third of the overall (near one billion). This reflects the relative affluence of English-speaking countries and high Internet penetration rates in them.
This lead may be eroding due mainly to a rapid increase of Chinese users,[4] which broadly parallels China's advance on other economic fronts. In fact, if first-language speakers are compared, Chinese ought, in time, to outstrip English by a wide margin (837+ million for Mandarin Chinese, 370+ million for English).
First-language users among other relatively affluent countries appear generally stable, the two largest being German and Japanese, which each have between 5% and 10% of the overall share.
As a foreign language [edit]
|
|
This section may stray from the topic of the article. (June 2011) |
If a gradual decline in English first-language users is inevitable, it does not necessarily follow that English will not continue to be the language of choice for those accessing the World Wide Web. There is an enormous pool of English second-language speakers who employ the language in technical, governmental and educational spheres[5] and access the Internet in English.
A classic example of this scenario is India, the world's second most populated country. With economic growth, English has begun exploding as the emerging lingua franca in India. In 1995 it was thought that perhaps only 4% of the population was truly fluent in English (still an impressive 40 million).[6] A decade later, by 2005, India had the world's largest English-speaking and understanding population [7] and second largest "Fluent English" speaking population (led only by the U.S.). It is expected to have the world's largest number of English speakers within a decade.[8]
Chinese is rarely employed as a lingua franca outside of China by non-ethnic Chinese; even countries bordering the country or with large Chinese minorities (Malaysia) tend toward English as a commercial and educational language[citation needed]. Further, China is not truly monoglot: Mandarin is official but different spoken variants of Chinese are often mutually unintelligible; the diaspora disproportionately speaks Cantonese[citation needed]. There is, however, an existing written standard that serves as a common written language.
In the future, then, English and Chinese may have roughly equal positions at the top of the overall web first-language users, but English will likely continue to dominate as the default choice for those accessing the World Wide Web in a second language.
Other world languages that could conceivably begin to challenge English include Spanish and Arabic, though it remains to be seen if these, too, will be largely isolated to first-language speakers on the Internet as is Chinese.
World Wide Web content [edit]
One widely quoted figure for the amount of web content in English is 80%.[9] Other sources show figures five to fifteen points lower, though still well over 50%.[10] [11] [12] There are two notable facts about these percentages:
The English web content is greater than the number of first-language English users by as much as 2 to 1.[citation needed]
Given the enormous lead it already enjoys and its increasing use as a lingua franca in other spheres, English web content may continue to dominate even as English first-language Internet users decline. This is a classic positive feedback loop: new Internet users find it helpful to learn English and employ it on-line, thus reinforcing the language's prestige and forcing subsequent new users to learn English as well.
Certain other factors (some predating the medium's appearance) have propelled English into a majority web-content position. Most notable in this regard is the tendency for researchers and professionals to publish in English to ensure maximum exposure. The largest database of medical bibliographical information, for example, shows English was the majority language choice for the past forty years and its share has continually increased over the same period.[13]
The fact that non-Anglophones regularly publish in English only reinforces the language's dominance. English has a rich technical vocabulary[citation needed] (largely because native and non-native speakers alike use it to communicate technical ideas) and many IT and technical professionals use English regardless of country of origin (Linus Torvalds, for instance, comments his code in English, despite being from Finland and having Swedish as his first language).
Notes [edit]
- ^ "Usage of content languages for websites", W3Techs, retrieved 24 March 2013]
- ^ List of Faroese-English-Danish IT words
- ^ "dżojstik". Słownik języka polskiego. Polish Scientific Publishers PWN. Retrieved 2012-09-26.
- ^ English grip on internet being eroded | Technology | Guardian Unlimited
- ^ NationMaster - English status (most recent) by country
- ^ English in India - and Who Speaks English to Whom and When?
- ^ ABC NewsRadio: wordwatch, English speakers
- ^ Inglish as She's Spoke
- ^ What percentage of the internet is in English?
- ^ "Usage of content languages for websites". W3Techs.com. Retrieved 30 December 2011.
- ^ http://www.verisign.com/press_releases/pr/page_029135.html
- ^ English could snowball on Net TRN 112101
- ^ Language and country preponderance trends in MEDLINE and its causes