July 17, 1894|
Reedsburg, Wisconsin, USA
|Died||24 November 1978
New Milford, Connecticut, USA
|Known for||machine translation|
|Awards||Kalinga Prize (1964)|
Warren Weaver (July 17, 1894 – November 24, 1978) was an American scientist, mathematician, and science administrator. He is widely recognized as one of the pioneers of machine translation, and as an important figure in creating support for science in the United States.
Weaver received three degrees from the University of Wisconsin–Madison: a Bachelor of Science in 1916, a civil engineering degree in 1917, and a Ph.D. in 1921. He became an assistant professor of mathematics at Throop College (now California Institute of Technology). He served as a second lieutenant in the Air Service during World War I. After the war, he returned to teach mathematics at Wisconsin (1920–32). Weaver married Mary Hemenway, one of his fellow students at Wisconsin, a few years after their graduation. They had a son, Warren Jr., and a daughter, Helen.
Weaver was director of the Division of Natural Sciences at the Rockefeller Foundation (1932–55), and was science consultant (1947–51), trustee (1954), and vice president (from 1958) at the Sloan-Kettering Institute for Cancer Research. His chief researches were in the problems of communication in science and in the mathematical theory of probability and statistics.
At the Rockefeller Foundation, he was responsible for approving grants for major projects in molecular engineering and genetics, in agriculture (particularly for developing new strains of wheat and rice), and in medical research. During World War II, he was seconded from the foundation to head the Applied Mathematics Panel at the U.S. Office of Scientific Research and Development, directing the work of mathematicians in operations research with the assistance of Mina Rees. He was familiar with the development of electronic calculating machines and the successful application of mathematical and statistical techniques in cryptography.
When Claude Shannon's landmark 1948 articles on communication theory were republished in 1949 as The Mathematical Theory of Communication, the book also republished a much shorter article authored by Weaver, which discusses the implications of Shannon's more technical work for a general audience.
With Max Mason he co-authored the book The Electromagnetic Field, first published in 1929 and re-issued in 1959. He also authored the book Lady Luck: The Theory of Probability, first published in 1963 and republished in 1982.
The "Translation" memorandum
Weaver had first mentioned the possibility of using digital computers to translate documents between natural human languages in March 1947 in a letter to the cyberneticist Norbert Wiener. In the following two years, he had been urged by his colleagues at the Rockefeller Foundation to elaborate on his ideas. The result was a memorandum, entitled simply "Translation", which he wrote in July 1949 at Carlsbad, New Mexico.
Said to be probably the single most influential publication in the early days of machine translation, it formulated goals and methods before most people had any idea of what computers might be capable of, and was the direct stimulus for the beginnings of research first in the United States and then later, indirectly, throughout the world. The impact of Weaver's memorandum is attributable not only to his widely recognized expertise in mathematics and computing, but also, and perhaps even more, to the influence he enjoyed with major policy-makers in U.S. government agencies.
Weaver's memorandum was designed to suggest more fruitful methods than any simplistic word-for-word approach, which had grave limitations. He put forward four proposals. The first was that the problem of multiple meanings might be tackled by the examination of immediate context. For example, the English word fast has at least two meanings which we can paraphrase as rapid or motionless. If we wish to translate an English text, it is likely that these two senses of fast correspond to different words in the target language, and in order to translate the word correctly one needs to know which sense is intended. Weaver proposed that this problem could be solved by looking at the words that occur in the vicinity of the word to be translated, and he conjectured that the number of context words that would be required is fairly small.
The second proposal in the memorandum was inspired by work on an early type of neural networks by McCulloch and Pitt. Weaver interpreted these results as meaning that given a set of premises, any logical conclusion could be deduced automatically by computer. To the extent that human language has a logical basis, Weaver hypothesized that translation could be addressed as a problem of formal logic, deducing "conclusions" in the target language from "premises" in the source language.
The third proposal was that cryptographic methods were possibly applicable to translation. If we want to translate, say, a Russian text into English, we can take the Russian original as an encrypted version of the English plaintext. Weaver was especially impressed with the potential of Shannon's classified work on cryptography and Information theory from World War II.
Finally, the fourth proposal was that there may also be linguistic universals underlying all human languages which could be exploited to make the problem of translation more straightforward. Weaver argued for this position with what is one of the best-known metaphors in the literature of machine translation: "Think, by analogy, of individuals living in a series of tall closed towers, all erected over a common foundation. When they try to communicate with one another, they shout back and forth, each from his own closed tower. It is difficult to make the sound penetrate even the nearest towers, and communication proceeds very poorly indeed. But, when an individual goes down his tower, he finds himself in a great open basement, common to all the towers. Here he establishes easy and useful communication with the persons who have also descended from their towers".
Weaver's memorandum has triggered immediate action from the part of other MT specialists. One of the first people on the scene was Erwin Reifler, mentioned in the memorandum itself. In his study carried out in January 1950, he put forward the idea of pre- and post-editing with the assumption that fully automated translation can only be done on the basis of word for word substitutions, which would cause inadequacies and errors in the generated translation. His suggestion for eliminating the problem was implementing a human pre-editor with the knowledge of the output language, who would add additional symbols for grammatical, lexical and logical correctness. The post editor, in turn, would have the task of rendering the text generated by MT reasonable and logical; ideally, he would have the knowledge of the source language.
Bar-Hillel was appointed as a research assistant in the Research Laboratory for Electronics at the Massachusetts Institute of Technology (MIT) in 1951, and his responsibility was to explore the possibilities for MT implementation and plan further research. In his survey carried out in 1951, he argued that the benefits of MT lie in the satisfying translation demands for financial, diplomatic, science and express translations such as in newspapers or journals. According to him, machine translation could also contribute to explaining certain issues associated with linguistics and communication. A year later, in 1952, he organized the first conference devoted to MT at the Massachusetts Institute of Technology, and machine translation was developed in the further years as articles were published by Bar-Hillel and Reifler. The latter focused on pre- and post-editing, the translation of German compound nouns, and methods for eliminating lexical ambiguity within sentences.
The most meaningful effect of the MIT conference of 1952 was the decision of Leon Dorty to develop a program able to demonstrate the possibilities for MT implementation. A small-scale system for translating some Russian sentences into English was developed and on 7 January 1954 a demonstration took place at the New York headquarters of IBM. Although its limitations were acknowledged, people attending the conference were impressed by the machine-generated translation, which resulted in financial support for MT research.
Advocate for science
Weaver early understood how greatly the tools and techniques of physics and chemistry could advance knowledge of biological processes, and used his position in the Rockefeller Foundation to identify, support, and encourage the young scientists who years later earned Nobel Prizes and other honours for their contributions to genetics or molecular biology.
He had a deep personal commitment to improving the public understanding of science. He was president of the American Association for the Advancement of Science in 1954 and chairman of the board in 1955, a member or chairman of numerous boards and committees, and the primary author of the Arden House Statement, a 1951 declaration of principle and guide to setting the association's goals, plans, and procedures. Weaver was awarded the Public Welfare Medal from the National Academy of Sciences in 1957. In 1965 he was awarded the first Arches of Science Medal for outstanding contributions to the public understanding of the meaning of science to contemporary men and women, and UNESCO's Kalinga Prize for distinguished contributions to the popular understanding of science.
Weaver was fascinated by Lewis Carroll's Alice's Adventures in Wonderland. In 1964, having built up a collection of 160 versions in 42 languages, Weaver wrote a book about the translation history of Alice called Alice in many tongues: The translations of Alice in Wonderland. Among other features, it provides excerpts from the business correspondence of author Lewis Carroll (the Reverend Charles Dodgson) dealing with publishing royalties and permissions as Alice's fame snowballed worldwide. Ever the scientist, even in the area of literature, Weaver devised a design for evaluating the quality of the various translations, focusing on the nonsense, puns and logical jokes in the Mad Tea-Party scene. His range of contacts provided an impressive if eccentric list of collaborators in the evaluation exercise including anthropologist Margaret Mead (for the South Pacific Pidgin translation), longtime Jerusalem mayor Teddy Kollek, and Nobel laureate biochemist Hugo Theorell (Swedish). — The 2015 book Alice in a World of Wonderlands continues and updates Weaver’s endeavour and analyzes Alice translations in 174 languages in a similar vein.
- Piore, Emanuel R. (April 1979). "Obituary: Warren Weaver". Physics Today. 32 (4): 72. Bibcode:1979PhT....32d..72P. doi:10.1063/1.2995512.
- Lovett, Charlie (2000). Warren Weaver: Scientist Humanitarian Carrollian. Lewis Carroll Society of North America.
- Reproduced in: Locke, W.N.; Booth, D.A., eds. (1955). "Translation" (PDF). Machine Translation of Languages. Cambridge, Massachusetts: MIT Press. pp. 15–23. ISBN 0-8371-8434-7.
- Novak, Matt (30 May 2012). "The Cold War origins of Google Translate". BBC News. Retrieved 2012-05-31.
- Hutchins, John. "First Steps in Mechanical Translation" (PDF).
- "Public Welfare Award". National Academy of Sciences. Retrieved 17 February 2011.
- Weaver, Warren (1964). Alice in many tongues. The translations of Alice in Wonderland. Madison: University of Wisconsin Press.
- Lindseth, Jon A., ed. (2015). Alice in a World of Wonderlands: The Translations of Lewis Carroll’s Masterpiece. I. New Castle: Oak Knoll Press. pp. 21–22. ISBN 978-1-58456-331-0.
- Hutchins, W. J. (2000). Early years in machine translation: Memoirs and biographies of pioneers. Amsterdam: John Benjamins.
- Shannon, Claude E. & Weaver, Warren (1949). The Mathematical Theory of Communication. Urbana: The University of Illinois Press.
|Wikiquote has quotations related to: Warren Weaver|