Wikipedia:What SYNTH is not
|While this essay is not a Wikipedia policy or guideline itself, it is intended to supplement the Wikipedia:No original research page, to which editors should defer in case of inconsistency between that page and this one.|
|This page in a nutshell: Although avoiding original research is an important part of ensuring that Wikipedia content is verifiable, use some common sense about it, and particularly about asserting original research by synthesis.|
In this article, the term SYNTH refers to Wikipedia's policy of forbidding original research by synthesis, and to its forms and nature. SYNTH cautions against original research by synthesis, where an editor combines reliably sourced statements in a way that makes or suggests a new statement not supported by any one of the sources. This essay is intended to help explain the spirit of that policy. In particular, this essay is intended to oppose taking an excessively strict interpretation of the policy in many cases. After all, Wikipedia does not have firm rules.
- 1 SYNTH is not useless
- 2 SYNTH is not unnecessary
- 3 SYNTH is not merely unclear writing
- 4 SYNTH is not an advocacy tool
- 5 SYNTH is not directly applicable to talk pages
- 6 SYNTH is not a rigid rule
- 7 SYNTH is not mere juxtaposition
- 8 SYNTH is not summary
- 9 SYNTH is not explanation
- 10 SYNTH is not the word "thus"
- 11 SYNTH is not ubiquitous
- 12 SYNTH is not presumed
- 13 SYNTH is not a catch-all
- 14 SYNTH is not important per se
- 15 SYNTH is not a policy
- 16 SYNTH is not obvious I
- 17 SYNTH is not obvious II
- 18 SYNTH is not a secondary-school question
- 19 SYNTH is not a matter of grammar
- 20 SYNTH is not just any synthesis
- 21 SYNTH is not primarily point-by-point
- 22 SYNTH is not NPOV, when it is point-by-point
- 23 SYNTH is not unpublishably unoriginal
- 24 SYNTH is not numerical summarization
- 25 See also
SYNTH is not useless
"SYNTH" as a policy is not a re-statement of Wikipedia's policy against Original Research. SYNTH provides a guideline for helping to determine the difference between summarizing the information from sources and extrapolating new information from sources.
Here's a hypothetical example of SYNTH:
The editor who added the citation-needed tag is correct.
The sourced numbers would seem to imply, by simple arithmetic, that 500 men went missing. But if the editors working on the article had read the full text of the hypothetical order instead of only the excerpt quoted in source #1, they would have found that men are counted as "missing" for the purposes of the desertion order only if they still haven't been reunited with their unit after five days, whereas source #2 was reporting on the number of men who did not make roll call the evening after the battle, and that only 148 men deserted during the battle of Salamander Creek. To avoid this sort of error, we need a reliable source who has made the same inference, rather than having editors bring together disparate pieces of information themselves. This helps provide a clear solution to many content disputes.
SYNTH is not unnecessary
"SYNTH" refers both to a policy forbidding original research by synthesis, and to such synthesis itself. The policy is not unnecessary. You should not be able to use Wikipedia to publish a crackpot theory, even if you can cite sources for all the premises of your arguments. You should not be able to count something as verified when it really isn't, even if you can cite sources for all the premises of your arguments.
Here's an example of using SYNTH to push a fringe theory:
The first and last sentences in that paragraph makes claims which are not supported by any source. Even assuming that each quote is supported by the in-line citations, those two claims are not only SYNTH, but factually incorrect. None of the quotes state explicit support for the hypothesis, nor do any of the quotes indicate that the quoted party is actively pursuing a geological model based on it. Even if reading all of those sources provides one with the distinct impression that the paragraph and all of its claims are true, it is not necessarily so, and it would be wrong to present this view as a fact.
The purpose of the policy is to ensure the accuracy of Wikipedia. The above quote pushes the view that the hypothesis of a Hollow Earth is a serious subject of scientific inquiry, when the truth is that it is not. Absent this policy, articles about controversial subjects or fringe theories could easily devolve into pure advocacy or POV pushing.
SYNTH is not merely unclear writing
Unclear writing can cause editors to commit SYNTH unintentionally. If your writing is so garbled that you say something you didn't intend to, well, you still said it. But if it's merely unclear, then there's no new thesis introduced. There's just an unclear statement that could be taken as introducing a new assertion. And if some writing is unclear, the response shouldn't be to challenge the presumed assertion as SYNTHful. The response should be to either ask for clarification on the article talk page (if it's a high-traffic article where other editors will be able to clarify the statement), ask for clarification on the author's user talk page, or boldly clarify it yourself.
SYNTH is not an advocacy tool
If someone doesn't like what was said, and they therefore cry SYNTH, others almost certainly will be right to cry foul. Virtually anything can be shoehorned into a broad reading of SYNTH, but the vast majority of it shouldn't be.
SYNTH is not directly applicable to talk pages
A talk page is the right place to claim that something in an article is SYNTH. The policy does not forbid inferences on talk pages that would be SYNTH if made in an article.
Drawing non-trivial inferences is the heart of argument, and on talk pages, you're supposed to present arguments. As the policy consensus says, "The quality of an argument is more important than whether it represents a minority or a majority view." Likewise the guideline PNSD says, "Wikipedia works by building consensus. When conflicts arise, they are resolved through discussion, debate and collaboration." Of course, these are arguments about what the sources and policy say, or what will or won't improve an article, not arguments about the substantive issues themselves. As the talk page guidelines say, "Talk pages are not a forum for editors to argue their personal point of view about a controversial issue. They are a forum to discuss how the points of view of reliable sources should be included in the article, so that the end result is neutral."
SYNTH is not a rigid rule
Wikipedia doesn't have them, supposedly. But if a policy gets enforced zealously, it can be hard to tell the difference. The solution is not to enforce policies zealously. Never use a policy in such a way that the net effect will be to stop people from improving an article.
SYNTH is not mere juxtaposition
SYNTH is when two or more reliably-sourced statements are combined to produce a new thesis that isn't verifiable from the sources. Given just about any two juxtaposed statements, one can imagine that something might be insinuated by the juxtaposition. Don't. If the juxtaposition really does constitute SYNTH, the insinuation will be obvious to everyone. Gray-area cases aren't SYNTH, just unclear writing.
Nothing is insinuated by the mere fact that these sentences are in the same paragraph. The reader would get the same meaning from these sentences if they were in separate paragraphs, or in different parts of the article.
SYNTH is not summary
SYNTH is when two or more reliably-sourced statements are combined to produce a new thesis that isn't verifiable from the sources. Summary is necessary to reduce the information in lengthy sources to an encyclopedic length -- even when the information being summarized comes from multiple sources. It's not necessary to find a source that summarizes the information. As long as what's in the article is an accurate, neutral summary, and each of the statements is verified by an appropriate source, then the summary is also verified by the same sources. Summary is not forbidden by any Wikipedia policy. On the contrary, "coming up with summary statements for difficult, involved problems" has been described as "the essence of the NPOV process". 
SYNTH is not explanation
SYNTH is when two or more reliably-sourced statements are combined to produce a new thesis that isn't verifiable from the sources. If you're just explaining the same material in a different way, there's no new thesis.
SYNTH is not the word "thus"
When the word "thus", "therefore", or any equivalent appears in an article, that doesn't mean SYNTH has been committed. The word helps explain how facts or ideas are related. It doesn't mean that the previously cited facts are being used as sources for the statement that follows it. More often, if the "thus" statement is not followed by an inline citation, the editor believed it unlikely to be challenged and therefore not one of the four things required to have an inline citation.
SYNTH is not ubiquitous
If your understanding of SYNTH includes 90% of what's on Wikipedia, your understanding of SYNTH is wrong. If your understanding of SYNTH includes all instances of reading a table, because reading a table requires "synthesizing" the entry in the table with the label of what the table is, your understanding of SYNTH is wrong. Objective straightforward basic descriptions of an illustration is not SYNTH. If your understanding of SYNTH includes reading any sentence that continues from one page to the next of an online article, because that involves "synthesizing" stuff from two different URLs, your understanding of SYNTH is wrong.
If you think the letter of the policy supports such an understanding, you may be right. Even policies are subject to occasional revision, and even with consensus there can be blunders in wording. But there was never consensus to obliterate 90% of Wikipedia. The correct response is to go to the policy talk page and get consensus to clarify it, or just ignore it.
SYNTH is not presumed
If you want to revert something on the grounds that it's SYNTH, you should be able to explain what new thesis is being introduced and why it's not verified by the sources. You don't have to put the whole explanation in the edit summary, but if someone asks on the talk page, you should have something better ready than "Of course it's SYNTH. You prove it isn't." The burden of proof is light: just explaining what new assertion is made will do, and then it's up to the other editor to show that your reading is unreasonable. But in any disagreement, the initial burden of proof is on the person making the claim, and the claim that something is SYNTH is no exception.
SYNTH is not a catch-all
If there's something bugging you about an edit, but you're not sure what, why not use SYNTH? After all, everything under the sun can be shoehorned into a broad-enough reading of SYNTH. Well, because it isn't SYNTH. It's shoehorning. To claim SYNTH, you should be able to explain what new claim was made, and what sort of additional research a source would have to do in order to support the claim.
SYNTH is not important per se
Please note that this section does not apply to potentially-controversial statements about living people, about which Wikipedia policy is stricter than about other statements.
What matters is that all material in Wikipedia is verifiable, not that it's actually verified. By this we mean that it is important that a suitable reliable source that supports this material has been published in the real world, not that someone has gotten around to typing up a specific bibliographic citation in the article.
Citations are not an end in themselves. If there's a statement for which no source is cited, that's normally ok, as with the example on Wikipedia:No original research: "Paris is the capital of France" needs no source because no one is likely to object to it, but we know that sources for it exist. Likewise with very many unsourced statements, regardless of whether they could be deduced from sourced statements in the same article, we know the sources exist.
SYNTH is not a policy
It's part of a policy: no original research. If a putative SYNTH doesn't constitute original research, then it doesn't constitute SYNTH. The section points out that synthesis can and often does constitute original research. It does not follow that all synthesis constitutes original research.
SYNTH is not obvious I
It's not always obvious whether something is SYNTH. To be able to say that something is SYNTH, you have to be able to understand what it says, what the sources say, and whether the sources suffice to verify the assertion. If you don't understand something, don't say it's SYNTH. Say it's too advanced for the article. Say it's unclear writing. Boldly try to clarify it. Allege on the noticeboard that it's SYNTH. But don't revert it indiscriminately for being SYNTH.
SYNTH is not obvious II
If something is obvious to anyone who reads and understands the sources that are supposed to support it, then it's not SYNTH. An example of a perfectly valid citation is given in the guideline on citations, at WP:Bundling: "The sun is pretty big, but the moon is not so big." The bundled citation uses one source for the size of the sun, and another for the size of the moon. Neither says that the sun is bigger than the moon, but the article is making that comparison. Given the two sources, the conclusion is obvious. So a typical reader can use the sources to check the accuracy of the comparison.
Beware however of using vague concepts defined in one source to interpret statements in another source. Many concepts in social sciences have vague definitions. The same words may denote slightly different things in different sources. In such cases it's better to attribute the varying concepts rather than to make definite statements about the most general concept you can synthesize. See the sorites paradox for an example of slippery slope when using vague concepts.
SYNTH is not a secondary-school question
Most Wikipedia articles, those on subjects of general interest, should be comprehensible to a typical secondary-school student. It does not follow that a secondary student should be able to evaluate whether the cited sources suffice to verify a particular assertion. Inevitably, many sources are more advanced than the article. Normally, however, an ordinary educated layperson can understand the sources adequately. If such a person can confirm that the sources suffice to verify the claim, then the claim is not SYNTH -- even if a typical secondary-school student would have trouble evaluating the question.
SYNTH is not a matter of grammar
The policy gives one example of something that's not SYNTH and says, "The first paragraph is fine, because each of the sentences is carefully sourced." SYNTH is when two or more reliably-sourced statements are combined to produce a new thesis that isn't verifiable from the sources: it's about what the writing says, not the grammatical structure of how it says it.
If you want to take one source that says "allergic reactions can lead to death" and a second source that says "some people are allergic to apples", and you want to add these up into a new conclusion, "so nobody should eat apples unless they're trying to commit suicide", then there is no possible way to say this without violating SYNTH. It does not matter what grammar or structure you use: introducing your own new idea is prohibited.
On the other hand, if every single idea (considered separately as well as the overall effect) is taken from reliable sources (rather than your own new ideas), then there is no grammar structure or way of expressing these ideas that will make the material violate SYNTH. There are many ways of expressing these concepts that make the article badly written, but bad writing that introduces no novel ideas is bad writing, not a violation of SYNTH.
SYNTH is not just any synthesis
SYNTH is original research by synthesis, not synthesis per se. In 2004, Jimbo Wales actually contrasted synthesis with original research: "In many cases, the distinction between original research and synthesis of published work will require thoughtful editorial judgment."  It seems clear to me that "synthesis of published work" was assumed to be part of the legitimate role of Wikipedia.
Some old versions of NOR even said "Wikipedia is a secondary source (one that analyzes, assimilates, evaluates, interprets, and/or synthesizes primary sources) or tertiary source ..." (emphasis added). Which is really helpful for those editors with time travel capabilities who can go back and edit Wikipedia before community consensus changed the policy to specifically remove that connotation.
SYNTH is not primarily point-by-point
Very old versions of NOR focused on crackpot theories being published whole, not on single statements within the exposition of a body of well-established fact. The under-the-microscope level of scrutiny often practiced now, which demands removal of a single clause until a source can be found that presents the material the same way, is more a result of creep than of well-thought-out policy.
Crackpot theories should be removed quickly; their exclusion is a good reason to have NOR be a central tenet of Wikipedia policy. By contrast, unsourced but uncontroversial individual statements can be left in place indefinitely, with at most a citation-needed template. If such statements were what the policy was primarily about, it would be a minor guideline.
SYNTH is not NPOV, when it is point-by-point
The policy forbids "synthesis of published material that advances a position" (emphasis added). A NPOV article gives appropriate weight to all positions, if there are multiple positions on a subject, by including multiple statements. A single assertion, by contrast, can be NPOV only if it doesn't advance any particular position to begin with. So if a single statement, taken in isolation, is NPOV, then it's not SYNTH.
SYNTH is not unpublishably unoriginal
When you look at a case of putative SYNTH, apply the following test. Suppose you took this claim to a journal that does publish original research. Would they (A) vet your article for correctness, documentation, and style, and publish it if it met their standards in those areas? Or would they (B) laugh in your face because your "original research" is utterly devoid of both originality and research, having been common knowledge in the field since ten years before you were born? If you chose (B), it's not original research -- even if it violates the letter of WP:SYNTH.
SYNTH is not numerical summarization
- See more details at Wikipedia:About Valid Routine Calculations.
Treatment of numeric data is an encyclopedic issue: summarization by sum, average, etc. are necessary expedients, and should not be confused with original research. As an example, if a source shows (without any total following it) "
1+1+1+1", a Wikipedia article can express the same data with summarization "
1+1+1+1=4". (Whether to express only the result
4, if it is not explicitly given by any source, could be a point for discussion, but in any case it is not SYNTH.)
Summarizations based on statistical methods, however, are original research by synthesis, as they involve the reinterpretation of data, and decisions about which statistical methods and significance levels are appropriate. It is common to confuse the arithmetic mean (summarization) with the expected value (mean and standard deviation calculated by a reliable source). If in any doubt about summarization vs. statistical reinterpretation, discuss it first.