Talk:Fisher's exact test

From Wikipedia, the free encyclopedia
Jump to: navigation, search
WikiProject Statistics (Rated Start-class, High-importance)
WikiProject icon

This article is within the scope of the WikiProject Statistics, a collaborative effort to improve the coverage of statistics on Wikipedia. If you would like to participate, please visit the project page or join the discussion.

Start-Class article Start  This article has been rated as Start-Class on the quality scale.
 High  This article has been rated as High-importance on the importance scale.

Please note that the minimum expected value for a chi-squared test to be appropriate is 10 not 5, in the particular case where there is only one degree of freedom (see any responsible stats cookbook). This was correct in earlier versions of the page and I have put it back now. seglea 21:30, 11 May 2006 (UTC)

In Bob Moore. (2004) On Log-Likelihood-Ratios and the Significance of Rare Events. In Proc. of the ACL 2004., Moore shows that Fischer's exact is not really prohibitively more expensive to compute than Chi-square. In light of this, the introductory paragraph suggesting that its computational complexity is a major consideration may deserve some qualification. —Preceding unsigned comment added by (talk) 21:16, 19 February 2008 (UTC)

I checked the link to and found there were no content for general cases. So I deleted it. Lixiaoxu (talk) 13:33, 9 November 2008 (UTC)

Yes you're quite right. I've also removed the link at the bottom. RupertMillard (Talk) 10:53, 24 March 2009 (UTC)
I don't understand this. I quote from the beginning of the mathworld article, "Let there exist two such variables X and Y, with m and n observed states, respectively...."; and there then follows a paragraph giving the formula and some description of procedures for the m x n case. In what sense can this be described as having no content for the m x ncase? I have therefore restored the link and reference. seglea (talk) 23:34, 24 March 2009 (UTC)
Oh yes - you're right. I think I'm going mad! Thank you for putting the link back in. I don't think the article's brilliantly clear, but it's a start - very vague about the other measures of association that are required for case. RupertMillard (Talk) 07:10, 25 March 2009 (UTC)
Agreed. I only put it in because at least it states unambiguously that the is possible, and so many students (and not a few lecturers) believe that only 2 x 2 can be done. There might be a reference to a better source in some SPSS manual, since SPSS will calculate the case, but I don't have one to hand. seglea (talk) 21:43, 25 March 2009 (UTC)

In the example the notation switches from girls and boys to men and women. Perhaps it would be less confusing to maintain one label. Australisergosum (talk) 01:41, 16 December 2008 (UTC)

I wonder if the example gender x dieting is well-chosen... It is a requirement of the standard exact fisher test that both marginals are fixed; it can easily be assumed that a researcher could choose to include an equal number of men and women in his/her sample, but how about dieters versus non-dieters? These particular marginal counts seem to be random to me? —Preceding unsigned comment added by (talk) 23:09, 31 March 2010 (UTC)

The link to points to an applet that only calculates P(Cutoff), and not the actual probability of the null hypothesis. calculates the interesting probability correctly, and works for NxN matrices. (talk) 21:12, 22 January 2009 (UTC)

I just looked at the link for the Fisher exact test calculator that you gave: Fisher Exact Test Calculators: 2-by-2 and N-by_N, but the HTML was rather mangled, so it is not rendered in Firefox 12 or IE9. Looking at the source, I see that the page has good information. Here are the direct (working) links to the calculators:
Everettr2 (talk) 20:37, 8 May 2012 (UTC)

Reference does not exist - Exact inference in categorical data. Biometrics, 53(1), 112-117.'[edit]

Mehta, C. R.& Patel, N. R. 1997. Exact inference in categorical data. Biometrics, 53(1), 112-117. definitely does not exist.[1] Is the intention to reference Mehta CR. Exact inference for categorical data. Encyclopedia of Biostatistics 1998; 2:1411–1422 as per[2]? I would probably cite this as Corcoran, Christopher D; Senchaudhuri, Pralay; Mehta, Cyrus R; Patel, Nitin R, Exact Inference for Categorical Data, doi:10.1002/0470011815.b2a10019 . Anyway, I have removed the reference for now, as it was superfluous to the 1984 reference. RupertMillard (Talk) 10:47, 24 March 2009 (UTC)

Very odd. That reference was added by an anon in April 2008, presumably relying on a secondary source. seglea (talk) 23:43, 24 March 2009 (UTC)


Can someone spell out how the value from Fisher exact is used please? Is fisher exact value same as p-value? What is considered to be statistically significant? —Preceding unsigned comment added by Sedoc (talkcontribs) 16:09, 5 June 2009 (UTC)

You should try the mathematics reference desk for a question like that. Baccyak4H (Yak!) 17:41, 5 June 2009 (UTC)

Is there any confirmation on the minimum value of n=5 or 10 or it it still a debated topic? I have seen textbooks (Biostatistics the bare essentials 2nd edition - Geoffrey R Norman/David L Streiner) and statistics professors in the flesh that says otherwise. Any paper/summary would help the layman to understand the debate if any. Thanks a million. —Preceding unsigned comment added by (talk) 04:51, 30 October 2009 (UTC)

Fisher-Irwin Test[edit]

This is the same as the Fisher-Irwin test, correct? If so there should at least be a redirect, and a mention in the article. Esoxidtcontribs 18:13, 12 January 2013 (UTC)


I changed the example from dieting to studying. Female teenagers are particularly likely to develop eating disorders, and dieting seems to be influenced by societal expectations that it's normal to diet (see eating disorder). Since there's no reason whatsoever that this example must be about dieting, I changed it. "Studiers" is an awkward word, so feel free to change it to "slackers" and "keeners" or whatever you can think of that fits better. But really, it would make most sense to find an example that doesn't involve made up statistics about the habits of people who happen to have penises vs people without penises. For example, it could be two sets of patients taking a new medication.