Jump to content

Talk:Truncated normal distribution

Page contents not supported in other languages.
From Wikipedia, the free encyclopedia

This is an old revision of this page, as edited by Heheman3000 (talk | contribs) at 23:42, 29 December 2012 (→‎Simulation). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

WikiProject iconMathematics Start‑class Low‑priority
WikiProject iconThis article is within the scope of WikiProject Mathematics, a collaborative effort to improve the coverage of mathematics on Wikipedia. If you would like to participate, please visit the project page, where you can join the discussion and see a list of open tasks.
StartThis article has been rated as Start-class on Wikipedia's content assessment scale.
LowThis article has been rated as Low-priority on the project's priority scale.
WikiProject iconStatistics Unassessed
WikiProject iconThis article is within the scope of WikiProject Statistics, a collaborative effort to improve the coverage of statistics on Wikipedia. If you would like to participate, please visit the project page, where you can join the discussion and see a list of open tasks.
???This article has not yet received a rating on Wikipedia's content assessment scale.
???This article has not yet received a rating on the importance scale.

Mean

The expression for the mean is given as: . This must be incorrect, because it sometimes gives mean values outside the truncation bounds. For example, , , , gives a mean of 1.53. I believe the correct expression is . Agreed? Jtmcg1128 (talk) 18:03, 13 October 2011 (UTC)[reply]

  • The expression is correct. It's an issue of being defined as the standard normal pdf: , see discussion on the pdf definition below. The pdf of a distribution with arbitrary mean and standard deviation is . (Karekafi (talk) 17:25, 14 November 2011 (UTC))[reply]

Regarding the pdf

  • I am concerned that , cannot be the formula for a PDF of a truncated random normal variable. Say there is a left truncated (a = 0) normal random variable with positive mean. If we choose X to be a negative value, then is positive, is 1 and is positive. Altogether, the PDF cannot be zero as it should be. Perhaps defining it piecewise is the most logical idea because I cannot think of an explicit formula.
    • Well, the article already mentioned that the domain of X is [a,b]. Thus is zero outside a and b. So, in your example, if a = 0, then f(x) is zero. Robbyjo (talk) 20:04, 20 February 2008 (UTC)[reply]
  • In my opinion the formula for is incorrect. It should be: . In the current version, if you truncate at a=-inf, b=+inf you will not get Normal distribution. Compare also: http://rss.acs.unt.edu/Rdoc/library/msm/html/tnorm.html —Preceding unsigned comment added by 128.143.16.201 (talk) 20:31, 20 February 2008 (UTC)[reply]
    • It's not a typo; note that is the standard normal pdf. So gives you the pdf for . Write that out and you'll see why. Josuechan (talk) 23:11, 20 February 2008 (UTC)[reply]
The correct Density function is this one . This function is both positive and integrates to 1 (because of change of variables, ). The incorrect version, listed above, integrates to .Iwaterpolo (talk) 01:57, 3 June 2010 (UTC)[reply]
  • I agree on the PDF formulation. For an easier understanding I added the exact definition of the standard normal pdf in the text below. This may help people not completely familiar with the exact notation to more quickly understand the problem. (Karekafi (talk) 16:40, 14 November 2011 (UTC))[reply]

Interactive calculators

Entropy formula

Appears to be wrong. The values from the formula don't agree with numerical computation of of the entropy. I derived the case for a one-sided truncated normal, and that differs from this case, but I haven't had time to go back and derive the two-sided case. Would be nice if someone can track down a reference for this or find the correct formula. --Jpillow (talk) 17:20, 12 January 2012 (UTC)[reply]

Simulation

The simulation section appears to be wrong/misleading. I understand the formula as basically simulating rejecting sampling by basically drawing uniformly from the lower and upper bounds of the CDF of the non-truncated normal (with appropriate parameters), then inverting to obtain the actual value. However, in this case the use of is misleading because it is referring to the CDF of the normal distribution with parameters instead of the standard normal.

There are several ways to resolve, this, but I feel that the following would be easiest. Note that using with the standard normal CDF instead of would require the result to be multiplied by then added to .

A random variate x defined as with the CDF of a normal distribution with mean and variance , and its inverse, a uniform random number on , follows the distribution truncated to the range .