Sunrise problem

From Wikipedia, the free encyclopedia
Jump to: navigation, search

The sunrise problem can be expressed as follows: "What is the probability that the sun will rise tomorrow?" The sunrise problem illustrates the difficulty of using probability theory when evaluating the plausibility of statements or beliefs.

According to the Bayesian interpretation of probability, probability theory can be used to evaluate the plausibility of the statement, "The sun will rise tomorrow." We just need a hypothetical random process that determines whether the sun will rise tomorrow or not. Based on past observations, we can infer the parameters of this random process, and from there evaluate the probability that the sun will rise tomorrow.

One sun, many days[edit]

The sunrise problem was first introduced in the 18th century by Pierre-Simon Laplace, who treated it by means of his rule of succession.[1] Let p be the long-run frequency of sunrises, i.e., the sun rises on 100 × p% of days. Prior to knowing of any sunrises, one is completely ignorant of the value of p. Laplace represented this prior ignorance by means of a uniform probability distribution on p. Thus the probability that p is between 20% and 50% is just 30%. This must not be interpreted to mean that in 30% of all cases, p is between 20% and 50%; that would be a frequentist approach to applied probability. Rather, it means that one's state of knowledge (or ignorance) justifies one in being 30% sure that the sun rises between 20% of the time and 50% of the time. Given the value of p, and no other information relevant to the question of whether the sun will rise tomorrow, the probability that the sun will rise tomorrow is p. But we are not "given the value of p". What we are given is the observed data: the sun has risen every day on record. Laplace inferred the number of days by saying that the universe was created about 6000 years ago, based on a young-earth creationist reading of the Bible. To find the conditional probability distribution of p given the data, one uses Bayes theorem, which some call the Bayes–Laplace rule. Having found the conditional probability distribution of p given the data, one may then calculate the conditional probability, given the data, that the sun will rise tomorrow. That conditional probability is given by the rule of succession. The plausibility that the sun will rise tomorrow increases with the number of days on which the sun has risen so far.

Laplace, however, recognised this to be a misapplication of the rule of succession through not taking into account all the prior information available immediately after deriving the result:

But this number [the probability of the sun coming up tomorrow] is far greater for him who, seeing in the totality of phenomena the principle regulating the days and seasons, realizes that nothing at present moment can arrest the course of it.

It is noted by Jaynes & Bretthorst (2003) that Laplace's warning had gone unheeded by workers in the field.[2]

A reference class problem arises: the plausibility inferred will depend on whether we take the past experience of one person, of humanity, or of the earth. A consequence is that each referent would hold different plausibility of the statement. In Bayesianism, any probability is a conditional probability given what one knows. That varies from one person to another.

One day, many suns[edit]

Alternatively, one could say that a sun is selected from all the possible stars every day, being the star that one sees in the morning. The plausibility of the "sun will rise tomorrow" (i.e., the probability of that being true) will then be the proportion of stars that do not "die", e.g., by becoming novae, and so failing to "rise" on their planets (those that still exist, irrespective of the probability that there may then be none, or that there may then be no observers).

One faces a similar reference class problem: which sample of stars should one use. All the stars? The stars with the same age as the sun? The same size?

Mankind's knowledge of star formations will naturally lead one to select the stars of same age and size, and so on, to resolve this problem. In other cases, one's lack of knowledge of the underlying random process then makes the use of Bayesian reasoning less useful. Less accurate, if the knowledge of the possibilities is very unstructured, thereby necessarily having more nearly uniform prior probabilities (by the principle of indifference). Less certain too, if there are effectively few subjective prior observations, and thereby a more nearly minimal total of pseudocounts, giving fewer effective observations, and so a greater estimated variance in expected value, and probably a less accurate estimate of that value.

See also[edit]

References[edit]

  1. ^ Chung, K. L. & AitSahlia, F. (2003). Elementary probability theory: with stochastic processes and an introduction to mathematical finance. Springer. pp. 129–130. ISBN 978-0-387-95578-0.
  2. ^ ch 18, pp 387–391 of Jaynes, E. T. & Bretthorst, G. L. (2003). Probability Theory: The Logic of Science. Cambridge University Press. ISBN 978-0-521-59271-0

Further reading[edit]

  • Howie, David. (2002). Interpreting probability: controversies and developments in the early twentieth century. Cambridge University Press. pp. 24. ISBN 978-0-521-81251-1