Poisson process

From Wikipedia, the free encyclopedia
Jump to: navigation, search

In probability theory, a Poisson process is a stochastic process which counts the number of events[note 1] and the time that these events occur in a given time interval. The time between each pair of consecutive events has an exponential distribution with parameter λ and each of these inter-arrival times is assumed to be independent of other inter-arrival times. The process is named after the French mathematician Siméon-Denis Poisson and is a good model of radioactive decay,[1] telephone calls[2] and requests for a particular document on a web server.[3]

The Poisson process is a continuous-time process; the sum of a Bernoulli process can be thought of as its discrete-time counterpart. A Poisson process is a pure-birth process, the simplest example of a birth-death process. It is also a point process on the real half-line.

Contents

[edit] Definition

The basic form of Poisson process, often referred to simply as "the Poisson process", is a continuous-time counting process {N(t), t ≥ 0} that possesses the following properties:

  • N(0) = 0
  • Independent increments (the numbers of occurrences counted in disjoint intervals are independent from each other)
  • Stationary increments (the probability distribution of the number of occurrences counted in any time interval only depends on the length of the interval)
  • No counted occurrences are simultaneous.

Consequences of this definition include:

Other types of Poisson process are described below.

[edit] Types

[edit] Homogeneous

Sample Path of a Poisson process N(t)

The homogeneous Poisson process is one of the most well-known Lévy processes. This process is characterized by a rate parameter λ, also known as intensity, such that the number of events in time interval (tt + τ] follows a Poisson distribution with associated parameter λτ. This relation is given as

 P [(N(t+ \tau) - N(t)) = k] = \frac{e^{-\lambda \tau} (\lambda \tau)^k}{k!}  \qquad k= 0,1,\ldots,

where N(t + τ) − N(t) = k is the number of events in time interval (tt + τ].

Just as a Poisson random variable is characterized by its scalar parameter λ, a homogeneous Poisson process is characterized by its rate parameter λ, which is the expected number of "events" or "arrivals" that occur per unit time.

N(t) is a sample homogeneous Poisson process, not to be confused with a density or distribution function.

[edit] Non-homogeneous

In general, the rate parameter may change over time; such a process is called a non-homogeneous Poisson process or inhomogeneous Poisson process. In this case, the generalized rate function is given as λ(t). Now the expected number of events between time a and time b is

\lambda_{a,b} = \int_a^b \lambda(t)\,dt.

Thus, the number of arrivals in the time interval (ab], given as N(b) − N(a), follows a Poisson distribution with associated parameter λa,b

 P [(N(b) - N(a)) = k] = \frac{e^{-\lambda_{a,b}} (\lambda_{a,b})^k}{k!} \qquad k= 0,1,\ldots.

A homogeneous Poisson process may be viewed as a special case when λ(t) = λ, a constant rate.

[edit] Spatial

An important variation on the (notionally time-based) Poisson process is the spatial Poisson process. In the case of a one-dimension space (a line) the theory differs from that of a time-based Poisson process only in the interpretation of the index variable. For higher dimension spaces, where the index variable (now x) is in some vector space V (e.g. R2 or R3), a spatial Poisson process can be defined by the requirement that the random variables defined as the counts of the number of "events" inside each of a number of non-overlapping finite sub-regions of V should each have a Poisson distribution and should be independent of each other.

[edit] Space-time

A further variation on the Poisson process, the space-time Poisson process, allows for separately distinguished space and time variables. Even though this can theoretically be treated as a pure spatial process by treating "time" as just another component of a vector space, it is convenient in most applications to treat space and time separately, both for modeling purposes in practical applications and because of the types of properties of such processes that it is interesting to study.

In comparison to a time-based inhomogeneous Poisson process, the extension to a space-time Poisson process can introduce a spatial dependence into the rate function, such that it is defined as \lambda(x,t), where x \in V for some vector space V (e.g. R2 or R3). However a space-time Poisson process may have a rate function that is constant with respect to either or both of x and t. For any set S \subset V (e.g. a spatial region) with finite measure \mu(S), the number of events occurring inside this region can be modeled as a Poisson process with associated rate function λS(t) such that

\lambda_S(t) = \int_S \lambda(x,t)\,d\mu(x).

[edit] Separable space-time processes

In the special case that this generalized rate function is a separable function of time and space, we have:

\lambda(x,t) = f(x) \lambda(t) \,

for some function f(x). Without loss of generality, let

\int_V f(x) \, d\mu(x)=1.

(If this is not the case, λ(t) can be scaled appropriately.) Now, f(x) represents the spatial probability density function of these random events in the following sense. The act of sampling this spatial Poisson process is equivalent to sampling a Poisson process with rate function λ(t), and associating with each event a random vector X sampled from the probability density function f(x). A similar result can be shown for the general (non-separable) case.

[edit] Characterisation

In its most general form, the only two conditions for a counting process to be a Poisson process are:[citation needed]

  • Orderliness: which roughly means
\lim_{\Delta t\to 0} P(N(t+\Delta t) - N(t) > 1 \mid N(t+\Delta t) - N(t) \geq 1)=0
which implies that arrivals don't occur simultaneously (but this is actually a mathematically stronger statement).
  • Memorylessness (also called evolution without after-effects): the number of arrivals occurring in any bounded interval of time after time t is independent of the number of arrivals occurring before time t.

These seemingly unrestrictive conditions actually impose a great deal of structure in the Poisson process. In particular, they imply that the time between consecutive events (called interarrival times) are independent random variables. For the homogeneous Poisson process, these inter-arrival times are exponentially distributed with parameter λ (mean 1/λ).

Proof : Let \tau_1 be the first arrival time of the Poisson process. Its distribution satisfies


\begin{align}
Pr[\tau_1=x]=&\lim_{dt\to 0}\frac{Pr[N_{x+dt}>0,N_x=0]}{dt}\\
=&\lim_{dt\to 0}\frac{1-Pr[N_{dt}=0]}{dt}Pr[N_x=0]\\
=&\lim_{dt\to 0}\frac{1-(1-\lambda dt +O(dt^2))}{dt}\exp(-\lambda x)\\
=&\lambda\exp(-\lambda x)
\end{align}

Also, the memorylessness property entails that the number of events in any time interval is independent of the number of events in any other interval that is disjoint from it. This latter property is known as the independent increments property of the Poisson process.


[edit] Properties

As defined above, the stochastic process {N(t)} is a Markov process, or more specifically, a continuous-time Markov process.[citation needed]

To illustrate the exponentially-distributed inter-arrival times property, consider a homogeneous Poisson process N(t) with rate parameter λ, and let Tk be the time of the kth arrival, for k = 1, 2, 3, ... . Clearly the number of arrivals before some fixed time t is less than k if and only if the waiting time until the kth arrival is more than t. In symbols, the event [N(t) < k] occurs if and only if the event [Tk > t] occurs. Consequently the probabilities of these events are the same:

P(T_k>t) = P(N(t)<k). \,

In particular, consider the waiting time until the first arrival. Clearly that time is more than t if and only if the number of arrivals before time t is 0. Combining this latter property with the above probability distribution for the number of homogeneous Poisson process events in a fixed interval gives

P(T_1>t)=P(N(t)=0)=P [(N(t) - N(0)) = 0] = \frac{e^{-\lambda t} (\lambda t)^0}{0!} = e^{-\lambda t}.

Consequently, the waiting time until the first arrival T1 has an exponential distribution, and is thus memoryless. One can similarly show that the other interarrival times Tk − Tk−1 share the same distribution. Hence, they are independent, identically-distributed (i.i.d.) random variables with parameter λ > 0; and expected value 1/λ. For example, if the average rate of arrivals is 5 per minute, then the average waiting time between arrivals is 1/5 minute.

[edit] Applications

The classic example of phenomena well modelled by a Poisson process is deaths due to horse kick in the Prussian army, as shown by Ladislaus Bortkiewicz in 1898.[4][5] The following examples are also well-modeled by the Poisson process:

  • The number of goals in a soccer match[6]
  • The arrival of customers in a queue.
  • The number of raindrops falling within a specified area.
  • The number of telephone calls arriving at a switchboard.
  • The number of particles emitted via radioactive decay by an unstable substance. In this case the Poisson process is non-homogeneous in a predictable manner - the emission rate declines as particles are emitted.
  • The number of requests for individual documents on a web server.[7]

[edit] Occurrence

The Palm–Khintchine theorem provides a result that shows that the superposition of many low intensity non-Poisson point processes will be close to a Poisson process.

[edit] See also

[edit] Further reading

  • Cox, D. R.; Isham, V. I. (1980). Point Processes. Chapman & Hall. ISBN 0-412-21910-7. 
  • Ross, S. M. (1995). Stochastic Processes. Wiley. ISBN 978-0-471-12062-9. 
  • Snyder, D. L.; Miller, M. I. (1991). Random Point Processes in Time and Space. Springer-Verlag. ISBN 0-387-97577-2. 

[edit] Notes

  1. ^ The word event used here is not an instance of the concept of event as frequently used in probability theory.

[edit] References

  1. ^ Cannizzaro, F.; Greco, G.; Rizzo, S.; Sinagra, E. (1978). "Results of the measurements carried out in order to verify the validity of the poisson-exponential distribution in radioactive decay events". The International Journal of Applied Radiation and Isotopes 29 (11): 649. doi:10.1016/0020-708X(78)90101-1.  edit
  2. ^ Willkomm, D.; Machiraju, S.; Bolot, J.; Wolisz, A. (2009). "Primary user behavior in cellular networks and implications for dynamic spectrum access". IEEE Communications Magazine 47 (3): 88. doi:10.1109/MCOM.2009.4804392.  edit
  3. ^ Arlitt, Martin F.; Williamson, Carey L. (1997). "Internet Web servers: Workload characterization and performance implications". IEEE/ACM Transactions on Networking 5 (5): 631. doi:10.1109/90.649565.  edit
  4. ^ Ladislaus von Bortkiewicz, Das Gesetz der kleinen Zahlen [The law of small numbers] (Leipzig, Germany: B.G. Teubner, 1898). On page 1, Bortkiewicz presents the Poisson distribution. On pages 23-25, Bortkiewicz presents his famous analysis of "4. Beispiel: Die durch Schlag eines Pferdes im preussischen Heere Getöteten." (4. Example: Those killed in the Prussian army by a horse's kick.).
  5. ^ Gibbons, Robert D.; Bhaumik, Dulal; Aryal, Subhash (2009). Statistical Methods for Groundwater Monitoring. John Wiley and Sons. p. 72. ISBN 0470164964. 
  6. ^ Heuer, A.; Müller, C.; Rubner, O. (2010). "Soccer: Is scoring goals a predictable Poissonian process?". EPL (Europhysics Letters) 89 (3): 38007. doi:10.1209/0295-5075/89/38007.  edit
  7. ^ Arlitt, Martin F.; Williamson, Carey L. (1997). "Internet Web servers: Workload characterization and performance implications". IEEE/ACM Transactions on Networking 5 (5): 631. doi:10.1109/90.649565.  edit
Personal tools
Namespaces

Variants
Actions
Navigation
Interaction
Toolbox
Print/export
Languages