Markov process
In probability theory and statistics, a Markov process or Markoff process, named after the Russian mathematician Andrey Markov, is a stochastic process that satisfies the Markov property. A Markov process can be thought of as 'memoryless': loosely speaking, a process satisfies the Markov property if one can make predictions for the future of the process based solely on its present state just as well as one could knowing the process's full history. i.e., conditional on the present state of the system, its future and past are independent.[1]
Introduction
A Markov process is a stochastic model that has the Markov property. It can be used to model a random system that changes states according to a transition rule that only depends on the current state. This article describes the Markov process in a very general sense, which is a concept that is usually specified further. Particularly, the system's state space and time parameter index needs to be specified. The following table gives an overview of the different instances of Markov processes for different levels of state space generality and for discrete time vs. continuous time.
Countable state space | Continuous or general state space | |
---|---|---|
Discrete-time | Markov chain on a countable or finite state space | Harris chain (Markov chain on a general state space) |
Continuous-time | Continuous-time Markov process | Any continuous stochastic process with the Markov property, e.g. the Wiener process |
Note that there is no definitive agreement in literature on the use of some of the terms that signify special cases of Markov processes. For example, often the term "Markov chain" is used to indicate a Markov process which has a finite or countable state-space, but Markov chains on a general state space fall under the same description. Similarly, a Markov chain would usually be defined for a discrete set of times (i.e. a discrete-time Markov chain)[2] although some authors use the same terminology where "time" can take continuous values.[3] In addition, there are other extensions of Markov processes that are referred to as such but do not necessarily fall within any of these four categories (see Markov model). Moreover, the time index need not necessarily be real-valued; like with the state space, there are conceivable processes that move through index sets with other mathematical constructs. Notice that the general state space continuous-time Markov chain is general to such a degree that it has no designated term.
Markov processes arise in probability and statistics in one of two ways. A stochastic process, defined via a separate argument, may be shown mathematically to have the Markov property, and as a consequence to have the properties that can be deduced from this for all Markov processes. Alternately, in modelling a process, one may assume the process to be Markov, and take this as the basis for a construction. In modelling terms, assuming that the Markov property holds is one of a limited number of simple ways of introducing statistical dependence into a model for a stochastic process in such a way that allows the strength of dependence at different lags to decline as the lag increases.
Markov property
This section may be too technical for most readers to understand.(December 2012) |
The general case
Let be a probability space with a filtration , for some (totally ordered) index set ; and let be a measure space. An S-valued stochastic process adapted to the filtration is said to possess the Markov property with respect to the if, for each and each with s < t,
A Markov process is a stochastic process which satisfies the Markov property with respect to its natural filtration.
For discrete-time Markov chains
In the case where is a discrete set with the discrete sigma algebra and , this can be reformulated as follows:
- .
Examples
Gambling
Suppose that you start with $10, and you wager $1 on an unending, fair, coin toss indefinitely, or until you lose all of your money. If represents the number of dollars you have after n tosses, with , then the sequence is a Markov process. If I know that you have $12 now, then it would be expected that with even odds, you will either have $11 or $13 after the next toss. This guess is not improved by the added knowledge that you started with $10, then went up to $11, down to $10, up to $11, and then to $12.
The process described here is a Markov chain on a countable state space that follows a random walk.
A birth-death process
If one pops one hundred kernels of popcorn, each kernel popping at an independent exponentially-distributed time, then this would be a continuous-time Markov process. If denotes the number of kernels which have popped up to time t, the problem can be defined as finding the number of kernels that will pop in some later time. The only thing one needs to know is the number of kernels that have popped prior to the time "t". It is not necessary to know when they popped, so knowing for previous times "t" is not relevant.
The process described here is an approximation of a Poisson point process - Poisson processes are also Markov processes.
A non-Markov example
Suppose that you have a coin purse containing five quarters (each worth 25c), five nickels (each worth 5c) and five dimes (each worth 10c), and one-by-one, you randomly draw coins from the purse and set them on a table. If represents the total value of the coins set on the table after n draws, with , then the sequence is not a Markov process.
To see why this is the case, suppose that in your first six draws, you draw all five nickels, and then a quarter. So . If we know not just , but the earlier values as well, then we can determine which coins have been drawn, and we know that the next coin will not be a nickel, so we can determine that with probability 1. But if we do not know the earlier values, then based only on the value we might guess that we had drawn four dimes and two nickels, in which case it would certainly be possible to draw another nickel next. Thus, our guesses about are impacted by our knowledge of values prior to .
Markovian representations
In some cases, apparently non-Markovian processes may still have Markovian representations, constructed by expanding the concept of the 'current' and 'future' states. For example, let X be a non-Markovian process. Then define a process Y, such that each state of Y represents a time-interval of states of X. Mathematically, this takes the form:
If Y has the Markov property, then it is a Markovian representation of X.
An example of a non-Markovian process with a Markovian representation is an autoregressive time series of order greater than one.[5]
See also
References
- ^ Markov process (mathematics) - Britannica Online Encyclopedia
- ^ Everitt,B.S. (2002) The Cambridge Dictionary of Statistics. CUP. ISBN 0-521-81099-X
- ^ Dodge, Y. The Oxford Dictionary of Statistical Terms, OUP. ISBN 0-19-920613-9
- ^ Durrett, Rick (2010). Probability: Theory and Examples (Fourth ed.). Cambridge: Cambridge University Press. ISBN 978-0-521-76539-8.
- ^ Doblinger, G., 1998. Smoothing of Noise AR Signals Using an Adaptive Kalman Filter. In EUSIPCO 98. pp. 781–784. Available at: http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.251.3078 [Accessed January 15, 2015].