|
|

Stochastic processIn the mathematics of probability, a stochastic process can be thought of as a random variable function (mathematics). In practical applications, the domain over which the function is defined is a time interval (a stochastic process of this kind is called a time series in applications) or a region of space (a stochastic process being called a random field). Familiar examples of time series include stock market and exchange rate fluctuations, signals such as speech, audio and video; medical data such as a patient's EKG, EEG, blood pressure or temperature; and random movement such as Brownian motion or random walk. Examples of random fields include static images, random topographies (landscapes), or composition variations of an inhomogeneous material. == Definition == A stochastic process is an indexed collection of random variables, each of which is defined on the same probability space and takes values on the same codomain (often the real number ). An important case is the discrete set :, where ''i'' runs over some discrete index set ''I'' - for example if the probability distributions of the satisfy the Markov property the process is a Markov chain. is often called (stochastic) ''transition function'' or ''stochastic kernel''. In a continuous stochastic process the index set is continuous (usually space or time), resulting in an infinite number of random variables. Each point in the sample space ; corresponds to a particular value for each of the random variables and the resulting function (mapping a point in the index set to the value of the random variable attached to it) is known as a ''realisation'' of the stochastic process. A particular stochastic process is determined by specifying the joint probability distributions of the various random variables . Stochastic processes may be defined in higher dimensions by attaching a multivariate random variable to each point in the index set, which is equivalent to using a multidimensional index set. Indeed a multivariate random variable can itself be viewed as a stochastic process with index set === Examples === The paradigm continuous stochastic process is that of Brownian motion. In its original form the problem was concerned with a particle floating on a liquid surface, receiving "kicks" from the molecules of the liquid. The particle is then viewed as being subject to a random force which, since the molecules are very small and very close together, is treated as being continuous and, since the particle is constrained to the surface of the liquid by surface tension, is at each point in time a vector parallel to the surface. Thus the random force is described by a two component stochastic process; two real-valued random variables are associated to each point in the index set, time, (note that since the liquid is viewed as being homogeneous the force is independent of the spatial coordinates) with the domain of the two random variables being , giving the ''x'' and ''y'' components of the force. A treatment of Brownian motion generally also includes the effect of viscosity, resulting in an equation of motion known as the Langevin equation. As another example, take the domain to be , the natural numbers, and our range to be , the real numbers. Then, a function is a sequence of real numbers, and a stochastic process with domain and range is a random sequence. The following questions arise: # How is a random sequence specified? # How do we find the answers to typical questions about sequences, such as ## what is the probability distribution of the value of ? ## what is the probability that ''f'' is bounded function? ## what is the probability that ''f'' is monotonic? ## what is the probability that has a limit as ? ## if we construct a series from , what is the probability that the series convergence? What is the probability distribution of the sum? Another important class of examples is when the domain is not a discrete space such as the natural numbers, but a continuous space such as the unit interval , the positive real numbers or the entire real line, . In this case, we have a different set of questions that we might want to answer: # How is a random function specified? # How do we find the answers to typical questions about functions, such as ## what is the probability distribution of the value of ? ## what is the probability that ''f'' is bounded/integrable/continuous/differentiable...? ## what is the probability that has a limit as ? ## what is the probability distribution of the integral ? There is an effective way to answer all of these questions, but it is rather technical (see ''Constructing Stochastic Processes'' below). === Interesting special cases === *Homogeneous processes: processes where the domain has some symmetry and the finite-dimensional probability distributions also have that symmetry. Special cases include stationary processes, also called time-homogeneous. *process with independent increments: processes where the domain is at least partially ordered and, if , all the variables are independent. Markov chains are a special case. *See also continuous-time Markov chain. *Markov processes are those in which the future is conditionally independent of the past given the present. *Point processes: random arrangements of points in a space . They can be modelled as stochastic processes where the domain is a sufficiently large family of subsets of , ordered by inclusion; the range is the set of natural numbers; and, if A is a subset of B, with probability 1. *Gaussian processes: processes where all linear combinations of coordinates are normal distribution random variables. *Poisson processes *Gauss-Markov processes: processes that are both Gaussian and Markov *Martingales -- processes with constraints on the expectation *Galton-Watson processes *Elevator paradox *Branching processes *Bernoulli processes *Many stochastic processes are Lévy processes. == Constructing stochastic processes == In the ordinary axiomatization of probability theory by means of measure theory, the problem is to construct a sigma-algebra of measurable set of the space of all functions, and then put a finite measure on it. For this purpose one traditionally uses a method called Kolmogorov extension. There is at least one alternative axiomatization of probability theory by means of expectation on C-star algebra algebra of random variables. In this case the method goes by the name of Gelfand-Naimark-Segal construction. This is analogous to the two approaches to measure and integration, where one has the choice to construct measures of sets first and define integrals later, or construct integrals first and define set measures as integrals of characteristic functions. === The Kolmogorov extension === The Kolmogorov extension proceeds along the following lines: assuming that a probability measure on the space of all functions exists, then it can be used to specify the probability distribution of finite-dimensional random variables . Now, from this n-dimensional probability distribution we can deduce an (n-1)-dimensional marginal probability distribution for . There is an obvious compatibility condition, namely, that this marginal probability distribution be the same as the one derived from the full-blown stochastic process. When this condition is expressed in terms of probability density function, the result is called the Chapman-Kolmogorov equation. The Kolmogorov extension theorem guarantees the existence of a stochastic process with a given family of finite-dimensional probability distribution satisfying the Chapman-Kolmogorov compatibility condition. === Separability, or what the Kolmogorov extension does not provide === Recall that, in the Kolmogorov axiomatization, measurable sets are the sets which have a probability or, in other words, the sets corresponding to yes/no questions that have a probabilistic answer. The Kolmogorov extension starts by declaring to be measurable all sets of functions where finitely many coordinates are restricted to lie in measurable subsets of . In other words, if a yes/no question about f can be answered by looking at the values of at most finitely many coordinates, then it has a probabilistic answer. In measure theory, if we have a countably infinite collection of measurable sets, then the union and intersection of all of them is a measurable set. For our purposes, this means that yes/no questions that depend on countably many coordinates have a probabilistic answer. The good news is that the Kolmogorov extension makes it possible to construct stochastic processes with fairly arbitrary finite-dimensional distributions. Also, every question that one could ask about a sequence has a probabilistic answer when asked of a random sequence. The bad news is that certain questions about functions on a continuous domain don't have a probabilistic answer. One might hope that the questions that depend on uncountably many values of a function be of little interest, but the really bad news is that virtually all concepts of calculus are of this sort. For example: #bounded function #continuous function #differentiability all require knowledge of uncountably many values of the function. One solution to this problem is to require that the stochastic process be separable. In other words, that there be some countable set of coordinates whose values determine the whole random function ''f''. ==See also== * List of stochastic processes topics Stochastic processes Stochastic processThe definition given on this page for "stochastic process" as a random function is an elegant mathematical definition if one takes the viewpoint that there is a family of functions on a common domain and range and a probability measure on a sigma algebra of subsets of that family. But might a reader interpret this to mean that there could be a random choice of a function from a set of functions with different domains and ranges? (And would that fall within the intended definition?) :One way to deal with varying domains is by taking direct sums of the function spaces involved. -- User:Miguel Whatever the final verdict on the above point is, it would help to explain how the often seen definition of a stochastic process as "an indexed family of random variables" agrees with the idea of "random function". This would involve explaining that an indexing set can be finite, countable or uncountable. That fact may surprise someone who hasn't studied abstract mathematics. :This article needs input from otehr people, please add this definition if you'd like. -- User:Miguel There is some inconsistency with the way books treat the term "stochastic process". Some (such as Gardiner) restrict it to a process in time. This appears to be in the same spirit as author's who say, for a vector, "In this book W will be a finite dimensional vector space" in order to set a context that is more specific than the general definition of "vector". There are books which use "Random Field" to include processes that take place in time. These are arbitrary conventions. It would help the reader to mention that he may encounter these inconsistencies. :I think it is best to give the broadest possible definition to which the basic techniques can be applied, and list special cases or narrower definitions later in the article. If you can add other definitions ''with references to the literature'' that will greatly improve the article. -- User:Miguel ::I'll attempt to do that, but first I must study how the Wikipedia does things. Is there a way to use standard html editors like Netscape Composer or Mozilla's composer to create Wikipedia pages? User:Stephen Tashiro :::Just go ahead and edit the page. Under the editing window there is an ''editing help'' link you can follow to get instructions on how to format the content and include mathematics. And remember the wikipedia policy: be bold in updating pages. -- User:Miguel I can't yet expound upon the QM approach to stochastic processes. Below is what I propose for the traditional approach. If I put it all in at the top of the page, I get a warning about a 33 Kb limit, so I didn't edit the entry yet. User:Tashiro :Wow, that's pretty impressive. Why don't you add the content to the article a bit at a time? That way you will get around the 33Kb limit, and if you use sectioning (see the editing help about this) appropriately you will never hit the limit. Also, the rest of us will be able to work on your edits as you add them to the article more easily than if you added the whole thing at once. -- User:Miguel OK, I pasted the draft into the beginning of the article. I confess that I don't know html or the wikipedia editing conventions very well. My method is compose the pages with Mozilla, view source, cut and paste into the Wiki editing window. Some spurious blank lines have been introduced and people who use the Wiki editor may not like the symbols introduced by Mozilla. Is there a consensus about whether using html editors to compose Wiki pages is a good or bad thing? User:Tashiro :You really have to read the Wikipedia:How does one edit a page. The point of wiki is that ho HTML is necessary to create a page, you just write text and add very little wiki-specific markup. :I have moved your discussion to the end of the article because it does not really have the structure of an Encyclopedia article, although there is very good content in it. I will work on merging your discussion into the existing content. The first thing I'll do is section your discussion. -- User:Miguel == Modified Definition == I don't find the given definition very rigourous - it is more of a description than a definition. Heres the beginings of a new version. I still need to do some work on this as it tails off a bit towards the end... :A one dimensional stochastic process consists of a set of random variables together with a bijective map assigning one of these random variables to each member of an index set . All of the random variables share the same codomain (a probability space ) and the same domain (a measurable space). Thus each point corresponds to a value for each random variable and hence a function , known as a ''realisation'' of the stochastic process. :Technically a stochastic process is not a function, only a particular realisation of it, and the mapping can be described as such. Despite this, the term ''random function'' and notations such as are convenient abbreviations. :Stochastic processes can then be discrete or continuous: a discrete stochastic process is an indexed collection of random variables : :where ''i'' runs over the countable index set ''I'', while a continuous stochastic process is an uncountably infinite set of random variables, with an uncountable index set. :A particular stochastic process is determined by specifying the joint probability distributions of the various random variables ''f''(''x''). Comments and opinions please...! User:SgtThroat 15:21, 12 Dec 2004 (UTC) * In an encyclopedia article, clarity should come first and rigor later. User:Michael Hardy 19:56, 12 Dec 2004 (UTC) I quite agree, but in a mathematical article such as this anything that claims to be a definition should be rigourous; compare for example the articles on ring_(mathematics) or sigma-algebras, bith of which start off with a clear definition. I suppose that what I am really saying is that I find the definition given unclear as well as un-rigorous, perhaps with the first as a consequence of the second. As an alternative suggestion how about calling the 'definition' a 'description' (although I still feel that it needs work), then placing a proper definition later on..?User:SgtThroat 13:37, 13 Dec 2004 (UTC) : But should one ''always'' begin with a definition? Sometimes clarity might be better served by beginning with an intuitive description and putting the rigorous definition after some discussion that enables the reader to understand the definition. User:Michael Hardy 22:33, 13 Dec 2004 (UTC) Once again I was not precise enough with my language - I meant 'further down' when I wrote 'later on', so I think we are in agreement. Thus I propose putting the definition further down. This necessitates renaming the existing 'definition' (which I don't think is appropriate anyway) - any suggestions? Also I'd like comment on the definition above - is it completely correct?User:SgtThroat 01:01, 14 Dec 2004 (UTC) I have now changed the "definition" section to "Common stochastic processes" and removed some of what I felt was rather confusing language. I've put in an example, probably in the wrong place, which I hope gives a feel for the index set and the domain of the random variables. I plan to rework the above "modified definition" to remove some stuff that is probably redundant due to overlap with the reworked first section of the article, then include it towards the end of the article. I'd like to hear opinions and comments on both parts. User:SgtThroat 20:30, 15 Dec 2004 (UTC) :Moved the Brownian motion example into the following section, which I renamed "examples". I then removed the empty examples section that was already there. Forgot to add an edit summary... User:SgtThroat 20:51, 15 Dec 2004 (UTC) ==The last section== The last section is too long. How about spinning it off as a seperate article? -User:Wshun 03:07, 14 Nov 2003 (UTC) I'm putting it here. for the time being. This will allow us to fix the formatting problems, and also to move content gradually from here to the main text. I don't think a new article is necessary, but maybe the need for it will become apparent in the process. -- User:Miguel 17:17, 14 Nov 2003 (UTC) === Informal Discussion === Several different definitions are found for "stochastic process" in mathematical literature. This is not particularly scandalous. Some mathematical definitions describe things which have many specific properties that are crucial in writing mathematical proofs. (For an example of such a definition, see the entry for vector space in the Wikipedia.) Other mathematical definitions do not provide much specific information. The traditional definitions of "stochastic process" fall in this latter category. Even a book whose subject is "stochastic processes", may treat the definition rather casually. Such texts chose to add details later by defining special cases of the general concept. In order to explain the distinctions among the various defintiions of "stochastic process" it is best to begin with an informal discussion. One purpose of defining the term "stochastic process" is to create terminology that is broad enough to describe a random phenomenon that produces an infinite amount of data each time it occurs. Another goal is to have terminology that is narrow enough to describe situations where the data is , in a manner of speaking, all of the same format (e.g. it might be all be prices in dollars or it might be all be measurements of air pressure in millibars ). We give some examples of some physical situations that can be viewed as a stochastic process. Example 1: Consider air pressures in millibars at the local airport airport from 6:00AM to 7:00AM. Assuming that time is a continuum, there are an infinite number of times between 6:00AM and 7:00AM. A single occurrence ("realization") of the process is this infinite set of pressures that occurs on a particular day. Each of these pressures is a datum that is in the same format as the others. Example 2: Suppose we hand out 8.5" by 11" sheets
of white paper to each member of an audience and ask them to draw a
picture. Let us take the simplistic view that underlying
process that creates the picture is the same for each member
of the audience. Each picture we receive is a realization of
this random phenomena. Each picture contains an infinite amout
of data if we take the idea of the space as a continuum
seriously. At each location (x,y)
in the picture there is a certain color value that is part of the
realized data. There are an infinite number of such locations
(x,y) on a sheet of 8.5" by 11" paper.
To describe a color we may need more than a single number.
Suppose the color data at location (x,y) is
given by a triplet of numbers (r,g,b) that measure the
red,blue and green intensities. Since the data has this form
at all locations, we can think of it as being "all of the same
format". It is important to understand that the same practical problem can be described in mathematics in different ways. Mathematics itself does not specify a unique translation of a physical situation into mathematical terms. We give some examples of such ambiguity. Example 3: Suppose we measure the height of a randomly selected person, we may think of this as a process that produces a single datum, the persons height, each time we perform it The most common mathematical treatment of this situation is to view it as a realization of a random variable . It is also possible to view this as a stochastic process that produces 1 datum on each realization. However it is not customary to do this. Example 4: Suppose we perform an experiment where we measure the weight, height and temperature of a randomly selected person. This is usually viewed as a realization of a multivariate random variable which has three components. It is also possible to view this as a stochastic process that produces 1 datum , the triplet of (weight, height , temperature), on each realization. However it is not customary to do this. Some authors [ Neftci] view a stochastic process as a "random function" in the following manner. A stochastic process is considered to be a function of two variables f(t,w). The first variable describes which datum we wish to examine from all the data produced by one realization of the process. The second variable w represents which specific realization of the process occurred. In example 1, We may think of w is a datum
that specifies information such as "January 3, 2002
6:00AM to 7:00AM". The variable t
would be used to indicate a specific time. For example, one
possible value of t is 6:03 AM. From this
point of view, the realization of the process consists of
picking a specific value of w. Then, with w
being fixed, the function f(t,w) becomes a
function of t alone. So a single realization of
the process is a specific function of t. The values of most of the variables in the above examples are familiar mathematical quantites, such as numbers or vectors. But the reader may it have difficulty conceptualizing the nature of the variable w and stating exactly what possible values it make have. The possible values of w are taken from a probability space . Roughly speaking, the "probability space" refers to three things: 1) a set of things that we may think of as "primitive" events, 2) a collection of subsets of the primitive events and 3) a function or algorithm that is able to assign a probablity to each of these subsets. In the simple examples above, we can only give a partial description of the probability space. In example 1, the primitive events can be described as "all possible 6 to 7 AM time periods at the airport". In example 2, the primitive events can be described as "all possible pictures that people might draw on 8.5" by 11" paper". These descriptions dodge the question of which subsets of primitive events can be assigned a probability and how this might be done. A primitive event in the probability space should determine all the values of the random phenomenon. For instance, in example 1 it would not be correct to say that a pressure reading of 1013.25 millibars at 6:03 AM is a primitive event. Giving the air pressure at a single time would not determine it at the others. In example 2, a primitive event would not be "all the colors a person might draw at some location on the paper" or "all the images a person might draw in the upper left hand corner of the picture". A primitive event should determine the whole picture. In practical applications of stochastic processes , there is often a quantitative description of the probability space. For example, one may assume a specific formula or algorithm generates the occurrence of the process. The algorithm will usually involve taking realizations of random variables and doing certain computations with the results to arrive at the realization of the stochastic process. In such a case, the primitive events are the set of all possible realizations of the random variables employed by the algorithm. The probabilities involved are computed from the joint distribution of these random variables. Example 5: For the sake of having a simple example, assume that nature generates the air pressures of example 1 according to the following scheme. Pick two air pressures in millibars by selecting a two random numbers number p0 and p1 from a probability distribution on the interval -0.2.to 0.2. Let the resulting pressure readings be given by a pressure-vs-time graph that is a straight line connecting the points ( 6:00 AM, 1013.25 + p0) with (7:00, 1013.25 + p1). We may view stochastic process of example 5 as a function f(t,w) where w is the vector (p0,p1). A primitive event is the selection of specific values for p0 and p1. The probabilities of various subsets of primitive events can be computed from the joint distribution of (p0,p1), which is the product of two uniform distributions since we have assumed p0 and p1 are independent. For example, we can compute the probability of the subset "p1 > 0.0 and p2 > 0.1".
In example 1, t is a time and the index set is the set of all times between 6:00 AM and 7:00 AM (One should not assume that an "index set " must be a set of integers. Indexing via a set of integers is often used computer programming, as when we index an array A by integers in order to refer to A[1], A[2], etc. However the concept of "index set" in a stochastic process is more general than this. The "index set" can be any set at all.) The pressure at a specific time, such as 6:03 AM can be viewed as a single random variable since the pressure at this time will be different on different days. We think of the stochastic process as an infinite family of random variables X(t) that are indexed by the time t. A random variable has a domain and a range. A realization of a variable like X(6:03 AM) is a single real number. So we may say its range is the set of numbers that are possible pressure readings. The domain of the random variable X(603 AM) ) is not the set of times, even though the notation X(t) makes it appear this way. The domain of X(6:03 AM) is the probability space for the phenomena. In this example, the domain is a datum that would describe a specific date at the airport such as "January 3, 2002 6:00AM to 7:00AM"
See other meanings of words starting from letter: SSB | SC | SD | SE | SF | SG | SH | SI | SJ | SK | SL | SM | SN | SO | SP | SR | SS | ST | SU | SW | SX | SY | SZ |Words begining with Stochastic_process: Stochastic_process Stochastic_process Stochastic_processes Stochastic_processes Stochastic_processes Stochastic_process_algebra |
These materials are based on Wikipedia and licensed under the GNU FDL
YouTube.com videos better site than Turbo Tax 2007 |
|
|