Geometric distribution

From Maths
Revision as of 03:17, 16 January 2018 by Alec (Talk | contribs) (Overhaul of page, marked out work to do, good progress made - STILL TO DO pull infobox out and put in /infobox - SAVING WORK)

Jump to: navigation, search
Stub grade: A*
This page is a stub
This page is a stub, so it contains little or minimal information and is on a to-do list for being expanded.The message provided is:
It's crap, look at it.
  • Dire notice: "The content is iffy, and uses a weird convention where geometric is time to first failure. New page content at Geometric distribution2"
    • Dire notice removed Alec (talk) 03:35, 15 January 2018 (UTC)
  • Partial expectation proof to be found at Geometric distribution2 page.
Geometric Distribution
[ilmath]X\sim\text{Geo}(p)[/ilmath]

for [ilmath]p[/ilmath] the probability of each trials' success

[ilmath]X\eq k[/ilmath] means that the first success occurred on the [ilmath]k^\text{th} [/ilmath] trial, [ilmath]k\in\mathbb{N}_{\ge 1} [/ilmath]
Definition
Defined over [ilmath]X[/ilmath] may take values in [ilmath]\mathbb{N}_{\ge 1}\eq\{1,2,\ldots\} [/ilmath]
p.m.f [ilmath]\mathbb{P}[X\eq k]:\eq (1-p)^{k-1}p[/ilmath]
c.d.f / c.m.f[Note 1] [ilmath]\mathbb{P}[X\le k]\eq 1-(1-p)^k[/ilmath]
cor: [ilmath]\mathbb{P}[X\ge k]\eq (1-p)^{k-1} [/ilmath]
Properties
Expectation: [math]\mathbb{E}[X]\eq\frac{1}{p} [/math][1]
Variance:
TODO: Unknown
[Note 2]
[ilmath]\newcommand{\P}[2][]{\mathbb{P}#1{\left[{#2}\right]} } \newcommand{\Pcond}[3][]{\mathbb{P}#1{\left[{#2}\!\ \middle\vert\!\ {#3}\right]} } \newcommand{\Plcond}[3][]{\Pcond[#1]{#2}{#3} } \newcommand{\Prcond}[3][]{\Pcond[#1]{#2}{#3} }[/ilmath]
[ilmath]\newcommand{\E}[1]{ {\mathbb{E}{\left[{#1}\right]} } } [/ilmath][ilmath]\newcommand{\Mdm}[1]{\text{Mdm}{\left({#1}\right) } } [/ilmath][ilmath]\newcommand{\Var}[1]{\text{Var}{\left({#1}\right) } } [/ilmath][ilmath]\newcommand{\ncr}[2]{ \vphantom{C}^{#1}\!C_{#2} } [/ilmath]

Definition

Consider a potentially infinite sequence of [ilmath]\text{Borv} [/ilmath] variables, [ilmath] ({ X_i })_{ i = 1 }^{ n } [/ilmath], each independent and identically distributed (i.i.d) with [ilmath]X_i\sim[/ilmath][ilmath]\text{Borv} [/ilmath][ilmath](p)[/ilmath], so [ilmath]p[/ilmath] is the probability of any particular trial being a "success".

The geometric distribution models the probability that the first success occurs on the [ilmath]k^\text{th} [/ilmath] trial, for [ilmath]k\in\mathbb{N}_{\ge 1} [/ilmath].

As such:

  • [ilmath]\P{X\eq k} :\eq (1-p)^{k-1}p[/ilmath] - pmf / pdf - Claim 1 below
  • [ilmath]\mathbb{P}[X\le k]\eq 1-(1-p)^k[/ilmath] - cdf - Claim 2 below
    • [ilmath]\mathbb{P}[X\ge k]\eq (1-p)^{k-1} [/ilmath] - an obvious extension.

Convention notes

Grade: A**
This page requires some work to be carried out
Some aspect of this page is incomplete and work is required to finish it
The message provided is:
If [ilmath]X\sim\text{Geo}(p)[/ilmath] is defined as above then there are 3 other conventions I've seen:
  1. [ilmath]X_1\sim\text{Geo}(1-p)[/ilmath] in our terminology, they would write [ilmath]\text{Geo}(p)[/ilmath], which measures "trials until first failure" instead of success as we do
  2. [ilmath]X_2:\eq X-1[/ilmath] - the number of trials BEFORE first success
  3. [ilmath]X_3:\eq X_1-1[/ilmath] - the number of trials BEFORE first failure
Document and explain Alec (talk) 03:17, 16 January 2018 (UTC)

Warning:That grade doesn't exist!

Properties

For [ilmath]p\in[0,1]\subseteq\mathbb{R} [/ilmath] and [ilmath]X\sim\text{Geo}(p)[/ilmath] we have the following results about the geometric distribution:

  • [ilmath]\E{X}\eq\frac{1}{p} [/ilmath] for [ilmath]p\in(0,1][/ilmath] and is undefined or tentatively defined as [ilmath]+\infty[/ilmath] if [ilmath]p\eq 0[/ilmath]

To do:

  1. Variance of the geometric distribution
  2. Mdm of the geometric distribution

Proof of claims

Claim 1: [ilmath]\P{X\eq k}\eq (1-p)^{k-1} p [/ilmath]

TODO: This requires improvement, it was copy and pasted from some notes
  • [ilmath]\P{X\eq k} :\eq (1-p)^{k-1}p[/ilmath] - which is derived as folllows:
    • [ilmath]\P{X\eq k} :\eq \Big(\P{X_1\eq 0}\times\Pcond{X_2\eq 0}{X_1\eq 0}\times\cdots\times \Pcond{X_{k-1}\eq 0}{X_1\eq 0\cap X_2\eq 0\cap\cdots\cap X_{k-2}\eq 0}\Big)\times\Pcond{X_k\eq 1}{X_1\eq 0\cap X_2\eq 0\cap\cdots\cap X_{k-1}\eq 0} [/ilmath]
      • Using that the [ilmath]X_i[/ilmath] are independent random variables we see:
        • [math]\P{X\eq k}\eq \left(\prod^{k-1}_{i\eq 1}\P{X_i\eq 0}\right)\times \P{X_k\eq 1} [/math]
          [math]\eq (1-p)^{k-1} p[/math] as they all have the same distribution, namely [ilmath]X_i\sim\text{Borv}(p)[/ilmath]

Claim 2: [ilmath]\mathbb{P}[X\le k]\eq 1-(1-p)^k[/ilmath]

Grade: A**
This page requires one or more proofs to be filled in, it is on a to-do list for being expanded with them.
Please note that this does not mean the content is unreliable. Unless there are any caveats mentioned below the statement comes from a reliable source. As always, Warnings and limitations will be clearly shown and possibly highlighted if very important (see template:Caution et al).
The message provided is:
Trivial to do, direct application of Geometric series result Alec (talk) 03:17, 16 January 2018 (UTC)

See also

Distributions

Notes

  1. Do we make this distinction for cumulative distributions?
  2. Due to different conventions on the definition of geometric (for example [ilmath]X':\eq X-1[/ilmath] for my [ilmath]X[/ilmath] and another's [ilmath]X'\sim\text{Geo}(p)[/ilmath]) or even differing by using [ilmath]1-p[/ilmath] in place of [ilmath]p[/ilmath] in the [ilmath]X[/ilmath] and [ilmath]X'[/ilmath] just mentioned - I cannot be sure without working it out that it's [math]\frac{1-p}{p^2} [/math] - I record this value only for a record of what was once there with the correct expectation - DO NOT USE THIS EXPRESSION

References

  1. See Expectation of the geometric distribution