Misplaced Pages

Zero-truncated Poisson distribution

Article snapshot taken from Wikipedia with creative commons attribution-sharealike license. Give it a read and then ask your questions in the chat. We can research this topic together.
Conditional Poisson distribution restricted to positive integers
This article needs additional citations for verification. Please help improve this article by adding citations to reliable sources. Unsourced material may be challenged and removed.
Find sources: "Zero-truncated Poisson distribution" – news · newspapers · books · scholar · JSTOR (August 2013) (Learn how and when to remove this message)

In probability theory, the zero-truncated Poisson distribution (ZTP distribution) is a certain discrete probability distribution whose support is the set of positive integers. This distribution is also known as the conditional Poisson distribution or the positive Poisson distribution. It is the conditional probability distribution of a Poisson-distributed random variable, given that the value of the random variable is not zero. Thus it is impossible for a ZTP random variable to be zero. Consider for example the random variable of the number of items in a shopper's basket at a supermarket checkout line. Presumably a shopper does not stand in line with nothing to buy (i.e., the minimum purchase is 1 item), so this phenomenon may follow a ZTP distribution.

Since the ZTP is a truncated distribution with the truncation stipulated as k > 0, one can derive the probability mass function g(k;λ) from a standard Poisson distribution f(k;λ) as follows:

g ( k ; λ ) = P ( X = k X > 0 ) = f ( k ; λ ) 1 f ( 0 ; λ ) = λ k e λ k ! ( 1 e λ ) = λ k ( e λ 1 ) k ! {\displaystyle g(k;\lambda )=P(X=k\mid X>0)={\frac {f(k;\lambda )}{1-f(0;\lambda )}}={\frac {\lambda ^{k}e^{-\lambda }}{k!\left(1-e^{-\lambda }\right)}}={\frac {\lambda ^{k}}{(e^{\lambda }-1)k!}}}

The mean is

E [ X ] = λ 1 e λ = λ e λ e λ 1 {\displaystyle \operatorname {E} ={\frac {\lambda }{1-e^{-\lambda }}}={\frac {\lambda e^{\lambda }}{e^{\lambda }-1}}}

and the variance is

Var [ X ] = λ + λ 2 1 e λ λ 2 ( 1 e λ ) 2 = E [ X ] ( 1 + λ E [ X ] ) {\displaystyle \operatorname {Var} ={\frac {\lambda +\lambda ^{2}}{1-e^{-\lambda }}}-{\frac {\lambda ^{2}}{(1-e^{-\lambda })^{2}}}=\operatorname {E} (1+\lambda -\operatorname {E} )}

Parameter estimation

The method of moments estimator λ ^ {\displaystyle {\widehat {\lambda }}} for the parameter λ {\displaystyle \lambda } is obtained by solving

λ ^ 1 e λ ^ = x ¯ {\displaystyle {\frac {\widehat {\lambda }}{1-e^{-{\widehat {\lambda }}}}}={\bar {x}}}

where x ¯ {\displaystyle {\bar {x}}} is the sample mean.

This equation has a solution in terms of the Lambert W function. In practice, a solution may be found using numerical methods.

Examples

Insurance claims:

Imagine navigating the intricate landscape of auto insurance claims, where each claim signifies a unique event – an accident or damage occurrence. The ZTP distribution seamlessly aligns with this scenario, excluding the possibility of policyholders with zero claims.

Let X denote the random variable representing the number of insurance claims. If λ is the average rate of claims, the ZTP probability mass function takes the form:

P ( X = k ) = λ k e λ k ! ( 1 e λ ) {\displaystyle P(X=k)={\frac {\lambda ^{k}e^{-\lambda }}{k!\left(1-e^{-\lambda }\right)}}} for k= 1,2,3,...

This formula encapsulates the probability of observing k claims given that at least one claim has transpired. The denominator ensures the exclusion of the improbable zero-claim scenario. By utilizing the zero-truncated Poisson distribution, the manufacturing company can analyze and predict the frequency of defects in their products while focusing on instances where defects exist. This distribution helps in understanding and improving the quality control process, especially when it's crucial to account for at least one defect.

Generating zero-truncated Poisson-distributed random variables

Random variables sampled from the zero-truncated Poisson distribution may be achieved using algorithms derived from Poisson distribution sampling algorithms.

init:
    Let k ← 1, t ← e / (1 - e) * λ, s ← t.
    Generate uniform random number u in .
while s < u do:
    k ← k + 1.
    t ← t * λ / k.
    s ← s + t.
return k.

The cost of the procedure above is linear in k, which may be large for large values of λ {\displaystyle \lambda } . Given access to an efficient sampler for non-truncated Poisson random variates, a non-iterative approach involves sampling from a truncated exponential distribution representing the time of the first event in a Poisson point process, conditional on such an event existing. A simple NumPy implementation is:

def sample_zero_truncated_poisson(rate):
    u = np.random.uniform(np.exp(-rate), 1)
    t = -np.log(u)
    return 1 + np.random.poisson(rate - t)

References

  1. ^ Cohen, A. Clifford (1960). "Estimating parameters in a conditional Poisson distribution". Biometrics. 16 (2): 203–211. doi:10.2307/2527552. JSTOR 2527552.
  2. Singh, Jagbir (1978). "A characterization of positive Poisson distribution and its application". SIAM Journal on Applied Mathematics. 34: 545–548. doi:10.1137/0134043.
  3. "Stata Data Analysis Examples: Zero-Truncated Poisson Regression". UCLA Institute for Digital Research and Education. Retrieved 7 August 2013.
  4. Johnson, Norman L.; Kemp, Adrianne W.; Kotz, Samuel (2005). Univariate Discrete Distributions (third ed.). Hoboken, NJ: Wiley-Interscience.
  5. Borje, Gio (2016-06-01). "Zero-Truncated Poisson Distribution Sampling Algorithm". Archived from the original on 2018-08-26.
  6. Hardie, Ted (1 May 2005). "[R] simulate zero-truncated Poisson distribution". r-help (Mailing list). Retrieved 27 May 2022.
Probability distributions (list)
Discrete
univariate
with finite
support
with infinite
support
Continuous
univariate
supported on a
bounded interval
supported on a
semi-infinite
interval
supported
on the whole
real line
with support
whose type varies
Mixed
univariate
continuous-
discrete
Multivariate
(joint)
Directional
Univariate (circular) directional
Circular uniform
Univariate von Mises
Wrapped normal
Wrapped Cauchy
Wrapped exponential
Wrapped asymmetric Laplace
Wrapped Lévy
Bivariate (spherical)
Kent
Bivariate (toroidal)
Bivariate von Mises
Multivariate
von Mises–Fisher
Bingham
Degenerate
and singular
Degenerate
Dirac delta function
Singular
Cantor
Families
Categories: