Logarithmic distribution

Discrete probability distribution
Logarithmic
Probability mass function
Plot of the logarithmic PMF
Plot of the logarithmic PMF
The function is only defined at integer values. The connecting lines are merely guides for the eye.
Cumulative distribution function
Plot of the logarithmic CDF
Plot of the logarithmic CDF
Parameters 0 < p < 1 {\displaystyle 0<p<1}
Support k { 1 , 2 , 3 , } {\displaystyle k\in \{1,2,3,\ldots \}}
PMF 1 ln ( 1 p ) p k k {\displaystyle {\frac {-1}{\ln(1-p)}}{\frac {p^{k}}{k}}}
CDF 1 + B ( p ; k + 1 , 0 ) ln ( 1 p ) {\displaystyle 1+{\frac {\mathrm {B} (p;k+1,0)}{\ln(1-p)}}}
Mean 1 ln ( 1 p ) p 1 p {\displaystyle {\frac {-1}{\ln(1-p)}}{\frac {p}{1-p}}}
Mode 1 {\displaystyle 1}
Variance p 2 + p ln ( 1 p ) ( 1 p ) 2 ( ln ( 1 p ) ) 2 {\displaystyle -{\frac {p^{2}+p\ln(1-p)}{(1-p)^{2}(\ln(1-p))^{2}}}}
MGF ln ( 1 p e t ) ln ( 1 p )  for  t < ln p {\displaystyle {\frac {\ln(1-pe^{t})}{\ln(1-p)}}{\text{ for }}t<-\ln p}
CF ln ( 1 p e i t ) ln ( 1 p ) {\displaystyle {\frac {\ln(1-pe^{it})}{\ln(1-p)}}}
PGF ln ( 1 p z ) ln ( 1 p )  for  | z | < 1 p {\displaystyle {\frac {\ln(1-pz)}{\ln(1-p)}}{\text{ for }}|z|<{\frac {1}{p}}}

In probability and statistics, the logarithmic distribution (also known as the logarithmic series distribution or the log-series distribution) is a discrete probability distribution derived from the Maclaurin series expansion

ln ( 1 p ) = p + p 2 2 + p 3 3 + . {\displaystyle -\ln(1-p)=p+{\frac {p^{2}}{2}}+{\frac {p^{3}}{3}}+\cdots .}

From this we obtain the identity

k = 1 1 ln ( 1 p ) p k k = 1. {\displaystyle \sum _{k=1}^{\infty }{\frac {-1}{\ln(1-p)}}\;{\frac {p^{k}}{k}}=1.}

This leads directly to the probability mass function of a Log(p)-distributed random variable:

f ( k ) = 1 ln ( 1 p ) p k k {\displaystyle f(k)={\frac {-1}{\ln(1-p)}}\;{\frac {p^{k}}{k}}}

for k ≥ 1, and where 0 < p < 1. Because of the identity above, the distribution is properly normalized.

The cumulative distribution function is

F ( k ) = 1 + B ( p ; k + 1 , 0 ) ln ( 1 p ) {\displaystyle F(k)=1+{\frac {\mathrm {B} (p;k+1,0)}{\ln(1-p)}}}

where B is the incomplete beta function.

A Poisson compounded with Log(p)-distributed random variables has a negative binomial distribution. In other words, if N is a random variable with a Poisson distribution, and Xi, i = 1, 2, 3, ... is an infinite sequence of independent identically distributed random variables each having a Log(p) distribution, then

i = 1 N X i {\displaystyle \sum _{i=1}^{N}X_{i}}

has a negative binomial distribution. In this way, the negative binomial distribution is seen to be a compound Poisson distribution.

R. A. Fisher described the logarithmic distribution in a paper that used it to model relative species abundance.[1]

See also

References

  1. ^ Fisher, R. A.; Corbet, A. S.; Williams, C. B. (1943). "The Relation Between the Number of Species and the Number of Individuals in a Random Sample of an Animal Population" (PDF). Journal of Animal Ecology. 12 (1): 42–58. doi:10.2307/1411. JSTOR 1411. Archived from the original (PDF) on 2011-07-26.

Further reading

  • Johnson, Norman Lloyd; Kemp, Adrienne W; Kotz, Samuel (2005). "Chapter 7: Logarithmic and Lagrangian distributions". Univariate discrete distributions (3 ed.). John Wiley & Sons. ISBN 978-0-471-27246-5.
  • Weisstein, Eric W. "Log-Series Distribution". MathWorld.
  • v
  • t
  • e
Discrete
univariate
with finite
support
with infinite
support
Continuous
univariate
supported on a
bounded interval
supported on a
semi-infinite
interval
supported
on the whole
real line
with support
whose type varies
Mixed
univariate
continuous-
discrete
Multivariate
(joint)DirectionalDegenerate
and singular
Degenerate
Dirac delta function
Singular
Cantor
Families
  • Category
  • Commons