Half-normal distribution

Probability distribution

Half-normal distribution
Probability density function $\sigma =1$
Cumulative distribution function $\sigma =1$
Parameters	$\sigma >0$ — (scale)
Support	$x\in [0,\infty )$
PDF	$f(x;\sigma )={\frac {\sqrt {2}}{\sigma {\sqrt {\pi }}}}\exp \left(-{\frac {x^{2}}{2\sigma ^{2}}}\right)\quad x>0$
CDF	$F(x;\sigma )=\operatorname {erf} \left({\frac {x}{\sigma {\sqrt {2}}}}\right)$
Quantile	$Q(F;\sigma )=\sigma {\sqrt {2}}\operatorname {erf} ^{-1}(F)$
Mean	${\frac {\sigma {\sqrt {2}}}{\sqrt {\pi }}}\approx 0.797885\sigma$
Median	$\sigma {\sqrt {2}}\operatorname {erf} ^{-1}(1/2)\approx 0.674490\sigma$
Mode	$0$
Variance	$\sigma ^{2}\left(1-{\frac {2}{\pi }}\right)$
Skewness	${\frac {{\sqrt {2}}(4-\pi )}{(\pi -2)^{3/2}}}\approx 0.9952717$
Excess kurtosis	${\frac {8(\pi -3)}{(\pi -2)^{2}}}\approx 0.869177$
Entropy	${\frac {1}{2}}\log _{2}\left(2\pi e\sigma ^{2}\right)-1$

In probability theory and statistics, the half-normal distribution is a special case of the folded normal distribution.

Let $X$ follow an ordinary normal distribution, $N(0,\sigma ^{2})$ . Then, $Y=|X|$ follows a half-normal distribution. Thus, the half-normal distribution is a fold at the mean of an ordinary normal distribution with mean zero.

Properties

Using the $\sigma$ parametrization of the normal distribution, the probability density function (PDF) of the half-normal is given by

f_{Y}(y;\sigma )={\frac {\sqrt {2}}{\sigma {\sqrt {\pi }}}}\exp \left(-{\frac {y^{2}}{2\sigma ^{2}}}\right)\quad y\geq 0,

where $E[Y]=\mu ={\frac {\sigma {\sqrt {2}}}{\sqrt {\pi }}}$ .

Alternatively using a scaled precision (inverse of the variance) parametrization (to avoid issues if $\sigma$ is near zero), obtained by setting $\theta ={\frac {\sqrt {\pi }}{\sigma {\sqrt {2}}}}$ , the probability density function is given by

f_{Y}(y;\theta )={\frac {2\theta }{\pi }}\exp \left(-{\frac {y^{2}\theta ^{2}}{\pi }}\right)\quad y\geq 0,

where $E[Y]=\mu ={\frac {1}{\theta }}$ .

The cumulative distribution function (CDF) is given by

F_{Y}(y;\sigma )=\int _{0}^{y}{\frac {1}{\sigma }}{\sqrt {\frac {2}{\pi }}}\,\exp \left(-{\frac {x^{2}}{2\sigma ^{2}}}\right)\,dx

Using the change-of-variables $z=x/({\sqrt {2}}\sigma )$ , the CDF can be written as

F_{Y}(y;\sigma )={\frac {2}{\sqrt {\pi }}}\,\int _{0}^{y/({\sqrt {2}}\sigma )}\exp \left(-z^{2}\right)dz=\operatorname {erf} \left({\frac {y}{{\sqrt {2}}\sigma }}\right),

where erf is the error function, a standard function in many mathematical software packages.

The quantile function (or inverse CDF) is written:

Q(F;\sigma )=\sigma {\sqrt {2}}\operatorname {erf} ^{-1}(F)

where $0\leq F\leq 1$ and $\operatorname {erf} ^{-1}$ is the inverse error function

The expectation is then given by

E[Y]=\sigma {\sqrt {2/\pi }},

The variance is given by

\operatorname {var} (Y)=\sigma ^{2}\left(1-{\frac {2}{\pi }}\right).

Since this is proportional to the variance σ² of X, σ can be seen as a scale parameter of the new distribution.

The differential entropy of the half-normal distribution is exactly one bit less the differential entropy of a zero-mean normal distribution with the same second moment about 0. This can be understood intuitively since the magnitude operator reduces information by one bit (if the probability distribution at its input is even). Alternatively, since a half-normal distribution is always positive, the one bit it would take to record whether a standard normal random variable were positive (say, a 1) or negative (say, a 0) is no longer necessary. Thus,

h(Y)={\frac {1}{2}}\log _{2}\left({\frac {\pi e\sigma ^{2}}{2}}\right)={\frac {1}{2}}\log _{2}\left(2\pi e\sigma ^{2}\right)-1.

Applications

The half-normal distribution is commonly utilized as a prior probability distribution for variance parameters in Bayesian inference applications.^[1]^[2]

Parameter estimation

Given numbers $\{x_{i}\}_{i=1}^{n}$ drawn from a half-normal distribution, the unknown parameter $\sigma$ of that distribution can be estimated by the method of maximum likelihood, giving

{\hat {\sigma }}={\sqrt {{\frac {1}{n}}\sum _{i=1}^{n}x_{i}^{2}}}

The bias is equal to

b\equiv \operatorname {E} {\bigg [}\;({\hat {\sigma }}_{\mathrm {mle} }-\sigma )\;{\bigg ]}=-{\frac {\sigma }{4n}}

which yields the bias-corrected maximum likelihood estimator

{\hat {\sigma \,}}_{\text{mle}}^{*}={\hat {\sigma \,}}_{\text{mle}}-{\hat {b\,}}.

Related distributions

The distribution is a special case of the folded normal distribution with μ = 0.
It also coincides with a zero-mean normal distribution truncated from below at zero (see truncated normal distribution)
If Y has a half-normal distribution, then (Y/σ)² has a chi square distribution with 1 degree of freedom, i.e. Y/σ has a chi distribution with 1 degree of freedom.
The half-normal distribution is a special case of the generalized gamma distribution with d = 1, p = 2, a = ${\sqrt {2}}\sigma$ .
If Y has a half-normal distribution, Y^-2 has a Levy distribution
The Rayleigh distribution is a moment-tilted and scaled generalization of the half-normal distribution.
Modified half-normal distribution^[3] with the pdf on $(0,\infty )$ is given as $f(x)={\frac {2\beta ^{\frac {\alpha }{2}}x^{\alpha -1}\exp(-\beta x^{2}+\gamma x)}{\Psi {\left({\frac {\alpha }{2}},{\frac {\gamma }{\sqrt {\beta }}}\right)}}}$ , where $\Psi (\alpha ,z)={}_{1}\Psi _{1}\left({\begin{matrix}\left(\alpha ,{\frac {1}{2}}\right)\\(1,0)\end{matrix}};z\right)$ denotes the Fox–Wright Psi function.

References

^ Gelman, A. (2006), "Prior distributions for variance parameters in hierarchical models", Bayesian Analysis, 1 (3): 515–534, doi:10.1214/06-ba117a
^ Röver, C.; Bender, R.; Dias, S.; Schmid, C.H.; Schmidli, H.; Sturtz, S.; Weber, S.; Friede, T. (2021), "On weakly informative prior distributions for the heterogeneity parameter in Bayesian random‐effects meta‐analysis", Research Synthesis Methods, 12 (4): 448–474, arXiv:2007.08352, doi:10.1002/jrsm.1475, PMID 33486828, S2CID 220546288
^ Sun, Jingchao; Kong, Maiying; Pal, Subhadip (22 June 2021). "The Modified-Half-Normal distribution: Properties and an efficient sampling scheme". Communications in Statistics - Theory and Methods. 52 (5): 1591–1613. doi:10.1080/03610926.2021.1934700. ISSN 0361-0926. S2CID 237919587.

External links

Half-Normal Distribution at MathWorld

(note that MathWorld uses the parameter

\theta ={\frac {1}{\sigma }}{\sqrt {\pi /2}}

Probability distributions (list)

Discrete
univariate

with finite support	Benford Bernoulli beta-binomial binomial categorical hypergeometric negative Poisson binomial Rademacher soliton discrete uniform Zipf Zipf–Mandelbrot
with infinite support	beta negative binomial Borel Conway–Maxwell–Poisson discrete phase-type Delaporte extended negative binomial Flory–Schulz Gauss–Kuzmin geometric logarithmic mixed Poisson negative binomial Panjer parabolic fractal Poisson Skellam Yule–Simon zeta

Continuous
univariate

supported on a bounded interval	arcsine ARGUS Balding–Nichols Bates beta beta rectangular continuous Bernoulli Irwin–Hall Kumaraswamy logit-normal noncentral beta PERT raised cosine reciprocal triangular U-quadratic uniform Wigner semicircle
supported on a semi-infinite interval	Benini Benktander 1st kind Benktander 2nd kind beta prime Burr chi chi-squared noncentral inverse scaled Dagum Davis Erlang hyper exponential hyperexponential hypoexponential logarithmic F noncentral folded normal Fréchet gamma generalized inverse gamma/Gompertz Gompertz shifted half-logistic half-normal Hotelling's T-squared inverse Gaussian generalized Kolmogorov Lévy log-Cauchy log-Laplace log-logistic log-normal log-t Lomax matrix-exponential Maxwell–Boltzmann Maxwell–Jüttner Mittag-Leffler Nakagami Pareto phase-type Poly-Weibull Rayleigh relativistic Breit–Wigner Rice truncated normal type-2 Gumbel Weibull discrete Wilks's lambda
supported on the whole real line	Cauchy exponential power Fisher's z Kaniadakis κ-Gaussian Gaussian q generalized normal generalized hyperbolic geometric stable Gumbel Holtsmark hyperbolic secant Johnson's S_U Landau Laplace asymmetric logistic noncentral t normal (Gaussian) normal-inverse Gaussian skew normal slash stable Student's t Tracy–Widom variance-gamma Voigt
with support whose type varies	generalized chi-squared generalized extreme value generalized Pareto Marchenko–Pastur Kaniadakis κ-exponential Kaniadakis κ-Gamma Kaniadakis κ-Weibull Kaniadakis κ-Logistic Kaniadakis κ-Erlang q-exponential q-Gaussian q-Weibull shifted log-logistic Tukey lambda