2.5.10.2 Algorithm: Parametric Distribution Analysis (Arbitrary Censoring)
Time data are input as pairs \( (t1_i , t2_i) \) where \( t1_i \le t2_i \). The following input formats correspond to the four supported data types, exact, right-censored, left-censored, and interval-censored observations.
| Data Type | Input Format |
|---|---|
| Exact | \[ t1_i = t2_i \] |
| Right-censored | \( t2_i \) is missing |
| Left-censored | \( t1_i \) is missing |
| Interval-censored | \[ t1_i < t2_i \] |
The MLE (maximum likelihood estimation) and LS (least squares) methods are used to estimate parameters for each distribution.
Under the MLE method:
- First calculate the likelihood function \(\mathcal{L}( \theta )\) for the distribution, which is the product of Probability Density Function for failure time data and the Survival Density Function for censored time data.
- Then maximize the logarithm of the likelihood function \(\ell ( \theta )\) by setting its partial derivatives equal to zero with respect to parameters.
- Use Newton-Raphson method to solve these parameters in step 2.
Under the LS method:
- First plot the failure times (or log failure times) against the transformed cumulative probabilities on a probability plot. Note that the Turnbull method is used to estimate the cumulative distribution function for arbitrarily censored data.
- Then fit a straight line by minimizing the sum of squared deviations
- Use the slope and scale from the fitted line to calculate parameter estimates
Contents
Distributions
Supported distributions include:
| Distribution | Parameters | Probability density function (PDF) | Cumulative distribution function (CDF) | Inverse cumulative distribution function |
|---|---|---|---|---|
| Normal | location: \(\mu\) scale: \(\sigma\) |
\[f(x)=\frac{1}{ \sqrt{2 \pi} \sigma } e^{ -\frac{( x - \mu )^2}{2 \sigma^2} }, \; \sigma > 0\] | \[P(x) = \Phi \left( \frac{x - \mu}{\sigma} \right)\] | \[x = \mu + \sigma \Phi^{-1}(p)\] |
| Lognormal | location: \(\mu\) scale: \(\sigma\) |
\(f(x)=\frac{1}{ \sqrt{2 \pi} \sigma x } e^{ -\frac{( \text{ln}(x) - \mu )^2}{2 \sigma^2} }, \)
\[ x>0 , \; \sigma > 0\] |
\[P(x) = \Phi \left( \frac{\text{ln}(x) - \mu}{\sigma} \right)\] | \[\text{ln}(x) = \mu + \sigma \Phi^{-1}(p)\] |
| Exponential | scale: \(\theta\) | \[f(x) = \frac{ 1 }{ \theta } e^{-\frac{x}{\theta}}, \; x > 0, \; \theta > 0\] | \[P(x) = 1- e^{ -\frac{x}{\theta} }\] | \[ x = \theta ( - \text{ln}(1-p) ) \] |
| Smallest Extreme Value | location: \(\alpha\) scale: \(\beta\) |
\(f(x) = \frac{1}{\beta } e^{ \frac{x- \alpha}{\beta}} e^{ - e^{ \frac{x- \alpha}{\beta} } }, \)
\[\beta > 0\] |
\[P(x) = 1- e^{-e^{\frac{x-\alpha}{\beta}}}\] | \[ x = \alpha + \beta \text{ln}( - \text{ln}(1-p) ) \] |
| Weibull | scale: \(a\) shape: \(b\) |
\(f(x) = \frac{b}{a} \left(\frac{ x }{ a }\right)^{b - 1} e^{ -(\frac{ x }{ a })^b }, \)
\[x > 0, \; a > 0, \; b > 0\] |
\[P(x) = 1 - e^{ -(\frac{ x }{ a })^b }\] | \[ \text{ln}(x) = \text{ln}(a) + \frac{1}{b} \text{ln}( - \text{ln}(1-p) ) \] |
| Logistic | location: \(\mu\) scale: \(\sigma\) |
\[f(x) = \frac{e^{\frac{x- \mu}{\sigma}}}{\sigma(1+e^{\frac{x- \mu}{\sigma}})^2 },\sigma > 0\] | \[P(x) = \frac{1}{1+e^{-\frac{x- \mu}{\sigma}}} \] | \[x =\mu+ \sigma\text{ln}(\frac{p}{1-p}) \] |
| Loglogistic | location: \(\mu\) scale: \(\sigma\) |
\[\begin{array}{l}f(x) = \frac{e^{\frac{\text{ln}(x)- \mu}{\sigma}}}{x\sigma(1+e^{\frac{\text{ln}(x)- \mu}{\sigma}})^2 } , \\ x>0, \sigma > 0\end{array}\] | \[P(x) = \frac{1}{1+e^{-\frac{\text{ln}(x)- \mu}{\sigma}}} \] | \[x =e^{\mu+ \sigma\text{ln}(\frac{p}{1-p})} \] |
where Φ is the CDF function for the standard normal distribution.
Hazard Function
The hazard function is calculated as follows:
\[ h(x) = \frac{f(x)}{1 - P(x)} \]
- where \(f(x)\) is the PDF function and \(P(x)\) is the CDF function.
Estimate Parameter Standard Errors
Once parameters are estimated by MLE method, the Fisher information matrix (FIM) can be calculated by Hessian matrix:
Covariance matrix of parameters can be expressed as:
where n is the number of points.
Parameter's standard error is the square root of the diagonal elements in the covariance matrix C.
| Distribution | Parameters | Lower Confidence Limit | Upper Confidence Limit |
|---|---|---|---|
| Normal, Lognormal, Smallest Extreme Value, Logistic, Loglogistic | location: \(\mu\) | \[ \hat{\mu} - Z_{\alpha}{SE_{\hat{\mu}}} \] | \[ \hat{\mu} + Z_{\alpha}{SE_{\hat{\mu}}} \] |
| scale: \(\sigma\) | \[ \frac{\hat{\sigma}}{\exp\!\left(\dfrac{Z_{\alpha}\,SE_{\hat{\sigma}}}{\hat{\sigma}}\right)} \] | \[ \hat{\sigma}{\exp\!\left( \, \frac{Z_{\alpha}SE_{\hat{\sigma}}}{\hat{\sigma}} \right)} \] | |
| Exponential | scale: \(\theta\) | \[\frac{ \hat{\theta} }{ \exp\!\left( \, \frac{Z_{\alpha}SE_{\hat{\theta}}}{\hat{\theta}} \right) }\] | \[ \hat{\theta}{\exp\!\left( \, \frac{Z_{\alpha}SE_{\hat{\theta}}}{\hat{\theta}} \right)}\] |
| Weibull | scale: \(a\) | \[ \frac{\hat{a}}{\exp\!\left(\dfrac{Z_{\alpha}\,SE_{\hat{a}}}{\hat{a}}\right)} \] | \[ \hat{a}{\exp\!\left( \, \frac{Z_{\alpha}SE_{\hat{a}}}{\hat{a}} \right)} \] |
| shape: \(b\) | \[ \frac{\hat{b}}{\exp\!\left(\dfrac{Z_{\alpha}\,SE_{\hat{b}}}{\hat{b}}\right)} \] | \[ \hat{b}{\exp\!\left( \, \frac{Z_{\alpha}SE_{\hat{b}}}{\hat{b}} \right)} \] |
where \( Z_\alpha = Z_{1-\alpha/2} = \Phi^{-1}(1 - \tfrac{\alpha}{2}) \)
Variance estimates for left and interval-censored data in an exponential model with fixed parameter may be unstable when based solely on observed information. To improve numerical stability, a weighted combination of observed and expected (Fisher) information contributions may be constructed.
Quantities of Distribution
Once parameters are estimated using either the MLE or LS method, quantities of interest can be calculated
- Mean
- Standard Deviation
- Q1, Median, Q3
- IQR = Q3 - Q1
Formulas to calculate the distribution Mean
| Distribution | Mean | Lower Confidence Level | Upper Confidence Level |
|---|---|---|---|
| Normal | \[\hat{\mu}\] | \[\hat{\mu} - Z_{\alpha}SE_{\hat{\mu}}\] | \[\hat{\mu} + Z_{\alpha}SE_{\hat{\mu}}\] |
| Lognormal | \[ \exp(\mu + \frac{1}{2}\sigma^2)\] | \[\hat{(Mean)} / {\exp}(Z_{\alpha}{SE_{\hat{(Mean)}}})\] | \[\hat{(Mean)}{\exp}(Z_{\alpha}{SE_{\hat{(Mean)}}}) \] |
| Exponential | \[\hat{\theta}\] | \[\hat{\theta} / \exp({\frac{Z_{\alpha} SE_{\hat{\theta}}}{\hat{\theta}}})\] | \[\hat{\theta} \exp({\frac{Z_{\alpha} SE_{\hat{\theta}}}{\hat{\theta}}})\] |
| Smallest Extreme Value | \[\hat{\mu} - EC\,\hat{\sigma}\] | \[\hat{(Mean)} - Z_{\alpha}SE_{\hat{(Mean)}}\] | \[\hat{(Mean)} + Z_{\alpha}SE_{\hat{(Mean)}}\] |
| Weibull | \[\hat{a}\,\Gamma\!\left(1+\tfrac{1}{\hat{b}}\right)\] | \[ \exp\!\left( \log(\hat{\text{Mean}}) - Z_{\alpha}\, SE_{\log(\hat{\text{Mean}})} \right) \] | \[ \exp\!\left( \log(\hat{\text{Mean}}) + Z_{\alpha}\, SE_{\log(\hat{\text{Mean}})} \right) \] |
| Logistic | \[\hat{\mu}\] | \[\hat{\mu} - Z_{\alpha}SE_{\hat{\mu}}\] | \[\hat{\mu} + Z_{\alpha}SE_{\hat{\mu}}\] |
| Loglogistic | \[ \exp(\hat{\mu})\Gamma(1+\hat{\sigma})\Gamma(1-\hat{\sigma}) \] | \[\hat{(Mean)} / {\exp}(Z_{\alpha}{SE_{\hat{(Mean)}}})\] | \[\hat{(Mean)}{\exp}(Z_{\alpha}{SE_{\hat{(Mean)}}}) \] |
where \(EC\) is Euler's constant and \( Z_\alpha = Z_{1-\alpha/2} = \Phi^{-1}(1 - \tfrac{\alpha}{2}) \)
Formulas to calculate the distribution Standard Deviation
| Distribution | Standard Deviation | Lower Confidence Level | Upper Confidence Level |
|---|---|---|---|
| Normal | \[\hat{\sigma}\] | \[\hat{\sigma}/\exp\!\left( \,\dfrac{Z_{\alpha}}{\hat{\sigma}SE_{\hat{\sigma}}} \right) \] | \[\hat{\sigma}\exp\!\left( \,\dfrac{Z_{\alpha}}{\hat{\sigma}SE_{\hat{\sigma}}} \right)\] |
| Lognormal | \[ \sqrt{ \exp\left( \hat{\mu} + \tfrac{1}{2}\hat{\sigma}^{2} \right) \left[ \exp\left( \hat{\sigma}^{2} \right) - 1 \right] } \] | \[\hat{(SD)} / {\exp}(Z_{\alpha}{SE_{\hat{(SD)}}})\] | \[\hat{(SD)}{\exp}(Z_{\alpha}{SE_{\hat{(SD)}}}) \] |
| Exponential | \[\hat{\theta}\] | \[\hat{\theta} / \exp({\frac{Z_{\alpha} SE_{\hat{\theta}}}{\hat{\theta}}})\] | \[\hat{\theta} \exp({\frac{Z_{\alpha} SE_{\hat{\theta}}}{\hat{\theta}}})\] |
| Smallest Extreme Value | \[\hat{\sigma}\sqrt{\tfrac{\pi^{2}}{6}}\] | \[\hat{(SD)}/\exp\!\left( \,\dfrac{Z_{\alpha}}{\hat{(SD)}SE_{\hat{(SD)}}} \right) \] | \[\hat{(SD)}\exp\!\left( \,\dfrac{Z_{\alpha}}{\hat{(SD)}SE_{\hat{(SD)}}} \right)\] |
| Weibull | \[\hat{a}\sqrt{\!\left[\Gamma\!\left(1+\tfrac{2}{\hat{b}}\right)-\Gamma^{2}\!\left(1+\tfrac{1}{\hat{b}}\right)\right]}\] | \[ \exp\!\Big(\log \hat{\sigma} - Z_{\alpha}\,SE_{\log \hat{\sigma}}\Big) \] | \[ \exp\!\Big(\log \hat{\sigma} + Z_{\alpha}\,SE_{\log \hat{\sigma}}\Big) \] |
| Logistic | \[\hat{\sigma}\sqrt{\tfrac{\pi^{2}}{3}}\] | \[\hat{(SD)}/\exp\!\left( \,\dfrac{Z_{\alpha}}{\hat{(SD)}SE_{\hat{(SD)}}} \right) \] | \[\hat{(SD)}\exp\!\left( \,\dfrac{Z_{\alpha}}{\hat{(SD)}SE_{\hat{(SD)}}} \right)\] |
| Loglogistic | \[\sqrt{\,\exp(2\hat{\mu})\Big[\Gamma(1+2\hat{\sigma})\,\Gamma(1-2\hat{\sigma}) - \Gamma^2(1+\hat{\sigma})\,\Gamma^2(1-\hat{\sigma})\Big]\,}\] | \[\hat{(SD)} / {\exp}(Z_{\alpha}{SE_{\hat{(SD)}}})\] | \[\hat{(SD)}{\exp}(Z_{\alpha}{SE_{\hat{(SD)}}}) \] |
where \( Z_\alpha = Z_{1-\alpha/2} = \Phi^{-1}(1 - \tfrac{\alpha}{2}) \)
Percentiles
Calculate the time at which a prespecified percent of the population fails.
| Distribution | Percentile (\( \hat{x_p} \)) | Variance \( \left( \text{Var} \left( \hat{x_p} \right ) \right )\) |
|---|---|---|
| Normal | \[ \hat{\mu} + Z_p\hat{\sigma} \] | \[ \text{Var}(\hat{\mu}) + 2Z_{p} \text{Cov}(\hat{\mu}, \hat{\sigma}) + Z_{p}^{2} \text{Var}(\hat{\sigma}) \] |
| Lognormal | \[ \exp(\hat{\mu} + Z_p\hat{\sigma}) \] | \[ {\hat{x_p}}^2(\text{Var}(\hat{\mu}) + 2Z_{p} \text{Cov}(\hat{\mu}, \hat{\sigma})) + Z_{p}^{2} \text{Var}(\hat{\sigma})) \] |
| Exponential | \[ -\ln(1-p)\hat{\theta}\] | \[ (-\ln(1-p))^2 \text{Var}(\hat{\theta}) \] |
| Smallest Extreme Value | \[ \hat{\mu} + Z_p\hat{\sigma} \] | \[ \text{Var}(\hat{\mu}) + 2z_p\, \text{Cov}(\hat{\mu},\hat{\sigma}) + z_p^2\, \text{Var}(\hat{\sigma}) \] |
| Weibull | \[ \hat{a} \,\big[-\ln(1-p)\big]^{1/\hat{b}} \] | \[ \left(\frac{\hat{x}_p^{2}}{\hat{a}^{2}}\right)\, \text{Var}(\hat{a}) \;-\; 2\left(\frac{Z_{p}\,\hat{x}_p^{2}}{\hat{a}\,\hat{b}^2}\right)\, \text{Cov}(\hat{a},\hat{b}) \;+\; \left(\frac{\hat{x}_p^{2}}{\hat{b}^{4}}\right) Z_{p}^{2}\, \text{Var}(\hat{b}) \] |
| Logistic | \[ \hat{\mu} + z_p\hat{\sigma}\ \] | \[ \text{Var} (\hat{\mu}) + 2z_p\, \text{Cov}(\hat{\mu},\hat{\sigma}) + z_p^2\, \text{Var}(\hat{\sigma}) \] |
| Loglogistic | \[ \exp\!\big(\hat{\mu} + Z_p\hat{\sigma}\big) \] | \[ \hat{x}_p^{\,2}\!\left[ \text{Var}(\hat{\mu}) + 2z_p\, \text{Cov}(\hat{\mu},\hat{\sigma}) + z_p^2\, \text{Var}(\hat{\sigma}) \right] \] |
For Normal, Smallest Extreme Value, and Logistic distributions:
Lower confidence limit: \( \hat{x}_p - Z_{\alpha}\sqrt{ \text{Var}(\hat{x}_p)} \)
Upper confidence limit: \( \hat{x}_p + Z_{\alpha}\sqrt{ \text{Var}(\hat{x}_p)} \)
For Lognormal, Exponential, Weibull, Loglogistic distributions:
Lower confidence limit: \( \exp\!\left( \ln(\hat{x}_p) \;-\; Z_{\alpha}\,\frac{\sqrt{\, \text{Var}(\hat{x}_p)\,}}{\hat{x}_p} \right) \)
Upper confidence limit: \( \exp\!\left( \ln(\hat{x}_p) \;+\; Z_{\alpha}\,\frac{\sqrt{\, \text{Var}(\hat{x}_p)\,}}{\hat{x}_p} \right) \)
Survival Probabilities
Calculate the percent of the population that has not yet failed at a pre-specified time point.
| Distribution | Survival Probability (\( \hat{z} \)) | Variance of Survival Probability (\( Var(\hat{z}) \)) |
|---|---|---|
| Normal | \[ \frac{x - \hat{\mu}}{\hat{\sigma}} \] | \[ \frac{Var(\hat{\mu}) + 2\hat{z} Cov(\hat{\mu}, \hat{\sigma}) + \hat{z}^{2} Var(\hat{\sigma})}{\sigma^2} \] |
| Lognormal | \[ \frac{ln(x) - \hat{\mu}}{\hat{\sigma}} \] | \[ \frac{Var(\hat{\mu}) + 2\hat{z} Cov(\hat{\mu}, \hat{\sigma}) + \hat{z}^{2} Var(\hat{\sigma})}{\sigma^2} \] |
| Exponential | \[ ln(x) - ln(\hat{\theta}) \] | \[ \frac{Var(\hat{\theta})}{\hat{\theta}^2} \] |
| Smallest Extreme Value | \[ \frac{x - \hat{\mu}}{\hat{\sigma}} \] | \[ \frac{Var(\hat{\mu}) + 2\hat{z} Cov(\hat{\mu}, \hat{\sigma}) + \hat{z}^{2} Var(\hat{\sigma})}{\sigma^2} \] |
| Weibull | \[ \hat{b}\,(\ln x - \ln a) \] | \[ \frac{\hat{z}^{2}\,Var(\hat{b})}{\hat{b}^{2}} \;-\; 2\,\frac{\hat{z}\,Cov(\hat{a},\hat{b})}{\hat{a}} \;+\; \frac{\hat{b}^{2}\,Var(\hat{a})}{\hat{a}^{2}} \] |
| Logistic | \[ \frac{x - \hat{\mu}}{\hat{\sigma}} \] | \[ \frac{Var(\hat{\mu}) + 2\hat{z} Cov(\hat{\mu}, \hat{\sigma}) + \hat{z}^{2} Var(\hat{\sigma})}{\sigma^2} \] |
| Loglogistic | \[ \frac{ln(x) - \hat{\mu}}{\hat{\sigma}} \] | \[\frac{Var(\hat{\mu}) + 2\hat{z} Cov(\hat{\mu}, \hat{\sigma}) + \hat{z}^{2} Var(\hat{\sigma})}{\sigma^2} \] |
Lower bound = \( \hat{z} - Z_{\alpha}\sqrt{Var(\hat{z})} \)
Upper bound = \( \hat{z} + Z_{\alpha}\sqrt{Var(\hat{z})} \)
For Normal and Lognormal distributions:
Lower confidence limit: \( 1 - \frac{1}{2}\left(1 + \mathrm{erf}\left(\frac{UB}{\sqrt{2}}\right)\right) \)
Upper confidence limit: \( 1 - \frac{1}{2}\left(1 + \mathrm{erf}\left(\frac{LB}{\sqrt{2}}\right)\right) \)
For Exponential, Smallest Extreme Value, and Weibull distributions:
Lower confidence limit: \( \exp(-\exp(UB)) \)
Upper confidence limit: \( \exp(-\exp(LB)) \)
For Logistic and Loglogistic distributions:
Lower confidence limit: \( \frac{1}{1 + \exp(UB)} \)
Upper confidence limit: \( \frac{1}{1 + \exp(LB)} \)
Probability Plot
Points in the probability plot are sorted in ascending order. For arbitrarily censored data, the cumulative distribution function is estimated using the Turnbull method, and the resulting probabilities are used as plotting positions.
The middle line is the expected percentiles for given probabilities in terms of the inverse cumulative distribution function using parameters from the MLE or LS method.
To estimate confidence limits of percentiles, variance of percentiles is calculated by propagation of error:
where xp is the percentile for a given probability p, and (.)T denotes the transpose of a vector.
- Confidence limits of percentiles can be calculated as below:
\[x_{pL} = x_p - Z_{\alpha} \sigma_{x_p}\]
\[x_{pU} = x_p + Z_{\alpha} \sigma_{x_p}\]
where \(Z_{\alpha} = \Phi^{-1}(1- \alpha / 2)\) .
- For some positive random variables, e.g. in Lognormal, Exponential, Weibull, and Loglogistic distributions, confidence limits are:
\[x_{pL} = e^{ \text{ln}(x_p) - Z_{\alpha} \frac{\sigma_{x_p}}{x_p} }\]
\[x_{pU} = e^{ \text{ln}(x_p) + Z_{\alpha} \frac{\sigma_{x_p}}{x_p} }\]
The fitted line is constructed as follows for each distribution:
| Distribution | x-axis | y-axis |
|---|---|---|
| Normal | \[ t_p \] | \[ \phi^{-1}(p) \] |
| Lognormal | \[ ln(t_p) \] | \[ \phi^{-1}(p) \] |
| Exponential | \[ ln(t_p) \] | \[ ln(-ln(1-p)) \] |
| Smallest Extreme Value | \[ t_p \] | \[ ln(-ln(1-p)) \] |
| Weibull | \[ ln(t_p) \] | \[ ln(-ln(1-p)) \] |
| Logistic | \[ t_p \] | \[ ln(\frac{p}{1-p}) \] |
| Loglogistic | \[ ln(t_p) \] | \[ ln(\frac{p}{1-p}) \] |
where Φ is the CDF function for the standard normal distribution.
The probability plot is another tool that can be used to assess fit. The probability plot shows an approximately straight line if the assumed distribution is a good fit.
Goodness of Fit
Anderson-Darling Test
The hypotheses for the Anderson-Darling test are:
- H0: Input data follows the distribution.
- H1: Input data doesn't follow the distribution.
- Anderson-Darling Statistic
- Input data (or interval endpoints for censored observations) are sorted first. For arbitrarily censored data, the empirical cumulative distribution function is estimated using the Turnbull method. For each observation, the fitted cumulative distribution function is evaluated as: \( Z_i = F(x_i;\hat{\theta}) \) where \( F(\cdot) \) is the assumed distribution with parameters estimated by the MLE or LS method.
The Anderson-Darling statistic A2 is calculated as:
- \( \text{PartC}_i = F_n(Z_{i-1})^2 \, \ln \! \left(\dfrac{Z_i(1 - Z_{i-1})}{Z_{i-1}(1 - Z_i)}\right) \), \(Z_0 = 0, \, F_n(Z_0) = 0, \, \ln(Z_0) = 0, Z_{n+1} = 1 - 10^{-12}\).
\( A^2 = n \sum_{i=1}^{n+1} \Bigl[ \text{PartA}_i + \text{PartB}_i + \text{PartC}_i \Bigr]\),
where \( \text{PartA}_i = (Z_{i-1} - Z_i) \; + \; \ln\! \left(\frac{1 - Z_{i-1}}{1 - Z_i} \right) \), \( \text{PartB}_i = 2 F_n(Z_{i-1}) \, \ln \! \left(\dfrac{1 - Z_i}{1 - Z_{i-1}} \right) \),
When there is no censoring in maximum likelihood estimation method,
\[ A^2 = -n -(1/n) \sum_{i=1}^n \Bigl[ (2i-1) \text{ln} (Z_i) + ( 2n+1-i ) \text{ln} (1 - Z_i) \Bigr]\]
A2 value can be used to assess the fit. The smaller the value is, the better the model will be.
Pearson Correlation Coefficient
The Pearson correlation coefficient is calculated under the Least Squares estimation method. It measures how well the straight line fits the input data. A high value indicates the least squares regression lines explains most of the variation in Y through X.
Reference
- R.A. Lockhart and M.A. Stephens (1994). "Estimation and Tests of Fit for the Three-parameter Weibull Distribution". Journal of the Royal Statistical Society, Vol. 56, No. 3, pp. 491-500.
- Ralph B. D'Agostino, Michael A. Stephens (Eds.) (1986). Goodness-of-Fit Techniques. New York: Marcel Dekker
- Hartigan J A (1975). Clustering Algorithms. Wiley
- W. Stute, W. G. Manteiga, and M. P. Quindimil (1993). “Bootstrap based goodness-of-fit-tests”. Metrika, 40.1: pp. 243-256.
- W. Q. Meeker, L. A. Escobar, and F. G. Pascual (2021). Statistical Methods for Reliability Data, 2nd Edition. New York: John Wiley & Sons.
- B. W. Turnbull (1976). "The Empirical Distribution Function with Arbitrarily Grouped, Censored and Truncated Data". Journal of the Royal Statistical Society, Vol. 38, pp. 290-295.