The probability plot is used to test whether a dataset follows a given distribution. It shows a graph with an observed cumulative percentage on the X axis and an expected cumulative percentage on the Y axis. If all the scatter points are close to the reference line, we can say that the dataset follows the given distribution.
A Q-Q (Quantile-Quantile) plot is another graphic method for testing whether a dataset follows a given distribution. It differs from the probability plot in that it shows observed and expected values instead of percentages on the X and Y axes. If all the scatter points are close to the reference line, we can say that the dataset follows the given distribution.
Origin supports five given distributions (Normal, Lognormal, Exponential, Weibull and Gamma), and five methods for plotting percentile approximations (Blom, Benard, Hazen, Van der Waerden, and Kaplan-Meier).
To create a probability plot or Q-Q plot:
As you can see, in this example,

|
Input Data |
Specify the input data. You can select multiple columns as inout variables. |
|---|---|
|
Group |
Specify the grouping column(s) in order to seperate the input variable(s) into multiple different plots. |
|
Graph Arrangement |
The controls under this brach will help you arrange the multiple input variables and groups, split the graph into multiple panels and pages.
|
|
Share X Scales |
Specify whether share X scales for all layers on same graph. This option is only available when Separate Layers on Same Graph selected in Multiple Data and Multiple Groups or grouping columns selected in Split Panels by box. |
|
Share Y Scales |
Specify whether share Y scales for all layers on same graph. This option is only available for Q-Q plot when Separate Layers on Same Graph selected in Multiple Data and Multiple Groups or grouping columns selected in Split Panels by box. |
|
Distribution |
Select a distribution type for your data. For more information about distributions, please refer to Distributions section.
|
|
Score Method |
Select a method for plotting percentile approximations. For more information about methods, please refer to Score Methods section.
|
|
Confidence Band |
Specify whether to output the confidence band in probability plot. For computation details, see Algorithms. |
|
Confidence Level(%) |
Only available when Confidence Band is selected. Specify the confidence level in percentage for the chosen distribution. |
| Exchange X-Y Axes |
Specify whether to switch X and Y axis positions. |
| X Minimum X Maximum |
By default, Auto boxes are checked, which means the minimum and maximum X values of the Reference Line columns in the result sheet "PlotData#" will be used to created the distribution curves. Uncheck the Auto check box, you can enter the minimum and/or maximum of the X values for all distribution reference lines. All refence lines will be plotted within this X range. |
|
Output Range |
This determines where the calculated data for the graph is stored. |
|
Output Graphs |
This determines where the result graphs are stored. |
Origin includes four distributions for Probability and Q-Q plots. The following table lists their density functions:
| Distribution | Density Function p(x) | Range | Parameters |
|---|---|---|---|
|
Normal |
|
all
|
|
|
Lognormal |
|
|
|
|
Exponential |
|
|
is the scale parameter.
|
|
Weibull |
|
|
|
|
Gamma |
|
|
|
To construct a probability plot, sort the observed dataset from smallest to largest:
,
is the total number of the observed dataset.The sorted observed values are represented on the plot by points whose X-coordinates are
and whose Y-coordinates are calculated using the Score Method.

Scale types of probability plot are different according to the distributions
| Distribution | X Scale Type | Y Scale Type |
|---|---|---|
|
Normal |
Linear |
Probability |
|
Lognormal |
Ln |
Probability |
|
Exponential |
Ln |
Double Log Reciprocal |
|
Weibull |
Log10 |
Double Log Reciprocal |
|
Gamma |
Log10 |
Probability |
To construct a Q-Q plot,sort the observed dataset from smallest to largest:
, where
is the total number of observed values.The Y values are the inverse cumulative distribution functions of the score method used.

Input data is ordered from smallest to largest, and then the serial number of the sorted data is scored using one of the methods listed below. In this table,
is the serial number and
is the total number of the nonmissing input data.
| Methods | Plotting Position
|
|---|---|
|
Blom |
|
|
Benard |
|
|
Hazen |
|
|
Van der Waerden |
|
|
Kaplan-Meier |
|