An outlier is statistically an observation which is numerically distant from the rest of the data. Origin provides methods and tools to help finding and testing for outliers.
To determine whether there is an outlier in a data set from repeated measurements, tools for Grubbs test and Dixon's Q test are available, and it's also possible to roughly visualize the outlier using the Q-Q plot.
To detect an outlier from regression, you may use the standardized residuals.
Once you’ve statistically determined if a point is an outlier, you can then mask the point using the Regional Mask Tool button,
, on the Tools toolbar.
For a series of repeated measured data listed in a column, in order to detect if there is an outlier or not with Grubbs Test:
or
The result will be output to both Result Log and Command Window, a report sheet will also be generated and if Outlier Plot is selected, a worksheet with plot data will also be generated:
| ox | The value of the suspected point |
|---|---|
| index | Row index of suspected point |
| gstat | The calculated g value from suspected point |
| critical | The critical g value at the specified significance level |
| pval | The p value for the test |
| sig | sig=1 means there is an outlier, sig=0 means there is no outlier |
| conclusion | A statement of conclusion indicating the statistical result. |
| rd | The worksheet range to put the plot data for outlier plot, if the Outlier Plot option is selected. |
| rt | The worksheet range to put the report table. |
For a series of repeated measured data listed in a column (Sample size from 3 to 10), in order to detect if there is an outlier or not with Dixon's Q Test:
or
The result will be output to both Result Log and Command Window:
| ox | The value of the suspected point |
|---|---|
| index | Row index of suspected point |
| qstat | The calculated Q value from suspected point |
| critical | The critical Q value at the specified significance level |
| sig | sig=1 means there is an outlier, sig=0 means there is no outlier |
| conclusion | A statement of conclusion indicating the statistical result. |
| rd | The worksheet range to put the plot data for outlier plot, if the Outlier Plot option is selected.. |
| rt | The worksheet range to put the report table. |
You can perform a regression (Linear, Polynomial or Nonlinear Curve Fitting), and then use the standardized residuals to determine which data points are outliers.
The following short tutorial will show you how to make use of residual plot to detect outlier: