Reversed inverse regression for the univariate linear calibration and its statistical properties derived using a new methodology

Pilsang Kang; Changhoi Koo; Hokyu Roh

doi:10.1051/ijmqe/2017021

All issues

Volume 8 (2017)

Int. J. Metrol. Qual. Eng., 8 (2017) 28

Full HTML

Open Access

Issue		Int. J. Metrol. Qual. Eng. Volume 8, 2017


Article Number		28
Number of page(s)		10
DOI		https://doi.org/10.1051/ijmqe/2017021
Published online		27 November 2017

Int. J. Metrol. Qual. Eng. 8, 28 (2017)

Research Article

Reversed inverse regression for the univariate linear calibration and its statistical properties derived using a new methodology

Pilsang Kang^*, Changhoi Koo and Hokyu Roh

Quality Management Center, KEPCO NF, 242, Daedeok-daero 989 beon-gil Daejeon 34057, Korea

^* pskang@knfc.co.kr

Received: 12 March 2017
Accepted: 28 September 2017

Abstract

Since simple linear regression theory was established at the beginning of the 1900s, it has been used in a variety of fields. Unfortunately, it cannot be used directly for calibration. In practical calibrations, the observed measurements (the inputs) are subject to errors, and hence they vary, thus violating the assumption that the inputs are fixed. Therefore, in the case of calibration, the regression line fitted using the method of least squares is not consistent with the statistical properties of simple linear regression as already established based on this assumption. To resolve this problem, “classical regression” and “inverse regression” have been proposed. However, they do not completely resolve the problem. As a fundamental solution, we introduce “reversed inverse regression” along with a new methodology for deriving its statistical properties. In this study, the statistical properties of this regression are derived using the “error propagation rule” and the “method of simultaneous error equations” and are compared with those of the existing regression approaches. The accuracy of the statistical properties thus derived is investigated in a simulation study. We conclude that the newly proposed regression and methodology constitute the complete regression approach for univariate linear calibrations.

Key words: bias / classical regression / error propagation / mean-data-point-based variance / population-regression-line-based variance / reversed inverse regression / simultaneous error equations / Taylor approximation

© P. Kang et al., published by EDP Sciences, 2017

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

1 Introduction

Simple linear regression is a model with a single independent variable in which a regression line is fitted through n data points such that the sum of squared errors (SSE), i.e., the vertical distances between the data points and the fitted line, is as small as possible. The statistical properties of this model have been established as theorems and are presented in many statistics textbooks, e.g., the textbook written by Walpole and Myers [1]. In this model, a regression line of y on x is fitted based on the assumption that x is fixed but y varies according to a normal distribution. This model is called “basic regression” throughout the remainder of this study. Unfortunately, when calibrating an instrument such as a chemical analyzer using basic regression, a problem arises. In practical calibrations, the observed measurements (the x values) are subject to errors, and hence they vary, thus violating the assumption of fixed inputs. As a result, in the case of calibration, the regression line fitted using the method of least squares is not consistent with the statistical properties of basic regression as already established based on this assumption.

Two approaches have been considered as possible solutions for this problem. In the first approach [2], called classical regression, the “standards” (the x values) are treated as the inputs, and the observed measurements (the y values) are treated as the response; these values are used to fit a regression line of y on x. This regression approach is consistent with the assumption that x is fixed. The problem with this approach is that estimating the x value for a new observed measurement involves the reciprocal of the estimated slope. Williams [3] demonstrated that the reciprocal of the slope has an infinite variance, which indicates that classical regression has an infinite variance and, hence, an infinite mean squared error. Nevertheless, Parker et al. [4] obtained an asymptotic approximation of the variance of the prediction interval using a formula derived by Casella and Berger [5] using the Delta Method. However, Parker et al.'s approach still has limitations. Even if we rely on this approximation, we cannot determine a prediction interval with a given confidence level because the approximation cannot be used to express the prediction interval as a t_n₋₂ distribution.

In the second approach [6], called inverse regression, the standards (the x values) are treated as the response, the observed measurements (the y values) are treated as the inputs, and these values are used to fit a regression line of x on y. This regression approach is inconsistent with the assumption that the inputs are fixed. Shukla and Datta [7] and Oman [8] derived expressions for the mean and mean squared error of predicted x value based on multiple measurements taken during the prediction stage of the calibration process. Fuller [9] made a similar suggestion regarding the derivation of both the predicted x value and the prediction interval. Fuller's approach requires that the variance of the observed measurements is known. In his approach, it is necessary to measure a standard multiple times independently to estimate the variance. Parker et al. [4] derived the bias in prediction using a formula established by Pham-Gia et al. [10] with the aid of the Delta Method. Parker et al. [4] also showed through several simulation studies that inverse regression is preferable to classical regression in terms of bias and mean squared error. However, to derive the statistical properties of inverse regression, Parker et al. were obliged to borrow their estimate for the variance of the slope from “reversed basic regression” because of technical difficulties, which devalues their approach. (Reversed basic regression is basic regression in which the roles of x and y have merely been reversed.)

As a fundamental solution for the calibration problem, which has not yet been resolved completely, the current study introduces “reversed inverse regression” along with a new methodology for deriving its statistical properties. (Simply put, “fundamental solution for the univariate linear calibration problem” = “reversed inverse regression” + “new methodology for deriving the statistical properties of the regression”.) In the proposed regression approach, the observed measurements (the x values) are treated as the inputs, and the standards (the y values) are treated as the response; these values are used to fit a regression line of y on x. The statistical properties of this regression are derived using the “error propagation rule” and the “method of simultaneous error equations”. In this regression approach, it is not necessary to measure any standards multiple times independently. We present an example of practical calibration. Each of three types of regression (i.e., classical regression, inverse regression and reversed inverse regression) is applied to the calibration example, and the corresponding calibration results, including the subsequently calculated estimates for the variance of the prediction interval, are compared. In addition, the accuracy of the statistical properties derived using the new methodology is investigated in a Monte Carlo simulation study.

2 Regression and methodology

If the roles of x and y are reversed, then inverse regression becomes reversed inverse regression. Reversed inverse regression is more convenient to use for calibration than inverse regression because the reversed roles are consistent with the convention that the variable x represents the inputs, whereas the variable y represents the response. This regression approach also violates the assumption that the inputs are fixed. It is modeled as follows. (It may be desirable to use some other term than “reversed inverse regression”, e.g., “pseudo-basic regression”, to eliminate potential confusion in terminology.)

–
There is a linear relationship between x and y.
–
The observed measurements (the x values) are treated as the inputs, the standards (the y values) are treated as the response, and these values are used to fit a regression line of y on x.
–
For the fitting of the regression line, n data points of the form (x_i, y_i) (i = 1, …, n) are used. The x_i value varies according to a normal distribution, whereas the y_i value is fixed; y_i = α + βx_i + ε_i, ε_i ∼ N(0, σ²).
–
The x_i's (i.e., x₁, …, x_n) are treated as variables. The variables x_i and x_j (i ≠ j) are independent of each other: cov[x_i, x_j] = 0, i ≠ j.
–
The regression line $\hat{y} = \hat{α} + \hat{β} x$ is fitted such that SSE is minimized.
- •
  $S S E = \sum {(y_{i} - \hat{α} - \hat{β} x_{i})}^{2}$ , $\hat{β} = S_{x y} / S_{x x}$ , $S_{x x} = \sum {(x_{i} - \bar{x})}^{2}$ , $S_{x y} = \sum (x_{i} - \bar{x}) (y_{i} - \bar{y})$
–
The variance of x_i is uniform for all i (i = 1, …, n). In other words, the variance of the observed measurements is equal over the entire calibration range of interest.
- •
  $σ_{x i}^{2}$ denotes the variance of the variable x_i; $σ_{x 1}^{2} = \dots = σ_{x n}^{2} (= σ_{x}^{2})$ .
–
The population regression line y = α + βx is defined as follows:
- •
  $β = \sum (x_{i 0} - {\bar{x}}_{0}) (y_{i} - \bar{y}) / \sum {(x_{i 0} - {\bar{x}}_{0})}^{2}$ , $α = \bar{y} - β {\bar{x}}_{0}$ , and $σ_{x}^{2} β^{2} = σ^{2}$ .
- •
  $\bar{x} = (\sum x_{i}) / n$ , $\bar{y} = (\sum y_{i}) / n$ , x_i₀ is the mean of x_i, and ${\bar{x}}_{0} = (\sum x_{i 0}) / n$ .
- •
  All points (x_i₀, y_i) (i = 1, …, n) lie on the population regression line. In this study, we call these points the “mean data points”.

(∑ denotes summation from i = 1 to n throughout this study.)

In reversed inverse regression, the assumption that the observed measurements (the x values), despite being the inputs, vary according to normal distributions is very important. Suppose that the regression line fitting is repeated an infinite number of times using a “new set of n different standards (or reference solutions)” each time. Here, this “new set of n different standards” refers to newly prepared standards whose nominal y values (or target y values) and confidence levels are identical to those of the previous set of standards. In this case, the x_i's (i.e., x₁, …, x_n) will be observed to vary according to normal distributions. The standards are subject to errors that may arise when preparing or manufacturing them. However, such errors will appear as variations in the x_i's after being combined with random measurement errors. If the “same set of n different standards” is measured repeatedly, we will only observe the variance associated with the random measurement errors; the errors of the standards themselves will not be reflected. Such a variance should not be treated as the variance needed to derive the statistical properties of linear regression. In this respect, Fuller [9] is incorrect, because his approach requires a standard to be independently measured multiple times to estimate the variance. As previously mentioned, reversed inverse regression does not require any such separate prior measurements.

The slope of the regression line that is fitted on the basis of reversed inverse regression is: $\hat{β} = S_{x y} / S_{x x} = \sum (x_{i} - \bar{x}) (y_{i} - \bar{y}) / \sum {(x_{i} - \bar{x})}^{2} .$

Unfortunately, it is technically difficult to derive the variance of the slope directly from the definition of the variance, i.e., var[f(x₁, …, x_n)] = E[{f(x₁, …, x_n)−E[f(x₁, …, x_n)]}²], because $\hat{β}$ is a fractional expression that contains “∑(x_i − $\bar{x}$ )²” in the denominator and the x_i's vary rather than being fixed. Because of this difficulty, we directly treat the x_i's as variables and derive the variance of the slope based on the first-order Taylor approximation as follows: $\begin{array}{l} f (x_{1}, ..., x_{n}) = f (x_{10}, ..., x_{n 0}) + \sum (x_{i} - x_{i 0}) {[\partial f / \partial x_{i}]}^{*} + Remainder, \\ var [f (x_{1}, ..., x_{n})] = E [{f (x_{1}, ..., x_{n}) - E [f (x_{1}, ..., x_{n})]}^{2}] \\ \approx E [{f (x_{1}, ..., x_{n}) - f (x_{10}, ..., x_{n 0})}^{2}] \\ \approx {\sum {(\partial f / \partial x_{i})}^{2} {σ_{x i}}^{2} + 2 \sum \sum (\partial f / \partial x_{i}) (\partial f / \partial x_{j}) cov [x_{i}, x_{j}]}^{*} . \\ Note : E [{f (x_{1}, ..., x_{n}) - f (x_{10}, ..., x_{n 0})}^{2}] \\ = E [{f (x_{1}, ..., x_{n}) - E [f (x_{1}, ..., x_{n})]}^{2}] + {bias in f (x_{1}, ..., x_{n})}^{2}, \end{array}$ where the notation [ ]^* or { }^* indicates that the value of the function contained within the bracket is determined using the mean values of the variables, i.e., x₁₀, …, x_n₀ [11]. Even in the case of derivation of expectations, this notation is often used for the same purpose. In particular, we define the expectation E[{f(x₁, ..., x_n) − f(x₁₀, …, x_n₀)}²] as the “mean-data-point-based variance”. The approximation method for deriving the variance described herein is commonly referred to as the “error propagation rule”, and only the first-order partial derivatives are included in its derivation. To derive the variance of the slope, var $\hat{[β]}$ , after the partial differentiation of $\hat{β}$ with respect to the x_i's, the variances of the x_i's, including the covariances of x_i and x_j (j > i), are combined in accordance with the error propagation rule. The final result obtained from this combination process is the approximate variance of the slope. The same method can be used to derive the variance of the intercept and the variance of the predicted y value. All other statistical properties of reversed inverse regression, such as the expectation and bias of the slope and the expectation of the mean squared error, are derived by utilizing another special method, called the “method of simultaneous error equations” in this study, in combination with the error propagation rule. When we need to derive another statistical property from the primary expressions already obtained using the error propagation rule, the first-order Taylor approximation is mainly used. Error terms of orders higher than (σ_x/A)² are discarded during or after the approximation calculations. For example, (σ_x/A)⁴ (=1/10⁸) is very small and can be neglected in comparison with (σ_x/A)² (=1/10⁴).

The Delta Method is also an asymptotic approximation method based on Taylor approximation [12]. Parker et al. [4] used the Delta Method to derive the variance of the prediction interval for classical regression. When the Delta Method is applied to the inverted equation x = − ${\hat{α}}^{'}$ / ${\hat{β}}^{'}$ + (1/ ${\hat{β}}^{'}$ )y, the x_i's and y_i's are not directly treated as variables. Instead, U (= $- {\hat{α}}^{'}$ + y₀ − ϵ₀) and V ( $= {\hat{β}}^{'}$ ) are treated as the variables [4,5,10]. This is the most notable difference between the Delta Method and the approximation method used in this study.

3 Statistical properties of reversed inverse regression

The variance and bias of the slope and the expectation of the mean squared error are the statistical properties that are primarily required in linear regression because other properties, such as the variance and bias of the intercept and the variance of the prediction interval, depend on them. Therefore, the variance of the slope, var[ $\hat{β}$ ], is first derived using the error propagation rule as follows (see supplementary material): $var [\hat{β}] = {σ (\hat{β})}^{2} \approx \sum {(\partial \hat{β} / \partial x_{i})}^{2} σ {x_{i}}^{2} = {[S_{y y} / S_{x y}^{2}]}^{*} σ^{2} .$ (1)

To investigate the accuracy of the variance obtained using equation (1), we should consider two factors. One is that error terms of orders higher than $σ_{x}^{2}$ are not included in the derivation. The other is that because equation (1) represents the population-regression-line-based variance, the bias in $\hat{β}$ is not reflected in the calculation of [S_yy/ $S_{x y}^{2}$ ]^*σ² (=[S_yy/ $S_{x y}^{2}$ ]^* $σ_{x}^{2}$ β²). The bias in $\hat{β}$ depends on $σ_{x}^{2}$ and n. The details of the effects of these two factors are explained based on the simulation results in Section 5. For reference, the variance of $\hat{β}$ for basic regression is [1/S_xx]^*σ², and this variance is not an approximation but an exact expression. The relationship between the estimates of var $\hat{[β]}$ _{reversed inverse} and var $\hat{[β]}$ _basic for a given set of data points is as follows: $Estimate for var {[\hat{β}]}_{revrsed inverse} = {1 / r^{2} (x, y)} (Estimate for var {[\hat{β}]}_{basic}),$ where r(x, y) is the estimated correlation coefficient between x and y, i.e., r(x, y) = S_xy/(S_xxS_yy)^1/2, and r²(x, y) is typically very close to 1 in linear calibrations.

The variance of the intercept, var $[\hat{α}]$ , is also derived using the error propagation rule as follows (see supplementary material): $\begin{array}{l} \hat{α} = (\sum y_{i}) / n - {\sum (x_{i} - \bar{x}) (y_{i} - \bar{y}) / \sum {(x_{i} - \bar{x})}^{2}} (\sum x_{i}) / n, \\ var [\hat{α}] = {σ (\hat{α})}^{2} \approx \sum {(\partial \hat{α} / \partial x_{i})}^{2} σ_{x i}^{2} = {[1 / n + {\bar{x}}^{2} (S_{y y} / S_{x y}^{2})]}^{*} σ^{2} . \end{array}$ (2)

Separately from the previous derivation process, another equation for deriving var[ $\hat{α}$ ] can be obtained by applying the error propagation rule to $\hat{α}$ = $\bar{y}$ − $\hat{β} \bar{x}$ : $\begin{array}{l} var [\hat{α}] = {σ (\hat{α})}^{2} \approx {(\partial \hat{α} / \partial \hat{β})}^{2} {σ (\hat{β})}^{2} + {(\partial \hat{α} / \partial \bar{x})}^{2} {σ (\bar{x})}^{2} + 2 (\partial \hat{α} / \partial \hat{β}) (\partial \hat{α} / \partial \bar{x}) σ (\hat{β}) σ (\bar{x}) r (\hat{β}, \bar{x}) \\ \approx [1 / n + {\bar{x}}^{2} (S_{y y} / S_{x y}^{2})] σ^{2} + 2 (- x) (- \hat{β}) {(S_{y y} / S_{x y}^{2}) σ_{x} {\hat{β}}^{2}}^{1 / 2} (σ_{x}^{2} / n)^{1 / 2} r (\hat{β}, \bar{x}) . \end{array}$ (3)

From equations (2) and (3), we can see that r $\hat{(β}$ , $\bar{x)}$ ≈ 0, and hence, $\hat{β}$ and $\bar{x}$ are nearly independent of each other. In equation (2), var[ $\hat{α}$ ] is derived by treating $\hat{α}$ as a function of x_i's (i = 1, …, n), whereas in equation (3), var $\hat{[α]}$ is derived by treating $\hat{α}$ as a function of $\hat{β}$ and $\bar{x}$ . In this way, by formulating two separate equations to obtain the variance of a statistic using the error propagation rule, we can derive the covariance or correlation coefficient between any two statistics. This method is called the “method of simultaneous error equations” in this study. Nearly all of the covariances (or correlation coefficients) in a linear regression problem can be derived using this method. In addition, the derived covariances can be further used to derive other statistical properties. However, we should note that the covariances thus derived are typically approximations, not exact expressions.

A predicted y value is the y value of a point (x, y) on the fitted regression line and is determined by substituting x into $\hat{y}$ = $\hat{α}$ + $\hat{β}$ x. The variance of such a predicted y value, var $\hat{[y]}$ , is derived using the error propagation rule as follows: $\begin{array}{l} \hat{y} = \bar{y} - {\sum (x_{i} - \bar{x}) y_{i} / \sum {(x_{i} - \bar{x})}^{2}} {\sum x_{i} / n} + {\sum (x_{i} - \bar{x}) y_{i} / \sum {(x_{i} - \bar{x})}^{2}} x, \\ var [\hat{y}] = {σ (\hat{y})}^{2} \approx \sum {(\partial \hat{y} / \partial x_{i})}^{2} σ_{x i}^{2} = {[1 / n + {(x - \bar{x})}^{2} (S_{y y} / S_{x y}^{2})]}^{*} σ^{2} . \end{array}$ (4)

Separately from equation (4), another equation for deriving var $\hat{[y]}$ $\hat{(α}$ can be obtained by applying the error propagation rule to $\hat{y}$ = $\hat{α}$ + $\hat{β}$ x: $\begin{array}{l} var [\hat{y}] \approx {(\partial \hat{y} / \partial \hat{α})}^{2} {σ (\hat{α})}^{2} + {(\partial \hat{y} / \partial \hat{β})}^{2} {σ (\hat{β})}^{2} + 2 (\partial \hat{y} / \partial \hat{α}) (\partial \hat{y} / \partial \hat{β}) {σ (\hat{α})} {σ (\hat{β})} r (\hat{α}, \hat{β}) \\ \approx [1 / n + {\bar{x}}^{2} (S_{y y} / S_{x y}^{2}) {\hat{β}}^{2} σ_{x}^{2} + x^{2} (S_{y y} / S_{x y}^{2})] {\hat{β}}^{2} σ_{x}^{2} \\ + 2 x {(S_{y y} / S_{x y}^{2})}^{1 / 2} \hat{β} σ_{x} {[1 / n + {(\bar{x})}^{2} (S_{y y} / S_{x y}^{2})]}^{1 / 2} \hat{β} σ_{x} r (\hat{α}, \hat{β}) . \end{array}$ (5)

From equations (4) and (5), the correlation coefficient r $\hat{(α,}$ $\hat{β)}$ can be determined as follows: $r (\hat{α}, \hat{β}) \approx - \bar{x} S_{y y}^{1 / 2} / S_{x y} {[1 / n + {\bar{x}}^{2} (S_{y y} / S_{x y}^{2})]}^{1 / 2} .$

As the next step, we derive the expectations of $\hat{β}$ and $\hat{α}$ , and the biases in $\hat{β}$ , $\hat{α}$ and $\hat{y}$ . For this purpose, the following statistical properties are derived in advance using the method of simultaneous error equations (see supplementary material): $\begin{array}{l} E [\sum {(x_{i} - \bar{x})}^{2}] = \sum {(x_{i 0} - {\bar{x}}_{0})}^{2} + (n - 1) σ_{x}^{2}, \\ cov [{\sum (x_{i} - \bar{x})}^{2}, 1 / {\sum (x_{i} - \bar{x})}^{2}] \approx - [4 / S_{x x}] σ_{x}^{2}, \\ cov [\sum (x_{i} - \bar{x}) y_{i}, 1 / {\sum (x_{i} - \bar{x})}^{2}] \approx - [2 / S_{x y}] β^{2} σ_{x}^{2} . \end{array}$ E[1] = E[∑(x_i− $\bar{x}$ )²/∑(x_i− $\bar{x}$ )²] = E[∑(x_i− $\bar{x}$ )²] ∙ E[1/∑(x_i− $\bar{x}$ )²] + cov[∑(x_i − $\bar{x}$ )², 1/∑(x_i− $\bar{x}$ )²], and hence, E[1/∑(x_i− $\bar{x}$ )²] ≈ {1 + [4/S_xx] $σ_{x}^{2}$ }/{∑(x_i₀ − ${\bar{x}}_{0}$ )² + (n − 1) $σ_{x}^{2}$ }. Therefore, the expectation of the slope, β_E, can be derived as follows (see supplementary material for more details): $\begin{array}{l} β_{E} = E [\sum (x_{i} - \bar{x}) y_{i} / \sum {(x_{i} - \bar{x})}^{2}] \\ = E [\sum (x_{i} - \bar{x}) y_{i}] E [1 / {\sum (x_{i} - \bar{x})}^{2}] + cov [\sum (x_{i} - \bar{x}) y_{i}, 1 / \sum {(x_{i} - \bar{x})}^{2}] \\ \approx S_{x y} {1 + [4 / S_{x x}] σ_{x}^{2}} / {\sum {(x_{i 0} - {\bar{x}}_{0})}^{2} + (n - 1) σ_{x}^{2}} - [2 / S_{x y}] β^{2} σ_{x}^{2} . \end{array}$

If we apply the first-order Taylor approximation to simplify the expression S_xy{1 + [4/S_xx] $σ_{x}^{2}$ }/{∑(x_i₀ − ${\bar{x}}_{0}$ )² + (n − 1) $σ_{x}^{2}$ }, we obtain the following expressions for β_E and α_E: $\begin{array}{l} β_{E} = E [\hat{β}] \approx β - β {[1 / S_{x x}]}^{*} (n - 3) σ_{x}^{2}, \\ α_{E} = E [\hat{α}] = E [\bar{y} - \hat{β} \bar{x}] \approx α + {\bar{x}}_{0} β {[1 / S_{x x}]}^{*} (n - 3) σ_{x}^{2} . \end{array}$

Accordingly, the biases in $\hat{β}$ , $\hat{α}$ and $\hat{y}$ are as follows: $bias [\hat{β}] \approx - β {[1 / S_{x x}]}^{*} (n - 3) {σ_{x}}^{2},$ (6) $\begin{array}{l} bias [\hat{α}] \approx + {\bar{x}}_{0} β {[1 / S_{x x}]}^{*} (n - 3) {σ_{x}}^{2}, \\ bias [\hat{y}] = E [\hat{α} + \hat{β} x] - (α + β x) = E [\hat{α}] + x E [\hat{β}] - (α + β x) \\ \approx - (n - 3) (x - {\bar{x}}_{0}) {[1 / S_{x y}]}^{*} σ^{2} . \end{array}$ (7)

Based on these biases, we can see that β and α are not the mean, median, or mode of the $\hat{β}$ and $\hat{α}$ distributions. However, we can say that $\hat{β}$ and $\hat{α}$ , despite being slightly skewed, follow approximately normal distributions centered at β and α respectively, because the terms β[1/S_xx]^*(n − 3) $σ_{x}^{2}$ and ${\bar{x}}_{0}$ β[1/S_xx]^*(n − 3) $σ_{x}^{2}$ are each very small in magnitude in practical calibrations. (When n is 3, β coincides with β_E. The same can be said of α and α_E.)

To show that the slope, intercept and predicted y value in reversed inverse regression can be expressed as t_n₋₂ distributions, it is necessary to know the statistical properties of the mean squared error (MSE). The expectation of MSE is first derived (see supplementary material for more details): $\begin{array}{l} cov [\hat{β}, \bar{x}] \approx 0, \\ \sum cov [\hat{β}, x_{i}] \approx 0, \\ cov [{\hat{β}}^{2}, {\sum (x_{i} - \bar{x})}^{2}] \approx - 4 {\hat{β}}^{2} σ_{x}^{2}, \\ cov [\hat{β}, \sum (x_{i} - \bar{x}) y_{i}] \approx (S_{y y} S_{x x} / S_{x y}^{2} - 2) {\hat{β}}^{2} σ_{x}^{2}, \\ \sum var [x_{i} - \bar{x}] = (n - 1) σ_{x}^{2}, \end{array}$ $\begin{array}{l} E^{2} [\hat{β}] \approx {β - β {[1 / S_{x x}]}^{*} (n - 3) σ_{x}^{2}}^{2} \\ \approx β^{2} - 2 β^{2} {[1 / S_{x x}]}^{*} (n - 3) σ_{x}^{2}, \end{array}$ $\begin{array}{l} S S E = \sum {(y_{i} - \hat{α} - \hat{β} x_{i})}^{2} = \sum {y_{i} - (\bar{y} - \hat{β} \bar{x}) - \hat{β} x_{i}}^{2} = \sum {(y_{i} - \bar{y}) - (\hat{β} x_{i} - \hat{β} \bar{x})}^{2} \\ = \sum {(y_{i} - \bar{y})}^{2} - 2 \sum (y_{i} - \bar{y}) \hat{β} (x_{i} - \bar{x}) + \sum {\hat{β}}^{2} {(x_{i} - \bar{x})}^{2} {Typically, y_{i} - \bar{y} \neq \hat{β} (x_{i} - \bar{x})}, \end{array}$ $\begin{array}{l} E [S S E] = E [\sum {(y_{i} - \bar{y})}^{2}] - 2 {E [\hat{β}] E [\sum (x_{i} - \bar{x}) y_{i}] + cov [\hat{β}, \sum (x_{i} - \bar{x}) y_{i}] \\ - \bar{y} E [\hat{β}] E [\sum (x_{i} - \bar{x})]} + {var [\hat{β}] + E^{2} [\hat{β}]} {\sum var [x_{i} - \bar{x}] + \sum E^{2} [x_{i} - \bar{x}]} \\ + cov [{\hat{β}}^{2}, {\sum (x_{i} - \bar{x})}^{2}] \\ \approx (n - 1) - {[S_{y y} S_{x x} / S_{x y}^{2}]}^{*} σ_{x}^{2} β^{2} = (n - 2) σ^{2} . \end{array}$ $\begin{array}{l} M S E = \sum {(y_{i} - \hat{α} - \hat{β} x_{i})}^{2} / (n - 2), \\ ∴ E [M S E] \approx σ^{2} (= σ_{x}^{2} β^{2}) . \end{array}$ (8)

To investigate the accuracy of the expectation of MSE obtained using equation (8), we should consider the same factors taken into account in the case of the variance of $\hat{β}$ . The accuracy of the derived E[MSE] is discussed in detail based on simulation results in Section 5.

The correlation coefficient between the slope and the mean squared error, r $\hat{(β,}$ MSE), is derived using the method of simultaneous error equations. Let K = $\hat{β}$ ∑(y_i− $\hat{α}$ − $\hat{β}$ x_i)² = (S_xxS_yyS_xy − $S_{x y}^{3}$ )/ $S_{x x}^{2}$ , A = $\hat{β}$ = S_xy/S_xx, and F = ∑(y_i − $\hat{α}$ − $\hat{β}$ x_i)² = (S_xxS_yy − $S_{x y}^{2}$ )/S_xx. Then, two separate equations for deriving the variance of K can be established. The correlation coefficient r $\hat{(β,}$ MSE) is obtained from these two equations: $\begin{array}{l} K = \hat{β} \sum {(y_{i} - \hat{α} - \hat{β} x_{i})}^{2} = (S_{x x} S_{y y} S_{x y} - S_{x y}^{3}) / S_{x x}^{2}, \\ K = A F, \end{array}$ $\begin{array}{l} σ_{K}^{2} \approx \sum {(\partial K / \partial x_{i})}^{2} σ_{x i}^{2}, \\ σ_{K}^{2} \approx {(\partial K / \partial A)}^{2} σ_{A}^{2} + {(\partial K / \partial F)}^{2} σ_{F}^{2} + 2 (\partial K / \partial A) (\partial K / \partial F) σ_{A} σ_{F} r (A, F), \end{array}$ $\begin{array}{l} r (A, F) = r (\hat{β}, \sum {(y_{i} - \hat{α} - \hat{β} x_{i})}^{2}) = r (\hat{β}, S S E) \\ \approx - {(S_{x x} S_{y y} S_{x y} - S_{x y}^{3}) / S_{x x}} {S_{x x} / (S_{x x} S_{y y}^{2} S_{x y}^{2} - S_{y y} S_{x y}^{4})}^{1 / 2} \\ = - {1 - r^{2} (x, y)}^{1 / 2} \approx 0, \end{array}$ $∴ r (\hat{β}, M S E) \approx 0 .$

Additionally, $\hat{β}$ and $\bar{x}$ are independent of each other and $\bar{x}$ and MSE are also independent of each other, then r( $\hat{α}$ , MSE) = r( $\bar{y}$ − $\hat{β} \bar{x}$ , MSE) ≈ 0.

In the expression ∑(y_i − $\hat{α}$ − $\hat{β}$ x_i)²/(n − 2), the y_i's are constant, $\hat{β}$ and $\hat{α}$ follow approximately normal distributions, and the x_i's also follow normal distributions. Therefore, (n − 2)MSE/σ² approximately follows a χ² distribution with n − 2 degrees of freedom. In addition, both $\hat{β}$ and $\hat{α}$ are nearly independent of MSE. Based on these facts, the following expressions can be obtained (see equations. (1), (2), (4) and (8)): $\begin{array}{l} T_{1} = (\hat{β} - β) / {[S_{y y} / S_{x y}^{2}]}^{1 / 2} \hat{σ} \sim t_{n - 2}, \\ T_{2} = (\hat{α} - α) / {[1 / n + {\bar{x}}^{2} (S_{y y} / S_{x y}^{2})]}^{1 / 2} \hat{σ} \sim t_{n - 2}, \\ T_{3} = {\hat{y} - (α + β x)} / {[1 / n + {(x - \bar{x})}^{2} (S_{y y} / S_{x y}^{2})]}^{1 / 2} \hat{σ} \sim t_{n - 2}, \\ T_{4} = {y_{0} - (\hat{α} + \hat{β} x)} / {[1 + 1 / n + {(x - \bar{x})}^{2} (S_{y y} / S_{x y}^{2})]}^{1 / 2} \hat{σ} \sim t_{n - 2}, \end{array}$ where $\hat{σ}$ is the square root of MSE and y₀ is the nominal y value of a newly prepared standard. The T's are all approximate t_n₋₂ distributions. Although ${\bar{x}}^{2}$ , (x − $\bar{x}$ )² and S_yy/ $S_{x y}^{2}$ , which appear in the T's, are functions of x_i (i = 1, …, n), the t_n₋₂ distributions are not greatly deformed by these functions because the fluctuations of S_yy/ $S_{x y}^{2}$ (or [1/n + ${\bar{x}}^{2}$ (S_yy/ $S_{x y}^{2}$ )]) corresponding to the variations of the x_i's are typically very small compared with the magnitude of S_yy/ $S_{x y}^{2}$ (or [1/n + ${\bar{x}}^{2}$ (S_yy/ $S_{x y}^{2}$ )]) itself. Based on these t_n_–2 distributions, we can evaluate the uncertainty (or confidence interval) of a measurement value determined based on the fitted regression line.

4 Comparison of regression approaches

Krutchkoff [6,13] compared classical regression and inverse regression using Monte Carlo simulations and recommended inverse regression based on the mean squared error. However, Berkson [14] and Halpern [15] presented significant criticisms of Krutchkoff's work. Parker et al. [4] also conducted several simulation studies and concluded that inverse regression performs better than classical regression. It seems that such debates arise because the existing regression approaches and accompanying methodologies are theoretically incomplete. Unusually, we compare different linear regression approaches using a practical calibration example. Each of three types of regression (classical, inverse and reversed inverse) is applied to the calibration scenario. In practical calibrations, the variance of the prediction interval is one of the most important statistical properties. Therefore, we identify the differences among the three regressions based on a comparison of the variances of the prediction interval estimated using the three regression approaches. For the fitting of a regression line as an example of practical calibration, we use a set of data points collected by Suh [16] while evaluating the uncertainty in the measurements recorded by an absorption spectrometer. The spectrometer determines the chemical concentrations (ppm) in a sample by measuring the absorbances (%) due to the corresponding chemical elements. Suh measured five different Cd (cadmium) standards. The data points collected by Suh and the calibration results are as follows: $\begin{array}{l} (0.1 ppm, 0.028 %), (0.3 ppm, 0.084 %), (0.5 ppm, 0.135 %), \\ (0.7 ppm, 0.180 %), (0.9 ppm, 0.215 %) . \end{array}$

4.1 Classical regression

–
x: Cd concentration (ppm), y: absorbance (%).
–
$\bar{x} = 0 .5$ , $\bar{y} = 0.1284$ , S_xx = 0.4, S_yy = 0.02225, S_xy = 0.094, r(x, y) = S_xy/(S_xxS_yy)^1/2 = 0.9964.
–
$M S E ({\hat{σ}}^{2}) = \sum {(y_{i} - {\hat{α}}^{'} - {\hat{β}}^{'} x_{i})}^{2} / (5 - 2) = 0.000056$ , ${\hat{β}}^{'} = S_{x y} / S_{x x} = 0.235$ , ${\hat{α}}^{'} = 0.0109 .$
–
Regression line: $\hat{x} = - {\hat{α}}^{'} / {\hat{β}}^{'} + (1 / {\hat{β}}^{'}) y .$
–
Estimator for the variance of the prediction interval (EV_C): $[1 + 1 / n + {(x - \bar{x})}^{2} / S_{x x})] {\hat{σ}}^{2} {(1/ {\hat{β}}^{'})}^{2} .$
- •
  $\hat{x}$ = −0.04638 + 4.25532y. (Measurement equation)
- •
  EV_C = {1 + 1/5 + (0.8685 − 0.5)²/0.4}0.000056(1/0.235)² = 0.0015611 (at x = 0.8685 ppm).

Note: −0.04638 + 4.25532 × 0.215(%) = 0.8685 (ppm).

4.2 Inverse regression

–
x: Cd concentration (ppm), y: absorbance (%).
–
$\bar{x}$ = 0.5, $\bar{y}$ = 0.1284, S_xx = 0.4, S_yy = 0.02225, S_xy = 0.094, r(x, y) = S_xy/(S_xxS_yy)^1/2 = 0.9964.
–
MSE( ${\hat{σ}}^{2}$ ) = ∑(x_i − ${γ^{'}}_{0}$ − ${γ^{'}}_{1} y_{i}$ )²/(5 − 2) = 0.001, ${γ^{'}}_{1}$ = S_xy/S_yy = 4.22472, ${γ^{'}}_{0}$ = −0.04245.
–
Regression line: $\hat{x}$ = ${γ^{'}}_{0}$ + ${γ^{'}}_{1} y .$
–
Estimator for the variance of the prediction interval (EV_I): [1 + 1/n + (y − $\bar{y}$ )²/S_yy] ${\hat{σ}}^{2}$ .
- •
  $\hat{x}$ = −0.04245 + 4.22472y. (Measurement equation)
- •
  EV_I = {1 + 1/5 + (0.215 − 0.1284)²/0.02225} × 0.001 = 0.0015371 (at y = 0.215%).

4.3 Reversed inverse regression

–
x: absorbance (%), y: Cd concentration (ppm).
–
$\bar{x}$ = 0.1284, $\bar{y}$ = 0.5, S_xx = 0.02225, S_yy = 0.4, S_xy = 0.094, r(x, y) = S_xy/(S_xxS_yy)^1/2 = 0.9964.
–
MSE( ${\hat{σ}}^{2}$ ) = ∑(y_i − $\hat{α}$ − $\hat{β}$ x_i)²/(5 − 2) = 0.001, $\hat{β}$ = S_xy/S_xx = 4.22472, $\hat{α}$ = −0.04245.
–
Regression line: $\hat{y}$ = $\hat{α}$ + $\hat{β}$ x.
–
Estimator for the variance of the prediction interval (EV_RI): [1 + 1/n + (x − $\bar{x}$ )²(S_yy/ $S_{x y}^{2}$ )] ${\hat{σ}}^{2}$ .
- •
  $\hat{y}$ = −0.04245 + 4.22472x. (Measurement equation)
- •
  EV_RI = {1 + 1/5 + (0.215 − 0.1284)²(0.4/0.094²)} × 0.001 = 0.0015395 (at x = 0.215%).

The estimate EV_RI derived via reversed inverse regression at x = 0.215% (the upper end of the calibration range) is compared with the estimate EV_C derived via classical regression at x = 0.8685 ppm and with the estimate EV_I derived via inverse regression at y = 0.215%. All three estimates are different from one another. Classical regression yields the largest estimate, and inverse regression yields the smallest one. This can be explained by rewriting and comparing the following three estimators. (Both EV_C and EV_I are those derived by Parker et al. [4].) When rewriting EV_C and EV_I, the roles of x and y were reversed to facilitate comparison. In addition, ${\hat{σ}}_{y}^{2} {(1 / {\hat{β}}^{'})}^{2}$ in the expression for classical regression was changed to ${\hat{σ}}_{x}^{2} {(1 / {\hat{β}}^{'})}^{2} .$ $\begin{array}{l} E V_{C} = [1 + 1 / n + {(\hat{y} - \bar{y})}^{2} / S_{y y}] {\hat{σ}}_{x}^{2} {(1 / {\hat{β}}^{'})}^{2}, \\ E V_{I} = [1 + 1 / n + {(x - \bar{x})}^{2} / S_{x x}] {\hat{σ}}^{2}, \\ E V_{R I} = [1 + 1 / n + {(x - \bar{x})}^{2} (S_{y y} / S_{x y}^{2})] {\hat{σ}}^{2} \end{array}$

The correlation coefficient r(x, y) {=S_xy/(S_xxS_yy)^1/2} is very close to, but always smaller than, 1 in linear calibrations. In addition, $S_{y y} / S_{x y}^{2} = (1 / S_{x x}) {1 / r^{2} (x, y)}$ and ${(\hat{y} - \bar{y})}^{2} / S_{y y} = {(x - \bar{x})}^{2} (S_{y y} / S_{x y}^{2}) .$ The term ${\hat{σ}}_{x}^{2} {(1 / {\hat{β}}^{'})}^{2}$ is greater than ${\hat{σ}}^{2} .$ Therefore, the estimates can be arranged in order of increasing magnitude as follows: “inverse”, “reversed inverse” and then “classical”. This ordering holds for all linear calibrations. The differences among the three estimates depend on r(x, y). In Suh's measurement experiment, r(x, y) is 0.9964 (n = 5), the estimate derived via classical regression at the upper end of the calibration range is approximately 1.5% greater than that derived via inverse regression, and the estimate derived via reversed inverse regression is approximately 0.15% greater than that derived via inverse regression. If Suh had repeated this measurement experiment, the results would have been similar to those of this calibration. Regarding these calibration results, we should remind ourselves that although we rely on the estimate derived via classical regression, we cannot determine the prediction interval with a given confidence level because the estimate cannot be used to express the prediction interval as a t_n₋₂ distribution. In addition, we should remind ourselves that the estimate derived via inverse regression is not a theoretically correct one.

5 Simulation study

We conducted a Monte Carlo simulation study to investigate the accuracy of the statistical properties derived using the error propagation rule and the method of simultaneous error equations based on the first-order Taylor approximation. var $[β]$ , bias $\hat{[β}]$ and E[MSE] were the main targets of investigation because the accuracy of other properties, such as var[ $\hat{α}$ ], bias[ $\hat{α}$ ], var[ $\hat{y}$ ], bias[ $\hat{y}$ ] and var[prediction interval], depends on the accuracy of these three properties. We designed a simulation of regression line fitting using five data points based on reversed inverse regression. We first created five intended mean data points (x_i₀, y_i) (i = 1, …, 5) that were needed for the simulation as follows: $\begin{array}{l} (x_{1,0} = 412, y_{1} = 10), (x_{2,0} = 812, y_{2} = 20), (x_{3,0} = 1212, y_{3} = 30), \\ (x_{4,0} = 1612, y_{4} = 40), (x_{5,0} = 2012, y_{5} = 50) . \end{array}$

${\bar{x}}_{0} = 1212, \bar{y} = 30 .$
Intended population regression line: y = −0.3 + 0.025x (β = 0.025, α = −0.3).

Depending on the intended variance $σ_{x}^{2}$ , the simulation study was organized into five simulation groups, SG1, SG2, SG3, SG4 and SG5, and the intended variances assigned to the five groups were 90², 60², 24², 12² and 6², respectively. Five simulations per group were conducted (25 simulations in total). In every simulation, the regression line fitting was repeated 50 000 times using independent random numbers generated from normal distributions using the program “Minitab 15”. The results of the conducted simulations are presented along with the corresponding theoretically derived properties in Tables 1 and 2. (Even if different parameters, such as a different number of data points, a different ratio of $\bar{y}$ to ${\bar{x}}_{0}$ , or non-equal distances between the x_i₀'s, were applied in a simulation study, such a simulation study would yield conclusions essentially similar to those of this study.)

In Tables 1 and 2, the ratio of Svar[ $\hat{β}$ ] to Dvar[ $\hat{β}$ ] ranges from 0.971 to 1.017 and the ratio of SE[MSE] to DE[MSE] ranges from 0.983 to 1.002. In addition, the two derived variances ^*Dvar[ $\hat{β}$ ] and Dvar[ $\hat{β}$ ] are very close to each other. Therefore, we can conclude that the variance of the slope and the expectation of the mean squared error derived using the error propagation rule and the method of simultaneous error equations largely coincide with the simulation results.

According to Table 1, when $σ_{x}^{2}$ is 6², the ratio of bias[ $\hat{β}$ ] to {var[ $\hat{β}$ ]}^1/2 is approximately −0.01, and when $σ_{x}^{2}$ is 90², the ratio is approximately −0.14. These two ratios are very different from each other in magnitude. In the case of either simulation or derivation, as the variance $σ_{x}^{2}$ increases, both the absolute value of the bias in $\hat{β}$ and the variance of $\hat{β}$ increase. The rate of increase of the absolute value of the bias in $\hat{β}$ is equal to the rate of increase of $σ_{x}^{2}$ (see Eq. (6)), whereas the rate of increase of {var[ $\hat{β}$ ]}^1/2 is the square root of the rate of increase of $σ_{x}^{2}$ (see Eq. (1)). This indicates that as $σ_{x}^{2}$ increases, the $\hat{β}$ distribution becomes more skewed. In Tables 1 and 2, the derived values of the bias in $\hat{β}$ largely coincide with the simulation results regardless of $σ_{x}^{2}$ . This indicates that although the first-order Taylor approximation is used to derive the bias in $\hat{β}$ , the derived bias does not greatly differ from the simulation result. The bias in $\hat{β}$ plays an important role in analyzing the accuracy of other derived statistical properties.

When $σ_{x}^{2}$ is small, the derived variance of $\hat{β}$ exactly coincides with the simulation result; however, when $σ_{x}^{2}$ is large, the derived variance of $\hat{β}$ is generally slightly greater than the simulation result. When the variance of $\hat{β}$ (i.e., $var [\hat{β}] = {[S_{y y} / S_{x y}^{2}]}^{*} σ_{x}^{2} β^{2}$ ) is derived using the error propagation rule, the partial derivatives of orders higher than the first are not included in the derivation, and the approximation var [f (x₁, … , x_n)] ≈ E [{f (x₁, … , x_n) − f (x₁, … , x_n)} ²] is used instead of the exact definition var [f (x₁, … , x_n)] = E [{f (x₁, … , x_n) − E [f (x₁, … , x_n)]} ²] to derive the variance. This results in two phenomena. The first phenomenon is that error terms of orders higher than $σ_{x}^{2}$ are excluded from the derivation, and the second phenomenon is that the bias in $\hat{β}$ is not reflected in the derivation. The bias in $\hat{β}$ depends on $σ_{x}^{2}$ and n (see equation (6)). In this simu lation study, n is 5. The first phenomenon typically causes the derived variance of $\hat{β}$ (i.e., Dvar[ $\hat{β}$ ]) to decrease, whereas the second phenomenon tends to cause it to increase. If $σ_{x}^{2}$ is small, both effects are trivial, and Dvar[ $\hat{β}$ ] is nearly equal to Svar[ $\hat{β}$ ]. If $σ_{x}^{2}$ is large, both of these effects are also large. However, the effect of the second phenomenon is much greater than that of the first. As a result, if $σ_{x}^{2}$ is large, then Dvar[ $\hat{β}$ ] is greater than Svar[ $\hat{β}$ ]. If we substitute β_E (SMean[ $\hat{β}$ ] in Tab. 2) into equation (1) in place of β, we can obtain a variance of $\hat{β}$ that is much closer to the simulation result. For example, for SG1-1, we can obtain (1000/40 000²) × 90² × 0.0247465² = 0.0017607² by substituting β_E (= 0.0247465) into ${[S_{y y} / S_{x y}^{2}]}^{*} σ_{x}^{2} β_{E}^{2}$ . (The difference between ${[S_{y y} / S_{x y}^{2}]}^{*} σ_{x}^{2} β^{2}$ and ${[S_{y y} / S_{x y}^{2}]}^{*} σ_{x}^{2} β_{E}^{2}$ is approximately equal to the square of bias[ $\hat{β}$ ].) This value is very close to the simulation result. The difference that still remains can be regarded as the effect of the first phenomenon.

With regard to the expectation of the mean squared error, a similar explanation is possible. Even in this case, the effect of the second phenomenon is greater than that of the first phenomenon, and hence, DE[MSE] is generally greater than SE[MSE]. In particular, let us attempt to approximately calculate the effect of the first phenomenon using another expression for the expectation of MSE. Namely, in the equation $E [M S E] = E [(S_{x x} S_{y y} - S_{x y}^{2}) / (n - 2) S_{x x}] \approx E [σ_{x}^{2} {\hat{β}}^{2}] = σ_{x}^{2} {E^{2} [\hat{β}] + var [\hat{β}]} \approx σ_{x}^{2} β_{E}^{2} + β^{2} {[S_{y y} / S_{x y}^{2}]}^{*} σ_{x}^{4}$ , the last term on the right-hand side reflects the effect of the first phenomenon to a certain extent. This equation helps us understand the two phenomena.

In Table 2, if $σ_{x}^{2}$ is large, then ^*Dvar[ $\hat{β}$ ] is generally greater than Dvar[ $\hat{β}$ ]. In every simulation, the estimate for the variance of the slope, i.e., (S_yy/ $S_{x y}^{2}$ ) ${\hat{σ}}^{2}$ , was calculated for each regression line. ^*Dvar[ $\hat{β}$ ] is the mean of the 50,000 estimates thus calculated. We can also obtain ^*Dvar[ $\hat{β}$ ] using another method, as follows: $\begin{array}{l} {}^{*}{Dvar [\hat{β}]} = E [(S_{y y} / S_{x y}^{2}) (S_{x x} S_{y y} - S_{x y}^{2}) / S_{x x} (n - 2)] = E [{(S_{y y} / S_{x y}^{2}) \hat{σ}}^{2}] \\ \approx E [(S_{y y} / S_{x y}^{2}) σ_{x}^{2} {\hat{β}}^{2}] = E [(S_{y y} / S_{x x}^{2}) σ_{x}^{2}] \\ = S_{y y} σ_{x}^{2} E [1 / S_{x x}^{2}] = S_{y y} σ_{x}^{2} {E^{2} [1 / S_{x x}] + var [1 / S_{x x}]} \\ \approx {S_{y y} σ_{x}^{2} / S_{x x}^{2} + 2 (7 - n) S_{y y} σ_{x}^{4} / S_{x x}^{3}}^{*} . \end{array}$

The term ${2 (7 - n) S_{y y} σ_{x}^{4} / S_{x x}^{3}}^{*}$ reflects the difference between ^*Dvar[ $\hat{β}$ ] and Dvar[ $\hat{β}$ ]. The difference depends on $σ_{x}^{4}$ and n.

In this section, we investigated the accuracy of the statistical properties of reversed inverse regression as derived using the error propagation rule and the method of simultaneous error equations through comparisons with simulation results. However, it should also be noted that the main target that calibration experts wish to obtain (or approach) by means of regression line fitting is the population regression line y = α + βx, not the average regression line y = α_E + β_Ex. In this respect, it is recommended that after the physical or chemical value of a sample is determined based on the fitted regression line, the determined value be corrected taking into account the bias in predicted y value (see Eq. (7)); such a bias correction will lead us closer to the true value.

Table 1

Simulation results and theoretically derived properties.

Table 2

Ratios of the simulation results to the corresponding derived properties.

6 Conclusion

From Osborne [17], it can be seen that considerable effort has been made to resolve the linear calibration problem since the 1930s. Most representatively, Eisenhart [2] suggested classical regression as a solution for the problem, and Krutchkoff [6] suggested inverse regression as another solution. Later, Parker et al. [4] derived the variances of the prediction interval and the biases in $\hat{x}$ for these two types of regression using the Delta Method. However, it can be said that the problem has not yet been resolved completely. As a fundamental solution for this problem, the current study introduced reversed inverse regression along with a methodology for deriving its statistical properties. In this study, the statistical properties of reversed inverse regression, such as the variance and bias of the slope, the expectation of the mean squared error, and the variance of the predicted y value, were derived using the error propagation rule and the method of simultaneous error equations. The method of simultaneous error equations, which was introduced for the first time in this study, is a useful tool for deriving the covariance of any two statistics. As another example of its use, all of the statistical properties of basic regression can be derived much more easily with the aid of this method. Even in the case of weighted linear regression, this method can be used to derive its statistical properties.

We presented an example of practical calibration. Each of the three types of regression (i.e., classical, inverse and reversed inverse) was applied to this calibration example. As a result, we found that the estimates of the variance of the prediction interval can be arranged in order of increasing magnitude as follows: “inverse,” “reversed inverse” and then “classical”. This ordering holds for all linear calibrations. The differences among the three estimates depend on r(x, y). As the next step, to investigate the accuracy of the three derived statistical properties of reversed inverse regression, i.e., Dvar[ $\hat{β}$ ], Dbias[ $\hat{β}$ ] and DE[MSE], a Monte Carlo simulation study was conducted. Through this simulation study, we found that when the variance of the observed measurements, i.e., $σ_{x}^{2}$ , is small, the theoretically derived variance and bias of the slope as well as the theoretically derived expectation of the mean squared error coincide with the simulation results. However, when $σ_{x}^{2}$ is large, there are small differences between the derived properties and the simulation results. Such differences are caused by two phenomena. The first phenomenon is that error terms of orders higher than $σ_{x}^{2}$ are excluded from the derivation, and the second phenomenon is that the bias in $\hat{β}$ is not reflected in the derivation. The first phenomenon typically causes the derived statistical properties to decrease, whereas the second phenomenon tends to cause them to increase (when n is greater than 3). The effect of the second phenomenon is larger than that of the first phenomenon, and hence, the values of the derived properties are typically slightly greater than the simulation results. In this way, after performing simulations we could investigate and analyze the differences between the derived statistical properties and the simulation results. This is another benefit of the new methodology used to derive the statistical properties of reversed inverse regression.

7 Implications and influences

Lwin and Maritz [18] suggested that regression models do not require the assumption of fixed inputs. In other words, regardless of whether the regression model of interest is consistent with this assumption, the method of least squares can be applied to fit a regression line. In that sense, it is meaningless to identify whether the line fitted using one regression approach is preferable to that fitted using another regression approach. However, it is nevertheless essential to know the statistical properties of the type of regression used for fitting. Unfortunately, the known statistical properties of the existing regression approaches are not without flaw. By contrast, all of the statistical properties of reversed inverse regression can be derived using the newly proposed methodology, and the statistical properties derived in this manner are theoretically correct and sufficiently accurate. In this respect, we claim that reversed inverse regression and the new methodology for deriving its statistical properties together serve as a fundamental solution for the univariate linear calibration problem, which had not previously been completely resolved. Finally, we expect this new methodology to be widely used in the field of calibration.

Supplementary Material

Derivations of the statistical properties of reversed inverse regression. Access here

Acknowledgments

The study reported in this paper was conducted as part of a plan to improve the quality assurance and control system of KEPCO Nuclear Fuel. The authors would like to express their thanks for the support from their company, without which the study could not have been successfully completed. In particular, the authors would like to express special thanks to President & CEO, Jaehee Lee; Executive Vice President & Chief Production Officer, Sundoo Kim; and Ex-Executive Vice President & Chief Production Officer, Chuljoo Park, who cordially supported and encouraged the authors in their study on the statistical theory and development of a new calibration approach using a regression model.

References

R.E. Walpole, R.H. Myers, Probability and Statistics for Engineers and Scientists, 5th edn. (Macmillan Publishing Company, London, 1993) [Google Scholar]
C. Eisenhart, The interpretation of certain regression methods and their use in biological and industrial research, Ann. Math. Stat. 10, 162–186 (1939) [CrossRef] [Google Scholar]
E.J. Williams, A note on regression methods in calibration, Technometrics 11, 189–192 (1969) [Google Scholar]
P.A. Parker, G.G. Vining, S.R. Wilson, J.L. Szarka III, N.G. Johnson, The prediction properties of inverse and reverse regression for the simple linear calibration problem, J. Qual. Technol. 42, 332–347 (2010) [CrossRef] [Google Scholar]
G. Casella, R.L. Berger, Statistical Inference, 2nd edn. (Duxbury, Pacific Grove, 2002) [Google Scholar]
R.G. Krutchkoff, Classical and inverse regression methods, Technometrics 9, 425–439 (1967) [Google Scholar]
G.K. Shukla, P. Datta, Comparison of the inverse estimator with the classical estimator subject to a preliminary test in linear calibration, J. Stat. Plan. Inference 12, 93–102 (1985) [Google Scholar]
S.D. Oman, An exact formula for the M.S.E. of the inverse estimator in the linear calibration problem, J. Stat. Plan. Inference 11, 189–196 (1985) [Google Scholar]
W. Fuller, Measurement Error Models (John Wiley & Sons, Hoboken, 1987) [CrossRef] [Google Scholar]
T. Pham-Gia, N. Turkkan, E. Marchand, Density of the ratio of two normal random variables and applications, Commun. Stat. Theory Methods 35, 1569–1591 (2006) [Google Scholar]
N. Tsoulfanidis, Measurement and Detection of Radiation (Hemisphere Publishing Corporation, Washington, 1983) [Google Scholar]
A. Papanicolaou, Taylor Approximation and the Delta Method (coursehero, Stanford, 2009), 103 p. [Google Scholar]
R.G. Krutchkoff, Classical and inverse regression methods in extrapolation, Technometrics 11, 605–608 (1969) [Google Scholar]
J. Berkson, Estimation of a linear function for a calibration line; consideration of a recent proposal, Technometrics 11, 647–660 (1969) [Google Scholar]
M. Halpern, On inverse estimation in linear regression, Technometrics 12, 727–736 (1970) [Google Scholar]
M.Y. Suh, Methods for the Calculation of Uncertainty in Analytical Chemistry, KAERI/TR1602/2000 Korean Language (Korea Atomic Energy Research Institute, Daejeon, 2000) [Google Scholar]
C. Osborne, Statistical calibration: a review, Int. Stat. Rev. 59, 309–336 (1991) [Google Scholar]
T. Lwin, J.S. Maritz, An analysis of the linear calibration controversy from the perspective of compound estimation, Technometrics 24, 235–242 (1982) [Google Scholar]

Cite this article as: Pilsang Kang, Changhoi Koo, Hokyu Roh, Reversed inverse regression for the univariate linear calibration and its statistical properties derived using a new methodology, Int. J. Metrol. Qual. Eng. 8, 28 (2017)

All Tables

Table 1

Simulation results and theoretically derived properties.

In the text

Table 2

Ratios of the simulation results to the corresponding derived properties.

In the text

Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.

Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.

Initial download of the metrics may take a while.

[1] R.E. Walpole, R.H. Myers, Probability and Statistics for Engineers and Scientists, 5th edn. (Macmillan Publishing Company, London, 1993) [Google Scholar]

[2] C. Eisenhart, The interpretation of certain regression methods and their use in biological and industrial research, Ann. Math. Stat. 10, 162–186 (1939) [CrossRef] [Google Scholar]

[3] E.J. Williams, A note on regression methods in calibration, Technometrics 11, 189–192 (1969) [Google Scholar]

[4] P.A. Parker, G.G. Vining, S.R. Wilson, J.L. Szarka III, N.G. Johnson, The prediction properties of inverse and reverse regression for the simple linear calibration problem, J. Qual. Technol. 42, 332–347 (2010) [CrossRef] [Google Scholar]

[5] G. Casella, R.L. Berger, Statistical Inference, 2nd edn. (Duxbury, Pacific Grove, 2002) [Google Scholar]

[6] R.G. Krutchkoff, Classical and inverse regression methods, Technometrics 9, 425–439 (1967) [Google Scholar]

[7] G.K. Shukla, P. Datta, Comparison of the inverse estimator with the classical estimator subject to a preliminary test in linear calibration, J. Stat. Plan. Inference 12, 93–102 (1985) [Google Scholar]

[8] S.D. Oman, An exact formula for the M.S.E. of the inverse estimator in the linear calibration problem, J. Stat. Plan. Inference 11, 189–196 (1985) [Google Scholar]

[9] W. Fuller, Measurement Error Models (John Wiley & Sons, Hoboken, 1987) [CrossRef] [Google Scholar]

[10] T. Pham-Gia, N. Turkkan, E. Marchand, Density of the ratio of two normal random variables and applications, Commun. Stat. Theory Methods 35, 1569–1591 (2006) [Google Scholar]

[11] N. Tsoulfanidis, Measurement and Detection of Radiation (Hemisphere Publishing Corporation, Washington, 1983) [Google Scholar]

[12] A. Papanicolaou, Taylor Approximation and the Delta Method (coursehero, Stanford, 2009), 103 p. [Google Scholar]

[13] R.G. Krutchkoff, Classical and inverse regression methods in extrapolation, Technometrics 11, 605–608 (1969) [Google Scholar]

[14] J. Berkson, Estimation of a linear function for a calibration line; consideration of a recent proposal, Technometrics 11, 647–660 (1969) [Google Scholar]

[15] M. Halpern, On inverse estimation in linear regression, Technometrics 12, 727–736 (1970) [Google Scholar]

[16] M.Y. Suh, Methods for the Calculation of Uncertainty in Analytical Chemistry, KAERI/TR1602/2000 Korean Language (Korea Atomic Energy Research Institute, Daejeon, 2000) [Google Scholar]

[17] C. Osborne, Statistical calibration: a review, Int. Stat. Rev. 59, 309–336 (1991) [Google Scholar]

[18] T. Lwin, J.S. Maritz, An analysis of the linear calibration controversy from the perspective of compound estimation, Technometrics 24, 235–242 (1982) [Google Scholar]