Possible bug in regress when y and x are complex
4 views (last 30 days)
Show older comments
I am looking for feedback on whether regress produces incorrect confidence intervals when x and y are complex.
The following code
rng(2);
% Model equation: y = x*b + error, where b = (1 + 1j).
% The 95% confidence interval the imaginary part of b has zero width.
x = randn(N,1) + 1j*randn(N,1);
y = x*(1+1j) + sigma*(randn(N,1) + 1j*randn(N,1));
[b,bint] = regress(y,x)
produces
b = 1.0120 + 0.9942i
bint = 0.9520 + 0.9942i 1.0720 + 0.9942i
My interpretation is
real(b) = 1.012 with a 95% confidence interval of [0.95, 1.07]
imag(b) = 0.9942 with a 95% confidence interval of [.9942, .9942]
But imag(b) should have a 95% confidence interval that is similar in width to that for real(b).
To get the expected answer, I do the regression without complex numbers:
% Emulate how the regression is done complex x and y. We get the
% same result for b and the correct 95% confidence intervals on
% both the real and imaginary parts of b. (I have verified via
% simulation that this gives confidence intervals for both components
% are consistent with a 95% confidence intervals.)
y = [real(y);imag(y)];
x = [real(x),-imag(x);imag(x),real(x)];
[b,bint] = regress(y,x);
b = b(1) + 1j*b(2)
bint = [bint(1,1)+1j*bint(2,1),bint(1,2)+1j*bint(2,2)]
In this case I get the same answer for b and confidence intervals that are consistent with what is expected, but different from the first example:
b = 1.0120 + 0.9942i
bint = 0.9726 + 0.9548i 1.0514 + 1.0336i
My interpretation of this result is
real(b) = 1.012 with a 95% confidence interval of [0.9726, 1.514]
imag(b) = 0.9942 with a 95% confidence interval of [.9548, 1.0336]
Note that the condidence interval width for real(b) here is smaller than found in the first example. It seems possible that the confidence interval for real(b) for the first example is based on the confidence interval of abs(b).
0 Comments
Answers (1)
Aditya
on 26 Feb 2024
The MATLAB function `regress` is not designed to handle complex numbers directly. When you pass complex numbers to `regress`, it only operates on the real part of the data, which is why you observe the zero width for the confidence interval of the imaginary part of `b`. This is consistent with the documentation, which does not specify support for complex numbers.
Your interpretation of the results from the first example is correct; however, the confidence interval for the imaginary part is incorrect because `regress` is not handling the complex data properly.
The second approach you've taken is the correct way to perform linear regression with complex numbers. By separating the real and imaginary parts and stacking them, you can use `regress` to estimate the coefficients for both the real and imaginary parts. This method treats the problem as a multivariate regression with two independent variables for each observation: one for the real part and one for the imaginary part of `x`.
The confidence intervals you get from the second approach are correct for both real and imaginary parts of `b`. The smaller confidence interval width for the real part in the second example compared to the first is likely due to the proper handling of the variance in both real and imaginary parts, which affects the estimated standard errors used to calculate the confidence intervals.
In summary, to perform regression with complex numbers and obtain correct confidence intervals, you should use the second approach, which correctly accounts for the complex nature of the data. The first approach using `regress` directly with complex numbers will not provide accurate confidence intervals for the imaginary part.
See Also
Categories
Find more on Matrix Indexing in Help Center and File Exchange
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!