Very small feasibility value but very large first-order optimality value

Question

Chaitanya Awasthi on 7 Apr 2022

0
Link

Direct link to this question

https://in.mathworks.com/matlabcentral/answers/1691335-very-small-feasibility-value-but-very-large-first-order-optimality-value

Commented: Chaitanya Awasthi on 10 Apr 2022

I am solving a large-scale optimzation problem (over 8000 decision variables) with fmincon using interior-point algorithm. My problem consists of a nonlinear objective function and linear as well as nonlinear equality constraints. Upon termination, I get a solution which satisfies all my constraints, but has really poor first-order optimality. Upon further digging, I found the large optimality value is due to the lagrange multipliers (for both the linear and nonlinear equality constraints) having very large values themselves (over 1e28).

Can somebody please help me understand why is this the case? And is there any way to improve the first-order optimality value here? I will appreciate any pointers in this regard. I have attached a screenshot to show the aforementioned problem. Thank you for your time.

Best,

Chaitanya

0 Comments
Show -2 older commentsHide -2 older comments

Sign in to comment.

Sign in to answer this question.

Answer 1

Alan Weiss on 8 Apr 2022

0
Link

Direct link to this answer

https://in.mathworks.com/matlabcentral/answers/1691335-very-small-feasibility-value-but-very-large-first-order-optimality-value#answer_938035

This is a very interesting issue; I have never seen anything quite like it before. Apparently, you are providing gradients for both the objective and nonlinear constraints. I would first look there to see whether those nonlinear functions are correct--it is notoriously difficult to get them correct without using a symbolic approach.

If everything is correct, then you might try rerunning the optimization from the solution point but this time with modified options. I'd try HessianApproximation = 'finite-difference' and maybe try setting ScaleProblem to true. And certainly set CheckGradients to true.

Alan Weiss

MATLAB mathematical toolbox documentation

1 Comment
Show -1 older commentsHide -1 older comments

Chaitanya Awasthi on 9 Apr 2022

Edited: Chaitanya Awasthi on 9 Apr 2022

Hi Alan,

You are indeed correct in saying that I am providing the gradients for the cost function and the nonlinear constraints. To the best of my ability, I can also attest that my constraint functions are coded correctly.

Also, prior to asking my question, I had already set the ScaleProblem to true and checked my gradients using CheckGradients. However, the one thing that I did not think about was using a different hessian approximation (I had been using lbfgs due to large-scale nature of my problem). As soon as I did that, not only did I get a much better solution, my first-order optimality decreased enormously too (please see the screenshot). I got similarly great results when I instead used the bfgs approximation. Thank you so much for helping me out by pointing me in this direction.

However, a question still remains as to why the lbfgs approximation was so terrible. Do you have any thoughts on this matter?

Sign in to comment.

Answer 2

Matt J on 9 Apr 2022

0
Link

Direct link to this answer

https://in.mathworks.com/matlabcentral/answers/1691335-very-small-feasibility-value-but-very-large-first-order-optimality-value#answer_938680

Open in MATLAB Online

That can happen if your objective function or constraints are not continuously differentiable, e.g.,

fmincon(@(x)sqrt(abs(x)),1,[],[],[],[],[],[],[],...
           optimoptions('fmincon','Display','iter','Algorithm','interior-point'))
                                            First-order      Norm of
 Iter F-count            f(x)  Feasibility   optimality         step
     2    1.000000e+00    0.000e+00    5.000e-01
     4    7.071068e-01    0.000e+00    7.071e-01    5.000e-01
     8    6.195833e-01    0.000e+00    8.070e-01    8.839e-01
    10    2.953197e-01    0.000e+00    1.693e+00    4.711e-01
    13    2.688914e-01    0.000e+00    1.859e+00    1.595e-01
    15    1.057901e-01    0.000e+00    4.726e+00    8.349e-02
    19    6.155009e-02    0.000e+00    8.123e+00    1.498e-02
    22    3.076759e-02    0.000e+00    1.625e+01    4.735e-03
    25    2.513630e-02    0.000e+00    1.989e+01    1.578e-03
    27    1.539175e-02    0.000e+00    3.248e+01    8.687e-04
    30    5.700689e-03    0.000e+00    8.770e+01    2.694e-04
    34    4.080285e-03    0.000e+00    1.225e+02    4.915e-05
    36    3.463278e-03    0.000e+00    1.443e+02    2.864e-05
    38    1.870291e-03    0.000e+00    2.671e+02    1.549e-05
    41    1.237146e-03    0.000e+00    4.032e+02    5.029e-06
    43    1.222445e-03    0.000e+00    4.080e+02    3.025e-06
    45    1.645466e-04    0.000e+00    2.707e+03    1.521e-06
    56    1.193397e-04    0.000e+00    3.448e+03    4.132e-08
    58    9.435727e-05    0.000e+00    4.022e+03    2.315e-08
    60    5.965666e-05    0.000e+00    5.114e+03    1.246e-08
    62    5.845842e-05    0.000e+00    5.160e+03    6.976e-09
    64    9.283106e-06    0.000e+00    7.593e+03    3.504e-09
    74    6.648107e-06    0.000e+00    7.758e+03    1.304e-10
    76    4.657401e-06    0.000e+00    7.885e+03    6.589e-11

Local minimum possible. Constraints satisfied.

fmincon stopped because the size of the current step is less than
the value of the step size tolerance and constraints are 
satisfied to within the value of the constraint tolerance.
ans = 2.1691e-11

3 Comments
Show 1 older commentHide 1 older comment

Matt J on 9 Apr 2022

Edited: Matt J on 9 Apr 2022

I don't think that's the issue. If you were to use, as your initial x0, the solution that you originally obtained with lbfgs, I'd be willing to bet that the algorithm will stay there, no matter what Hessian approximation you use.

Chaitanya Awasthi on 10 Apr 2022

Hi Matt,

I have got some very surprising results for you. Like you asked, I used the solution of lbfgs as an initial point for not just finite-difference hessian approximation but also (another round of) lbfgs hessian approximation. Interestingly, in both situations, the solver does not stop at the first step, rather it further decreases the cost function. I was less surprised it happens in the case of finite-difference hessian but quite surprised it happened even for lbfgs. Please see the screenshots below. Do you have any thoughts what is going on here?

Hessian approximation: lbfgs

Hessian approximation: finite-difference

Sign in to comment.

Very small feasibility value but very large first-order optimality value

0 Comments
Show -2 older commentsHide -2 older comments

Accepted Answer

1 Comment
Show -1 older commentsHide -1 older comments

More Answers (1)

3 Comments
Show 1 older commentHide 1 older comment

See Also

Categories

Tags

Products

Release

Community Treasure Hunt

Very small feasibility value but very large first-order optimality value

0 Comments Show -2 older commentsHide -2 older comments

Accepted Answer

1 Comment Show -1 older commentsHide -1 older comments

More Answers (1)

3 Comments Show 1 older commentHide 1 older comment

See Also

Categories

Tags

Products

Release

Community Treasure Hunt

0 Comments
Show -2 older commentsHide -2 older comments

1 Comment
Show -1 older commentsHide -1 older comments

3 Comments
Show 1 older commentHide 1 older comment