Error using Parpool inside a SC with MCR v98 (R2020a), and SLURM is the job sch. manager
4 views (last 30 days)
Show older comments
Hello all,
I was running a compiled standalone app/program that uses the parallel toolbox with MCR v98 (2020a) inside a SC normally, this is, I got the results I wanted. After some other tests, and NOT modifying anything in the compile stand-alone app, I am getting this output error file:
Parallel pool failed to start with the following error.
Error in StackCurrentF/OpenParPool (line 551)
Error in StackCurrentF (line 87)
Caused by:
Error using parallel.internal.pool.InteractiveClient>iThrowWithCause (line 670)
Failed to locate and destroy old interactive jobs.
Error using parallel.Cluster/findJob (line 74)
Unknown type: concurrentconcurrent.
parallel:cluster:PoolCreateFailed
So, no parallel computation. This happens even when I run a small interactive Job with srun that only turns-on the Pool and then wait and then closses it.
What can be the problem?
Any insights, or past experienses with similar problems, might be of great help.
Thank you!
1 Comment
Edric Ellis
on 2 Apr 2024
I suggest contacting MathWorks support who should be able to help resolve this.
Accepted Answer
R
on 8 May 2024
I previously encountered this error due to the local job storage location being accessed simultaneously by multiple jobs/users, which triggered the issue. I managed to resolve it by implementing the solution provided in the following MATLAB Answer:
More Answers (0)
See Also
Categories
Find more on Third-Party Cluster Configuration in Help Center and File Exchange
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!