Do worker nodes in a parallel pool need direct connectivity to the client?
Show older comments
I'm trying to start a parallel pool on a HPC cluster using a windows client. I've installed the third-party SLURM tool add-on and configured a cluster. All validation checks pass except the last one for parpool:
it says:
Error Report: Failed to initialize the interactive session.
Caused by:
Error using parallel.internal.pool.AbstractInteractiveClient>iThrowIfBadParallelJobStatus (line 423)
The interactive communicating job errored with the following message: MatlabPoolPeerInstance{fLabIndex=1, fNumberOfLabs=2, fUuid=0c40d093-9791-4944-8fe5-30feb46c98d5} was unable to find the host for workstation:27370 due to a JVM UnknownHostException: null
"workstation" being the hostname of my local machine.
It looks like the worker nodes are trying to reach my local machine, which of course is not possible, as they are isolated. I feel they should be maybe contacting the ClusterHost to which MATLAB connets via SSH and from which the jobs are submitted?
Accepted Answer
More Answers (0)
Categories
Find more on Third-Party Cluster Configuration in Help Center and File Exchange
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!