Cluster Computer Day 2
This is where I'm at close.
mpirun -np 8 -hostfile ~/mpi_hosts ./mpi_program
ssh: Could not resolve hostname master: Name or service not known
ORTE was unable to reliably start one or more daemons.
This usually is caused by:
* not finding the required libraries and/or binaries on
one or more nodes. Please check your PATH and LD_LIBRARY_PATH
settings, or configure OMPI with --enable-orterun-prefix-by-default
* lack of authority to execute on one or more specified nodes.
Please verify your allocation and authorities.
* the inability to write startup files into /tmp (--tmpdir/orte_tmpdir_base).
Please check with your sys admin to determine the correct location to use.
* compilation of the orted with dynamic libraries when static are required
(e.g., on Cray). Please check your configure cmd line and consider using
one of the contrib/platform definitions for your system type.
* an inability to create a connection back to mpirun due to a
lack of common network interfaces and/or no route found between
them. Please check network connectivity (including firewalls
and network routing requirements).
ORTE does not know how to route a message to the specified daemon
located on the indicated node:
my node: raspberry
target node: slave1
This is usually an internal programming error that should be
reported to the developers. In the meantime, a workaround may
be to set the MCA param routed=direct on the command line or
in your environment. We apologize for the problem.
$ cd ~/.ssh
Next step is to copy to all the slave computers.
After installing SSH on all nodes.
Hope to see you succeed in this challenge!
Curated by @arc7icwolf.byte for the #LearnToCode community.