This is the command pdsh that can be run in the OnWorks free hosting provider using one of our multiple free online workstations such as Ubuntu Online, Fedora Online, Windows online emulator or MAC OS online emulator
PROGRAM:
NAME
pdsh - issue commands to groups of hosts in parallel
SYNOPSIS
pdsh [options]... command
DESCRIPTION
pdsh is a variant of the rsh(1) command. Unlike rsh(1), which runs commands on a single
remote host, pdsh can run multiple remote commands in parallel. pdsh uses a "sliding
window" (or fanout) of threads to conserve resources on the initiating host while allowing
some connections to time out.
When pdsh receives SIGINT (ctrl-C), it lists the status of current threads. A second
SIGINT within one second terminates the program. Pending threads may be canceled by
issuing ctrl-Z within one second of ctrl-C. Pending threads are those that have not yet
been initiated, or are still in the process of connecting to the remote host.
If a remote command is not specified on the command line, pdsh runs interactively,
prompting for commands and executing them when terminated with a carriage return. In
interactive mode, target nodes that time out on the first command are not contacted for
subsequent commands, and commands prefixed with an exclamation point will be executed on
the local system.
The core functionality of pdsh may be supplemented by dynamically loadable modules. The
modules may provide a new connection protocol (replacing the standard rcmd(3) protocol
used by rsh(1)), filtering options (e.g. removing hosts that are "down" from the target
list), and/or host selection options (e.g., -a selects all hosts from a configuration
file.). By default, pdsh must have at least one "rcmd" module loaded. See the RCMD MODULES
section for more information.
RCMD MODULES
The method by which pdsh runs commands on remote hosts may be selected at runtime using
the -R option (See OPTIONS below). This functionality is ultimately implemented via
dynamically loadable modules, and so the list of available options may be different from
installation to installation. A list of currently available rcmd modules is printed when
using any of the -h, -V, or -L options. The default rcmd module will also be displayed
with the -h and -V options.
A list of rcmd modules currently distributed with pdsh follows.
rsh Uses an internal, thread-safe implementation of BSD rcmd(3) to run commands using
the standard rsh(1) protocol.
exec Executes an arbitrary command for each target host. The first of the pdsh remote
arguments is the local command to execute, followed by any further arguments. Some
simple parameters are substitued on the command line, including %h for the target
hostname, %u for the remote username, and %n for the remote rank [0-n] (To get a
literal % use %%). For example, the following would duplicate using the ssh
module to run hostname(1) across the hosts foo[0-10]:
pdsh -R exec -w foo[0-10] ssh -x -l %u %h hostname
and this command line would run grep(1) in parallel across the files
console.foo[0-10]:
pdsh -R exec -w foo[0-10] grep BUG console.%h
ssh Uses a variant of popen(3) to run multiple copies of the ssh(1) command.
mrsh This module uses the mrsh(1) protocol to execute jobs on remote hosts. The mrsh
protocol uses a credential based authentication, forgoing the need to allocate
reserved ports. In other aspects, it acts just like rsh. Remote nodes must be
running mrshd(8) in order for the mrsh module to work.
qsh Allows pdsh to execute MPI jobs over QsNet. Qshell propagates the current working
directory, pdsh environment, and Elan capabilities to the remote process. The
following environment variable are also appended to the environment: RMS_RANK,
RMS_NODEID, RMS_PROCID, RMS_NNODES, and RMS_NPROCS. Since pdsh needs to run setuid
root for qshell support, qshell does not directly support propagation of
LD_LIBRARY_PATH and LD_PREOPEN. Instead the QSHELL_REMOTE_LD_LIBRARY_PATH and
QSHELL_REMOTE_LD_PREOPEN environment variables will may be used and will be
remapped to LD_LIBRARY_PATH and LD_PREOPEN by the qshell daemon if set.
mqsh Similar to qshell, but uses the mrsh protocol instead of the rsh protocol.
krb4 The krb4 module allows users to execute remote commands after authenticating with
kerberos. Of course, the remote rshd daemons must be kerberized.
xcpu The xcpu module uses the xcpu service to execute remote commands.
OPTIONS
The list of available options is determined at runtime by supplementing the list of
standard pdsh options with any options provided by loaded rcmd and misc modules. In some
cases, options provided by modules may conflict with each other. In these cases, the
modules are incompatible and the first module loaded wins.
Standard target nodelist options
-w TARGETS,...
Target and or filter the specified list of hosts. Do not use with any other node
selection options (e.g. -a, -g, if they are available). No spaces are allowed in
the comma-separated list. Arguments in the TARGETS list may include normal host
names, a range of hosts in hostlist format (See HOSTLIST EXPRESSIONS), or a single
`-' character to read the list of hosts on stdin.
If a host or hostlist is preceded by a `-' character, this causes those hosts to be
explicitly excluded. If the argument is preceded by a single `^' character, it is
taken to be the path to file containing a list of hosts, one per line. If the item
begins with a `/' character, it is taken as a regular expression on which to
filter the list of hosts (a regex argument may also be optionally trailed by
another '/', e.g. /node.*/). A regex or file name argument may also be preceeded
by a minus `-' to exclude instead of include thoses hosts.
A list of hosts may also be preceded by "user@" to specify a remote username other
than the default, or "rcmd_type:" to specify an alternate rcmd connection type for
these hosts. When used together, the rcmd type must be specified first, e.g.
"ssh:user1@host0" would use ssh to connect to host0 as user "user1."
-x host,host,...
Exclude the specified hosts. May be specified in conjunction with other target node
list options such as -a and -g (when available). Hostlists may also be specified to
the -x option (see the HOSTLIST EXPRESSIONS section below). Arguments to -x may
also be preceeded by the filename (`^') and regex ('/') characters as described
above, in which case the resulting hosts are excluded as if they had been given to
-w and preceeded with the minus `-' character.
Standard pdsh options
-S Return the largest of the remote command return values.
-h Output usage menu and quit. A list of available rcmd modules will also be printed
at the end of the usage message.
-s Only on AIX, separate remote command stderr and stdout into two sockets.
-q List option values and the target nodelist and exit without action.
-b Disable ctrl-C status feature so that a single ctrl-C kills parallel job. (Batch
Mode)
-l user
This option may be used to run remote commands as another user, subject to
authorization. For BSD rcmd, this means the invoking user and system must be listed
in the userĀ“s .rhosts file (even for root).
-t seconds
Set the connect timeout. Default is 10 seconds.
-u seconds
Set a limit on the amount of time a remote command is allowed to execute. Default
is no limit. See note in LIMITATIONS if using -u with ssh.
-f number
Set the maximum number of simultaneous remote commands to number. The default is
32.
-R name
Set rcmd module to name. This option may also be set via the PDSH_RCMD_TYPE
environment variable. A list of available rcmd modules may be obtained via the -h,
-V, or -L options. The default will be listed with -h or -V.
-M name,...
When multiple misc modules provide the same options to pdsh, the first module
initialized "wins" and subsequent modules are not loaded. The -M option allows a
list of modules to be specified that will be force-initialized before all others,
in-effect ensuring that they load without conflict (unless they conflict with
eachother). This option may also be set via the PDSH_MISC_MODULES environment
variable.
-L List info on all loaded pdsh modules and quit.
-N Disable hostname: prefix on lines of output.
-d Include more complete thread status when SIGINT is received, and display connect
and command time statistics on stderr when done.
-V Output pdsh version information, along with list of currently loaded modules, and
exit.
qsh/mqsh module options
-n tasks_per_node
Set the number of tasks spawned per node. Default is 1.
-m block | cyclic
Set block versus cyclic allocation of processes to nodes. Default is block.
-r railmask
Set the rail bitmask for a job on a multirail system. The default railmask is 1,
which corresponds to rail 0 only. Each bit set in the argument to -r corresponds to
a rail on the system, so a value of 2 would correspond to rail 1 only, and 3 would
indicate to use both rail 1 and rail 0.
machines module options
-a Target all nodes from machines file.
genders module options
In addition to the genders options presented below, the genders attribute pdsh_rcmd_type
may also be used in the genders database to specify an alternate rcmd connect type than
the pdsh default for hosts with this attribute. For example, the following line in the
genders file
host0 pdsh_rcmd_type=ssh
would cause pdsh to use ssh to connect to host0, even if rsh were the default. This can
be overridden on the commandline with the "rcmd_type:host0" syntax.
-A Target all nodes in genders database. The -A option will target every host listed
in genders -- if you want to omit some hosts by default, see the -a option below.
-a Target all nodes in genders database except those with the "pdsh_all_skip"
attribute. This is shorthand for running "pdsh -A -X pdsh_all_skip ..."
-g attr[=val][,attr[=val],...]
Target nodes that match any of the specified genders attributes (with optional
values). Conflicts with the -a option. If used in combination with other node
selection options like -w, the -g option will select from the supplied node list,
instead of from the genders file as a whole. Otherwise, This option targets the
alternate hostnames in the genders database by default. The -i option provided by
the genders module may be used to translate these to the canonical genders
hostnames. If the installed version of genders supports it, attributes supplied to
-g may also take the form of genders queries. Genders queries will query the
genders database for the union, intersection, difference, or complement of genders
attributes and values. The set operation union is represented by two pipe symbols
('||'), intersection by two ampersand symbols ('&&'), difference by two minus
symbols ('--'), and complement by a tilde ('~'). Parentheses may be used to change
the order of operations. See the nodeattr(1) manpage for examples of genders
queries.
-X attr[=val][,attr[=val],...]
Exclude nodes that match any of the specified genders attributes (optionally with
values). This option may be used in combination with any other of the node
selection options (e.g. -w, -g, -a, -X may also take the form of genders queries.
Please see documentation for the genders -g option for more information about
genders queries.
-i Request translation between canonical and alternate hostnames.
-F filename
Read genders information from filename instead of the system default genders file.
If filename doesn't specify an absolute path then it is taken to be relative to the
directory specified by the PDSH_GENDERS_DIR environment variable (/etc by default).
An alternate genders file may also be specified via the PDSH_GENDERS_FILE
environment variable.
nodeupdown module options
-v Eliminate target nodes that are considered "down" by libnodeupdown.
slurm module options
The slurm module allows pdsh to target nodes based on currently running SLURM jobs. The
slurm module is typically called after all other node selection options have been
processed, and if no nodes have been selected, the module will attempt to read a running
jobid from the SLURM_JOBID environment variable (which is set when running under a SLURM
allocation). If SLURM_JOBID references an invalid job, it will be silently ignored.
-j jobid[,jobid,...]
Target list of nodes allocated to the SLURM job jobid. This option may be used
multiple times to target multiple SLURM jobs. The special argument "all" can be
used to target all nodes running SLURM jobs, e.g. -j all.
-P partition[,partition,...]
Target list of nodes containing in the SLURM partition partition. This option may
be used multiple times to target multiple SLURM partitions and/or partitions may be
given in a comma-delimited list.
torque module options
The torque module allows pdsh to target nodes based on currently running Torque/PBS jobs.
Similar to the slurm module, the torque module is typically called after all other node
selection options have been processed, and if no nodes have been selected, the module will
attempt to read a running jobid from the PBS_JOBID environment variable (which is set when
running under a Torque allocation).
-j jobid[,jobid,...]
Target list of nodes allocated to the Torque job jobid. This option may be used
multiple times to target multiple Torque jobs.
rms module options
The rms module allows pdsh to target nodes based on an RMS resource. The rms module is
typically called after all other node selection options, and if no nodes have been
selected, the module will examine the RMS_RESOURCEID environment variable and attempt to
set the target list of hosts to the nodes in the RMS resource. If an invalid resource is
denoted, the variable is silently ignored.
SDR module options
The SDR module supports targeting hosts via the System Data Repository on IBM SPs.
-a Target all nodes in the SDR. The list is generated from the "reliable hostname" in
the SDR by default.
-i Translate hostnames between reliable and initial in the SDR, when applicable. If
the a target hostname matches either the initial or reliable hostname in the SDR,
the alternate name will be substitued. Thus a list composed of initial hostnames
will instead be replaced with a list of reliable hostnames. For example, when used
with -a above, all initial hostnames in the SDR are targeted.
-v Do not target nodes that are marked as not responding in the SDR on the targeted
interface. (If a hostname does not appear in the SDR, then that name will remain in
the target hostlist.)
-G In combination with -a, include all partitions.
nodeattr module options
The nodeattr module supports access to the genders database via the nodeattr(1) command.
See the genders section above for a list of support options with this module. The option
usage with the nodeattr module is the same as genders, above, with the exception that the
-i option may only be used with -a or -g. NOTE: This module will only work with very old
releases of genders where the nodeattr(1) command supports the -r option, and before the
libgenders API was available. Users running newer versions of genders will need to use the
genders module instead.
dshgroup module options
The dshgroup module allows pdsh to use dsh (or Dancer's shell) style group files from
/etc/dsh/group/ or ~/.dsh/group/. The default search path may be overridden with the
DSHGROUP_PATH environment variable, a colon-separated list of directories to search. The
default value for DSHGROUP_PATH is /etc/dsh/group.
-g groupname,...
Target nodes in dsh group file "groupname" found in either ~/.dsh/group/groupname
or /etc/dsh/group/groupname.
-X groupname,...
Exclude nodes in dsh group file "groupname."
As an enhancement in pdsh, dshgroup files may optionally include other dshgroup files via
a special #include STRING syntax. The argument to #include may be either a file path, or
a group name, in which case the path used to search for the group file is the same as if
the group had been specified to -g.
netgroup module options
The netgroup module allows pdsh to use standard netgroup entries to build lists of target
hosts. (/etc/netgroup or NIS)
-g groupname,...
Target nodes in netgroup "groupname."
-X groupname,...
Exclude nodes in netgroup "groupname."
ENVIRONMENT VARIABLES
PDSH_RCMD_TYPE
Equivalent to the -R option, the value of this environment variable will be used to
set the default rcmd module for pdsh to use (e.g. ssh, rsh).
PDSH_SSH_ARGS
Override the standard arguments that pdsh passes to the ssh(1) command ("-2 -a -x
-l%u %h"). The use of the parameters %u, %h, and %n (as documented in the rcmd/exec
section above) is optional. If these parameters are missing, pdsh will append them
to the ssh commandline because it is assumed they are mandatory.
PDSH_SSH_ARGS_APPEND
Append additional options to the ssh(1) command invoked by pdsh. For example,
PDSH_SSH_ARGS_APPEND="-q" would run ssh in quiet mode, or "-v" would increase the
verbosity of ssh. (Note: these arguments are actually prepended to the ssh
commandline to ensure they appear before any target hostname argument to ssh.)
WCOLL If no other node selection option is used, the WCOLL environment variable may be
set to a filename from which a list of target hosts will be read. The file should
contain a list of hosts, one per line (though each line may contain a hostlist
expression. See HOSTLIST EXPRESSIONS section below).
DSHPATH
If set, the path in DSHPATH will be used as the PATH for the remote processes.
FANOUT Set the pdsh fanout (See description of -f above).
HOSTLIST EXPRESSIONS
As noted in sections above pdsh accepts lists of hosts the general form: prefix[n-m,l-
k,...], where n < m and l < k, etc., as an alternative to explicit lists of hosts. This
form should not be confused with regular expression character classes (also denoted by
``[]''). For example, foo[19] does not represent an expression matching foo1 or foo9, but
rather represents the degenerate hostlist: foo19.
The hostlist syntax is meant only as a convenience on clusters with a "prefixNNN" naming
convention and specification of ranges should not be considered necessary -- this
foo1,foo9 could be specified as such, or by the hostlist foo[1,9].
Some examples of usage follow:
Run command on foo01,foo02,...,foo05
pdsh -w foo[01-05] command
Run command on foo7,foo9,foo10
pdsh -w foo[7,9-10] command
Run command on foo0,foo4,foo5
pdsh -w foo[0-5] -x foo[1-3] command
A suffix on the hostname is also supported:
Run command on foo0-eth0,foo1-eth0,foo2-eth0,foo3-eth0
pdsh -w foo[0-3]-eth0 command
As a reminder to the reader, some shells will interpret brackets ('[' and ']') for pattern
matching. Depending on your shell, it may be necessary to enclose ranged lists within
quotes. For example, in tcsh, the first example above should be executed as:
pdsh -w "foo[01-05]" command
ORIGIN
Originally a rewrite of IBM dsh(1) by Jim Garlick <garlick@llnl.gov> on LLNL's ASCI Blue-
Pacific IBM SP system. It is now used on Linux clusters at LLNL.
LIMITATIONS
When using ssh for remote execution, expect the stderr of ssh to be folded in with that of
the remote command. When invoked by pdsh, it is not possible for ssh to prompt for
passwords if RSA/DSA keys are configured properly, etc.. For ssh implementations that
suppport a connect timeout option, pdsh attempts to use that option to enforce the timeout
(e.g. -oConnectTimeout=T for OpenSSH), otherwise connect timeouts are not supported when
using ssh. Finally, there is no reliable way for pdsh to ensure that remote commands are
actually terminated when using a command timeout. Thus if -u is used with ssh commands may
be left running on remote hosts even after timeout has killed local ssh processes.
Output from multiple processes per node may be interspersed when using qshell or mqshell
rcmd modules.
The number of nodes that pdsh can simultaneously execute remote jobs on is limited by the
maximum number of threads that can be created concurrently, as well as the availability of
reserved ports in the rsh and qshell rcmd modules. On systems that implement Posix
threads, the limit is typically defined by the constant PTHREADS_THREADS_MAX.
Use pdsh online using onworks.net services