GNU Parallel - Demo, design decisions and debugging
By Ole Tange
Date: Sunday, 19 June 2011 14:00
Duration: 40 minutes
Target audience: Any
Language: English
Tags: bash parallel usage
GNU Parallel is written in perl and an example of a generally useful tool for UNIX systems. In this talk we will look at some of the problems and design decisions when doing a UNIX tool. A few examples of GNU Parallel will be shown, too.
GNU parallel is a shell tool for executing jobs in parallel locally or using remote machines. A job is typically a single command or a small script that has to be run for each of the lines in the input. The typical input is a list of files, a list of hosts, a list of users, a list of URLs, or a list of tables.
If you use xargs today you will find GNU parallel very easy to use as GNU parallel is written to have the same options as xargs. If you write loops in shell, you will find GNU parallel may be able to replace most of the loops and make them run faster by running several jobs in parallel. If you use ppss or pexec you will find GNU parallel will often make the command easier to read.
GNU parallel makes sure output from the commands is the same output as you would get had you run the commands sequentially. This makes it possible to use output from GNU parallel as input for other programs.
For each line of input GNU parallel will execute command with the line as arguments. If no command is given, the line of input is executed. Several lines will be run in parallel. GNU parallel can often be used as a substitute for xargs or cat | sh.
Attended by: Ole Tange, Henrik Tougaard (htoug), Felix Antonius Wilhelm Ostmann (Sadrak), Lars Thegler (tagg), Kaare Rasmussen, Anton Berezin (Grrrr), Søren Lund (slu),