burnthewhich

What’s wrong with whichcraft?

The which command is a relic of a bygone Unix era that should have died a long time ago.

When I’ve suggested to “experienced programmers” that they should stop using which, and use something more reliable, they look at me like I’m the crazy one, and point out that “it’s more portable than all the alternatives” (or just insult my parentage).

Typically what they really mean is “it’s portable to some ancient pre-POSIX version of Solaris”, or “it’s portable to BusyBox”. OK, so if you care so much about portability to those crippled non-standard platforms that you’re willing to forego portability to more standard platforms, you’re welcome to go play in the cesspit on your own; have fun.

If you only use which from the interactive command line, then you’re doing nothing wrong, and you can happily ignore the rest of this report. But if you train, coach, or mentor other programmers, or review their work, or if you’re just curious, then please do read on.

What is which?

which is supposed to tell you which program would be run if you simply typed its name into the shell. What it actually does is tell you where it might find the command in $PATH, which means it’s OK for interactive use as long as you understand that limitation.

The thing it’s not designed to do, but which is by far its most common use-case, is to tell a script what program would be invoked by the script if it invoked the program by name later on. POSIX does not require the output of which to be usable for this or any particular purpose; merely that it “indicates” the path of the program corresponding to each argument.

What’s a good replacement?

For interactive use, it’s fine to keep using which if that’s what you’re used to, but you might like to consider command -v, or type, or type -a, or whence. type and comand -v should work in all POSIX shells; check your shell’s manual for other options.

How else can I get the path to a command?

Bzzzt! Wrong question!

What’s the right question?

[hint: keep reading]

How do I stop using it?

There are broadly two reasons why people use which in their scripts.

  1. The “good” reason is simply to check, before starting, that all the required commands will be available.
  2. The “bad” reason is to grab the path to each program, and use that – instead of the bare command name – to invoke it.

If you’re checking that a command is available, then you don’t need to know the path, you just need to know that it exists.

In that case the fix is to use command -v and discard its output, because POSIX requires it to have a useful exit status. (And it also requires its output to be exactly the path to the program, no more, no less.)

command -v foo 2>/dev/null ||
 die EX_OSFILE 'You need to install the "foo" command, or adjust your $PATH if "foo" is already installed'

If you’re recording the path so that you can use that to invoke it later, then simply don’t.

A badly outdated but still common idiom is to write CAT=`which cat` and then use $CAT rather than cat. Please don’t.

Using cat will work just as well, be just as fast, and be more reliable than $CAT.

But what’s the problem?

The first thing to know about which is that it’s a separate binary program, not part of whichever shell you’re using, either interactively, or as the interpreter for your script. Therefore it does not a priori know the internal state of your shell, and that means it’s always going to be an approximation.

A basic version of which will simply assume you want a program that’s found in $PATH. Most of the time that’ll be true, but this approach will forceably bypass anything that’s not an external binary:

It will also fail to take into account things that might change between when you invoke which and when you actually want to invoke the command:

There are also enhanced versions of which that have even more problems.