Things Unix can do atomically (2010)

This is a catalog of things UNIX-like/POSIX-compliant operating systems can do atomically, making them useful as building blocks for thread-safe and multi-process-safe programs without mutexes or read/write locks. The list is by no means exhaustive and I expect it to be updated frequently for the foreseeable future.

The philosophy here is to let the kernel do as much work as possible. At my most pessimistic, I trust the kernel developers more than a trust myself. More practically, it’s stupid to spend CPU time locking around an operation that’s already atomic. Added 2010-01-07.

Operating on a pathname

The operations below are best left to local filesystems. More than a few people have written in crying foul if any of these techniques are used on an NFS mount. True. When there are multiple kernels involved, the kernel can’t very well take care of all the locking for us. Added 2010-01-06.

mv -T atomically changes the target of to the directory pointed to by and is indispensable when deploying new code. Updated 2010-01-06: both operands are symlinks. (So this isn’t a system call, it’s still useful.) A reader pointed out that ln -Tfs accomplishes the same thing without the second symlink. Added 2010-01-06. Deleted 2010-01-06: strace(1) shows that ln -Tfs actually calls symlink(2) , unlink(2) , and symlink(2) once more, disqualifying it from this page. mv -T ends up calling rename(2) which can atomically replace . Caveat 2013-01-07: this does not apply to Mac OS X, whose mv(1) doesn’t call rename(2) . mv(1) .

atomically changes the target of to the directory pointed to by and is indispensable when deploying new code. Updated 2010-01-06: both operands are symlinks. (So this isn’t a system call, it’s still useful.) Deleted 2010-01-06: shows that actually calls , , and once more, disqualifying it from this page. ends up calling which can atomically replace . Caveat 2013-01-07: this does not apply to Mac OS X, whose doesn’t call . . link(oldpath, newpath) creates a new hard link called newpath pointing to the same inode as oldpath and increases the link count by one. This will fail with the error code EEXIST if newpath already exists, making this a useful mechanism for locking a file amongst threads or processes that can all agree upon the name newpath . I prefer this technique for whole-file locking because the lock is visible to ls(1) . link(2) .

creates a new hard link called pointing to the same inode as and increases the link count by one. This will fail with the error code if already exists, making this a useful mechanism for locking a file amongst threads or processes that can all agree upon the name . I prefer this technique for whole-file locking because the lock is visible to . . symlink(oldpath, newpath) operates very much like link(2) but creates a symbolic link at a new inode rather than a hard link to the same inode. Symbolic links can point to directories, which hard links cannot, making them a perfect analogy to link(2) when locking entire directories. This will fail with the error code EEXIST if newpath already exists, making this a perfect analogy to link(2) that works for directories, too. Be careful of symbolic links whose target inode has been removed ("dangling" symbolic links) — open(2) will fail with the error code ENOENT . It should be mentioned that inodes are a finite resource (this particular machine has 1,245,184 inodes). symlink(2) . Added 2010-01-07

operates very much like but creates a symbolic link at a new inode rather than a hard link to the same inode. Symbolic links can point to directories, which hard links cannot, making them a perfect analogy to when locking entire directories. This will fail with the error code if already exists, making this a perfect analogy to that works for directories, too. Be careful of symbolic links whose target inode has been removed ("dangling" symbolic links) — will fail with the error code . It should be mentioned that inodes are a finite resource (this particular machine has 1,245,184 inodes). . Added 2010-01-07 rename(oldpath, newpath) can change a pathname atomically, provided oldpath and newpath are on the same filesystem. This will fail with the error code ENOENT if oldpath does not exist, enabling interprocess locking much like link(oldpath, newpath) above. I find this technique more natural when the files in question will be unlink ed later. rename(2) .

can change a pathname atomically, provided and are on the same filesystem. This will fail with the error code if does not exist, enabling interprocess locking much like above. I find this technique more natural when the files in question will be ed later. . open(pathname, O_CREAT | O_EXCL, 0644) creates and opens a new file. (Don’t forget to set the mode in the third argument!) O_EXCL instructs this to fail with the error code EEXIST if pathname exists. This is a useful way to decide which process should handle a task: whoever successfully creates the file. open(2) .

creates and opens a new file. (Don’t forget to set the mode in the third argument!) instructs this to fail with the error code if exists. This is a useful way to decide which process should handle a task: whoever successfully creates the file. . mkdir(dirname, 0755) creates a new directory but fails with the error code EEXIST if dirname exists. This provides for directories the same mechanism link(2) open(2) with O_EXCL provides for files. mkdir(2) . Added 2010-01-06; edited 2013-01-07.

... continue reading