[PATCH] locking/atomic: Make test_and_*_bit() ordered on failure