On Thu, Jun 11, 2020 at 11:06:31AM +0000, Sargun Dhillon wrote:
On Thu, Jun 11, 2020 at 12:01:14PM +0200, Christian Brauner wrote:
On Wed, Jun 10, 2020 at 07:59:55PM -0700, Kees Cook wrote:
On Wed, Jun 10, 2020 at 08:12:38AM +0000, Sargun Dhillon wrote:
As an aside, all of this junk should be dropped:
- ret = get_user(size, &uaddfd->size);
- if (ret)
return ret;
- ret = copy_struct_from_user(&addfd, sizeof(addfd), uaddfd, size);
- if (ret)
return ret;
and the size member of the seccomp_notif_addfd struct. I brought this up off-list with Tycho that ioctls have the size of the struct embedded in them. We should just use that. The ioctl definition is based on this[2]: #define _IOC(dir,type,nr,size) \ (((dir) << _IOC_DIRSHIFT) | \ ((type) << _IOC_TYPESHIFT) | \ ((nr) << _IOC_NRSHIFT) | \ ((size) << _IOC_SIZESHIFT))
We should just use copy_from_user for now. In the future, we can either introduce new ioctl names for new structs, or extract the size dynamically from the ioctl (and mask it out on the switch statement in seccomp_notify_ioctl.
Yeah, that seems reasonable. Here's the diff for that part:
Why does it matter that the ioctl() has the size of the struct embedded within? Afaik, the kernel itself doesn't do anything with that size. It merely checks that the size is not pathological and it does so at compile time.
#ifdef __CHECKER__ #define _IOC_TYPECHECK(t) (sizeof(t)) #else /* provoke compile error for invalid uses of size argument */ extern unsigned int __invalid_size_argument_for_IOC; #define _IOC_TYPECHECK(t) \ ((sizeof(t) == sizeof(t[1]) && \ sizeof(t) < (1 << _IOC_SIZEBITS)) ? \ sizeof(t) : __invalid_size_argument_for_IOC) #endif
The size itself is not verified at runtime. copy_struct_from_user() still makes sense at least if we're going to allow expanding the struct in the future.
Right, but if we simply change our headers and extend the struct, it will break all existing programs compiled against those headers. In order to avoid that, if we intend on extending this struct by appending to it, we need to have a backwards compatibility mechanism. Just having copy_struct_from_user isn't enough. The data structure either must be fixed size, or we need a way to handle multiple ioctl numbers derived from headers with different sized struct arguments
The two approaches I see are:
- use more indirection. This has previous art in drm[1]. That's look
something like this:
struct seccomp_notif_addfd_ptr { __u64 size; __u64 addr; }
... And then it'd be up to us to dereference the addr and copy struct from user.
Which isn't great but could do.
- Expose one ioctl to the user, many internally
e.g., public api:
struct seccomp_notif { __u64 id; __u64 pid; struct seccomp_data; __u64 fancy_new_field; }
#define SECCOMP_IOCTL_NOTIF_RECV SECCOMP_IOWR(0, struct seccomp_notif)
internally: struct seccomp_notif_v1 { __u64 id; __u64 pid; struct seccomp_data; }
struct seccomp_notif_v2 { __u64 id; __u64 pid; struct seccomp_data; __u64 fancy_new_field; }
and we can switch like this: switch (cmd) { /* for example. We actually have to do this for any struct we intend to * extend to get proper backwards compatibility */ case SECCOMP_IOWR(0, struct seccomp_notif_v1) return seccomp_notify_recv(filter, buf, sizeof(struct seccomp_notif_v1)); case SECCOMP_IOWR(0, struct seccomp_notif_v2) return seccomp_notify_recv(filter, buf, sizeof(struct seccomp_notif_v3)); ... case SECCOMP_IOCTL_NOTIF_SEND: return seccomp_notify_send(filter, buf); case SECCOMP_IOCTL_NOTIF_ID_VALID: return seccomp_notify_id_valid(filter, buf); default: return -EINVAL; }
This has the downside that programs compiled against more modern kernel headers will break on older kernels.
- We can take the approach you suggested.
#define UNSIZED(cmd) (cmd & ~(_IOC_SIZEMASK << _IOC_SIZESHIFT) static long seccomp_notify_ioctl(struct file *file, unsigned int cmd, unsigned long arg) { struct seccomp_filter *filter = file->private_data; void __user *buf = (void __user *)arg; int size = _IOC_SIZE(cmd); cmd = UNSIZED(cmd);
switch (cmd) { /* for example. We actually have to do this for any struct we intend to * extend to get proper backwards compatibility */ case UNSIZED(SECCOMP_IOCTL_NOTIF_RECV): return seccomp_notify_recv(filter, buf, size); ... case SECCOMP_IOCTL_NOTIF_SEND: return seccomp_notify_send(filter, buf); case SECCOMP_IOCTL_NOTIF_ID_VALID: return seccomp_notify_id_valid(filter, buf); default: return -EINVAL; } }
Leaving that aside, the proposed direction here seems to mean that any change to the struct itself will immediately mean a new ioctl() but afaict, that also means a new struct. Since when you simply extend the struct for the sake of the new ioctl you also change the size for the ioctl.
Sure, you can simply treat the struct coming through the old ioctl as being "capped" by e.g. hiding the size as suggested but then the gain by having two separate ioctls is 0 compared to simply versioning the struct with an explicit size member since the size encoded in the ioctl and the actual size of the struct don't line up anymore which is the only plus I can see for relying on _IOC_SIZE(). All this manages to do then is to make it more annoying for userspace since they now need to maintain multiple ioctls(). And if you have - however unlikely - say three different ioctls all to be used with a different struct size of the same struct I now need to know which ioctl() goes with which size of the struct (I guess you could append the size to the ioctl name? *shudder*). If you have the size in the struct itself you don't need to care about any of that. Maybe I'm not making sense or I misunderstand what's going on though.
Christian
I don't understand why userspace has to have any knowledge of this. As soon as we add the code above, and we use copy_struct_from_user based on _that_ size,
Hm, which code exactly?
In the previous mail the only thing proposed was to switch to a simple copy_from_user() which effectively bars us from extending the seccomp_addfd struct which this is about, right? At that point, the only option then becomes to either introduce a new ioctl() and a new struct or to go for the hack in e.g. 3. (Afaiu, 2. is not working anymore because we break userspace as soon as we append "fancy_new_field" to the struct because it changes the ioctl() unless I'm missing something.)
Let me maybe rephrase: I'd prefer we merge something for addfd that is extensible with minimal burden on userspace. But if we are fine with saying "we don't care, let's just use copy_from_user() for addfd and if we extend we add a new struct + ioctl" then ok, sure. But I would prefer to keep dealing with new structs + ioctls (Look at the end of btrfs.h [1] unlikely to be a problem for us, but still.) as little as possible because that will be more churn in userspace code than I'd prefer.
So either [1] or - since none of the generic extensibility seems to be particularly nice - we bite the bullet and just add a:
__u64 reserved[4]
field and hope that this will carry us for a long time (probably will for quite a long time) and defer the new ioctl() problem.
[1]: https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/tree/incl...
userspace will get free upgrades. If they are compiling against an older header than the kernel, size will return a smaller number, and thus we will zero out our trailing bits, and if their number is bigger, we just check their bits are appropriately zeroed.
This approach would be forwards-and-backwards compatible.
There's a little bit of prior art here as well [2]. The approach is that we effectively do the thing we had earlier with passing a size with copy_struct_from_user, but instead of the size being embedded in the struct, it's embedded in the ioctl command itself.
That looks super sketchy. :D
Christian