On Wed, Apr 15, 2020 at 11:46:53AM +0300, Heikki Krogerus wrote:
Hi Greg,
On Wed, Apr 15, 2020 at 08:11:54AM +0200, Greg KH wrote:
diff --git a/lib/kobject.c b/lib/kobject.c index 83198cb37d8d..5921e2470b46 100644 --- a/lib/kobject.c +++ b/lib/kobject.c @@ -663,6 +663,7 @@ EXPORT_SYMBOL(kobject_get_unless_zero); */ static void kobject_cleanup(struct kobject *kobj) {
- struct kobject *parent = kobj->parent; struct kobj_type *t = get_ktype(kobj); const char *name = kobj->name;
@@ -680,6 +681,9 @@ static void kobject_cleanup(struct kobject *kobj) kobject_uevent(kobj, KOBJ_REMOVE); }
- /* make sure the parent is not released before the (last) child */
- kobject_get(parent);
- /* remove from sysfs if the caller did not do it */ if (kobj->state_in_sysfs) { pr_debug("kobject: '%s' (%p): auto cleanup kobject_del\n",
@@ -693,6 +697,8 @@ static void kobject_cleanup(struct kobject *kobj) t->release(kobj); }
- kobject_put(parent);
No, please don't do this.
A child device should have always incremented the parent already if it was correctly registered. We have had this patch been proposed multiple times over the years, and every time it was, we said no and went and fixed the real issue which was with the user of the interface.
The parent ref count is incremented by the child, that is not the problem. The problem is that when that child is released, if it's the last child of the parent, and there are no other users for the parent, then the parent is actually released _before_ the child. And that happens in the above function kobject_cleanup().
We can work around the problem by taking a reference to the parent separately, but we have to do that everywhere separately (which I guess is exactly what has been done so far). That workaroud still does not really fix the core problem. The core problem is still that lib/kboject.c is allowing the parent kobject to be released before the child kobject, and that quite simply should not be allowed to happen.
I don't have a problem if you want to have a better solution for this, but the solution really can't anymore be that we are always expected to separately increment the parent's ref count with every type of kobject.
Why is this suddenly showing up as a issue and it hasn't ever before? Because of that I would argue that the problem is not in the kobject core, but the use of it.
When a child object is created, it should have already incremented the parent reference count before it is registered, to ensure that the parent does not go away before the child is finished being registered. Then, when the child is removed, it decrements the reference count of the parent as it "knows" it is done with that pointer.
So perhaps this new swnode code is just wrong? Dealing with "raw" kobjects is hard, and not generally recommended due to issues like this. When you use the driver core, all of this logic is already handled for you...
And I just looked at the code swnode_register() needs to increment the parent kobject's reference, as it is saving a pointer to it away to be used later. That's just basic reference-counted-pointer logic here, please fix the issue there.
thanks,
greg k-h