On Tue, Apr 17, 2018 at 01:07:17PM +0200, Michal Hocko wrote:
On Tue 17-04-18 12:39:36, Greg KH wrote:
On Mon, Apr 16, 2018 at 11:28:44PM +0200, Jiri Kosina wrote:
On Mon, 16 Apr 2018, Sasha Levin wrote:
I agree that as an enterprise distro taking everything from -stable isn't the best idea. Ideally you'd want to be close to the first extreme you've mentioned and only take commits if customers are asking you to do so.
I think that the rule we're trying to agree upon is the "It must fix a real bug that bothers people".
I think that we can agree that it's impossible to expect every single Linux user to go on LKML and complain about a bug he encountered, so the rule quickly becomes "It must fix a real bug that can bother people".
So is there a reason why stable couldn't become some hybrid-form union of
- really critical issues (data corruption, boot issues, severe security issues) taken from bleeding edge upstream
- [reviewed] cherry-picks of functional fixes from major distro kernels (based on that very -stable release), as that's apparently what people are hitting in the real world with that particular kernel
It already is that :)
The problem Sasha is trying to solve here is that for many subsystems, maintainers do not mark patches for stable at all.
The way he is trying to do that is just wrong. Generate a pressure on those subsystems by referring to bug reports and unhappy users and I am pretty sure they will try harder... You cannot solve the problem by bypassing them without having deep understanding of the specific subsytem. Once you have it, just make sure you are part of the review process and make sure to mark patches before they are merged.
I think we just don't agree on how we should "pressure".
Look at the discussion I had with the XFS folks who just don't want to deal with this -stable thing because they have to much work upstream.
There wasn't a single patch in -stable coming from XFS for the past 6+ months. I'm aware of more than one way to corrupt an XFS volume for any distro that uses a kernel older than 4.15.
Sure, please buy them a beer at LSF/MM (I'll pay) and ask them to be better about it, but I don't see this changing.
The solution to this, in my opinion, is to automate the whole selection and review process. We do selection using AI, and we run every possible test that's relevant to that subsystem.
At which point, the amount of work a human needs to do to review a patch shrinks into something far more managable for some maintainers.
So real bugfixes that do hit people are not getting to those kernels, which force the distros to do extra work to triage a bug, dig through upstream kernels, find and apply the patch.
I would say that this is the primary role of the distro. To hide the jungle of the upstream work and provide the additional of bug filtering and forwarding them the right direction.
More often than triaging, you'll just be asked to upgrade to the latest version. What sort of user experience does that provide?
[snip]
So nothing "new" is happening here, EXCEPT we are actually starting to get a better kernel-wide coverage for stable fixes, which we have not had in the past. That's a good thing! The number of patches applied to stable is still a very very very tiny % compared to mainline, so nothing new is happening here.
yes I do agree, the stable process is not very much different from the past and I would tend both processes broken because they explicitly try to avoid maintainers which is just wrong.
Avoid maintainers?! We send so much "spam" trying to get maintainers more involved in the process. How is that avoiding them?
If you're a maintainer who has specific requirements for the -stable flow, or you have any automated testing you'd like to be run on these commits, or you want these mails to come in a different format, or pretty much anything else at all just shoot me a mail!
It's been almost impossible to get maintainers involved in this process.
We don't sneak anything past maintainers, there are multiple mails over multiple weeks for each commit that would go in. You don't have to review it right away either, just reply with "please don't merge until I'm done reviewing" and it'll get removed from the queue.
Oh, and if you do want to complain about huge new features being backported, look at the mess that Spectre and Meltdown has caused in the stable trees. I don't see anyone complaining about those massive changes :)
Are you serious? Are you going the compare the biggest PITA that the community had to undergo because of HW issues with random pattern matching in changelog/diffs? Come on!
HW Issues are irrelevant here. You had a bug that allowed arbitrary kernel memory access. I can easily list quite a few commits, that are not tagged for stable, that fix exactly the same thing.