On Mon, Oct 20, 2025 at 05:28:02AM -0700, Christoph Hellwig wrote:
On Fri, Oct 17, 2025 at 08:55:24AM -0300, Jason Gunthorpe wrote:
On Thu, Oct 16, 2025 at 11:32:59PM -0700, Christoph Hellwig wrote:
On Mon, Oct 13, 2025 at 06:26:10PM +0300, Leon Romanovsky wrote:
From: Leon Romanovsky leonro@nvidia.com
Make sure that all VFIO PCI devices have peer-to-peer capabilities enables, so we would be able to export their MMIO memory through DMABUF,
How do you know that they are safe to use with P2P?
All PCI devices are "safe" for P2P by spec. I've never heard of a non-complaint device causing problems in this area.
Real PCIe device, yes. But we have a lot of stuff mascquerading as such with is just emulated or special integrated. I.e. a lot of integrated Intel GPUs claim had issue there.
Sure, but this should be handled by the P2P subsystem and PCI quirks, IMHO. It isn't VFIOs job.. If people complain about broken HW then it is easy to add those things.
I think the majority of stuff is OK, there is a chunk of configurations that will have clean failures - meaning the initiating device gets an error indication and handles it. Then there is a small minority where the platform crashes with a machine check.
IDK where Intel GPU lands on this, but VFIO has always supported P2P and userspace/VMs have always been able to trigger these kinds of bugs. If nobody has complained so far I'm not inclined to do anything right now.
VFIO has always kind of come along with a footnote that if you actually want fully safe VFIO then it is up to the user to validate the SOC and device implementations are sane.
Jason