Re: [PATCH v4 02/15] genpt: Add Documentation/ files

27 Aug 2025

Hi,
On 8/26/25 10:18 AM, Jason Gunthorpe wrote:
...
Add some general description and pull in the kdoc comments from the source
file to index most of the useful functions.
Tested-by: Alejandro Jimenez alejandro.j.jimenez@oracle.com
Signed-off-by: Jason Gunthorpe jgg@nvidia.com

Documentation/driver-api/generic_pt.rst | 140 ++++++++++++++++++++++++
 Documentation/driver-api/index.rst      |   1 +
 2 files changed, 141 insertions(+)
 create mode 100644 Documentation/driver-api/generic_pt.rst

diff --git a/Documentation/driver-api/generic_pt.rst b/Documentation/driver-api/generic_pt.rst
new file mode 100644
index 00000000000000..45b05dafece814
--- /dev/null
+++ b/Documentation/driver-api/generic_pt.rst
@@ -0,0 +1,140 @@
+.. SPDX-License-Identifier: GPL-2.0



+========================
+Generic Radix Page Table
+========================



+.. kernel-doc:: include/linux/generic_pt/common.h

:doc: Generic Radix Page Table


+.. kernel-doc:: drivers/iommu/generic_pt/pt_defs.h

:doc: Generic Page Table Language


+-----
+Usage
+-----



+Generic PT is structured as a multi-compilation system. Since each format
+provides an API using a common set of names there can be only one format active
+within a compilation unit. This design avoids function pointers around the low
+level API.



+Instead the function pointers can end up at the higher level API (ie map/unmap,
(i.e.,
...
+etc) and the per-format code can be directly inlined into the per-format
etc.)
...
+compilation unit. For something like iommu each format will be compiled into a
+per-format iommu operations kernel module.



I would s/iommu/IOMMU/g when it's used in a sentence as an acronym (i.e., not
functions names, struct names, struct fields, file names, etc.).
...
+For this to work the .c file for each compilation unit will include both the
+format headers and the generic code for the implementation. For instance in an
+implementation compilation unit the headers would normally be included as
+follows::


#include <linux/generic_pt/common.h>
#include "fmt/defs_amdv1.h"
#include "pt_defs.h"
#include "fmt/amdv1.h"
#include "pt_common.h"
#include "pt_iter.h"
#include "iommut_pt.h"  /* The iommu implementation */


+iommu_pt.h includes definitions that will generate the operations functions for
+map/unmap/etc using the definitions provided by AMDv1. The resulting module
etc.
...
+will have exported symbols named like pt_iommu_amdv1_init().



+Refer to drivers/iommu/generic-pt/fmt/iommu_template.h for an example of how the
+iommu implementation uses multi-compilation to generate per-format ops structs
+pointers.



+The format code is written so that the common names arise from #defines to
+distinct format specific names. This is intended to aid debuggability by
+avoiding symbol clashes across all the different formats.



+Exported symbols and other global names are mangled using a per-format string
+via the NS() helper macro.



+The format uses struct pt_common as the top level struct for the table,
top-level
...
+and each format will have its own struct pt_xxx which embeds it to store
+format-specific information.



+The implementation will further wrapper this in its own top level struct, such
this???         top-level
oh, maybe                          wrap this               top-level
...
+as struct pt_iommu_amdv1.



+----------------------------------------------
+Format functions at the struct pt_common level
+----------------------------------------------



+.. kernel-doc:: include/linux/generic_pt/common.h

:identifiers:

+.. kernel-doc:: drivers/iommu/generic_pt/pt_common.h



+-----------------
+Iteration Helpers
+-----------------



+.. kernel-doc:: drivers/iommu/generic_pt/pt_iter.h



+----------------
+Writing a Format
+----------------



+It is best to start from a simple format that is similar to the target. x86_64
+is usually a good reference for something simple, and AMDv1 is something fairly
+complete.



+The required inline functions need to be implemented in the format header.
+These should all follow the standard pattern of::


static inline pt_oaddr_t amdv1pt_entry_oa(const struct pt_state *pts)
{
[..]
}
#define pt_entry_oa amdv1pt_entry_oa


+Where a uniquely named per-format inline function provides the implementation
where
...
+and a define maps it to the generic name. This is intended to make debug symbols
+work better. inline functions should always be used as the prototypes in
+pt_common.h will cause the compiler to validate the function signature to
+prevent errors.



+Review pt_fmt_defaults.h to understand some of the optional inlines.



+Once the format compiles then it should be run through the generic page table
+kunit test in kunit_generic_pt.h using kunit. For example::


$ tools/testing/kunit/kunit.py run --build_dir build_kunit_x86_64 --arch x86_64 --kunitconfig ./drivers/iommu/generic_pt/.kunitconfig amdv1_fmt_test.*
[...]
[11:15:08] Testing complete. Ran 9 tests: passed: 9
[11:15:09] Elapsed time: 3.137s total, 0.001s configuring, 2.368s building, 0.311s running


+The generic tests are intended to prove out the format functions and give
+clearer failures to speed up finding the problems. Once those pass then the
+entire kunit suite should be run.



+---------------------------
+IOMMU Invalidation Features
+---------------------------



+Invalidation is how the page table algorithms synchronize with a HW cache of the
+pagetable memory, typically called the TLB (or IOTLB for IOMMU cases).
page table
to match the rest of this document.
...



+The TLB can store present PTEs, non-present PTEs and table pointers, depending
+on its design. Every HW has its own approach on how to describe what has changed
+to get changed items removed from the TLB.
to have changed items removed
...



+PT_FEAT_FLUSH_RANGE
+-------------------



+PT_FEAT_FLUSH_RANGE is the easiest scheme to understand. It tries to generate a
+single range invalidation for each operation, over invalidating if there are
over-invalidating
...
+gaps of VA that don't need invalidation. This trades off impacted VA for number
+of invalidation operations. It does not keep track of what is being invalidated,
invalidated;
...
+however if pages have to be freed then page table pointers have to be cleaned
however,
...
+from the walk cache. The range can start/end at any page boundary.



+PT_FEAT_FLUSH_RANGE_NO_GAPS
+---------------------------



+PT_FEAT_FLUSH_RANGE_NO_GAPS is similar to PT_FEAT_FLUSH_RANGE however it tries
PT_FEAT_FLUSH_RANGE; however, it tries
...
+to minimize the amount of impacted VA by issuing extra flush operations. This is
+useful if the cost of processing VA is very high, for instance because a
+hypervisor is processing the page table with a shadowing algorithm.
-- 
~Randy



    

2025

2024

2023

2022

2021

2020

2019

2018

2017

Re: [PATCH v4 02/15] genpt: Add Documentation/ files