Introduction

This document describes the policy, process and criteria for updating nVidia proprietary drivers in a stable supported distro, including LTS.

nVidia proprietary GPU drivers are broadly used by gamers or for GPU compute activities (AI/ML). Support for new GPUs as well as bug fixes are regularly released by the vendor (on average 2 driver updates a month). For an improved experience or simply to support new GPUs without installing drivers from unofficial sources (for example PPAs), it is essential to release the latest version of the drivers regularly to the stable releases of Ubuntu.

What sort of updates

There are 3 levels of supported drivers:

The release targeted by the SRU are:

Short lived branches are not SRUed to non-lts releases to align the policy on the HWE stacks however this decision can be revisited depending on the demand.

When a new major version of a driver is available and uploaded, the versioning and naming scheme of the packages is so that users won’t be automatically upgraded to the latest version. They will only have updates for minor releases of a driver.

For example if version 100.1 is in the LTS.

If several versions of the driver support the GPU, ubuntu-driver will expose them all.

Release Schedule

Requesting the SRU

The SRU should be done with a single process bug, instead of individual bug reports for individual bug fixes. The one bug should have the following:

QA Process

Packaging QA

The objective of the separate packaging QA is to test:

The resulting package, with all the changes in place, must undergo and pass the following additional QA procedures:

QA tests

SRU Template

[Impact]
This release provides both bug fixes and new features and we would like to
make sure all of our users have access to these improvements.
The notable ones are:

   * <TODO: Create list with LP: # included >

See the changelog entry below for a full list of changes and bugs.

[Test Case]
The following development and SRU process was followed:
https://wiki.ubuntu.com/NVidiaUpdates

<TODO Document any QA done, automated and manual>

The QA team that executed the tests will be in charge of attaching the artifacts and console output of the appropriate run to the bug. nVidia maintainers team members will not mark ‘verification-done’ until this has happened.

[Regression Potential]
In order to mitigate the regression potential, the results of the
aforementioned system level tests are attached to this bug.

<TODO: attach nvidia-proposed test artifacts for every SRU release, not a link as links expire>


[Discussion]
<TODO: other background>


<TODO: Paste in change log entry from nVidia for this version of the driver>

Additional notes

Driver Upgrades

If an nVidia driver is updated then all nVidia user space components will stop working immediately after the respective package updates as the loaded kernel module and the user space components have a version mismatch. The consequences are not immediately visible to the user as nVidia components in memory are still properly matched and hence still work. The real issue is with new processes as for an instance no OpenGL applications or CUDA workloads can be launched anymore.

The way to fix this is to reboot immediately after an nVidia driver has been updated, including for minor version updates.

This particular issue is discussed in Debian Bug 889669

References

NVidiaUpdates (last edited 2023-01-12 15:01:13 by anthonywong)