CVE-2023-52738

Source
https://cve.org/CVERecord?id=CVE-2023-52738
Import Source
https://storage.googleapis.com/cve-osv-conversion/osv-output/CVE-2023-52738.json
JSON Data
https://api.osv.dev/v1/vulns/CVE-2023-52738
Downstream
Related
Published
2024-05-21T15:23:01.903Z
Modified
2026-04-02T09:43:15.817611Z
Severity
  • 5.3 (Medium) CVSS_V3 - CVSS:3.1/AV:N/AC:L/PR:N/UI:N/S:U/C:N/I:N/A:L CVSS Calculator
Summary
drm/amdgpu/fence: Fix oops due to non-matching drm_sched init/fini
Details

In the Linux kernel, the following vulnerability has been resolved:

drm/amdgpu/fence: Fix oops due to non-matching drm_sched init/fini

Currently amdgpu calls drmschedfini() from the fence driver sw fini routine - such function is expected to be called only after the respective init function - drmschedinit() - was executed successfully.

Happens that we faced a driver probe failure in the Steam Deck recently, and the function drmschedfini() was called even without its counter-part had been previously called, causing the following oops:

amdgpu: probe of 0000:04:00.0 failed with error -110 BUG: kernel NULL pointer dereference, address: 0000000000000090 PGD 0 P4D 0 Oops: 0002 [#1] PREEMPT SMP NOPTI CPU: 0 PID: 609 Comm: systemd-udevd Not tainted 6.2.0-rc3-gpiccoli #338 Hardware name: Valve Jupiter/Jupiter, BIOS F7A0113 11/04/2022 RIP: 0010:drmschedfini+0x84/0xa0 [gpusched] [...] Call Trace: <TASK> amdgpufencedriverswfini+0xc8/0xd0 [amdgpu] amdgpudevicefinisw+0x2b/0x3b0 [amdgpu] amdgpudriverreleasekms+0x16/0x30 [amdgpu] devmdrmdevinit_release+0x49/0x70 [...]

To prevent that, check if the drm_sched was properly initialized for a given ring before calling its fini counter-part.

Notice ideally we'd use sched.ready for that; such field is set as the latest thing on drmschedinit(). But amdgpu seems to "override" the meaning of such field - in the above oops for example, it was a GFX ring causing the crash, and the sched.ready field was set to true in the ring init routine, regardless of the state of the DRM scheduler. Hence, we ended-up using sched.ops as per Christian's suggestion [0], and also removed the no_scheduler check [1].

[0] https://lore.kernel.org/amd-gfx/984ee981-2906-0eaf-ccec-9f80975cb136@amd.com/ [1] https://lore.kernel.org/amd-gfx/cd0e2994-f85f-d837-609f-7056d5fb7231@amd.com/

Database specific
{
    "osv_generated_from": "https://github.com/CVEProject/cvelistV5/tree/main/cves/2023/52xxx/CVE-2023-52738.json",
    "cna_assigner": "Linux"
}
References

Affected packages

Git / git.kernel.org/pub/scm/linux/kernel/git/stable/linux.git

Affected ranges

Type
GIT
Repo
https://git.kernel.org/pub/scm/linux/kernel/git/stable/linux.git
Events
Introduced
067f44c8b4590c3f24d21a037578a478590f2175
Fixed
2e557c8ca2c585bdef591b8503ba83b85f5d0afd
Fixed
2bcbbef9cace772f5b7128b11401c515982de34b
Fixed
5ad7bbf3dba5c4a684338df1f285080f2588b535
Type
GIT
Repo
https://git.kernel.org/pub/scm/linux/kernel/git/stable/linux.git
Events
Introduced
0 Unknown introduced commit / All previous commits are affected
Last affected
8ba968ae672b3075794c8086aa164595b0175abe

Database specific

source
"https://storage.googleapis.com/cve-osv-conversion/osv-output/CVE-2023-52738.json"