You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Bluefin Bug Report: System instability with recurring GPU errors and NVMe issues
System Information
OS: Bluefin 40 (FROM Fedora Silverblue)
Kernel: Linux 6.11.8-200.fc40.x86_64
Hardware: ASUS TUF GAMING X570-PLUS (WI-FI)
GPU: AMD Radeon Vega Series (Picasso/Raven 2)
Driver: xorg-x11-drv-amdgpu-23.0.0-3
Memory: 32GB (29Gi available)
Current Version: gts-40.20250115 (2025-01-15T01:08:05Z)
Issue Description
System experiences frequent unrecoverable hangs (that lasts minutes until the OS crashes) after running for a while.
IA analysis of the logs suggest that the crashes appear to be related to GPU driver issues and NVMe storage problems.
Critical Errors
1. GPU-related errors
amdgpu 0000:0a:00.0: amdgpu: Secure display: Generic Failure
amdgpu 0000:0a:00.0: amdgpu: SECUREDISPLAY: query securedisplay TA failed. ret 0x0
2. NVMe errors
nvme nvme0: failed to set APST feature (2)
nvme nvme1: failed to set APST feature (2)
3. System service errors
systemd[3217]: Failed to start app-gnome-gnome\x2dkeyring\x2dssh-3529.scope
systemd[3217]: Failed to start app-gnome-xdg\x2duser\x2ddirs-3556.scope
Update - System Freeze due to AMD GPU Driver Malfunction and CPU Lockup
Description
System experienced a complete freeze requiring a hard restart. Investigation revealed a cascade of failures starting with AMD GPU driver issues, leading to display controller errors and ultimately resulting in a CPU soft lockup.
System Information
Hardware:
GPU: AMD Radeon Vega
Driver: amdgpu
Display Configuration: Dual monitor setup (CRTC-0 and CRTC-1)
Timeline of Events
15:41:40 - Initial GPU graphics ring buffer timeout
15:41:45 - Display controller errors on both monitors
15:42:06 - CPU soft lockup occurred
System became unresponsive, requiring hard restart
Describe the bug
Bluefin Bug Report: System instability with recurring GPU errors and NVMe issues
System Information
Issue Description
System experiences frequent unrecoverable hangs (that lasts minutes until the OS crashes) after running for a while.
IA analysis of the logs suggest that the crashes appear to be related to GPU driver issues and NVMe storage problems.
Critical Errors
1. GPU-related errors
2. NVMe errors
3. System service errors
4. Display manager errors
System State
Installed Packages
Layered Packages
Local Packages
Steps to Reproduce
Additional Notes
Attempted Solutions
Logs and additional system information available upon request.
What did you expect to happen?
To not hang.
Output of
bootc status
Output of
groups
Extra information or context
No response
The text was updated successfully, but these errors were encountered: