You are not logged in.
Pages: 1
Hello,
I have a weird behaviour with my centos 7 fleet (different spec, hundreds of devices).
Problem:
When running xfce, and resizing windows of anything playing a video (vlc, RV, chrome), machine would freeze for a few frames (or sometimes locks up completely). At this point 1-2 reboots is guaranteed per day on average.
/var/log/messages shows multiple Xid 69 errors, sometimes Xid 31.
May 18 16:22:57 canpcanim3876 kernel: NVRM: Xid (PCI:0000:01:00): 69, pid=1340, Class Error: ChId 0009, Class 0000902d, Offset 000008b0, Data ffffffff, ErrorCode 00000004
May 18 16:22:57 canpcanim3876 kernel: NVRM: Xid (PCI:0000:01:00): 69, pid=1340, Class Error: ChId 0009, Class 0000902d, Offset 000008b0, Data ffffffff, ErrorCode 00000004
May 18 16:22:57 canpcanim3876 kernel: NVRM: Xid (PCI:0000:01:00): 69, pid=1340, Class Error: ChId 0009, Class 0000902d, Offset 000008b0, Data ffffffff, ErrorCode 00000004
May 18 16:22:57 canpcanim3876 kernel: NVRM: Xid (PCI:0000:01:00): 69, pid=1340, Class Error: ChId 0009, Class 0000902d, Offset 000008b0, Data fffffffe, ErrorCode 00000004
May 18 16:22:57 canpcanim3876 kernel: NVRM: Xid (PCI:0000:01:00): 69, pid=1340, Class Error: ChId 0009, Class 0000902d, Offset 000008b0, Data fffffffe, ErrorCode 00000004
May 18 16:22:58 canpcanim3876 kernel: NVRM: Xid (PCI:0000:01:00): 69, pid=1340, Class Error: ChId 0009, Class 0000902d, Offset 000008b0, Data ffffffff, ErrorCode 00000004
May 18 16:22:58 canpcanim3876 kernel: NVRM: Xid (PCI:0000:01:00): 69, pid=1340, Class Error: ChId 0009, Class 0000902d, Offset 000008b0, Data ffffffff, ErrorCode 00000004
May 18 16:22:58 canpcanim3876 kernel: NVRM: Xid (PCI:0000:01:00): 69, pid=1340, Class Error: ChId 0009, Class 0000902d, Offset 000008b0, Data ffffffff, ErrorCode 00000004
May 18 16:22:58 canpcanim3876 kernel: NVRM: Xid (PCI:0000:01:00): 69, pid=1340, Class Error: ChId 0009, Class 0000902d, Offset 000008b0, Data ffffffff, ErrorCode 00000004
May 18 16:22:58 canpcanim3876 kernel: NVRM: Xid (PCI:0000:01:00): 69, pid=1340, Class Error: ChId 0009, Class 0000902d, Offset 000008b0, Data ffffffff, ErrorCode 00000004
May 18 16:22:58 canpcanim3876 kernel: NVRM: Xid (PCI:0000:01:00): 69, pid=1340, Class Error: ChId 0009, Class 0000902d, Offset 000008b0, Data ffffffff, ErrorCode 00000004
May 18 16:22:58 canpcanim3876 kernel: NVRM: Xid (PCI:0000:01:00): 69, pid=1340, Class Error: ChId 0009, Class 0000902d, Offset 000008b0, Data fffffffe, ErrorCode 00000004
May 18 16:22:58 canpcanim3876 kernel: NVRM: Xid (PCI:0000:01:00): 69, pid=1340, Class Error: ChId 0009, Class 0000902d, Offset 000008b0, Data fffffffe, ErrorCode 00000004
May 18 16:22:58 canpcanim3876 kernel: NVRM: Xid (PCI:0000:01:00): 69, pid=1340, Class Error: ChId 0009, Class 0000902d, Offset 000008b0, Data ffffffff, ErrorCode 00000004
May 18 16:22:58 canpcanim3876 kernel: NVRM: Xid (PCI:0000:01:00): 69, pid=1340, Class Error: ChId 0009, Class 0000902d, Offset 000008b0, Data ffffffff, ErrorCode 00000004
May 18 16:22:58 canpcanim3876 kernel: NVRM: Xid (PCI:0000:01:00): 69, pid=1340, Class Error: ChId 0009, Class 0000902d, Offset 000008b0, Data fffffffe, ErrorCode 00000004
May 18 16:22:58 canpcanim3876 kernel: NVRM: Xid (PCI:0000:01:00): 69, pid=1340, Class Error: ChId 0009, Class 0000902d, Offset 000008b0, Data fffffffe, ErrorCode 00000004
[root@canpcanim3876 ~]# yum list installed | grep -i xfce
libxfce4ui.x86_64 4.12.1-3.el7 @epel
libxfce4util.x86_64 4.12.1-2.el7 @epel
xfce-polkit.x86_64 0.2-8.el7 @epel
xfce4-appfinder.x86_64 4.12.0-4.el7 @epel
xfce4-notifyd.x86_64 0.2.4-8.el7 @localrepo-epel
xfce4-panel.x86_64 4.12.1-4.el7 @epel
xfce4-power-manager.x86_64 1.6.0-2.el7 @epel
xfce4-pulseaudio-plugin.x86_64 0.2.5-2.el7 @epel
xfce4-screenshooter.x86_64 1.8.2-5.el7 @localrepo-epel
xfce4-session.x86_64 4.12.1-8.el7 @epel
xfce4-session-engines.x86_64 4.12.1-8.el7 @epel
xfce4-settings.x86_64 4.12.1-1.el7 @epel
xfce4-systemload-plugin.x86_64 1.1.2-3.el7 @localrepo-epel
xfce4-taskmanager.x86_64 1.2.1-1.el7 @localrepo-epel
xfce4-terminal.x86_64 0.8.7.4-2.el7 @localrepo-epel
xfce4-xkb-plugin.x86_64 0.7.1-4.el7 @localrepo-epel
kernel.x86_64 3.10.0-957.12.2.el7 @localrepo-updates
kernel.x86_64 3.10.0-1127.el7 @localrepo-base
kernel-devel.x86_64 3.10.0-957.12.2.el7 @localrepo-updates
kernel-devel.x86_64 3.10.0-1127.el7 @localrepo-base
kernel-headers.x86_64 3.10.0-1127.el7 @localrepo-base
kernel-tools.x86_64 3.10.0-1127.el7 @localrepo-base
kernel-tools-libs.x86_64 3.10.0-1127.el7 @localrepo-base
kmod-nvidia.x86_64 470.63.01-1.el7_9.elrepo @localrepo-elrepo
nvidia-detect.x86_64 440.64-1.el7.elrepo @localrepo-elrepo
nvidia-x11-drv.x86_64 470.63.01-1.el7_9.elrepo @localrepo-elrepo
nvidia-x11-drv-libs.x86_64 470.63.01-1.el7_9.elrepo @localrepo-elrepo
yum-plugin-nvidia.noarch 1.0.2-1.el7.elrepo @localrepo-elrepo
CentOS Linux release 7.8.2003 (Core)
This has been a problem since centos 7.4 (first detected).
No tied to gpu (running quadros from K to RTX A), nvidia driver version, or machine type (dell\lenovo\hp workstations of different models)
Now to complete surprise, installing gnome fixes everything, no more NVRM errors, no more lockups.
Would very much appreciate if someone could help figure this one out.
Offline
Pages: 1
[ Generated in 0.009 seconds, 7 queries executed - Memory usage: 534.28 KiB (Peak: 546.4 KiB) ]