J'ai récemment mis à jour Ubuntu 16.04 vers 18.04 sur mon PC (avec le GPU Nvidia GeForce GTX 1080 Ti) et depuis que mon système de fichiers s'est construit à partir de ces énormes fichiers journaux. Chaque fois que j'arrête mon ordinateur, je reçois un nombre infini de messages pcieport jusqu'à ce que le PC s'arrête.
Hier, j'ai laissé ma machine allumée pour la nuit et quand j'y suis revenu, il y a eu une notification que l'espace disque était épuisé, c'est-à-dire /dev/sda1
était utilisé à 100%. J'ai réussi à le découvrir via la commande du
- et j'ai identifié le problème pour enregistrer les fichiers dans /var/log/
- dossier, qui contenait des fichiers journaux de plus de 350GB
.
Actuellement, les fichiers journaux se reconstituent et occupent maintenant environ 150GB
. Les fichiers journaux à l'origine du problème sont les suivants: syslog.1, syslog
et kern.log
Ma question est: Quelle est la cause de ce problème et comment le résoudre?
J'ai répertorié ci-dessous des informations sur mon système et quelques lignes des fichiers journaux. Je vais les supprimer à nouveau, mais les supprimer sans fin ne semble pas la meilleure solution à long terme.
Distributor ID: Ubuntu
Description: Ubuntu 18.04.3 LTS
Release: 18.04
Codename: bionic
nvidia-smi
Thu Aug 15 09:25:53 2019
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 430.40 Driver Version: 430.40 CUDA Version: 10.1 |
|-------------------------------+----------------------+----------------------+
| GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
|===============================+======================+======================|
| 0 GeForce GTX 108... Off | 00000000:01:00.0 On | N/A |
| 24% 58C P0 67W / 250W | 1373MiB / 11177MiB | 1% Default |
+-------------------------------+----------------------+----------------------+
+-----------------------------------------------------------------------------+
| Processes: GPU Memory |
| GPU PID Type Process name Usage |
|=============================================================================|
| 0 1298 G /usr/lib/xorg/Xorg 89MiB |
| 0 1337 G /usr/bin/gnome-Shell 50MiB |
| 0 2258 G /usr/lib/xorg/Xorg 726MiB |
| 0 2465 G /usr/bin/gnome-Shell 189MiB |
| 0 14914 G ...e --type=gpu-process --field-trial-hand 154MiB |
| 0 18206 C /usr/lib/libreoffice/program/soffice.bin 137MiB |
+-----------------------------------------------------------------------------+
lspci -vt
-[0000:00]-+-00.0 Intel Corporation 8th Gen Core Processor Host Bridge/DRAM Registers
+-01.0-[01]--+-00.0 NVIDIA Corporation GP102 [GeForce GTX 1080 Ti]
| \-00.1 NVIDIA Corporation GP102 HDMI Audio Controller
+-02.0 Intel Corporation Device 3e92
+-14.0 Intel Corporation 200 Series/Z370 Chipset Family USB 3.0 xHCI Controller
+-16.0 Intel Corporation 200 Series PCH CSME HECI #1
+-17.0 Intel Corporation 200 Series PCH SATA controller [AHCI mode]
+-1b.0-[02]--
+-1c.0-[03]--
+-1c.4-[04]----00.0 ASMedia Technology Inc. Device 2142
+-1c.7-[05]----00.0 Realtek Semiconductor Co., Ltd. RTL8812AE 802.11ac PCIe Wireless Network Adapter
+-1d.0-[06]--
+-1f.0 Intel Corporation Z370 Chipset LPC/eSPI Controller
+-1f.2 Intel Corporation 200 Series/Z370 Chipset Family Power Management Controller
+-1f.3 Intel Corporation 200 Series PCH HD Audio
+-1f.4 Intel Corporation 200 Series/Z370 Chipset Family SMBus Controller
\-1f.6 Intel Corporation Ethernet Connection (2) I219-V
syslog.1
Aug 14 10:14:03 user kernel: [ 10.680132] pcieport 0000:00:1c.7: AER: Corrected error received: 0000:00:1c.7
Aug 14 10:14:03 user kernel: [ 10.680135] pcieport 0000:00:1c.7: PCIe Bus Error: severity=Corrected, type=Physical Layer, (Receiver ID)
Aug 14 10:14:03 user kernel: [ 10.680135] pcieport 0000:00:1c.7: device [8086:a297] error status/mask=00000001/00002000
Aug 14 10:14:03 user kernel: [ 10.680136] pcieport 0000:00:1c.7: [ 0] RxErr
Aug 14 10:14:03 user kernel: [ 10.680187] pcieport 0000:00:1c.7: AER: Corrected error received: 0000:00:1c.7
Aug 14 10:14:03 user kernel: [ 10.680190] pcieport 0000:00:1c.7: PCIe Bus Error: severity=Corrected, type=Physical Layer, (Receiver ID)
Aug 14 10:14:03 user kernel: [ 10.680190] pcieport 0000:00:1c.7: device [8086:a297] error status/mask=00000001/00002000
Aug 14 10:14:03 user kernel: [ 10.680191] pcieport 0000:00:1c.7: [ 0] RxErr
Aug 14 10:14:03 user kernel: [ 10.680281] pcieport 0000:00:1c.7: AER: Corrected error received: 0000:00:1c.7
Aug 14 10:14:03 user kernel: [ 10.680284] pcieport 0000:00:1c.7: PCIe Bus Error: severity=Corrected, type=Physical Layer, (Receiver ID)
Aug 14 10:14:03 user kernel: [ 10.680284] pcieport 0000:00:1c.7: device [8086:a297] error status/mask=00000001/00002000
Aug 14 10:14:03 user kernel: [ 10.680285] pcieport 0000:00:1c.7: [ 0] RxErr
Aug 14 10:14:03 user kernel: [ 10.680374] pcieport 0000:00:1c.7: AER: Multiple Corrected error received: 0000:00:1c.7
Aug 14 10:14:03 user kernel: [ 10.680378] pcieport 0000:00:1c.7: PCIe Bus Error: severity=Corrected, type=Physical Layer, (Receiver ID)
Aug 14 10:14:03 user kernel: [ 10.680379] pcieport 0000:00:1c.7: device [8086:a297] error status/mask=00000001/00002000
Aug 14 10:14:03 user kernel: [ 10.680380] pcieport 0000:00:1c.7: [ 0] RxErr
Aug 14 10:14:03 user kernel: [ 10.680586] pcieport 0000:00:1c.7: AER: Multiple Corrected error received: 0000:00:1c.7
Aug 14 10:14:03 user kernel: [ 10.680590] pcieport 0000:00:1c.7: PCIe Bus Error: severity=Corrected, type=Physical Layer, (Receiver ID)
Aug 14 10:14:03 user kernel: [ 10.680591] pcieport 0000:00:1c.7: device [8086:a297] error status/mask=00000001/00002000
Aug 14 10:14:03 user kernel: [ 10.680591] pcieport 0000:00:1c.7: [ 0] RxErr
syslog
Aug 15 09:04:23 user kernel: [ 307.590656] pcieport 0000:00:1c.7: [ 0] RxErr
Aug 15 09:04:23 user kernel: [ 307.590836] pcieport 0000:00:1c.7: AER: Multiple Corrected error received: 0000:00:1c.7
Aug 15 09:04:23 user kernel: [ 307.590841] pcieport 0000:00:1c.7: PCIe Bus Error: severity=Corrected, type=Physical Layer, (Receiver ID)
Aug 15 09:04:23 user kernel: [ 307.590843] pcieport 0000:00:1c.7: device [8086:a297] error status/mask=00000001/00002000
Aug 15 09:04:23 user kernel: [ 307.590844] pcieport 0000:00:1c.7: [ 0] RxErr
Aug 15 09:04:23 user kernel: [ 307.591125] pcieport 0000:00:1c.7: AER: Multiple Corrected error received: 0000:00:1c.7
Aug 15 09:04:23 user kernel: [ 307.591134] pcieport 0000:00:1c.7: PCIe Bus Error: severity=Corrected, type=Physical Layer, (Receiver ID)
Aug 15 09:04:23 user kernel: [ 307.591135] pcieport 0000:00:1c.7: device [8086:a297] error status/mask=00000001/00002000
Aug 15 09:04:23 user kernel: [ 307.591136] pcieport 0000:00:1c.7: [ 0] RxErr
Aug 15 09:04:23 user kernel: [ 307.591414] pcieport 0000:00:1c.7: AER: Multiple Corrected error received: 0000:00:1c.7
Aug 15 09:04:23 user kernel: [ 307.591419] pcieport 0000:00:1c.7: PCIe Bus Error: severity=Corrected, type=Physical Layer, (Receiver ID)
Aug 15 09:04:23 user kernel: [ 307.591420] pcieport 0000:00:1c.7: device [8086:a297] error status/mask=00000001/00002000
Aug 15 09:04:23 user kernel: [ 307.591422] pcieport 0000:00:1c.7: [ 0] RxErr
Aug 15 09:04:23 user kernel: [ 307.591607] pcieport 0000:00:1c.7: AER: Multiple Corrected error received: 0000:00:1c.7
Aug 15 09:04:23 user kernel: [ 307.591614] pcieport 0000:00:1c.7: PCIe Bus Error: severity=Corrected, type=Physical Layer, (Receiver ID)
Aug 15 09:04:23 user kernel: [ 307.591616] pcieport 0000:00:1c.7: device [8086:a297] error status/mask=00000001/00002000
Aug 15 09:04:23 user kernel: [ 307.591617] pcieport 0000:00:1c.7: [ 0] RxErr
Aug 15 09:04:23 user kernel: [ 307.591896] pcieport 0000:00:1c.7: AER: Multiple Corrected error received: 0000:00:1c.7
Aug 15 09:04:23 user kernel: [ 307.591901] pcieport 0000:00:1c.7: PCIe Bus Error: severity=Corrected, type=Physical Layer, (Receiver ID)
kern.log
Aug 14 10:14:03 user kernel: [ 11.219257] pcieport 0000:00:1c.7: AER: Corrected error received: 0000:00:1c.7
Aug 14 10:14:03 user kernel: [ 11.219259] pcieport 0000:00:1c.7: PCIe Bus Error: severity=Corrected, type=Physical Layer, (Receiver ID)
Aug 14 10:14:03 user kernel: [ 11.219260] pcieport 0000:00:1c.7: device [8086:a297] error status/mask=00000001/00002000
Aug 14 10:14:03 user kernel: [ 11.219260] pcieport 0000:00:1c.7: [ 0] RxErr
Aug 14 10:14:03 user kernel: [ 11.219443] pcieport 0000:00:1c.7: AER: Multiple Corrected error received: 0000:00:1c.7
Aug 14 10:14:03 user kernel: [ 11.219448] pcieport 0000:00:1c.7: PCIe Bus Error: severity=Corrected, type=Physical Layer, (Receiver ID)
Aug 14 10:14:03 user kernel: [ 11.219448] pcieport 0000:00:1c.7: device [8086:a297] error status/mask=00000001/00002000
Aug 14 10:14:03 user kernel: [ 11.219449] pcieport 0000:00:1c.7: [ 0] RxErr
Aug 14 10:14:03 user kernel: [ 11.219714] pcieport 0000:00:1c.7: AER: Corrected error received: 0000:00:1c.7
Aug 14 10:14:03 user kernel: [ 11.219717] pcieport 0000:00:1c.7: PCIe Bus Error: severity=Corrected, type=Physical Layer, (Receiver ID)
Aug 14 10:14:03 user kernel: [ 11.219718] pcieport 0000:00:1c.7: device [8086:a297] error status/mask=00000001/00002000
Aug 14 10:14:03 user kernel: [ 11.219718] pcieport 0000:00:1c.7: [ 0] RxErr
Aug 14 10:14:03 user kernel: [ 11.219916] pcieport 0000:00:1c.7: AER: Multiple Corrected error received: 0000:00:1c.7
Aug 14 10:14:03 user kernel: [ 11.219922] pcieport 0000:00:1c.7: PCIe Bus Error: severity=Corrected, type=Physical Layer, (Receiver ID)
Aug 14 10:14:03 user kernel: [ 11.219923] pcieport 0000:00:1c.7: device [8086:a297] error status/mask=00000001/00002000
Aug 14 10:14:03 user kernel: [ 11.219924] pcieport 0000:00:1c.7: [ 0] RxErr
Aug 14 10:14:03 user kernel: [ 11.220101] pcieport 0000:00:1c.7: AER: Corrected error received: 0000:00:1c.7
Aug 14 10:14:03 user kernel: [ 11.220104] pcieport 0000:00:1c.7: PCIe Bus Error: severity=Corrected, type=Physical Layer, (Receiver ID)
Aug 14 10:14:03 user kernel: [ 11.220105] pcieport 0000:00:1c.7: device [8086:a297] error status/mask=00000001/00002000
Aug 14 10:14:03 user kernel: [ 11.220105] pcieport 0000:00:1c.7: [ 0] RxErr
J'ai déjà résolu le problème (je pense) en utilisant les instructions ici:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/152117
SOLUTION: ajoutez pci = noaer à votre ligne de commande du noyau:
(1) éditez/etc/default/grub et ajoutez pci = noaer à la ligne commençant par GRUB_CMDLINE_LINUX_DEFAULT. Il ressemblera à ceci:
GRUB_CMDLINE_LINUX_DEFAULT = "splash silencieux pci = noaer"
(2) exécutez "Sudo update-grub"
(3) redémarrer
Après avoir implémenté ces étapes, je ne reçois plus les messages pcieport
- après l'arrêt de mon système et la taille des fichiers journaux a cessé d'augmenter énormément.
Cependant, je ne sais pas si cela a corrigé la cause première des messages d'erreur ...