web-dev-qa-db-fra.com

OOM-killer est invoqué sans raison Ubuntu 16.04

Je semble avoir un problème étrange avec des processus de tuer sans raison. Il s’agit d’un ordinateur Ubuntu 16.04 à jour, avec un noyau 4.4.0-62 générique, 3 ordinateurs virtuels et un BackupPC avec 16 Go RAM (l’ordinateur est un Dell t20). Les ordinateurs virtuels utilisent 256 Mo, 2 Go et 3 Go de RAM. Ubuntu est principalement configuré avec les paramètres par défaut. Les principales modifications postérieures à l’installation par défaut ont été l’installation de qemu et de backuppc après l’instant.

[    0.000000] Memory: 16298836K/16683092K available (8436K kernel code, 1291K rwdata, 3960K rodata, 1488K init, 1316K bss, 384256K reserved, 0K cma-reserved)

Les infos de sortie:

# lsb_release -a
No LSB modules are available.
Distributor ID: Ubuntu
Description:    Ubuntu 16.04.2 LTS
Release:        16.04
Codename:       xenial

Les paramètres de surengagement sont par défaut comme suit:

vm.overcommit_kbytes = 0
vm.overcommit_memory = 0
vm.overcommit_ratio = 50

Maintenant, le système tue parfois VM processus. Je ne comprends pas cela, car d'habitude, le MOO tue le processus en utilisant la majeure partie de la mémoire.

[241816.503021] Killed process 3198 (qemu-system-x86) total-vm:4181796kB, anon-rss:3324684kB, file-rss:3588kB

Le processus utilisait simplement 4 Go par vm et 3 Go rss. De plus, la machine ne s'échangeait même pas!

[241816.502934] Free swap  = 7953124kB
[241816.502935] Total swap = 8293372kB

Pouvez-vous dire pourquoi le processus d'abattage est mortel? Qu'est-ce que je rate? Parce qu'il semble que la machine utilise un total inférieur à 7 Go RAM à partir de 16 Go installé

Le journal complet est ci-dessous:

[241816.502856] cron invoked oom-killer: gfp_mask=0x26000c0, order=2, oom_score_adj=0
[241816.502859] cron cpuset=/ mems_allowed=0
[241816.502862] CPU: 0 PID: 1035 Comm: cron Not tainted 4.4.0-62-generic #83-Ubuntu
[241816.502863] Hardware name: Dell Inc. PowerEdge T20/0VD5HY, BIOS A06 01/27/2015
[241816.502864]  0000000000000286 00000000bf9ec188 ffff8800da123af0 ffffffff813f7c63
[241816.502866]  ffff8800da123cc8 ffff880405b5d400 ffff8800da123b60 ffffffff8120ad4e
[241816.502868]  0000000000000015 0000000000000000 ffff880409ac2540 ffff880407bad400
[241816.502869] Call Trace:
[241816.502873]  [<ffffffff813f7c63>] dump_stack+0x63/0x90
[241816.502876]  [<ffffffff8120ad4e>] dump_header+0x5a/0x1c5
[241816.502878]  [<ffffffff81390c14>] ? apparmor_capable+0xc4/0x1b0
[241816.502881]  [<ffffffff811926c2>] oom_kill_process+0x202/0x3c0
[241816.502882]  [<ffffffff81192ae9>] out_of_memory+0x219/0x460
[241816.502884]  [<ffffffff81198a5d>] __alloc_pages_slowpath.constprop.88+0x8fd/0xa70
[241816.502886]  [<ffffffff81198e56>] __alloc_pages_nodemask+0x286/0x2a0
[241816.502887]  [<ffffffff81198f0b>] alloc_kmem_pages_node+0x4b/0xc0
[241816.502890]  [<ffffffff8107ea5e>] copy_process+0x1be/0x1b70
[241816.502891]  [<ffffffff81213d73>] ? cp_new_stat+0x153/0x180
[241816.502893]  [<ffffffff810805a0>] _do_fork+0x80/0x360
[241816.502894]  [<ffffffff81080929>] SyS_clone+0x19/0x20
[241816.502897]  [<ffffffff818385f2>] entry_SYSCALL_64_fastpath+0x16/0x71
[241816.502898] Mem-Info:
[241816.502900] active_anon:1077377 inactive_anon:526767 isolated_anon:0
                 active_file:832229 inactive_file:670439 isolated_file:0
                 unevictable:914 dirty:0 writeback:0 unstable:0
                 slab_reclaimable:870324 slab_unreclaimable:29718
                 mapped:5481 shmem:5279 pagetables:5271 bounce:0
                 free:46803 free_pcp:0 free_cma:0
[241816.502902] Node 0 DMA free:15852kB min:64kB low:80kB high:96kB active_anon:0kB inactive_anon:0kB active_file:0kB inactive_file:0kB unevictable:0kB isolated(anon):0kB isolated(file):0kB present:15936kB managed:15852kB mlocked:0kB dirty:0kB writeback:0kB mapped:0kB shmem:0kB slab_reclaimable:0kB slab_unreclaimable:0kB kernel_stack:0kB pagetables:0kB unstable:0kB bounce:0kB free_pcp:0kB local_pcp:0kB free_cma:0kB writeback_tmp:0kB pages_scanned:0 all_unreclaimable? yes
[241816.502905] lowmem_reserve[]: 0 3376 15901 15901 15901
[241816.502907] Node 0 DMA32 free:85128kB min:14336kB low:17920kB high:21504kB active_anon:633984kB inactive_anon:650080kB active_file:994428kB inactive_file:726700kB unevictable:56kB isolated(anon):0kB isolated(file):0kB present:3578388kB managed:3497768kB mlocked:56kB dirty:0kB writeback:0kB mapped:9292kB shmem:12084kB slab_reclaimable:366052kB slab_unreclaimable:21320kB kernel_stack:1584kB pagetables:4008kB unstable:0kB bounce:0kB free_pcp:0kB local_pcp:0kB free_cma:0kB writeback_tmp:0kB pages_scanned:0 all_unreclaimable? no
[241816.502909] lowmem_reserve[]: 0 0 12524 12524 12524
[241816.502911] Node 0 Normal free:86232kB min:53180kB low:66472kB high:79768kB active_anon:3675524kB inactive_anon:1456988kB active_file:2334488kB inactive_file:1955056kB unevictable:3600kB isolated(anon):0kB isolated(file):0kB present:13088768kB managed:12825312kB mlocked:3600kB dirty:0kB writeback:0kB mapped:12632kB shmem:9032kB slab_reclaimable:3115244kB slab_unreclaimable:97552kB kernel_stack:2640kB pagetables:17076kB unstable:0kB bounce:0kB free_pcp:0kB local_pcp:0kB free_cma:0kB writeback_tmp:0kB pages_scanned:0 all_unreclaimable? no
[241816.502913] lowmem_reserve[]: 0 0 0 0 0
[241816.502915] Node 0 DMA: 1*4kB (U) 1*8kB (U) 0*16kB 1*32kB (U) 1*64kB (U) 1*128kB (U) 1*256kB (U) 0*512kB 1*1024kB (U) 1*2048kB (M) 3*4096kB (M) = 15852kB
[241816.502921] Node 0 DMA32: 15269*4kB (UME) 3028*8kB (UE) 0*16kB 0*32kB 0*64kB 0*128kB 0*256kB 0*512kB 0*1024kB 0*2048kB 0*4096kB = 85300kB
[241816.502925] Node 0 Normal: 21214*4kB (UMEH) 28*8kB (EH) 11*16kB (H) 11*32kB (H) 4*64kB (H) 3*128kB (H) 2*256kB (H) 0*512kB 0*1024kB 0*2048kB 0*4096kB = 86760kB
[241816.502931] Node 0 hugepages_total=0 hugepages_free=0 hugepages_surp=0 hugepages_size=1048576kB
[241816.502931] Node 0 hugepages_total=0 hugepages_free=0 hugepages_surp=0 hugepages_size=2048kB
[241816.502932] 1532619 total pagecache pages
[241816.502933] 24066 pages in swap cache
[241816.502934] Swap cache stats: add 757347, delete 733281, find 479805/565341
[241816.502934] Free swap  = 7953124kB
[241816.502935] Total swap = 8293372kB
[241816.502935] 4170773 pages RAM
[241816.502936] 0 pages HighMem/MovableOnly
[241816.502936] 86040 pages reserved
[241816.502937] 0 pages cma reserved
[241816.502937] 0 pages hwpoisoned
[241816.502938] [ pid ]   uid  tgid total_vm      rss nr_ptes nr_pmds swapents oom_score_adj name
[241816.502941] [  397]     0   397     8819      832      20       3       36             0 systemd-journal
[241816.502942] [  435]     0   435    25742      229      17       3        0             0 lvmetad
[241816.502944] [  454]     0   454    11440      574      23       3      488         -1000 systemd-udevd
[241816.502945] [ 1020]     0  1020    68967     1031      36       3       58             0 accounts-daemon
[241816.502947] [ 1022]     0  1022     1100      317       7       3        2             0 acpid
[241816.502948] [ 1029]     0  1029     6322      605      18       3       83             0 smartd
[241816.502949] [ 1031]     0  1031     7470      190      18       3       49             0 cgmanager
[241816.502950] [ 1035]     0  1035     7252      594      21       3       40             0 cron
[241816.502951] [ 1040]     0  1040     6511      477      18       3       35             0 atd
[241816.502952] [ 1042]   107  1042    10726      580      26       3       59          -900 dbus-daemon
[241816.502953] [ 1098]     0  1098    58693      333      17       3        5             0 lxcfs
[241816.502955] [ 1100]     0  1100     7159      461      18       3       60             0 systemd-logind
[241816.502956] [ 1102]   104  1102    64099      451      28       3      213             0 rsyslogd
[241816.502957] [ 1104]     0  1104    53932     1284      29       5     1500             0 snapd
[241816.502958] [ 1189]     0  1189    16380      764      37       4      143         -1000 sshd
[241816.502959] [ 1201]     0  1201     3344       24      11       3       13             0 mdadm
[241816.502960] [ 1208]     0  1208     1306       31       9       3        0             0 iscsid
[241816.502961] [ 1209]     0  1209     1431      878       9       3        0           -17 iscsid
[241816.502963] [ 1216]     0  1216    69278      914      39       4      596             0 polkitd
[241816.502964] [ 1263]     0  1263   365148     1616     170       4     2336             0 libvirtd
[241816.502965] [ 1293]     0  1293     3985      366      13       3        0             0 agetty
[241816.502966] [ 1298]     0  1298     4868       23      14       3       41             0 irqbalance
[241816.502967] [ 1310]   116  1310    27509      654      24       3      113             0 ntpd
[241816.502968] [ 1421]   115  1421    17416     1849      37       3     2222             0 BackupPC
[241816.502969] [ 1422]   115  1422    54531    34434     112       3     9739             0 BackupPC_trashC
[241816.502970] [ 1471]     0  1471    18941      896      40       3      237             0 Apache2
[241816.502972] [ 1544]     0  1544    16352      501      24       3       96             0 master
[241816.502973] [ 1546]   114  1546    16881      469      25       3       98             0 qmgr
[241816.502974] [ 1722]   113  1722    12496      352      27       3       97             0 dnsmasq
[241816.502975] [ 1723]     0  1723    12489        1      27       3       93             0 dnsmasq
[241816.502976] [ 1800]   113  1800    12496        0      27       3       98             0 dnsmasq
[241816.502977] [ 1804]     0  1804    48439      806      52       3       13          -900 virtlogd
[241816.502978] [ 1904]   112  1904   472592   285000     721       5     7103             0 qemu-system-x86
[241816.502979] [ 1997]   112  1997   277724    85130     334       4     9316             0 qemu-system-x86
[241816.502981] [ 3198]   112  3198  1045449   832068    1880       7    14166             0 qemu-system-x86
[241816.502982] [29065]    33 29065    18941      603      39       3      243             0 Apache2
[241816.502983] [29066]    33 29066    91246      692      69       3      738             0 Apache2
[241816.502984] [29067]    33 29067   124032     1274      71       4      225             0 Apache2
[241816.502985] [ 5735]   115  5735   295501   258925     578       4    17706             0 BackupPC_dump
[241816.502986] [ 5818]   115  5818   276492   238098     539       4    18790             0 BackupPC_dump
[241816.502988] [ 7774]   114  7774    16869     1111      24       3        0             0 pickup
[241816.502989] Out of memory: Kill process 3198 (qemu-system-x86) score 137 or sacrifice child
[241816.503021] Killed process 3198 (qemu-system-x86) total-vm:4181796kB, anon-rss:3324684kB, file-rss:3588kB
[241816.703137] virbr1: port 4(vnet2) entered disabled state
[241816.704366] device vnet2 left promiscuous mode
[241816.704367] virbr1: port 4(vnet2) entered disabled state
[241819.514670] audit: type=1400 audit(1487210104.861:50): apparmor="STATUS" operation="profile_remove" profile="unconfined" name="libvirt-c0ed3084-e7d5-4165-b125-8089914fe680" pid=8265 comm="apparmor_parser"
[247217.394936] libvirt-bin invoked oom-killer: gfp_mask=0x26000c0, order=2, oom_score_adj=0
[247217.394938] libvirt-bin cpuset=/ mems_allowed=0
[247217.394943] CPU: 1 PID: 8920 Comm: libvirt-bin Not tainted 4.4.0-62-generic #83-Ubuntu
[247217.394944] Hardware name: Dell Inc. PowerEdge T20/0VD5HY, BIOS A06 01/27/2015
[247217.394945]  0000000000000286 00000000e1669350 ffff88017aaffaf0 ffffffff813f7c63
[247217.394947]  ffff88017aaffcc8 ffff8800da16e200 ffff88017aaffb60 ffffffff8120ad4e
[247217.394948]  0000000000000015 0000000000000000 ffff880409ac2540 ffff880407bad400
[247217.394950] Call Trace:
[247217.394954]  [<ffffffff813f7c63>] dump_stack+0x63/0x90
[247217.394957]  [<ffffffff8120ad4e>] dump_header+0x5a/0x1c5
[247217.394960]  [<ffffffff81390c14>] ? apparmor_capable+0xc4/0x1b0
[247217.394962]  [<ffffffff811926c2>] oom_kill_process+0x202/0x3c0
[247217.394964]  [<ffffffff8119208e>] ? oom_unkillable_task+0x9e/0xd0
[247217.394965]  [<ffffffff81192ae9>] out_of_memory+0x219/0x460
[247217.394967]  [<ffffffff81198a5d>] __alloc_pages_slowpath.constprop.88+0x8fd/0xa70
[247217.394969]  [<ffffffff81198e56>] __alloc_pages_nodemask+0x286/0x2a0
[247217.394971]  [<ffffffff81198f0b>] alloc_kmem_pages_node+0x4b/0xc0
[247217.394974]  [<ffffffff8107ea5e>] copy_process+0x1be/0x1b70
[247217.394976]  [<ffffffff811c1660>] ? handle_mm_fault+0xce0/0x1820
[247217.394979]  [<ffffffff81037eb9>] ? sched_clock+0x9/0x10
[247217.394982]  [<ffffffff810b1bcf>] ? sched_clock_cpu+0x8f/0xa0
[247217.394984]  [<ffffffff810805a0>] _do_fork+0x80/0x360
[247217.394985]  [<ffffffff81080929>] SyS_clone+0x19/0x20
[247217.394988]  [<ffffffff818385f2>] entry_SYSCALL_64_fastpath+0x16/0x71
[247217.394989] Mem-Info:
[247217.394992] active_anon:495436 inactive_anon:332110 isolated_anon:0
                 active_file:1362581 inactive_file:834329 isolated_file:0
                 unevictable:914 dirty:5499 writeback:274 unstable:0
                 slab_reclaimable:959199 slab_unreclaimable:17954
                 mapped:6609 shmem:5247 pagetables:3469 bounce:0
                 free:58696 free_pcp:115 free_cma:0
[247217.394994] Node 0 DMA free:15852kB min:64kB low:80kB high:96kB active_anon:0kB inactive_anon:0kB active_file:0kB inactive_file:0kB unevictable:0kB isolated(anon):0kB isolated(file):0kB present:15936kB managed:15852kB mlocked:0kB dirty:0kB writeback:0kB mapped:0kB shmem:0kB slab_reclaimable:0kB slab_unreclaimable:0kB kernel_stack:0kB pagetables:0kB unstable:0kB bounce:0kB free_pcp:0kB local_pcp:0kB free_cma:0kB writeback_tmp:0kB pages_scanned:0 all_unreclaimable? yes
[247217.394997] lowmem_reserve[]: 0 3376 15901 15901 15901
[247217.394999] Node 0 DMA32 free:91172kB min:14336kB low:17920kB high:21504kB active_anon:345184kB inactive_anon:361348kB active_file:1469732kB inactive_file:782520kB unevictable:56kB isolated(anon):0kB isolated(file):0kB present:3578388kB managed:3497768kB mlocked:56kB dirty:3892kB writeback:220kB mapped:11244kB shmem:12080kB slab_reclaimable:422984kB slab_unreclaimable:12256kB kernel_stack:1616kB pagetables:2184kB unstable:0kB bounce:0kB free_pcp:0kB local_pcp:0kB free_cma:0kB writeback_tmp:0kB pages_scanned:216 all_unreclaimable? no
[247217.395002] lowmem_reserve[]: 0 0 12524 12524 12524
[247217.395004] Node 0 Normal free:127760kB min:53180kB low:66472kB high:79768kB active_anon:1636560kB inactive_anon:967092kB active_file:3980592kB inactive_file:2554796kB unevictable:3600kB isolated(anon):0kB isolated(file):0kB present:13088768kB managed:12825312kB mlocked:3600kB dirty:18104kB writeback:876kB mapped:15192kB shmem:8908kB slab_reclaimable:3413812kB slab_unreclaimable:59560kB kernel_stack:2592kB pagetables:11692kB unstable:0kB bounce:0kB free_pcp:460kB local_pcp:0kB free_cma:0kB writeback_tmp:0kB pages_scanned:0 all_unreclaimable? no
[247217.395006] lowmem_reserve[]: 0 0 0 0 0
[247217.395008] Node 0 DMA: 1*4kB (U) 1*8kB (U) 0*16kB 1*32kB (U) 1*64kB (U) 1*128kB (U) 1*256kB (U) 0*512kB 1*1024kB (U) 1*2048kB (M) 3*4096kB (M) = 15852kB
[247217.395014] Node 0 DMA32: 11405*4kB (UME) 5706*8kB (UME) 0*16kB 0*32kB 0*64kB 0*128kB 0*256kB 0*512kB 0*1024kB 0*2048kB 0*4096kB = 91268kB
[247217.395018] Node 0 Normal: 30930*4kB (UMEH) 264*8kB (UMEH) 5*16kB (H) 5*32kB (H) 4*64kB (H) 3*128kB (H) 2*256kB (H) 1*512kB (H) 0*1024kB 0*2048kB 0*4096kB = 127736kB
[247217.395025] Node 0 hugepages_total=0 hugepages_free=0 hugepages_surp=0 hugepages_size=1048576kB
[247217.395025] Node 0 hugepages_total=0 hugepages_free=0 hugepages_surp=0 hugepages_size=2048kB
[247217.395026] 2234245 total pagecache pages
[247217.395027] 31364 pages in swap cache
[247217.395028] Swap cache stats: add 769200, delete 737836, find 501629/589327
[247217.395029] Free swap  = 7999552kB
[247217.395029] Total swap = 8293372kB
[247217.395030] 4170773 pages RAM
[247217.395030] 0 pages HighMem/MovableOnly
[247217.395031] 86040 pages reserved
[247217.395031] 0 pages cma reserved
[247217.395032] 0 pages hwpoisoned
[247217.395032] [ pid ]   uid  tgid total_vm      rss nr_ptes nr_pmds swapents oom_score_adj name
[247217.395040] [  397]     0   397    10970     2059      23       3       31             0 systemd-journal
[247217.395041] [  435]     0   435    25742      229      17       3        0             0 lvmetad
[247217.395044] [  454]     0   454    11440      823      23       3      396         -1000 systemd-udevd
[247217.395046] [ 1020]     0  1020    68967     1031      36       3       58             0 accounts-daemon
[247217.395047] [ 1022]     0  1022     1100      317       7       3        2             0 acpid
[247217.395048] [ 1029]     0  1029     6322      605      18       3       83             0 smartd
[247217.395050] [ 1031]     0  1031     7470      190      18       3       49             0 cgmanager
[247217.395051] [ 1035]     0  1035     7252      593      21       3       41             0 cron
[247217.395053] [ 1040]     0  1040     6511      477      18       3       35             0 atd
[247217.395054] [ 1042]   107  1042    10726      580      26       3       59          -900 dbus-daemon
[247217.395055] [ 1098]     0  1098    58693      333      17       3        5             0 lxcfs
[247217.395057] [ 1100]     0  1100     7159      461      18       3       60             0 systemd-logind
[247217.395058] [ 1102]   104  1102    64099      510      28       3      203             0 rsyslogd
[247217.395060] [ 1104]     0  1104    53932     1284      29       5     1500             0 snapd
[247217.395061] [ 1189]     0  1189    16380      764      37       4      143         -1000 sshd
[247217.395063] [ 1201]     0  1201     3344       24      11       3       13             0 mdadm
[247217.395064] [ 1208]     0  1208     1306       31       9       3        0             0 iscsid
[247217.395065] [ 1209]     0  1209     1431      878       9       3        0           -17 iscsid
[247217.395066] [ 1216]     0  1216    69278      914      39       4      596             0 polkitd
[247217.395068] [ 1263]     0  1263   365148     2482     170       4     2162             0 libvirtd
[247217.395069] [ 1293]     0  1293     3985      366      13       3        0             0 agetty
[247217.395070] [ 1298]     0  1298     4868       23      14       3       41             0 irqbalance
[247217.395072] [ 1310]   116  1310    27509      654      24       3      113             0 ntpd
[247217.395073] [ 1421]   115  1421    17416     1864      37       3     2207             0 BackupPC
[247217.395075] [ 1422]   115  1422    54531    34425     112       3     9748             0 BackupPC_trashC
[247217.395076] [ 1471]     0  1471    18941      896      40       3      237             0 Apache2
[247217.395077] [ 1544]     0  1544    16352      504      24       3       93             0 master
[247217.395078] [ 1546]   114  1546    16881      469      25       3       98             0 qmgr
[247217.395080] [ 1722]   113  1722    12496      352      27       3       97             0 dnsmasq
[247217.395081] [ 1723]     0  1723    12489        1      27       3       93             0 dnsmasq
[247217.395082] [ 1800]   113  1800    12496      419      27       3       95             0 dnsmasq
[247217.395083] [ 1804]     0  1804    48439      815      52       3       11          -900 virtlogd
[247217.395085] [ 1904]   112  1904   472592   285001     721       5     7102             0 qemu-system-x86
[247217.395086] [ 1997]   112  1997   277724    85130     334       4     9316             0 qemu-system-x86
[247217.395088] [29065]    33 29065    18941      603      39       3      243             0 Apache2
[247217.395090] [29066]    33 29066    91246      691      69       3      739             0 Apache2
[247217.395091] [29067]    33 29067   124032     1274      71       4      225             0 Apache2
[247217.395092] [ 5735]   115  5735   295501   269817     578       4     6814             0 BackupPC_dump
[247217.395094] [ 5818]   115  5818   276492   247915     539       4     9138             0 BackupPC_dump
[247217.395095] [ 8764]   114  8764    16869     1113      25       3        0             0 pickup
[247217.395097] [ 8867]     0  8867    12555      709      30       3       11             0 cron
[247217.395098] [ 8870]     0  8870     1127      189       8       3        0             0 sh
[247217.395099] [ 8871]     0  8871     1092      165       8       3        0             0 run-parts
[247217.395101] [ 8887]     0  8887     1127      441       8       3        0             0 libvirt-bin
[247217.395102] [ 8920]     0  8920     1127       27       8       3        0             0 libvirt-bin
[247217.395103] Out of memory: Kill process 1904 (qemu-system-x86) score 47 or sacrifice child
[247217.395137] Killed process 1904 (qemu-system-x86) total-vm:1890368kB, anon-rss:1136532kB, file-rss:3472kB
[247217.472809] virbr1: port 2(vnet0) entered disabled state
[247217.474014] device vnet0 left promiscuous mode
[247217.474015] virbr1: port 2(vnet0) entered disabled state

Je vois aussi les messages suivants au démarrage. Je ne suis pas sûr s'ils sont liés.

[    0.000000] mtrr_cleanup: can not find optimal value
[    0.000000] please specify mtrr_gran_size/mtrr_chunk_size

De plus, quelques erreurs de mémoire ECC ont été enregistrées dans le BIOS. Mais ils étaient d'il y a des mois. Nous avons changé la machine entière en une nouvelle machine matérielle du même modèle. BIOS mis à niveau vers la dernière version. Jusqu'à présent, l'utilisation de la mémoire flotte autour de moins de la moitié de la mémoire de la machine. Nous verrons dans un instant si le MOO tuerait encore ou non les processus. Cela prenait habituellement une semaine ou deux ...

KiB Mem : 16338936 total,   173348 free,  6812676 used,  9352912 buff/cache
KiB Swap:  8293372 total,  7672968 free,   620404 used.  9059716 avail Mem

PDATE: La machine fonctionne parfaitement pour l'instant! Donc, le problème était probablement lié aux erreurs ECC que j'ai constatées dans le système OR la mise à jour du BIOS a corrigé le problème. Je ne suis pas sûr à 100%, car l'ensemble de la boîte a été remplacé par un autre modèle d'ordinateur et le BIOS a été mis à niveau. Jusqu'ici tout va bien!

1
yurtesen

Je suis désolé de poster ceci comme une réponse. Je n'ai pas le représentant dans "AskUbuntu" pour poster un commentaire et je suis venu ici pour poster le même problème.

J'ai une configuration très similaire à celle que vous avez (16.04.2 LTS, noyau 4.4.0-62-generic) et je rencontre le même problème. J'ai remarqué que le problème a commencé il y a environ 5 jours et qu'il s'est aggravé. Aujourd'hui, oom-killer a tué 4 processus et l'utilisation actuelle de la mémoire de mon système est tombée à 650 Mo, car il ne reste presque plus rien.

Je vais mettre à jour le noyau, redémarrer le système et indiquer si le problème a été résolu.

2
SeeJayEmm