Re: mm: kswapd struggles reclaiming the pages on 64GB server

From: Andriy Tkachuk
Date: Mon Aug 22 2016 - 18:46:33 EST


On Mon, Aug 22, 2016 at 7:37 PM, Andriy Tkachuk
<andriy.tkachuk@xxxxxxxxxxx> wrote:
>
> The following patch resolved the problem:
> ...

Sorry, I was too hurry in sending good news. As appeared - the problem
is still there:

06:10:01 PM pgpgin/s pgpgout/s fault/s majflt/s pgfree/s
pgscank/s pgscand/s pgsteal/s %vmeff
06:20:01 PM 78.72 1408298.79 3394.73 0.54 384043.36
204.96 10.88 148.06 68.60
06:30:01 PM 51.71 1408936.46 3101.97 0.31 384366.70
211.51 19.30 148.59 64.38
06:40:01 PM 65.29 1366755.43 3506.76 0.65 372628.96
383.94 27.11 154.71 37.64
06:50:01 PM 66.89 1376770.04 3544.59 0.78 375676.38
344.94 12.48 149.84 41.92
07:00:01 PM 15.87 1364029.92 3030.66 0.31 371903.73
307.63 13.32 132.75 41.36
07:10:01 PM 134.24 1370087.74 3397.32 1.06 373435.85
394.16 21.16 166.62 40.12
07:20:01 PM 95.27 1379686.06 3462.59 0.74 376050.98
356.59 29.23 159.00 41.21
07:30:01 PM 12.66 1392306.35 2929.64 0.25 379662.45
395.07 28.66 138.58 32.71
07:40:01 PM 142.06 1368613.00 3486.30 0.98 372912.77
575.74 40.40 174.45 28.31
07:50:02 PM 119.77 1389287.62 3562.60 1.02 378661.56
584.00 29.36 167.92 27.38
08:00:01 PM 7.31 1368140.71 3082.65 0.50 372828.37
711.84 58.08 130.82 16.99
08:10:01 PM 174.74 1381379.24 3529.78 1.65 376567.77
1023.07 46.96 182.60 17.07
08:20:01 PM 205.05 1353845.50 3546.95 1.88 368828.37
1807.24 57.29 183.46 9.84
08:30:01 PM 86.50 1375125.95 3182.56 1.41 374676.32
10590.52 217.62 148.83 1.38
08:40:01 PM 359.38 1345209.72 4185.55 3.45 366630.54
1419665.54 43307.32 220.06 0.02
08:50:01 PM 277.27 1324076.64 3889.98 2.51 360915.41
1779132.87 410.38 200.49 0.01
09:00:02 PM 96.48 1364092.11 3235.53 1.69 372031.87
1973266.99 167.74 163.43 0.01
09:10:01 PM 298.87 1324763.68 3914.11 2.90 360957.82
1772066.29 435.17 204.08 0.01

And I also see the inactive file list bigger than the active file:

# cat /proc/vmstat | egrep
'Cached|nr_.*active_|pgsteal_.*_normal|pgscan_kswapd_normal|pgrefill_normal|nr_vmscan_write|nr_swap|pgact'
nr_inactive_anon 695534
nr_active_anon 14427464
nr_inactive_file 2786
nr_active_file 2698
nr_vmscan_write 1740097
pgactivate 115697891
pgrefill_normal 33345818
pgsteal_kswapd_normal 367908859
pgsteal_direct_normal 681266
pgscan_kswapd_normal 10255454426

So it looks like the time to check Minchan's patch really (disable the
check that inactive file list is bigger than active file)...

Andriy