The performance of two coupled Musubi solvers decreases with increasing problem sizes for a cavity setup.
Timing results from the first Musubi:
# Revision SimName DomSize nProcs MLUPs MLUPs_kernel imbalance(%) timeMusubi maxIter totalDens timeMainLoop timeLoadMesh timeInitLvlD timewRestart timeBalance timeSource timeAux timeRelax Comp(%) Comm(%) BCbuffer(%) BC(%) Intp(%)
f80edce4580b+ cavity_fluid 29791 192 200.92E+00 5.07E+03 107.10 4.3648E+00 5000 29.791000000E+03 0.7686 0.1334 0.0223 0.0000 0.0000 0.0489 0.07 0.00 3.96 18.84 1.05 0.39 0.00
f80edce4580b+ cavity_fluid 64000 192 391.42E+00 5.80E+03 105.22 3.7766E+00 5000 64.000000000E+03 0.8420 0.2910 0.0223 0.0000 0.0000 0.0986 0.11 0.01 6.74 15.83 1.77 0.37 0.00
f80edce4580b+ cavity_fluid 125000 192 539.73E+00 6.87E+03 103.52 4.6628E+00 5000 125.000000000E+03 1.1858 0.7073 0.0310 0.0000 0.0000 0.1744 0.19 0.01 7.85 12.77 1.89 0.26 0.00
f80edce4580b+ cavity_fluid 250047 192 427.97E+00 6.58E+03 111.40 6.6400E+00 5001 250.047000000E+03 2.9539 0.2247 0.0403 0.0000 0.0000 0.3337 0.33 0.01 6.51 7.54 1.10 0.12 0.00
f80edce4580b+ cavity_fluid 493039 192 686.86E+00 6.06E+03 107.08 9.4986E+00 5001 493.039000000E+03 3.6192 0.2972 0.0488 0.0000 0.0000 0.6423 0.91 0.03 11.34 7.54 1.31 0.10 0.00
f80edce4580b+ cavity_fluid 1000000 192 597.78E+00 3.60E+03 107.01 18.5579E+00 5000 1.000000000E+06 8.4201 0.2001 0.0806 0.0000 0.0000 1.4636 1.76 0.09 16.61 14.20 1.68 0.07 0.00
f80edce4580b+ cavity_fluid 2000376 192 400.90E+00 1.36E+03 104.89 44.2614E+00 5001 2.000376000E+06 25.0388 0.7530 0.1505 0.0000 0.0000 4.3404 5.92 0.28 29.50 9.84 2.19 0.03 0.00
f80edce4580b+ cavity_fluid 4019679 192 312.79E+00 1.03E+03 105.59 97.2472E+00 5000 4.019679000E+06 64.3636 0.8667 0.3023 0.0000 0.0000 13.6290 15.66 0.56 30.35 6.76 1.50 0.02 0.00
f80edce4580b+ cavity_fluid 8000000 192 285.86E+00 946.83E+00 105.85 203.4730E+00 5000 8.000000000E+06 140.0620 0.9625 0.5397 0.0000 0.0000 33.1516 38.22 2.00 30.19 10.85 1.16 0.01 0.00
f80edce4580b+ cavity_fluid 16003008 192 271.22E+00 1.01E+03 105.22 430.3999E+00 5001 16.003008000E+06 295.6390 2.3404 1.0559 0.0000 0.0000 77.7521 89.46 4.75 26.77 12.74 0.96 0.01 0.00
Timing results from the second Musubi:
f80edce4580b+ cavity_T 29791 192 218.42E+00 2.31E+03 112.99 4.3593E+00 5000 14.843709680E+03 0.7090 1.4949 0.4072 0.0000 0.0000 0.0006 0.05 0.00 9.46 12.48 1.13 2.94 0.00
f80edce4580b+ cavity_T 64000 192 491.81E+00 2.50E+03 109.97 3.7775E+00 5000 31.748950607E+03 0.6771 1.0617 0.0277 0.0000 0.0000 0.0006 0.06 0.00 19.66 13.58 2.07 4.04 0.00
f80edce4580b+ cavity_T 125000 192 661.28E+00 2.56E+03 110.56 4.6638E+00 5000 60.755966848E+03 0.9729 1.1322 0.0317 0.0000 0.0000 0.0007 0.11 0.00 25.86 10.07 2.52 4.46 0.00
f80edce4580b+ cavity_T 250047 192 420.54E+00 2.28E+03 128.81 6.6399E+00 5001 117.112515274E+03 3.0120 0.5505 0.0888 0.0000 0.0000 0.0009 0.17 0.00 18.42 3.87 1.11 2.11 0.00
f80edce4580b+ cavity_T 493039 192 861.90E+00 2.33E+03 113.28 9.4982E+00 5001 219.241497252E+03 2.8871 1.2838 0.0824 0.0000 0.0000 0.0010 0.34 0.00 37.00 14.19 1.61 3.29 0.00
f80edce4580b+ cavity_T 1000000 192 808.66E+00 2.00E+03 106.09 18.5589E+00 5000 412.217421027E+03 6.2318 1.2603 0.0899 0.0000 0.0000 0.0013 0.98 0.00 40.43 10.36 2.37 3.05 0.00
f80edce4580b+ cavity_T 2000376 192 545.95E+00 1.13E+03 104.33 44.2620E+00 5001 749.071840980E+03 18.3551 4.8015 0.1785 0.0000 0.0000 0.0023 3.72 0.00 48.25 9.60 3.06 3.05 0.00
f80edce4580b+ cavity_T 4019679 192 388.82E+00 832.38E+00 105.98 97.2482E+00 5000 1.340734242E+06 51.7386 1.8442 0.4857 0.0000 0.0000 0.0035 9.62 0.00 46.71 9.69 2.51 2.54 0.00
f80edce4580b+ cavity_T 8000000 192 398.31E+00 674.56E+00 105.25 203.4739E+00 5000 2.342155756E+06 100.7803 2.0471 0.6323 0.0000 0.0000 0.0048 23.37 0.00 59.05 2.79 2.50 2.34 0.00
f80edce4580b+ cavity_T 16003008 192 385.29E+00 619.10E+00 104.87 430.3996E+00 5001 4.076251599E+06 207.9912 4.1267 1.1955 0.0000 0.0000 0.0049 44.81 0.01 62.23 5.55 1.97 1.96 0.00
It seems that MLUPs_kernel becomes stable with the increase of elements per node.
Originally posted by @MikeW097 in #4 (comment)
The performance of two coupled Musubi solvers decreases with increasing problem sizes for a cavity setup.
Timing results from the first Musubi:
Timing results from the second Musubi:
It seems that
MLUPs_kernelbecomes stable with the increase of elements per node.Originally posted by @MikeW097 in #4 (comment)