Looking at the connections per input pixel graphs in your Jupiter notebook, I see an interesting effect.
Unless it is a rendering bug, your simulations show smaller number of connections in the central area compared to the rest of it, while Mocanu et al show larger number of connections in the central area compared to the rest of it...
I'll try to figure out if it is just a rendering inversion or an actual effect here...
(Other than that, I happy to report that this notebook runs successfully under PyTorch 1.0 and Python 3.7.)