This is why having language runtimes that are NUMA aware like in JVM implementations (-XX:+UseNUMA) or languages like Chapel, is much easier than requiring everyone to be a top coder.
It isn't the same as manually doing the optimizations, still half way there is better than not at all.
It isn't the same as manually doing the optimizations, still half way there is better than not at all.