Making data available to a thread on the same core has exceptionally high bandwidth. Just that alone will help with scaling.
Making data available to a thread on the same core has exceptionally high bandwidth. Just that alone will help with scaling.