I don't think we should be using numa distance to reverse
certain allocation behavior. The latency data should be truthful, but
you're right we'll need a mechanism to keep general purpose
allocations out of that range by default.
Just to clarify: Do you propose/thinking to utilize NUMA API for
such (VRAM) allocations?