NVIDIA just released [cutile](https://developer.nvidia.com/cuda/tile) There might be an opportunity to leverage this for faster `gpu-stump`
NVIDIA just released cutile
There might be an opportunity to leverage this for faster
gpu-stump