Almost 2 years at Nvidia, and the Tile IR project has been a very large part of my time here!
So happy to see it finally coming to light. The CUDA GPU driver will now include a
#MLIR-based JIT compiler! :)
More MLIR-based announcement at GTC tomorrow in the Cutlass 4.0 Session!
We've announced cuTile, a tile programming model for CUDA!
It's an array-based paradigm where the compiler automates mem movement, pipelining & tensor core utilization, making GPU programming easier & more portable.
I'm proud of my stellar team for all their hard work on this!