Collaborator and friend Dan Alistarh talks at ETH about using the new NvFP4 and MXFP4 block formats for inference.
Some going from "terrible" accuracy to acceptable using micro rotations to smoothen outliers in blocks.
arxiv.org/abs/2509.23202
Great collaboration and cool stuff
Nov 5, 2025 ยท 8:32 AM UTC




