Collaborator and friend Dan Alistarh talks at ETH about using the new NvFP4 and MXFP4 block formats for inference. Some going from "terrible" accuracy to acceptable using micro rotations to smoothen outliers in blocks. arxiv.org/abs/2509.23202 Great collaboration and cool stuff

Nov 5, 2025 ยท 8:32 AM UTC

1
1
1
24