AWS activates Project Rainier: One of the world’s largest AI compute clusters comes online.
~500,000 Trainium2 chips, and Anthropic is scaling Claude to >1,000,000 chips by Dec-25, a huge jump in training and inference capacity.
AWS connected multiple US data centers into one UltraCluster so Anthropic can train larger Claude models and handle longer context and heavier workloads without slowing down.
Each Trn2 UltraServer links 64 Trainium2 chips through NeuronLink inside the node, and EFA networking connects those nodes across buildings, cutting latency and keeping the cluster flexible for massive scaling.
Trainium2 is optimized for matrix and tensor math with HBM3 memory, giving it extremely high bandwidth so huge batches and long sequences can be processed without waiting for data transfer.
The UltraServers act as powerful single compute units inside racks, while the UltraCluster spreads training across tens of thousands of these servers, using parallel processing to handle giant models efficiently.
AWS says Project Rainier is its largest training platform ever, delivering >5x compute than what Anthropic used before, allowing faster model training and easier large-scale experiments.
For energy use, AWS reports a 0.15 L/kWh water usage efficiency, matching 100% renewable power and adding nuclear and battery investments to keep growing while staying within its 2040 net-zero goal.
---
aboutamazon. com/news/aws/aws-project-rainier-ai-trainium-chips-compute-cluster
About a year ago, this site near South Bend, Indiana was just cornfields. Today, it’s 1 of our U.S. data centers powering Project Rainier – one of the world’s largest AI compute clusters, built in collaboration with
@AnthropicAI.
It is 70% larger than any AI computing platform in
#AWS history, with nearly 500K Trainium2 chips, and is now fully operational with Anthropic actively using it to train and run inference for its industry-leading AI model, Claude (providing 5X+ the compute they used to train their previous AI models). We expect Claude to be on more than 1 million Trainium2 chips by the end of year.
Will help enable the next generation of AI innovation as we further extend our infrastructure leadership.
aboutamazon.com/news/aws/aws…