Filter
Exclude
Time range
-
Near
you could do this or just do whatever kimi team did ig
idk what the fuss is, scaling RL to 1T+ params is simple all you need is: 1. a few thousand gpus 2. the og goat of opensource RL @vwxyzjn 3. the guy who invented the attention mechanism @DBahdanau
12