ab4n
Filter
Retweets
Media
Videos
News
Verified
Native videos
Replies
Links
Images
Safe
Quotes
Pro videos
Exclude
Retweets
Media
Videos
News
Verified
Native videos
Replies
Links
Images
Safe
Quotes
Pro videos
Time range
-
Near
Tweets
Users
Saurabh Shah
@saurabh_shah2
Nov 7
you could do this or just do whatever kimi team did ig
Rohan Pandey
@khoomeik
Oct 31
idk what the fuss is, scaling RL to 1T+ params is simple all you need is: 1. a few thousand gpus 2. the og goat of opensource RL
@vwxyzjn
3. the guy who invented the attention mechanism
@DBahdanau
12
Load more