Michael Anti · Aug 27, 2025 · 12:06 PM UTC

Michael Anti

Yudong Jin retweeted

Michael Anti

@mranti

Aug 27

凯恩在备战明年的CSP-J（今年有12岁年龄限制），找来找去，发现最好的算法书是 @krahets 的《Hello算法》，我们买的是python代码版本（网上有各语言版开源），但新C++语法其实看起来和Python没多大区别，凯恩读起来没障碍。这本书真的是大人小孩都能看。

380

MrNeRF · Jul 18, 2025 · 5:54 AM UTC

Yudong Jin retweeted

MrNeRF

@janusch_patas

Jul 18

Diffuman4D: 4D Consistent Human View Synthesis from Sparse-View Videos with Spatio-Temporal Diffusion Models Contributions: • We introduce Diffuman4D, a novel diffusion model that generates spatio-temporally consistent and high-resolution (1024p) human videos from sparse-view video inputs. • We propose a sliding iterative denoising mechanism that enhances both the spatial and temporal consistency of generated long-term videos while maintaining efficient inference. • We design a human pose conditioning scheme to enhance the appearance quality and motion accuracy of generated human videos. • We plan to release our processed version of the DNA-Rendering dataset, which we believe will benefit future research in this area.

458

Yudong Jin · Dec 23, 2024 · 4:18 AM UTC

Yudong Jin @krahets

23 Dec 2024

Want to model reflective scenes and render them in real-time? Check out EnvGS!

MrNeRF

@janusch_patas

22 Dec 2024

EnvGS: Modeling View-Dependent Appearance with Environment Gaussian Contributions: • We propose a novel scene representation for accurately modeling complex near-field and high-frequency reflections in real-world environments. • We developed a real-time ray-tracing renderer for 2DGS, enabling joint optimization of our representation for accurate scene reconstruction while achieving real-time rendering speeds. • Extensive experiments show that EnvGS significantly outperforms previous methods. To the best of our knowledge, EnvGS is the first method to achieve real-time photorealistic specular reflections synthesis in real-world scenes.

Haotong Lin · Dec 19, 2024 · 4:12 AM UTC

Yudong Jin retweeted

Haotong Lin @HaotongLin

19 Dec 2024

Check out our new work, Prompt Depth Anything, which achieves accurate metric depth estimation at up to 4K resolution! Thanks to all our collaborators!

Bingyi Kang

@bingyikang

19 Dec 2024

Want to use Depth Anything, but need metric depth rather than relative depth? Thrilled to introduce Prompt Depth Anything, a new paradigm for accurate metric depth estimation with up to 4K resolution. 👉Key Message: Depth foundation models like DA have already internalized rich geometric knowledge of the 3D world but lack a proper way to elicit it. Inspired by the success of prompting in LLMs, we propose prompting Depth Anything with metric cues to produce metric depth. This method proves to be very effective when using a low-cost lidar (e.g., iPhone's LiDAR), which is widely available, as prompts. We believe the prompt can generalize to other forms as long as scale information is provided. Prompt Depth Anything offers 1⃣A series of models for iPhone lidars. 2⃣4D reconstruction from monocular videos (captured with iPhone). 3⃣Improved generalization ability for robot manipulation, e.g. Training on cans but generalizing on glasses. 4⃣More detailed depth annotations for the ScanNet++ dataset. The first author is our excellent intern @HaotongLin. Paper: huggingface.co/papers/2412.1… Huggingface: huggingface.co/papers/2412.1… Project Page: promptda.github.io Code: github.com/DepthAnything/Pro…

Ruiqi Gao · Nov 28, 2024 · 3:08 AM UTC

Yudong Jin retweeted

Ruiqi Gao

@RuiqiGao

28 Nov 2024

CAT3D + time => CAT4D! 🐈 Check out our latest work on turning text/image(s)/video into dynamic 3D models that one can explore in real time, led by brilliant @ChrisWu6080!

Rundi Wu @ChrisWu6080

28 Nov 2024

🚀 Introducing CAT4D! 🚀 CAT4D transforms any real or generated video into dynamic 3D scenes with a multi-view video diffusion model. The outputs are dynamic 3D models that we can freeze and look at from novel viewpoints, in real-time! Be sure to try our interactive viewer!

119

Yudong Jin · Aug 12, 2024 · 11:00 AM UTC

Yudong Jin @krahets

12 Aug 2024

Awesome! The transformer version of cnn-explainer.

Brendan Bycroft

@BrendanBycroft

2 Dec 2023

Project #2: LLM Visualization So I created a web-page to visualize a small LLM, of the sort that's behind ChatGPT. Rendered in 3D, it shows all the steps to run a single token inference. (link in bio)

Physics In History · Aug 1, 2024 · 7:16 PM UTC

Yudong Jin retweeted

Physics In History

@PhysInHistory

1 Aug 2024

139

7,021

233

50,554

Ashish · Jul 5, 2024 · 12:24 PM UTC

Yudong Jin retweeted

Ashish

@Ash_uxi

5 Jul 2024

🗣️

227

4,657

Yudong Jin · Apr 1, 2024 · 6:54 PM UTC

Yudong Jin @krahets

1 Apr 2024

今天看到了一位读者的评论，心情久久未能平复... 愿功夫不负有心人！

340

Yudong Jin · Mar 22, 2024 · 8:43 AM UTC

Yudong Jin @krahets

22 Mar 2024

附文章链接（阮老师 YYDS ！😭） ruanyifeng.com/blog/2010/08/…

Yudong Jin · Mar 22, 2024 · 8:43 AM UTC

Yudong Jin @krahets

22 Mar 2024

前段时间重读了一下阮老师 2010 年写的博客「关于 IT 出版业」，颇有感触。十五年了，文章里谈到的畅销书《C++ Primer》仍然名列前茅，可谓经久不衰。这在互联网平台上是难以想象的。 “作者版税”“译者报酬”等话题，读起来似乎“时空停滞”了。出版业是一个有钝感力的行业。

ruanyf @ruanyf

22 Mar 2024

周五分享 - Hello 算法（图一）：开源的算法入门书籍hello-algo.com/chapter_paper… - StockCake（图二）：无限的无版权AI生成图片下载stockcake.com/ - KanjiVG（图三）：汉字SVG文件下载，有笔划动画kanjivg.tagaini.net/index.ht… #科技爱好者周刊（第294期）ruanyifeng.com/blog/2024/03/…

Yudong Jin · Mar 11, 2024 · 12:53 PM UTC

Yudong Jin @krahets

11 Mar 2024

❔我们为什么要学习数据结构与算法？ 📗《Hello算法》纸质书长什么样？ 🌈为什么要做开源书？新人 UP 主，请多多关照、一键三连～ bilibili.com/video/BV1QH4y15…

243

𝗞𝗶𝘀𝗵𝗶𝗺𝗼𝘁𝗼 岸本斉史 (Parody) · Mar 8, 2024 · 11:46 AM UTC

Yudong Jin retweeted

𝗞𝗶𝘀𝗵𝗶𝗺𝗼𝘁𝗼 岸本斉史 (Parody)

@kishimotomasshi

8 Mar 2024

“Son Goku from Dragon Ball is the ultimate Shōnen Jump model that made me think ‘now THIS is a main character’ I wanted a character like Goku in my manga. It’s that clear & simple mindset that makes readers feel great. It motivated me too. That’s my image of a Hero” — Kishimoto

304

5,110

142

43,507

Massimo · Mar 4, 2024 · 2:23 PM UTC

Yudong Jin retweeted

Massimo

@Rainmaker1973

4 Mar 2024

Instant AI art x.com/i/status/1764656365107…

335

1,969

292

15,142

Stability AI · Feb 22, 2024 · 1:24 PM UTC

Yudong Jin retweeted

Stability AI

@StabilityAI

22 Feb 2024

Announcing Stable Diffusion 3, our most capable text-to-image model, utilizing a diffusion transformer architecture for greatly improved performance in multi-subject prompts, image quality, and spelling abilities. Today, we are opening the waitlist for early preview. This phase is crucial for gathering insights to improve its performance and safety ahead of open release. You can sign up to join the waitlist and learn more here: bit.ly/3OR2qQF #stablediffusion3 Prompt: Epic anime artwork of a wizard atop a mountain at night casting a cosmic spell into the dark sky that says "Stable Diffusion 3" made out of colorful energy

239

1,254

446

5,018

OpenAI · Feb 15, 2024 · 6:14 PM UTC

Yudong Jin retweeted

OpenAI

@OpenAI

15 Feb 2024

Introducing Sora, our text-to-video model. Sora can create videos of up to 60 seconds featuring highly detailed scenes, complex camera motion, and multiple characters with vibrant emotions. openai.com/sora Prompt: “Beautiful, snowy Tokyo city is bustling. The camera moves through the bustling city street, following several people enjoying the beautiful snowy weather and shopping at nearby stalls. Gorgeous sakura petals are flying through the wind along with snowflakes.”

9,189

30,383

42,201

132,306