Keenan Crane · Aug 26, 2024 · 2:03 PM UTC

Keenan Crane

Pinned Tweet

Keenan Crane

@keenanisalive

26 Aug 2024

I work on fundamental algorithms for geometric and visual computing. Here's a taste of our group's work, as a list of "explainer" threads posted on Twitter/X over the years. (🧵) For a whole lot more, check out: cs.cmu.edu/~kmcrane/ geometry.cs.cmu.edu/

614

Keenan Crane · Oct 29, 2025 · 1:05 PM UTC

Keenan Crane

@keenanisalive

Oct 29

Fantastic video from @SciShow about our work that turns any shape into fair dice: piped.video/-gp7AbYD9NI?si=4pA6… Get all the details on Hossein Baktash’s page here: hbaktash.github.io/

How to Make Fair DnD Dice from ANY Shape

Try Rocket Money for free: https://rocketmoney.yt.link/fjwdHDO DnD (a.k.a. Dungeons and Dragons) is a game famous for having its players roll a bunch of diff...

youtube.com

Keenan Crane · Sep 25, 2025 · 1:39 PM UTC

Keenan Crane

@keenanisalive

Sep 25

“Fair dice” might make you think of perfect cubes with equal frequencies (say, 1/6 on all sides) 🎲 But “fair” really just means you get the frequencies you expect (say, 1/4, 1/4 & 1/2) We can now design fair dice with any frequencies—and any shape! 🐉 hbaktash.github.io/projects/…

Keenan Crane · Sep 24, 2025 · 3:38 PM UTC

Keenan Crane

@keenanisalive

Sep 24

Nicely produced clip by Matt Wein and Marylee Williams about our recent dice design project at @SCSatCMU and @AdobeResearch piped.video/shorts/jD0agpC2V… 🎲 🎥 🐉 🪙

Rethinking Fair Dice

Imagine opening a new board game to find there isn't a six-sided die in included. Instead, it's a figurine of a kitten, or some other object. Researchers at ...

youtube.com

Keenan Crane · Sep 11, 2025 · 1:27 PM UTC

Keenan Crane

@keenanisalive

Sep 11

I got tired of mashing together tools to write long threads with 𝐫𝐢𝐜𝐡 𝑓𝑜𝑟𝑚𝑎𝑡𝑡𝑖𝑛𝑔 and ℳα†ℏ—so I wrote La𝑇𝑤𝑒𝑒𝑡! It converts Markdown and LaTeX to Unicode that can be used in “tweets”, and automatically splits long threads. Try it out! keenancrane.github.io/LaTwee…

363

Keenan Crane · Sep 6, 2025 · 9:03 PM UTC

Keenan Crane

@keenanisalive

Sep 6

P.S. I should also mention that these diagrams were significantly improved via feedback from many folks from here and elsewhere. Hopefully they account for some of the gripes—if not, I'm ready for the next batch! 😉

Keenan Crane

@keenanisalive

Aug 29

I can't* fathom why the top picture, and not the bottom picture, is the standard diagram for an autoencoder. The whole idea of an autoencoder is that you complete a round trip and seek cycle consistency—why lay out the network linearly?

Keenan Crane · Sep 6, 2025 · 9:03 PM UTC

Keenan Crane

@keenanisalive

Sep 6

Of course, there will be those who say that the representation diagram is “obvious,” and “that's what everyone has in their head anyway.” If so… good for you! If not, I hope this alternative picture provides some useful insight as you hack in this space. [End 🧵]

Keenan Crane · Sep 6, 2025 · 9:03 PM UTC

Keenan Crane

@keenanisalive

Sep 6

If you want to use or repurpose these diagrams, the source files (as PDF) can be found at cs.cmu.edu/~kmcrane/Autoenco…

101

Keenan Crane · Sep 6, 2025 · 9:03 PM UTC

Keenan Crane

@keenanisalive

Sep 6

Likewise, here's a simpler “implementation” diagram, that still retains the most important idea of an *auto*-encoder, namely, that you're comparing the output against *itself*.

Keenan Crane · Sep 6, 2025 · 9:03 PM UTC

Keenan Crane

@keenanisalive

Sep 6

Personally, I find both of these diagrams a little bit crowded—here's a simpler “representation” diagram, with fewer annotations (that might anyway be better explained in accompanying text).

Keenan Crane · Sep 6, 2025 · 9:03 PM UTC

Keenan Crane

@keenanisalive

Sep 6

Finally, a natural question raised by this picture is: how do I sample/generate new latents z? For a “vanilla” autoencoder, there's no simple a priori description of the high-density regions. This situation motivates *variational* autoencoders (which are a whole other story…).

Keenan Crane · Sep 6, 2025 · 9:03 PM UTC

Keenan Crane

@keenanisalive

Sep 6

It should also be clear that, unless the reconstruction loss is exactly zero, the learned manifold M only approximates (rather than interpolates) the given data. For instance, x does not sit on M, even though x̂ does. (If M does interpolate all xᵢ, you're probably overfitting)

Keenan Crane · Sep 6, 2025 · 9:03 PM UTC

Keenan Crane

@keenanisalive

Sep 6

Another thing made clear by this picture is that, no matter what the true dimension of the data might be, the manifold M predicted by the decoder generically has the same dimension as the latent space: it's the image of R^k under g. So, the latent dimension is itself a prior.

Keenan Crane · Sep 6, 2025 · 9:03 PM UTC

Keenan Crane

@keenanisalive

Sep 6

In regions where we don't have many samples, the decoder g isn't reliable: we're basically extrapolating (i.e., guessing) what the true data manifold looks like. The diagram suggests this idea by “cutting off” the manifold—but in reality there’s no clear, hard cutoff.

Keenan Crane · Sep 6, 2025 · 9:03 PM UTC

Keenan Crane

@keenanisalive

Sep 6

Here's a way of visualizing the maps *defined by* an autoencoder. The encoder f maps high-dimensional data x to low-dimensional latents z. The decoder tries to map z back to x. We *always* learn a k-dimensional submanifold M, which is reliable only where we have many samples z.

157

Keenan Crane · Sep 6, 2025 · 9:03 PM UTC

Keenan Crane

@keenanisalive

Sep 6

This picture is great if you want to simply close your eyes and implement something. But suppose your implementation doesn't work—or you want to squeeze more performance out of your data. Is there another picture that helps you think about what's going on? (Obviously: yes!)

Keenan Crane · Sep 6, 2025 · 9:03 PM UTC

Keenan Crane

@keenanisalive

Sep 6

With autoencoders, the first (and last) picture we see often looks like this one: a network architecture diagram, where inputs get “compressed”, then decoded. If we're lucky, someone bothers to draw arrows that illustrate the main point: we want the output to look like the input!

Keenan Crane · Sep 6, 2025 · 9:03 PM UTC

Keenan Crane

@keenanisalive

Sep 6

A similar thing happens when (many) people learn linear algebra: They confuse the representation (matrices) with the objects represented by those matrices (linear maps… or is it a quadratic form?)

146

Keenan Crane · Sep 6, 2025 · 9:03 PM UTC

Keenan Crane

@keenanisalive

Sep 6

“Everyone knows” what an autoencoder is… but there's an important complementary picture missing from most introductory material. In short: we emphasize how autoencoders are implemented—but not always what they represent (and some of the implications of that representation).🧵

409

3,039

Keenan Crane · Aug 29, 2025 · 10:43 PM UTC

Keenan Crane

@keenanisalive

Aug 29

P.S. I'm sure there will also be people who comment, "But that's not the standard diagram for an autoencoder, it's actually…" Yes, please show me the better diagrams people are using! I chose this as the "standard" one because it appears on Wikipedia: en.wikipedia.org/wiki/Autoen…

Autoencoder - Wikipedia

en.wikipedia.org

Keenan Crane · Aug 29, 2025 · 10:41 PM UTC

Keenan Crane

@keenanisalive

Aug 29

I'm sure there are people who will comment, "Well actually, I prefer the top image because…" (and I'm very interested to hear those reasons). But for anyone interested in using the bottom image, I've put a PDF version here (CC0 1.0 Universal): cs.cmu.edu/~kmcrane/Autoenco… Enjoy!

137