Akshay 🚀 · Nov 6, 2025 · 12:44 PM UTC

Akshay 🚀

Shylesh Kumar retweeted

Akshay 🚀

@akshay_pachaar

Nov 6

Turn any GitHub repository into rich, navigable docs. Simply replace "github" with "deepwiki" in the repo URL.

117

885

vixhal · Nov 5, 2025 · 5:00 PM UTC

Shylesh Kumar retweeted

vixhal

@TheVixhal

Nov 5

She blocked me from everywhere. Not because I cheated. Not because I was broke. Not even because I said her sister was more beautiful. But because, in her words: "No matter what you do, you never win in life." At first, I thought she was just calling me a loser. Then my inner math brain clicked... She was literally describing the martingale principle. In martingale theory, fair games are designed so your expected fortune never goes up or down, no matter how many times you've won or lost in the past. The math says: all the history you drag with you? It doesn't change what happens next. A martingale is a mathematical framework for modeling fair games and stochastic processes where the conditional expected value of the next observation, given all past observations, equals the current value. Formulas Discrete time: - E[X(n+1) | X(1), X(2), ..., X(n)] = X(n) General (with filtrations): - E[X(t) | F(s)] = X(s) for all s < t Where - X(n) = Value of the process at time n - E[·|·] = Conditional expectation - F(s) = Information (filtration) available up to time s Let's take an example and solve step by step. A gambler plays a fair coin flip game: - Heads: Win $1 - Tails: Lose $1 - Probability: P(H) = P(T) = 0.5 - Starting fortune: X(0) = 50 We want to verify this is a martingale. Step 1: Current fortune After 3 flips, suppose the fortune is $52: X(3) = 52 Step 2: Possible outcomes for the next flip - Heads → X(4) = 52 + 1 = 53 - Tails → X(4) = 52 - 1 = 51 Step 3: Compute conditional expectation - E[X(4) | X(1), X(2), X(3)] - P(H) · X(4) + P(T) · X(4) - 0.5 × 53 + 0.5 × 51 - 26.5 + 25.5 = 52 Therefore, E[X(4) | past] = X(3) = 52 Conclusion: The expected fortune after the next flip equals the current fortune → this is a martingale. Congratulations 🎉 , you've just learned Martingale Theory! Bonus: Applications in AI/ML Reinforcement Learning Value functions in Markov Decision Processes (MDPs) rely on martingale properties to ensure convergence of temporal-difference (TD) learning algorithms. Stochastic Gradient Descent (SGD) The noise in mini-batch gradient estimates forms a martingale difference sequence, enabling rigorous convergence analysis and generalization bounds in neural network training. Sequential Hypothesis Testing Likelihood ratio tests under the null hypothesis form martingales, supporting efficient stopping rules in A/B testing and statistical quality control (e.g., Wald's SPRT).

366

Ahmad · Nov 5, 2025 · 7:03 PM UTC

Shylesh Kumar retweeted

Ahmad

@TheAhmadOsman

Nov 5

starting next week, and to make everything easier to find, i’m moving all my LLM/AI educational content, past and future, to Substack if you’ve been learning from my threads, consider subscribing

196

fofr · Nov 5, 2025 · 2:21 PM UTC

Shylesh Kumar retweeted

fofr

@fofrAI

Nov 5

I've worked with @kylancodes to put together a little dedicated Replicate model that gives you: - all the face images - a sprite/grid of all the faces - vanilla HTML, JS and CSS to render the effect - a preview video replicate.com/kylan02/face-l…

Kylan O'Connor

@kylancodes

Oct 21

I can’t be the first one to do this… right? Just built a python script using one of @fofrAI’s models on @replicate to map out a grid of images of my face looking in every direction on my portfolio website, tracking the mouse on screen. Should I drop the code on GitHub?

147

1,863

Urvashi Gormat · Nov 5, 2025 · 9:00 AM UTC

Shylesh Kumar retweeted

Urvashi Gormat @takeiteasyUrvy

Nov 5

got reminded of one of my most fav tweet on the internet

123

8,161

152

163,593

Avinash Singh · Nov 5, 2025 · 7:02 AM UTC

Shylesh Kumar retweeted

Avinash Singh

@AvinashSingh_20

Nov 5

Share your GitHub profile, I’ll review it and drop feedback!

1,191

1,598

𝑝𝑢𝑟𝑒 · Nov 1, 2025 · 2:57 PM UTC

Shylesh Kumar retweeted

𝑝𝑢𝑟𝑒

@purelivn

Nov 1

rainy day in edinburgh

433

13,323

1,728

51,176

Giyu · Nov 3, 2025 · 10:28 AM UTC

Shylesh Kumar retweeted

Giyu

@rutu_3

Nov 3

Drop your website and let's see how much creative you are in frontend engineering? here is mine: ruturajbayad.netlify.app

423

872

Utkarsh🦉 · Nov 3, 2025 · 12:45 PM UTC

Shylesh Kumar retweeted

Utkarsh🦉

@neuronomad_

Nov 3

1 Year into Machine Learning One year ago, I took my first step into Machine Learning Here is what one year of consistency looks like: ➤ Stanford CS229 - Machine Learning ➤Stanford CS109 - Probability & Data Science ➤Mathematics for ML and Data Science- Coursera ➤Hands-On ML with Scikit-Learn Keras & TensorFlow ➤ Deep Learning Specialization - Andrew Ng ➤Machine Learning Cookbook ➤ 100 Days of Machine Learning ➤Python & Scikit-Learn - freeCodeCamp ➤ Data Visualization - freeCodeCamp ➤ Tensorflow - FreeCodeCamp ➤3 small projects ➤2 major projects (one ongoing!) ➤ 3 hackathons - learned a ton each time Year 2 loading…

132

407

5,637

Pawel Somogyi · Nov 3, 2025 · 2:54 PM UTC

Shylesh Kumar retweeted

Pawel Somogyi @Papheoo

Nov 3

My scrambled eggs shader is really simple - the most important things are the heavy SSS and high roughness combined with low roughness clearcoat component. The displacement modifier with cloud noise gives a little more chunky details. And sprinkled chives help sell the illusion.

298

4,276

Kyle Walker · Nov 2, 2025 · 12:09 PM UTC

Shylesh Kumar retweeted

Kyle Walker @kyle_e_walker

Nov 2

For Day 2 of #30DayMapChallenge (Lines): did you know that mapgl has a built-in measurement tool in its draw control? Use `show_measurements = TRUE` to interactively measure line distances (and polygon areas) on your map. #rstats

Culture Explorer · Nov 2, 2025 · 1:05 PM UTC

Shylesh Kumar retweeted

Culture Explorer

@CultureExploreX

Nov 2

A 155-foot Hindu statue is rising in North Carolina, and suddenly everyone is arguing about American culture. Funny how we celebrate European cathedrals and Greek/Roman architecture here without a blink… but a Tamil monument triggers panic. Is the real debate about a statue or about who gets to shape the future face of American culture?

186

Sarah Chieng · Oct 29, 2025 · 4:14 PM UTC

Shylesh Kumar retweeted

Sarah Chieng

@SarahChieng

Oct 29

The modern way to read a research paper 👇 Surprisingly, reading a paper 4 times is faster than reading it once. Adapted from @eugeneyan’s 3-pass approach. As someone who was not in the habit of reading papers, this has made it way easier for me to understand and retain information from technical papers. My version: 1 / SKIM the abstract, intro, and conclusion and highlight key ideas and main points 2 / RE-READ the intro and conclusion, also read headers/top of each section. 3 / DEEP READ through the entire paper and take notes, annotate. 4 / (BONUS) If there’s code, drop the repo into Windsurf + walk through the implementation with CodeMaps. CodeMaps by @windsurf is one of my favorite under-rated features. you can step through line-by-line the paper's implementation inside of the code. Finished a great morning read on pruning experts to compress SMoE (sparse MoE) models—congrats to @Vithu and the @Cerebras team on the paper.

115

1,245

tldraw · Oct 26, 2025 · 1:14 PM UTC

Shylesh Kumar retweeted

tldraw

@tldraw

Oct 26

if you pause this at any moment the tldraw disappears

176

442

139

4,305

Vik Paruchuri · Oct 26, 2025 · 12:37 PM UTC

Shylesh Kumar retweeted

Vik Paruchuri

@VikParuchuri

Oct 26

Best OCR ever, huh?

Harveen Singh Chadha

@HarveenChadha

Oct 26

No, its not the best OCR ever here is the result from olmoOCR2 on the same and it does have a frightening degree of accuracy

1,605

Basil🧡 · Oct 24, 2025 · 10:31 PM UTC

Shylesh Kumar retweeted

Basil🧡

@LinkofSunshine

Oct 24

Does anyone have more of this genre of image of old and new cultures colliding

427

1,697

370

29,211

Milos Makes Maps · Oct 25, 2025 · 7:48 AM UTC

Shylesh Kumar retweeted

Milos Makes Maps

@milosmakesmaps

Oct 25

Transforming OpenStreetMap data into a stunning 3D city! Can't wait to share the tutorial with everyone this weekend Who's ready to dive in?

799

Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞) · Oct 25, 2025 · 5:32 AM UTC

Shylesh Kumar retweeted

Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)

@teortaxesTex

Oct 25

I find these mfers funny but sometimes it's hard to bear

588

Piotr Pomorski · Oct 22, 2025 · 11:09 AM UTC

Shylesh Kumar retweeted

Piotr Pomorski

@PtrPomorski

Oct 22

- MSE you use when you don’t have outliers - RMSE you use when you want to interpret the above better - MAE you use when you have positive/zero/negative values and outliers - MAPE you use when you only have positive values and emphasize interpretability - RMSLE you use for positive values with nonnormal distribution - wMAPE you use when you want MAPE but have large vs small values - sMAPE you use when you want MAPE but have zero/negative values - R2 you use because your boss only knows this one

Python Programming

@PythonPr

Oct 22

TOP 8 Machine Learning Regression Metrics

205

1,905

Rohan Paul · Oct 22, 2025 · 9:10 AM UTC

Shylesh Kumar retweeted

Rohan Paul

@rohanpaul_ai

Oct 22

Converts PDF documents to Markdown format using DeepSeek-OCR with FastAPI backend. This guy bave it 10000 pdfs to convert to markdown. averaging less than 1 second per page. Hardware - 1 x A6000 ADA on a Ryzen 1700 /w 32gb ram Dockerized model with fastapi in a wsl environment.

Rohan Paul

@rohanpaul_ai

Oct 21

👨‍🔧 Inside the smart design of DeepSeek OCR DeepSeek-OCR looks like just another OCR model at first glance, something that reads text from images. But it’s not just that. What they really built is a new way for AI models to store and handle information. Normally, when AI reads text, it uses text tokens (the units that LLMs process). Each word or part of a word becomes a token. When text gets long, the number of tokens explodes, and this makes everything slower and more expensive because the model’s computation cost grows roughly with the square of the number of tokens. That’s why even the most advanced models struggle with very long documents. 💡DeepSeek’s core idea was simple but revolutionary: Instead of feeding an LLM thousands of text tokens, it turns long text into an image, encodes that image into a small set of vision tokens, then lets a decoder reconstruct the text. The team asked a simple question, how many vision tokens are minimally needed to decode N text tokens, and they measured it end to end. The paper reports about 97% OCR precision when compressing text by 9–10x, and about 60% precision even at 20x. This shows that dense visual representations can carry the same information far more efficiently than plain text tokens. The engineering that makes this practical is a new encoder called DeepEncoder. It processes high resolution pages without blowing up memory by doing local window attention first, then a 16x convolutional downsampler, then global attention. That serial design keeps activations small while aggressively cutting token count. So why this is a big deal? Context is the currency of LLMs, and it is expensive. If visual tokens can represent past dialogue, documents, or code at 10x smaller size with high fidelity, you can keep far more context active, cut costs, and speed up inference. The paper also sketches a practical “forgetting” mechanism, you can progressively downscale older context images so recent information stays sharp while older context becomes cheaper over time, which matches how human memory fades. This makes long running assistants, RAG replacements, and whole codebase in context workflows much more realistic.

271

2,321