Black Lantern Security (BLSOPS) · Nov 3, 2025 · 7:15 PM UTC

Black Lantern Security (BLSOPS)

pjhartlieb retweeted

Black Lantern Security (BLSOPS) @BlackLanternLLC

Nov 3

🚨 CVE-2025-12463: an unauth’d SQL injection that, when skillfully weaponized, can leak or overwrite critical data. PoC + full teardown + hardening tips are live. Full details👇blog.blacklanternsecurity.co… #infosec #CVE #SQLi

CVE-2025-12463— 9.8 Unauthenticated SQL Injection in Guetebruck G-Cam Series Cameras

Smile, you’re on camera.

blog.blacklanternsecurity.com

Ahmad · Oct 31, 2025 · 7:16 AM UTC

pjhartlieb retweeted

Ahmad

@TheAhmadOsman

Oct 31

yesterday, Hugging Face dropped a 214-page MASTERCLASS on how to train LLMs > it’s called The Smol Training Playbook > and if want to learn how to train LLMs, > this GIFT is for you > this training bible walks you through the ENTIRE pipeline > covers every concept that matters from why you train, > to what you train, to how you actually pull it off > from pre-training, to mid-training, to post-training > it turns vague buzzwords into step-by-step decisions > architecture, tokenization, data strategy, and infra > highlights the real-world gotchas > instabilities, scaling headaches, debugging nightmares > distills lessons from building actual > state-of-the-art LLMs, not just toy models how modern transformer models are actually built > tokenization: the secret foundation of every LLM > tokenizer fundamentals > vocabulary size > byte pair encoding > custom vs existing tokenizers > all the modern attention mechanisms are here > multi-head attention > multi-query attention > grouped-query attention > multi-latent attention > every positional encoding trick in the book > absolute position embedding > rotary position embedding > yaRN (yet another rotary network) > ablate-by-frequency positional encoding > no position embedding > randomized no position embedding > stability hacks that actually work > z-loss regularization > query-key normalization > removing weight decay from embedding layers > sparse scaling, handled > mixture-of-experts scaling > activation ratio tuning > choosing the right granularity > sharing experts between layers > load balancing across experts > long-context handling via ssm > hybrid models: transformer plus state space models data curation = most of your real model quality > data curation is the main driver of your model’s actual quality > architecture alone won’t save you > building the right data mixture is an art, > not just dumping in more web scrapes > curriculum learning, adaptive mixes, ablate everything > you need curriculum learning: > design data mixes hat evolve as training progresses > use adaptive mixtures that shift emphasis > based on model stage and performance > ablate everything: run experiments to systematically > test how each data source or filter impacts results > smollm3 data > the smollm3 recipe: balanced english web data, > broad multilingual sources, high-quality code, and diverse math datasets > without the right data pipeline, > even the best architecture will underperform the training marathon > do your preflight checklist or die > check your infrastructure, > validate your evaluation pipelines, > set up logging, and configure alerts > so you don’t miss silent failures > scaling surprises are inevitable > things will break at scale in ways they never did in testing > vanishing throughput? that usually means > you’ve got a hidden shape mismatch or > batch dimension bug killing your GPU utilization > sudden drops in throughput? > check your software stack for inefficiencies, > resource leaks, or bad dataloader code > seeing noisy, spiky loss values? > your data shuffling is probably broken, > and the model is seeing repeated or ordered data > performance worse than expected? > look for subtle parallelism bugs > tensor parallel, data parallel, > or pipeline parallel gone rogue > monitor like your GPUs depend on it (because they do) > watch every metric, track utilization, spot anomalies fast > mid-training is not autopilot > swap in higher-quality data to improve learning, > extend the context window if you want bigger inputs, > and use multi-stage training curricula to maximize gains > the difference between a good model and a failed run is > almost always vigilance and relentless debugging during this marathon post-training > post-training is where your raw base model > actually becomes a useful assistant > always start with supervised fine-tuning (sft) > use high-quality, well-structured chat data and > pick a solid template for consistent turns > sft gives you a stable, cost-effective baseline > don’t skip it, even if you plan to go deeper > next, optimize for user preferences > direct preference optimization (dpo), > or its variants like kernelized (kto), > online (orpo), or adversarial (apo) > these methods actually teach the model > what “better” looks like beyond simple mimicry > once you’ve got preference alignment,go on-policy: > reinforcement learning from human feedback (rlhf) > or on-policy distillation, which lets your model learn > from real interactions or stronger models > this is how you get reliability and sharper behaviors > the post-training pipeline is where > assistants are truly sculpted; > skipping steps means leaving performance, > safety, and steerability on the table infra is the boss fight > this is where most teams lose time, > money, and sanity if they’re not careful > inside every gpu > you’ve got tensor cores and cuda cores for the heavy math, > plus a memory hierarchy (registers, shared memory, hbm) > that decides how fast you can feed data to the compute units > outside the gpu, your interconnects matter > pcie for gpu-to-cpu, > nvlink for ultra-fast gpu-to-gpu within a node, > infiniband or roce for communication between nodes, > and gpudirect storage for feeding massive datasets > straight from disk to gpu memory > make your infra resilient: > checkpoint your training constantly, > because something will crash; > monitor node health so you can kill or restart > sick nodes before they poison your run > scaling isn’t just “add more gpus” > you have to pick and tune the right parallelism: > data parallelism (dp), pipeline parallelism (pp), tensor parallelism (tp), > or fully sharded data parallel (fsdp); > the right combo can double your throughput, > the wrong one can bottleneck you instantly to recap > always start with WHY > define the core reason you’re training a model > is it research, a custom production need, or to fill an open-source gap? > spec what you need: architecture, model size, data mix, assistant type > transformer or hybrid > set your model size > design the right data mixture > decide what kind of assistant or > use case you’re targeting > build infra for the job, plan for chaos, pick your stability tricks > build infrastructure that matches your goals > choose the right GPUs > set up reliable storage > and plan for network bottlenecks > expect failures, weird bugs, > and sudden bottlenecks at scale > select your stability tricks in advance: > know which techniques you’ll use to fight loss spikes, > unstable gradients, and hardware hiccups closing notes > the pace of LLM development is relentless, > but the underlying principles never go out of style > and this PDF covers what actually matters > no matter how fast the field changes > systematic experimentation is everything > run controlled tests, change one variable at a time, and document every step > sharp debugging instincts will save you > more time (and compute budget) than any paper or library > deep knowledge of both your software stack > and your hardware is the ultimate unfair advantage; > know your code, know your chips > in the end, success comes from relentless curiosity, > tight feedback loops, and a willingness to question everything > even your own assumptions if i had this two years ago, it would have saved me so much time > if you’re building llms, > read this before you burn gpu months happy hacking

198

1,279

Sebastian Raschka · Oct 28, 2025 · 7:32 PM UTC

pjhartlieb retweeted

Sebastian Raschka

@rasbt

Oct 28

On that note, I am currently running a large-scale experiment on the upcoming inference-scaling chapter: A) Parallel Sampling - Self-Consistency (Majority Vote) - Rejection Sampling - Best-of-N (Verifier-Based) B) Sequential Refinement - Self-Refinement - Power Sampling - MCMC (Simple) - MCMC (Block as in "Reasoning with Sampling" paper) - Tree-of-Thought ... to decide which one(s) make(s) it for the detailed discussion into the main chapter versus which ones will be included as bonus materials. (All new chapters will of course be automatically available to all the early acessers, amd there are already 170 chapters to get started in the meantime 😊 Anything you'd think is worth adding to the list above?

Manning Publications

@ManningBooks

Oct 28

📣 Deal of the Day 📣 Oct 28 SAVE 50% TODAY ONLY! Build a Reasoning Model (From Scratch) & selected titles: hubs.la/Q03QBB8q0 Understand LLM reasoning by creating your own reasoning model–from scratch! @rasbt #LLMs #reasoning #python #pytorch Reasoning models break problems into steps, producing more reliable answers in math, logic, and code. Sebastian Raschka, the bestselling author of Build a Large Language Model (From Scratch), takes you inside the black box of reasoning-enhanced LLMs. You’ll start with a compact, pre-trained base model that runs on consumer hardware, then upgrade it step-by-step to tackle ever-more difficult problems and scenarios.

507

pjhartlieb · Oct 29, 2025 · 1:11 PM UTC

pjhartlieb @ghostrodeo0

Oct 29

experimental-history.com/p/t…

The Decline of Deviance

Where has all the weirdness gone?

experimental-history.com

pjhartlieb · Oct 29, 2025 · 1:07 PM UTC

pjhartlieb @ghostrodeo0

Oct 29

helpnetsecurity.com/2025/10/…

Proximity: Open-source MCP security scanner - Help Net Security

Proximity is an MCP security scanner that checks servers for prompts, tools, and resource details to help teams spot risks before deployment.

helpnetsecurity.com

pjhartlieb · Oct 28, 2025 · 8:45 PM UTC

pjhartlieb @ghostrodeo0

Oct 28

It just occurred to me that for these Frontier models and the agent based systems you can build on top of them, the UX/UI IS text and natural human language. All the way back to the beginning.

pjhartlieb · Oct 28, 2025 · 6:58 PM UTC

pjhartlieb @ghostrodeo0

Oct 28

Get instant alerts for vulnerabilities affecting your infrastructure. Track CVEs, monitor vendors, and protect what matters most. vulntracker.io/

VulnTracker - Vulnerability Tracking System

Professional vulnerability tracking and management system. Track CVEs, vendors, products and stay updated.

vulntracker.io

pjhartlieb · Oct 28, 2025 · 1:05 PM UTC

pjhartlieb @ghostrodeo0

Oct 28

boingboing.net/2025/10/27/sp…

Speedrunner beats Mario 64 with one button and quantum physics

The lord of Super Mario 64 returns with another challenge run that haunts and vexes me. He needs to be locked up for abusing this game.

boingboing.net

pjhartlieb · Oct 27, 2025 · 8:03 PM UTC

pjhartlieb @ghostrodeo0

Oct 27

open.substack.com/pub/blackl…

Attack Surface Management (ASM): Goals, Objectives, and Business Case

“I shouldn’t be able to even reach that from here”

blog.blacklanternsecurity.com

pjhartlieb · Oct 27, 2025 · 3:54 PM UTC

pjhartlieb @ghostrodeo0

Oct 27

propublica.org/article/fda-h…

Is Your Medication Made in a Contaminated Factory? The FDA Won’t Tell You.

The agency’s decision to conceal drug names on inspection reports has prevented doctors, pharmacists and patients from knowing whether medications made overseas are tainted by manufacturing failures...

propublica.org

pjhartlieb · Oct 27, 2025 · 2:15 PM UTC

pjhartlieb @ghostrodeo0

Oct 27

thaicert.or.th/en/2025/10/27…

Russian food safety regulator hit by DDoS attack, causing nationwide agricultural shipment delays -...

425/68 Monday, October 27, 2025 The Federal Service for Veterinary and Phytosanitary Surveillance, under Russia’s Ministry of Agriculture, suffered a major DDoS attack that temporarily disrupted its...

thaicert.or.th

pjhartlieb · Oct 26, 2025 · 3:45 PM UTC

pjhartlieb @ghostrodeo0

Oct 26

daily.bandcamp.com/lists/sou…

Field Notes: A Beginner’s Guide to Soundwalks

Soundwalking exists at the intersection of art, field recording, urban studies, and acoustic ecology.

daily.bandcamp.com

Jake Williams · Oct 20, 2025 · 2:02 PM UTC

pjhartlieb retweeted

Jake Williams

@MalwareJake

Oct 20

I hope cops never change...

sudox

@kmcnam1

Oct 19

Really amazing security is having your UN/PW taped on a sticky note taped to the public-facing side of your device as you're out recording protesters.

Ahmad · Oct 18, 2025 · 5:13 AM UTC

pjhartlieb retweeted

Ahmad

@TheAhmadOsman

Oct 18

Replying to @ghostrodeo0

Ahmad

@TheAhmadOsman

Oct 18

> for inference RTX PRO 6000 > DGX Spark > in a 4000 token chat: > RTX PRO 6000 is 6–7x faster > while only ~1.8x more expensive > DGX Spark took > 100 sec vs 14 sec on Llama 3.1 8B > and 13 min vs 100 sec on Llama 3.1 70B LLM inference is memory‑bound: 1792 GB/s vs 273 GB/s

pjhartlieb · Oct 17, 2025 · 1:35 PM UTC

pjhartlieb @ghostrodeo0

Oct 17

github.com/microsoft/SecRL

GitHub - microsoft/SecRL: Benchmarking LLM agents on Cyber Threat Investigation.

Benchmarking LLM agents on Cyber Threat Investigation. - microsoft/SecRL

github.com

pjhartlieb · Oct 17, 2025 · 1:33 PM UTC

pjhartlieb @ghostrodeo0

Oct 17

This is concerning. wired.com/story/scott-leiend…

One Republican Now Controls a Huge Chunk of US Election Infrastructure

Former GOP operative Scott Leiendecker just bought Dominion Voting Systems, giving him ownership of voting systems used in 27 states. Election experts don't know what to think.

wired.com

unusual_whales · Oct 16, 2025 · 11:17 AM UTC

pjhartlieb retweeted

unusual_whales

@unusual_whales

Oct 16

Bernie Sanders has said: "If you're the Qatari royal family worth $335 billion, Trump gives you an Air Force facility in Idaho. If you're the President of Argentina, Trump gives you a $20 billion bailout. If you're an American whose health care premiums are about to double? Tough luck."

361

569

5,925

Sebastian Raschka · Oct 16, 2025 · 1:36 PM UTC

pjhartlieb retweeted

Sebastian Raschka

@rasbt

Oct 16

Chapter 3, and with it the first 176 pages, is now live! (mng.bz/lZ5B)

661

Sebastian Raschka · Oct 15, 2025 · 11:48 PM UTC

pjhartlieb retweeted

Sebastian Raschka

@rasbt

Oct 15

Saw that DGX Spark vs Mac Mini M4 Pro benchmark plot making the rounds (looks like it came from @lmsysorg). Thought I’d share a few notes as someone who actually uses a Mac Mini M4 Pro and has been tempted by the DGX Spark. First of all, I really like the Mac Mini. It’s probably the best desktop I’ve ever owned. For local inference with open-weight LLMs, it works great (the plot above captures that well). I regularly run the gpt-oss-20B model on it. That said, I would not fine-tune even small LLMs on it since it gets very hot. The DGX Spark probably targets that type of sustained workload. (From those who have one, any thoughts on the noise and heat levels?) The other big thing that DGX Spark gets you is CUDA support. If you use PyTorch, that’s pretty essential since MPS on macOS is still unstable, and fine-tuning often fails to converge. E.g., see github.com/rasbt/LLMs-from-s… and github.com/rasbt/LLMs-from-s… I also like the Spark’s for factor (hey, it really appeals to the Mac Mini user in me). But for the same money, I could probably buy about 4000 A100 cloud GPU hours, and I keep debating which would be the better investment. Sure, I could also build/get a multi-GPU desktop. I had a Lambda system with four GTX 1080 Ti cards back in 2018, but it was too loud and hot for my office. And if I have to move it to another room and SSH into it anyway, I might as well use cloud GPUs instead?

120

970

pjhartlieb · Oct 16, 2025 · 1:24 PM UTC

pjhartlieb @ghostrodeo0

Oct 16

arstechnica.com/ai/2025/10/n…

Nvidia sells tiny new computer that puts big AI on your desktop

The 1 petaflop DGX Spark system runs AI models with 200 billion parameters locally for $4K.

arstechnica.com