Lx · Apr 18, 2016 · 9:10 AM UTC

Lx

Pinned Tweet

Lx @LxYuan

18 Apr 2016

Accept responsibility for your own actions.No excuses, no regrets, no alibis, don't point the finger, don't blame anybody else.-Tony Doherty

Matthew Hawthorne · Nov 3, 2025 · 6:36 PM UTC

Lx retweeted

Matthew Hawthorne @mhawthorne

Nov 3

recently partnered with @GergelyOrosz to write "What is good software architecture?" for The Pragmatic Engineer: newsletter.pragmaticengineer… the core thesis is that good architecture work involves upgrading your problems

What is good software architecture?

What good architecture looks like, how to improve your skill at building it –and why Architects are not always the answer. Guest post by Matthew Hawthorne, who built large systems at Netflix & Twitter

newsletter.pragmaticengineer.com

vLLM · Oct 22, 2025 · 3:17 PM UTC

Lx retweeted

vLLM

@vllm_project

Oct 22

it’s tokenization again! 🤯 did you know tokenize(detokenize(token_ids)) ≠ token_ids? RL researchers from Agent Lightning coined the term Retokenization Drift — a subtle mismatch between what your model generated and what your trainer thinks it generated. why? because most agents call LLMs via OpenAI-compatible APIs that only return strings, so when those strings get retokenized later, token splits may differ (HAV+ING vs H+AVING), tool-call JSON may be reformatted, or chat templates may vary. → unstable learning, off-policy updates, training chaos. 😬 (@karpathy has a great video explaining all details about tokenization 👉🏻 piped.video/watch?v=zduSFxRa… ) together with the Agent Lightning team at Microsoft Research, we’ve fixed it: vLLM’s OpenAI-compatible endpoints can return token IDs directly. just add "return_token_ids": true to your /v1/chat/completions or /v1/completions request, and you’ll get both prompt_token_ids and token_ids along with normal text outputs. no more drift. no more mismatch. your agent RL now trains exactly on what it sampled. read more from the blog 👇 👉 blog.vllm.ai/2025/10/22/agen… #vLLM #AgentLightning #RL #LLMs #OpenAIAPI #ReinforcementLearning

Let's build the GPT Tokenizer

The Tokenizer is a necessary and pervasive component of Large Language Models (LLMs), where it translates between strings and tokens (text chunks). Tokenizer...

youtube.com

694

Guido van Rossum · Oct 22, 2025 · 5:30 PM UTC

Lx retweeted

Guido van Rossum

@gvanrossum

Oct 22

The *typeagent* project implements long-term memory for agents that's better than RAG. We extract "knowledge" using an LLM which gives better precision/recall. Find code and presentation at github.com/microsoft/typeage…

359