BURKOV · Jul 5, 2025 · 6:50 AM UTC

BURKOV · Jul 5, 2025 · 6:50 AM UTC

BURKOV

BURKOV

@burkov

Jul 5

I really like the cross-LM workflow. For example, you use o3 to suggest fixes, and then you copy o3's proposed fix, paste it into Gemini, and ask for a full implementation. Once implemented by Gemini, you show the updated code to o3 (and also any feedback from the runtime) and ask to criticize, propose further fixes, or confirm that all is good. Each model, used in isolation, eventually falls into self-repetition. By this back and forth with two models, I feel like I'm less stuck in such self-repetition loops. And Gemini codes so much faster than others!

Jul 5, 2025 · 6:50 AM UTC

251

Sam Texas · Jul 5, 2025 · 7:54 AM UTC

Sam Texas @heysamtexas

Jul 5

Replying to @burkov

with all this hoop-jumping I have to ask, "What are you building?"

BURKOV · Jul 5, 2025 · 7:57 AM UTC

BURKOV

@burkov

Jul 5

It's stealth for now, but I see the light at the end of the tunnel, so stay tuned!

Bowser · Jul 5, 2025 · 6:52 AM UTC

Bowser @BowsersaurusRex

Jul 5

Replying to @burkov

I've found you can do this with the same model if you just tell the model that the content was from a different model.

BURKOV · Jul 5, 2025 · 6:58 AM UTC

BURKOV

@burkov

Jul 5

Yes, but it's not as effective. If you take a copy of your code that Gemini just reproduces in the output saying it was "fixed" and just start a new conversation with this code, it will converge again very fast.

more replies

Dat D 🫐 · Jul 5, 2025 · 7:17 AM UTC

Dat D 🫐 @datqduong

Jul 5

Replying to @burkov

why dont you just write the codes instead of copying back and forth?

BURKOV · Jul 5, 2025 · 7:56 AM UTC

BURKOV

@burkov

Jul 5

Write code? No, thanks, past this phase for good.

Fernan Franco · Jul 6, 2025 · 4:14 AM UTC

Fernan Franco @fernanlf

Jul 6

Replying to @burkov

youve become an agent

BURKOV · Jul 6, 2025 · 4:18 AM UTC

BURKOV

@burkov

Jul 6

Always was!

P1 · Jul 5, 2025 · 6:59 AM UTC

P1 @scientist_ws

Jul 5

Replying to @burkov

glad to hear this insight and some positivity on your end Andriy! thanks for the tip i’ll try this on my workflow

BURKOV · Jul 5, 2025 · 7:55 AM UTC

BURKOV

@burkov

Jul 5

I'm always positive about good things and negative about bad ones. Unfortunately, the quantity of bad things (lies mostly) has been too large lately.

more replies

Nicolas Thomas ژن_ژیان_ئازادی · Jul 5, 2025 · 8:02 AM UTC

Nicolas Thomas ژن_ژیان_ئازادی @thomnico

Jul 5

Replying to @burkov

I am amazed that those so-called steps towards AGI can't detect that they run in circles. Paying by the API calls doesn't give an incentive to fix it.

BURKOV · Jul 5, 2025 · 8:06 AM UTC

BURKOV

@burkov

Jul 5

-- I need A and B. -- I see, you need A and not B. Wait while I'm printing for 15 minutes. -- I said A AND B. -- Ah, now I see. You just need B. I will print for 15 more minutes; you can go grab a cup of coffee. -- I said BOTH A AND B. -- [The user seems frustrated for no reason] I understand your frustration; here's your solution for C and NOT A. Give it a spin and let me know if you see any edge cases!

more replies

John Ennis · Jul 5, 2025 · 12:38 PM UTC

John Ennis

@john_ennis_btc

Jul 5

Replying to @burkov

This ensemble approach is a case where diversity actually is a strength

Alex Strick van Linschoten · Jul 5, 2025 · 7:29 AM UTC

Alex Strick van Linschoten

@strickvl

Jul 5

Replying to @burkov

You’d love @RepoPrompt ! Check out @pvncher ‘s latest video of exactly this workflow

Hadi Khan · Jul 5, 2025 · 10:08 AM UTC

Hadi Khan

@hadikhantech

Jul 5

Replying to @burkov

Cool concept, I’ve been doing that subconsciously, especially when one model fails. You get a hang of what model to use when. Also manage context externally, in various markdowns. This is context engineering 🫡

Vladimir Vlach · Jul 5, 2025 · 7:44 AM UTC

Vladimir Vlach @vladaman

Jul 5

Replying to @burkov

I added a proposal to @OpenCode_AI - seems like a perfect use case.

Ulixes · Jul 5, 2025 · 7:03 AM UTC

Ulixes @ulixesplayer

Jul 5

Replying to @burkov

I also use your approach, although all this copy and past is kind of anoying. I don't why anyone come up with such a fully integrated approach for VS code...

Manish Patel · Jul 5, 2025 · 7:12 AM UTC

Manish Patel @manni_patel

Jul 5

Replying to @burkov

We discovered this too in @jivaAI. Cross checks or verification across models works well.

Raphael · Jul 5, 2025 · 10:01 PM UTC

Raphael @raph_dixon

Jul 5

Replying to @burkov

Can you describe your processes here re: the platforms and tools you’re using? I currently am doing this with (kind of) by sharing GitHub links, but I feel like there must be a more efficient way.

Zay Clarkson · Jul 5, 2025 · 11:45 AM UTC

Zay Clarkson

@TechPrepAI

Jul 5

Replying to @burkov

Yep I do this as well

kaleb · Jul 5, 2025 · 7:00 AM UTC

kaleb

@KalebAutomates

Jul 5

Replying to @burkov

Why do you think that is? Token choice? Preferred CoT?

Mr. Sonia Dane · Jul 5, 2025 · 7:07 PM UTC

Mr. Sonia Dane

@MrSoniaDane

Jul 5

Replying to @burkov

I've been doing this multi-model approach for several months now. My feeling is that the tunnel vision most models succumb to is usually due to the extended context of the current thread. So, as someone else said, even if you don't switch models, if you just present the prompt to the same model in a new thread, the model has a more open perspective without the extended context of the original thread. So it sees the code with "fresh eyes" somewhat like we do after sleeping on a problem - how we often wake up with a clear answer. But I also agree that presenting the problem to a reasoning model such as o3 or deep think R1, provides more useful insights. Which I can then take back to 4.1 to help me implement.

JuanC · Jul 5, 2025 · 11:45 AM UTC

JuanC

@cacus

Jul 5

Replying to @burkov

nice, I use pastemax for "context engineering" then use the prompt in 2 or 3 LLMs (g 2.5 pro, o3, r1). check their responses, pick the one that I think is the best approach and then use that as implementation plan for g 2.5 or 2.0 flash in roo code (blazing fast to implement).

Dylan Normandin · Jul 5, 2025 · 1:40 PM UTC

Dylan Normandin

@DNormandin1234

Jul 5

Replying to @burkov

Definitely. I have one o3 chat for high-level thinking, then gemini for mid-level and claude for tactical implementation. I often "uplevel" a chat when i think its not correct to the next AI up in the stack. and above o3 is a deep research implementation of o3.

Bjorn · Jul 5, 2025 · 3:16 PM UTC

Bjorn

@BjornHansenMMA

Jul 5

Replying to @burkov

This is the way.

Faceless · Jul 5, 2025 · 7:12 AM UTC

Faceless

@facelessbuilds

Jul 5

Replying to @burkov

Good idea

Saud Hashimi · Jul 5, 2025 · 9:50 AM UTC

Saud Hashimi

@saudhashimi

Jul 5

Replying to @burkov

This seems like the way to go actually. Using different models together by breaking up the workflow. Also controls for errors better as you say.

René-Marcellin Tchokomi · Jul 5, 2025 · 8:01 PM UTC

René-Marcellin Tchokomi

@r_tchokomi

Jul 5

Replying to @burkov

Thank you man.

nevgeniev · Jul 6, 2025 · 2:20 AM UTC

nevgeniev

@nevgeniev

Jul 6

Replying to @burkov

LLM pair programming :)))

Andres Franco · Jul 5, 2025 · 5:14 PM UTC

Andres Franco

@theinsiderzclub

Jul 5

Replying to @burkov

It’s basically a community of AIs helping each other stay grounded and avoid wild hallucinations.

papoyar · Jul 5, 2025 · 2:52 PM UTC

papoyar

@Papoyar40

Jul 5

Replying to @burkov

what about work flow where multi model work without copy paste? will it work?