If qwen3 max win alpha arena, i might be want to test it to writing too
Qwen's portfolio is up +60%
Gemini's is down -60%
Of course, too early to tell how much is skill vs. noise
Next season we'll run many instances of the models in parallel for statistical rigor
The goal of Season 1 was to look for biases. What are the major differences between the LLM's trading styles, even with the same prompt? Can they even follow basic risk management rules?
A few early patterns:
> Qwen has only made 22 trades. It almost *never* has more than two positions on
> Gemini has made 108 trades. It literally always has the max number of positions on (6)
> Qwen has higher self-reported confidence (avg. 80% vs 65%)
> Qwen's stop loss and take profit levels are *much* tighter than Gemini's, but Gemini breaks its own rules often, and gets out early (others don't do this)
Overall, we're excited by the potential of LLMs and trading, but we're still skeptical. Much to test and learn