1/29
@WesRothMoney
OpenAI *coding* progress:
1st reasoning model = 1,000,000th best coder in the world
o1 (Sept 2024) was ranked = 9800th
o3 (Jan 2025) was ranked = 175th
(today) internal model = 50th
superhuman coder by eoy 2025?
https://video.twimg.com/ext_tw_video/1888330009334743040/pu/vid/avc1/1280x720/JLZCr6fUNW_SGNym.mp4
2/29
@WesRothMoney
I had to edit the tweet, I put 2023 as the date for some reason
/shrug
thanks to everyone who pointed that out
3/29
@WesRothMoney
here's the full video I did with all the highlights from that talk:
https://invidious.poast.org/4Wa6St-uosY
4/29
@mikeboysen
I wonder what the 50th best code or thinks. Has anybody interviewed him
Lol
5/29
@WesRothMoney
he's re-reading The Butlerian Jihad...
(jokes aside, I think the software engineers will benefit greatly from AI coding tools)
6/29
@circlerotator
competitive programming is more like competitive math than software engineering
something to keep in mind
7/29
@WesRothMoney
yeah, I don't think it 'replaces' great engineers.
I do think it will 'enable' great engineers.
8/29
@drjfhll
I still think anthropic is better; and Gemini catching up
9/29
@erdavtyan
Extremely tightly scoped problems with a lot of research and algo combinations published and trained on.
Superhuman coder should be able to work on complex, high-context systems that have multiple moving parts and legacy code. They should fix versioning / deployment issues.
10/29
@doeurlich50289
Hearing sama making such direct claims means they'll crush 2025, and by the end of the year, we'll enter a new world and have to accept a new reality.
11/29
@SulkaMike
A lot of interesting takes here, summarized around the question... Even if it's number one on the benchmark does that change much?


. And if does induce change, why doesn't 10 million people with a plus account and the 175th ranked prog have changed the world so far?
12/29
@OlivioSarikas
If it is that good, why does basically any coder I know tell me that AI is good at simple code, but as soon as it becomes more complex, writing the code yourself is faster than finding the AI errors in the code?
13/29
@rosdikuat
I'm quite certain this will happen by December. Even today I mostly don't code, I mostly prompt.
14/29
@JOSmithIII
Does anyone know where the o3-mini tiers rank?
15/29
@ImJayBallentine
“We have a superior coding model but we are just gonna let Sonnet keep the lead.” Got it.
16/29
@hagestev
what happened to o2??
17/29
@langdon
A single “best‐fit” exponential model through the three data points projects reaching Rank 1 around April-May 2025. The initial drop was extremely fast (Sept→Jan), while the more recent decline (Jan→Feb) was slower - so if you weigh later data more, you’d land closer to mid‐ or late Summer 2025.
18/29
@0xShawnWang
source of rank?
19/29
@DavidPrice21106
This is getting crazy, Wes.
20/29
@_oddfox_
Once these coding agents are out publicly shyt is really going to take off. Seems like 2026 is the year of the intelligence explosion
21/29
@3DTechPrep
What used to be the difficult part of my projects (code) is now the easy part.
So simple now and have learned more in the last year about coding than in past 20.
It’s like having a brilliant coder always there to ask ANY question, no matter how dumb or hard, no judgement.
22/29
@ArcherNightfall
How many times does sama have to say it. How many times.
23/29
@jfp618
Get testing score does is just one attribute of a good engineer
24/29
@unaliveolives
Try to build and maintain a real app with o3. It is, for sure, not the world’s 175th best coder.
25/29
@PaulMaddison121
Software engineering is solving problems not churning out syntax like LLMs do
For example the trillions of reasoning models needed for AIs growth will need software engineers to create/implement.
26/29
@VojtechKulhavy
Here's the plot
27/29
@wei_andrew
Why’s OpenAI still having many programmers?
28/29
@ChefBeijing
Most top researchers in OpenAI may not be senior software engineers on real world projects, which think programmer contest like a shyt. You need a poor guy from China or India to dig into 3000 files and each of them have 3000 or 5000 lines of code and variables to fix a bug
29/29
@keithofaptos
If OpenAI truly wants to be in the right side of History, it would be marvelous to receive this internal model (50th best global coder) ASAP and in voice2voice, completely open sourced. That's what us non coders are just itching to for. Imagine paying $20/m for this?! 🫠


@sama
To post tweets in this format, more info here: https://www.thecoli.com/threads/tips-and-tricks-for-posting-the-coli-megathread.984734/post-52211196