1/11
@tsarnick
OpenAI's Noam Brown says the new o1 model beats GPT-4o at math and code, and outperforms expert humans at PhD-level questions, and "these numbers, I can almost guarantee you, are going to go up over the next year or two"
https://video.twimg.com/ext_tw_video/1848118049062453249/pu/vid/avc1/720x720/Lf3_qbJ45pHTJaMI.mp4
2/11
@tsarnick
Source:
https://invidious.poast.org/watch?v=Gr_eYXdHFis
3/11
@Yossi_Dahan_
4/11
@chandan_ganwani
2+2=4 It is always 4. It is a guarantee or not a guarantee. There is no almost guarantee. Just show us the results when it starts affecting daily lives or matters in real life. The rest is just a way to sell promises and hype. Get real and stop selling the dream of flying cars!
5/11
@RachelVT42
Interesting.
What about other abilities though?
Last time I tried it, it often wasn’t as good as 4o on writing tasks, especially when used in another language.
6/11
@danielbigham
So exciting. Buckle up!
7/11
@BenjaminDEKR
I mean, if two years from now LLM performance hasn't gone up, something is wrong
8/11
@matterasmachine
Phd is not about questions, it’s about creation.
9/11
@Shawnryan96
I just want OAI to teach them how to use tools really well and make them much more useful. I know getting smarter will help but I want agents lol
10/11
@Zero04203017
The score jumps in competition math from o1 preview to o1 is amazing.
11/11
@BeyondtheCodeAI
This could have major implications for fields like education and research.
To post tweets in this format, more info here: https://www.thecoli.com/threads/tips-and-tricks-for-posting-the-coli-megathread.984734/post-52211196