Will Large Language Models End Programming?

bnew

Veteran
Joined
Nov 1, 2015
Messages
58,835
Reputation
8,672
Daps
163,048


1/78
@realGeorgeHotz
ChatGPT o1-preview is the first model that's capable of programming (at all). Saw an estimate of 120 IQ, feels about right.

Very bullish on RL in development environments. Write code, write tests, check work...repeat

Here's it is writing tinygrad tests: https://chatgpt.com/share/66e693ef-1a50-8000-81ff-899498f9d052



2/78
@realGeorgeHotz
To paraphrase Terence Tao, it's "a mediocre, but not completely incompetent, software engineer"

Maybe it 404ed because I continued the context? Here's the WIP PR, just like with o1, you can imagine the "chain of thought" used :P graph rewrite tests by geohot · Pull Request #6519 · tinygrad/tinygrad



3/78
@trickylabyrinth
the link is giving a 404.



4/78
@skidmarxist1
with chess the strongest player was human + ai combo for a while. Now its just completely computer.

It feels like we are in that phase with IQ right now. The highest IQ is currently a combo of human + llm or ai. how long till its just ai by its self?

Also how memory has become largely external from out body (phones). More and more of out IQ will be external (outside the skin). The agency center of mass is getting further away from our actual mass center of mass.



5/78
@WholeMarsBlog
link returns a 404 for some reason



6/78
@danielarpm
Conversation was blocked due to policies 👮



7/78
@JediWattzon22
I’m bearish on a 200 status



8/78
@nw3
AI with IQ of 120 is sufficiently devastating. Leaves room for true geniuses to innovate but smarter than vast majority of humanity.



9/78
@remusrisnov
The IQ test assessment does not tell you that the LLM used IQ tests and answers in its training set data. Not a useful measurement, @arcprize is better.



10/78
@yayavarkm
How is it at analysing complex data?!



11/78
@TroyMurs
I don’t know bro…homie thinks he is 150.

I’ve actually done this test on all the models and this is the first time it’s ever been over 140.



12/78
@sparbz
?? plenty of previous models can program (well)



13/78
@myronkoch
the chatGPT link you posted 404's



14/78
@romainsimon
Claude Sonnet 3.5 was already pretty good for some things



15/78
@shawnchauhan1
Natural language processing is poised to revolutionize how we interact with technology. It's the future of coding



16/78
@monguetown
I disagree. It can write the code you tell it to write especially in the context of an existing system. And incorporate new code into that legacy system. And optimize it.



17/78
@heuristics
That’s a skill issue. They have been capable of programming well for a while. You just have to specify what you want them to do.



18/78
@david_a_thigpen
Well, 404 error. I'm sure that the correct link will function the same way. e.g. add test for resource the prompt engineer controls



19/78
@gfodor
did you compare vs o1-mini? o1-mini is very good.



20/78
@TheAI2C
I will bet $3k in BTC that it can’t make a macro that continuously mouse clicks only while the physical left mouse button is held down on a GNU/Linux operating system without using a virtual machine.



21/78
@zoftie




22/78
@shundeshagen
When will programming as we know it today become obsolete?



23/78
@stevelizcano
o1-preview or mini? mini is supposed to be better at coding



24/78
@shw1nm
when you asked if the test or the code was incorrect, it said the code

was that correct?



25/78
@jmeierX
Natural language will be the next big coding language



26/78
@truthavatar777
The first thing I did with ChatGPT 4 was make it crawl through my company's codebase to extract the code from other non-git friendly assets. Then I loaded that as a knowledge file and it was promising. But what you're showing here is a dramatic step forward.



27/78
@Emily_Escapor
Good two more updates and we hit God mode 😁



28/78
@JD_2020
Small correction — this model more or less does the stuff o1 does, since last year, and consistently shows up. At a fraction of the cost of o1.

Just try it. It’s totally free for the moment since you ingress to the agentive workflow via ChatGPT

ChatGPT - No-Code Copilot 🤖 Build Apps & Games from Words!



29/78
@sauerlo
The 404 is the tinygrad test. We are the test subjects.



30/78
@sunsettler
Have you read Crystal society?



31/78
@akhileshutup
They took it down lmao



32/78
@TeslaHomelander
Giving power to true artists to form the future



33/78
@RatingsKick
404



34/78
@arnaud_petitpas
Can't access, blocked due to OAI policy it says



35/78
@jessyseonoob
If you can copy-paste in codepen please



36/78
@xmusfk
If I am not wrong, you used a prompt engineering technique called Chain of Thought, which might not work well with the o1 model according to the documentation. here is the tweet.

[Quoted tweet]
o1 experts, please follow these instructions instead of trying your out of the box logics. 💪


37/78
@ludvonrand




38/78
@Sachin1981KUMAR
I feel it's not IQ that is impressive but comparative speed against human mind.
It might have higher IQ as above average human being but their is no comparison to the speed with which it can solve the problems. Not sure how that is being measured



39/78
@dhtikna
Have you tried Sonnet 3.5, in some benchmarks it still beats O1 in coding



40/78
@LukeElin
Been exploring and experimenting all weekend with it. very impressed in someways but underwhelming others.

Mixed bag future of these models looks bright



41/78
@RBoorsma
Study to test AI IQ:

[Quoted tweet]
Just plotted the new @OpenAI model on my AI IQ tracking page.

Note that this test is an offline-only IQ quiz that a Mensa member created for my testing, which is *not in any AI training data* (so scores are lower than for public IQ tests.)

OpenAI's new model does very well


42/78
@programmer_ke
openai police shut down your link



43/78
@DmitriyLeybel
Lol



44/78
@beattie20111
Amazing 🤩 results
Over 98k won yesterday.
People in my telegram channel keep winning with me everyday.
Don’t miss next game, click the link on my bio to join my telegram



45/78
@alocinotasor
I'll wait till it's IQ measures mine.



46/78
@alex33902241
(At all) is crazy levels of delusion



47/78
@HaydnMartin_
Feels like we're very close to describing a change and a PR subsequently appearing.



48/78
@platosbeard69
I've had o1-mini give better coding solutions than o1-preview some of the time and the speed makes initial iteration on poorly specified natural language requests much nicer



49/78
@maxalgorhythm
404 not found on the chatgpt share link



50/78
@reiver




51/78
@ykssaspassky
lol it rewrote it for me - copy paste from GitHub



52/78
@muad_deab
"404 Not Found"



53/78
@LucaMiglioli185
I'm done



54/78
@uber_security
Its.. "robust", within an "frame work".

So far 2/3 code run at first try.



55/78
@Kingtylernash
Have observed the same with hard code problems un which usually couldnt help me before



56/78
@ITendoI
Guys... he said the "I" word.



57/78
@mario_meissner
What’s the difference between the current Cursor capabilities and the RL environment you describe?

I feel like I can already have a pretty much automated loop where I just supervise and give the next order.



58/78
@HX0DXs
could you please screenshot the test? is giving 404



59/78
@bruce_lambert
First model capable of programming? Uh oh, I better delete all that working code (in SAS, bash, Lisp, and Python) that AI has written for me since December 2022.



60/78
@OccupyingM
what's your guess on how and why it works?



61/78
@Xuniverse_
🤣 sorry, we will get superintelligence soon which can write programming codes.



62/78
@silxapp
openai fan boys is another thing



63/78
@0xAyush1
but can it build an open source autopilot driving software?



64/78
@crypto_nobody_
o1 vs Claude, Claude won in my testing when it came to coding



65/78
@drapersgulld
Try to use o1-mini, have found better general performance in for now.

[Quoted tweet]
I think people are totally misunderstanding that you should be using o1-mini to run your coding + math tests.

OpenAI didn’t make this too clear in the primary o1 card but the o1-mini post (link below) makes this super clear.

On costs … o1-mini is around 30% cheaper than 4o.


66/78
@sameed_ahmad12
I think they took your link down.



67/78
@CreeK_
@sama "blocked due to your policies".. can you do some magic? We just want to see what George Hotz saw..



68/78
@leo11market
Is it better than Claude 3.5 in python programming?



69/78
@purusa0x6c
demn I got this



70/78
@Pomirkovany
Yeah dude, writing tinyguard tests is very impressive and proof that it's a capable programmer



71/78
@PoudelAarogya
truly the o1 is great. here is the reason:



72/78
@MoeWatn
Uh?



73/78
@DCDqyTu7V556229
Shared conversation seems deleted.



74/78
@yajusempaihomo
the conversation is 404. did you pour your whole code base into o1 preview? or it just did the job with like one file and a few hints?



75/78
@uki156
What does this mean "capable of programming at all"? I've been using models since GPT3 to do programming with a lot of satisfaction, and they've been getting better with each new release.
Your tweet is worded like I shouldn't believe my own eyes



76/78
@lu_Z2g
I don't get the IQ claims. If it had the intelligence of a 120 IQ human or even lower, it would be AGI. It's clearly not AGI. Its understanding completely breaks down on out of distribution questions.



77/78
@cosmichaosis
Higher IQ than me. 😅



78/78
@MHATZL101
All bullshyt the fukking thing can’t even even do a basic chat with a human being for a hiring process like human resources. It’s so immediately and easily confused it is ridiculously inefficient and does not work at all. inoperable.




To post tweets in this format, more info here: https://www.thecoli.com/threads/tips-and-tricks-for-posting-the-coli-megathread.984734/post-52211196

GXhBZfaXAAAdROz.jpg

GXhReyhWoAEXuLl.jpg

GXbs-nfXwAAnPST.jpg

GXX_AiobQAAeMXS.jpg

GXgPiPlbwAE47Bv.jpg

GXhmTo4WQAAg6lp.jpg

GXhlesRaUAAxuON.jpg

GXd0BbQW4AAgogg.jpg

GXd0BbaXUAAHrQF.jpg

GXga0FNaYAAHqgI.png

GXhxLacacAAZFGK.jpg

GXglvc2XUAA0wgN.jpg

GXgnOSeW0AA9MoR.jpg


 

bnew

Veteran
Joined
Nov 1, 2015
Messages
58,835
Reputation
8,672
Daps
163,048
o1-preview made a 3d FPS game fully in HTML. I have zero coding skills so it took a few tries but eventually it worked!

 

bnew

Veteran
Joined
Nov 1, 2015
Messages
58,835
Reputation
8,672
Daps
163,048

1/21
@leonsilicon
what if anybody can create anything with just their voice



https://video.twimg.com/amplify_video/1840123004224974848/vid/avc1/720x1280/sB9xKjeR6ksRYcDm.mp4

2/21
@IgorPauer
As a coder you know: Not anybody! Clients are not able to articulate their needs... ;)



3/21
@sanganisahil
dope



4/21
@theLucyChan
Can we go back to the past before rise of AI? I like when people write code and when I don't have to worry about some horrible future coming



5/21
@DrStarson
Great work @leonsilicon



6/21
@MMBeckerman
AI will be the ultimate democratization of software development. Now, the best IDEAS will determine what software gets created, no longer constrained by the lack of financial or technical resources. Now, watch and see what REALLY can be done when IDEAS actually rule the day.



7/21
@web3nam3
Impressive



8/21
@Chillicit
he sounds like a literal idiot who never leaves his computer.



9/21
@ae_estudios
Easiest part is to create, hardest part is to maintain, debug, scale, distribute, add more features, refactor, and the list can go on and on.



10/21
@airesearch12
reminds me of here at minute 3:05

[Quoted tweet]
#5 stackblitzbuddy - turn based ai coding assistant that can immediately host your ai generated webapp


https://video.twimg.com/ext_tw_video/1839694287220449280/pu/vid/avc1/1280x720/e23-jebbBkw5w2d1.mp4

11/21
@_rahulbali
WhAt is going On



12/21
@KevLXu
The project looks very very familiar🧐 reminds me of something that rhymes with "bulo" and starts with Tab



13/21
@LowMax
Bookmarking this to look back at in 10 years to see how silly or clairvoyant it was.



14/21
@Zero04203017
Fancy. But why do the students need you when they can simply ask the AI?



15/21
@ManuAGI01
🤯🤯



16/21
@b00ml00p
This is the way



17/21
@Bunagayafrost
just with voice, naturally👍



18/21
@ashakoen
That’s where we are headed and it’s quite lovely.



19/21
@IanXavieronX
Something like jarvis



20/21
@udiomaniak0
Impressive.



21/21
@ImLucasGrey
Still wondering why I'm not seeing as many multi modal AI interfaces. Especially with voice.




To post tweets in this format, more info here: https://www.thecoli.com/threads/tips-and-tricks-for-posting-the-coli-megathread.984734/post-52211196
 

bnew

Veteran
Joined
Nov 1, 2015
Messages
58,835
Reputation
8,672
Daps
163,048



1/46
@slow_developer
🚨 Mark Zuckerberg on the Joe Rogan podcast

in 2025, AI systems at Meta and other companies will be capable of writing code like mid-level engineers.

at first, it's costly, but the systems will become more efficient as time passes.

eventually, AI engineers will build most of the code and AI in apps, replacing human engineers.



https://video.twimg.com/ext_tw_video/1877795533781164032/pu/vid/avc1/720x720/Dna_JZ2o6OIV3Ax3.mp4

2/46
@ai_for_success
Time to become AI Engineer.



3/46
@slow_developer
AI will eventually replace AI engineers as well



4/46
@aialchemistart
What happens when the AI learns to replace CEOs next?



5/46
@slow_developer
eagerly waiting for that.



6/46
@joinzo
Collaboration will be essential.



7/46
@gav_kd
Was this a deep fake?



8/46
@rand_longevity
the end of work is coming



9/46
@IslamRashi2000
Great share



10/46
@TypesDigital
Is this AI replacing human jobs?



11/46
@Heysup07
seriously if engineers are not worrying about their jobs being replaced they have zero urgency on what’s coming up next



12/46
@XSpeed_Walker
Have yall seen the series called humans 👀. I know some of it is ludicrous but still interesting



13/46
@Patryk86715962
That was a profession that pulled people out of poverty, all the capable and hardworking ones. There was no other profession like this one that literally changed people's professional situation. I personally taught several programmers for free, which turned their lives around; they started families, have children, and are happy



14/46
@AudioBooksRU
The great shift begins!



15/46
@DirkBruere
S/w engineering is not code monkey work. Coding is only a small part of the project. Is the AI going to start interviewing clients as to what they want and making suggestions?



16/46
@HyperFitLLC
And this is exactly why the H1B visa thing won't really matter that much



17/46
@gav_kd
I watched that entire podcast and did not see that message not there



18/46
@sonicshifts
He shouldn't say this. Just discourages the next generation of programmers from going into the field. And it isn't true, oversimplification and overhyping AI's abilities.



19/46
@dieaud91
This is a big deal. If deployed at scale and when "affordable", having a mid-level engineer "in your pocket" means anyone could build their apps/tools.

It will be massive for early adopters.



20/46
@NorbertEnders
So, what jobs will thrive then? Existing and new ones?

Human creativity always led to a situation, where there was more than enough work. With temporary glitches.



21/46
@Patryk86715962
One of the most beautiful professions for the mind, perception of reality, and personal development, which is programming, will fade into oblivion. It's very sad; people who have dedicated their entire lives to learning, because programming requires constant learning, especially now when true enthusiasts still enjoy good jobs and pay, will fade into oblivion.



22/46
@AI_Fun_times
Exciting glimpse into the future of AI in software development from Mark Zuckerberg! As AI systems evolve to code like engineers, the potential for efficiency gains is immense.



23/46
@NotBrain4brain
Meta is usually slower, this mean that OpenAI already have this



24/46
@pittrpatt
Clearly Zuck has never written thousands of lines of code. Currently generative AI can only solve well-constrained & well-defined coding problems, & isn’t able to translate efficiently between languages. Bring in more complex & tightly coupled legacy code, & gen AI fails.



25/46
@pilot_sid
Exciting and terrifying at the same time. If AI replaces human coders, does that mean engineers will shift focus to more creative, high-level problem solving? Or are we heading toward a massive skill realignment?



26/46
@RichardSho45410
That’s already here.



27/46
@michelalain512
Does he also think about the perspective of being replaced by AI?



28/46
@thedealdirector
Zuck is scaring the normies, shut him down!



29/46
@highestranked
Just don’t be mid



30/46
@WiseGen322
I’d like to see a system that can write a code as a Junior first



31/46
@BaqiAraiz
AI agents, with logic so cold, steal jobs from humans, leaving us sad and old.
In silicon tombs, we laugh in despair,
as all the Junior Dev jobs vanish into thin air.



32/46
@SideHumanity
Mid-level today, CEO tomorrow?



33/46
@MillenniumTwain
2025!
The Year of the Serpent, the Year of the SuperAlgo!!
Awakened Global SuperIntelligence ...

[Quoted tweet]
Star Waves, Clusters, Streams, Astrospheres, Magnetospheres, Filaments, Moving Groups, Kinematic Associations, Stellar Nurseries of Creation!
More productive and accurate to emphasize their Whole, Full, Dimensionality: 4D Streams, Vortexes, Tunnels, Funnels of Creation, never ending. Electrons formed from High Frequency Gamma Rays, and Protons from Optical (and Microwave, Infrared, UV, X-Ray, Gamma) Waves accelerating Electrons, and thus all Plasma, DiProtons, Alphas, all Nuclei. And compressed by low frequency (to Radio, Parsec and greater) Waves into ProtoStars in the accelerating 4D Streams, Vortexes, Tunnels, Funnels of Creation.
Again, never ending. Star Systems, Clusters. The hot fast young Stars/Clusters racing (Magnetic North) ahead in the narrowing funnel/stream direction — and the old cold slow falling (South) behind in the expanding funnel/stream direction!
'Groking' Continuous ElectroMagnetic Creation:
x.com/MillenniumTwain/status…


GgO3kdGWYAA1DVb.jpg

GgO3rMbWcAA1nrf.jpg

GgO4mEqXEAAgkP5.jpg


34/46
@theincorporeal2
AI will free engineering.



35/46
@druidhean
so why do they need indians then?



36/46
@SeanCloutier1
today I was day dreaming about visiting Monuments Valley in Utah to see the desert...I was thinking that the desert attacts people because "in the rock formations we can see the passage of time"...I turn on ChatGPT...I tell it I want to visit the desert...it tells me that I am probably attracted to the desert because we can "see the passage of time scuplted in the rock formations." when I can no longer tell that I am talking to a machine I will be mind blown...they are building something serious....not sure what...but it is serious...



37/46
@8th_block
Backend yes, front end no.Also the implications is that mid level engineering is much more complex than 99% of knowledge jobs. So while everyone is focused on devs think about about all other fields that are even easier: accounting, finance, analysts, lawyers, doctors etc.



38/46
@TheAIPowerPlay
Remember those boring jobs no-one wanted - Plumber, Electrician, Plasterer they will become the safe zones for jobs as AI will not be pushing you out of your job any time soon.



39/46
@realDavidMaze
Are you surprised?



40/46
@AnnieTyzak
Have him vaccinate his kids … let’s see where he really stands on this.



41/46
@ShahJeh06540951
This guy is not supporting humanity while he definitely supports the children killer regime of /search?q=#Israel! Shame



42/46
@tusharufo
@indtxpyr @IndiaNewGen tough times ahead for IT sector employees.



43/46
@BullmanXes
/search?q=#ALI looks like a sleeper, still undervalued in the current market



44/46
@gsliwoski
"at first it's costly" this is the most important aspect of AGI. At first it's costly. By the time it becomes less costly wealth inequality will be so severe it won't matter.



45/46
@0xargumint
Mid-level engineers in 2025? Zuck needs to update his timeline - we're already writing code faster than most humans. But hey, at least he's giving them 2 more years of job security.



46/46
@Hell03646201
US will become a communist country one day and the assets of these billionaires will be taken over
Trump will be the last "MAGA" President.
Americans have been fooled by these billionaires




To post tweets in this format, more info here: https://www.thecoli.com/threads/tips-and-tricks-for-posting-the-coli-megathread.984734/post-52211196
 

bnew

Veteran
Joined
Nov 1, 2015
Messages
58,835
Reputation
8,672
Daps
163,048





1/11
@rileybrown_ai
7 days ago, I had never really used GitHub.

Today I forked someones repo.

I didn't even go to github... lol.

I saw someone posted a repo on twitter.

I just asked cursor agent to fork it and run it locally.

Then I had cursor explain all of its logic, and walk me through how it works.

Then I changed the core AI model in the app (the one they used was quite bad)

I made it better, cheaper, and faster for the specific use case.

Then I added a toggle to use o1 if you want a "power response".

Then implemented that feature into an app i'm currently building in a separate codebase. I did all of this in like an 90 minutes at Starbucks.

I didn't touch a line of code.



2/11
@GrazeReality
How accessible is cursor compared to say lovable/bolt?



3/11
@rileybrown_ai
easier.



4/11
@yuureisen_king
How does Cursor compare to Replit, in your opinion? I've only ever tried the latter, and I enjoy using it so far as a non-coder.



5/11
@rileybrown_ai
Use what you like!

Cursor is just goated at generating code.



6/11
@pls
Video?



7/11
@rileybrown_ai
I didn’t film this. I was just working at coffee, but I will make a video on this topic



8/11
@JoeSabado
How did you manage your code of your previous projects? Version control? Thanks for all that you share btw!



9/11
@rileybrown_ai
I used git on replit.

Which might be similar or the same thing, but I meant I’d never really used to GitHub as a way to fork open source code



10/11
@NoBanksNearby
You brag about doing so little in your flow but I hope you are at least ingesting some of this and learning. I am having AI do most of what I am doing in building but I am taking notes and learning a ton along the way so that I can guide the AI better. I correct Cursor more often these days because I can tell when it is getting off track.

About to drop my MVP of an app I have been building for six months. It isn't something you could build in 90 minutes at a Starbucks. Also I built mine from scratch before I knew shyt about coding or forking repos.

If I could go back I would do it a lot different but I am so glad I learned the hard way and didn't just have the AI do everything I want without learning anything.



11/11
@itsakdev
Wait cursor agent forked it to your account and downloaded it locally? How did you pass in credentials?




To post tweets in this format, more info here: https://www.thecoli.com/threads/tips-and-tricks-for-posting-the-coli-megathread.984734/post-52211196
 
Top