Is it a wrap for Software Engineers? Devin autonomous AI software engineer...

cartierhoe · Apr 9, 2024

O.T.I.S. said:
EXACTLY

These nikkas act like AI is going specifically after tech jobs

No, AI most likely will come after YOUR job. All these middle managing, or sit in the office all day just bullshyttin on TheColi… these the nikkas in danger imo.

Or even ordinary folk doing remedial jobs have a higher likelihood of getting replaced and going the way of the auto-factory worker like in the past..

IT is broad af… extremely broad. Broader than what most people know/think. The Coli tends to think everyone in IT just does developing or coding or some shyt… no, they dont.

I do cyber security with a background in system maintenance and networking. No way who I work for is trusting AI to run their systems and networks, especially with the amount of cybercrime going on. Will it be a tool, yes. Will it be significant.. yes. Will it replace most tech workers? Only people willing to do that are greedy/public CEO’s to try to cut labor costs..

but like OP said, its the same shyt when companies started outsourcing their entire IT departments overseas and realizing it was a HUGE mistake after they lost hundreds of millions. It happened to my mom and they came running back to her and kinda happened to me last year when I was let go because some clown thought our roles were ineffective and was a waste of money… until shyt hit the fan, that guy got fired, and they were back on my line asking me to return and then had to pay me more to come back

nikkas need to stop worrying about what AI will do to tech jobs and start worrying about what it will do to YOUR job.. because if you just pushing papers, on the phone, and going to meetings all day then

This is facts. You can't just let these systems and networks go and be managed by AI, you're looking at a ton of vulnerabilities. AI can help with scripting for sure, but even the best script can bomb out on an unexpected issue or something else came up due to someone else's work now you're whole process got messed up. I just try to do my best to be always learning. Nobody can tell the future, just gotta be diligent and improve on skills.

Cobalt Sire · Apr 9, 2024

It was all good just a week ago
Now ya Black ass about to be broke
nikkas thought they were protected in tech
Others lost their jobs, your about to be next
Couldn't happen to a better group of fools
"Not a problem for Stem gang" is no longer the rule
Wasn't tryna hear it when we warned you clowns
So get a job at Mcdonalds, how does that sound?

gho3st · Apr 9, 2024

O.T.I.S. said:
What is non-manual IT labor

IT people that dont do hardware or software suport. Basically nikkas who make 6 figures lol

JT-Money · Apr 9, 2024

Amazon’s security chief says he would be ‘astonished’ if cybersecurity professionals are laid off due to AI

Amazon’s security chief says he would be ‘astonished’ if AI causes layoffs in cybersecurity

The global cybersecurity workforce headcount is the highest it’s ever been, but it’s still not enough to meet industry demands, report shows.

fortune.com

bnew · Apr 9, 2024

1/4
The AI world is truly going insane!

Just the other day, we were discussing Princeton University's open-source AI software engineer SWE-agent outperforming Devin, and now a new contender, AutoCodeRover from Singapore, has dethroned SWE-agent in just a matter of days.

This powerhouse can tackle 67 GitHub issues (bug fixes or feature additions) in under ten minutes per issue, while regular developers take an average of over 2.77 days, all at a minuscule LLM cost of ~$0.5! Truly frightening!

2/4
Check it out!

3/4
New LLMs & cost transparency, accomplished!

4/4
@goon_nguyen has just added 4 more AI models to my settings!

All models come with input & output price (per million tokens).

1/4
it's over, and this is current generation LLMs (gpt-4)

> our approach resolved 67 GitHub issues in less than ten minutes each, whereas developers spent more than 2.77 days on average

2/4
ok not completely over for soft eng, but yea that efficiency gain is insane

3/4
this was the results posted from "devin" (they used 25% random subset)

4/4
code is here

1/1
1. Devin (Agentic AI Software Engineer)
2. followed by Devika: GitHub - stitionai/devika: Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective. Devika aims to be a competitive open-source alternative to Devin by Cognition AI. (open source Agentic AI Software Engineer)
3. followed by SWE-agent: GitHub - princeton-nlp/SWE-agent: SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.29% of bugs in the SWE-bench evaluation set and takes just 1.5 minutes to run. (resolved ~12.3% of issues on SWE-bench in comparison to Devin's ~13.8%)
4. and now AutoCodeRover: GitHub - nus-apr/auto-code-rover: Autonomous program improvement. (resolved ~22% of issues on SWE-bench)
5. I also came across this another AI Backend Engineer called GibsonAI(closed source).

All within weeks of each other.

1/4
Introducing AutoCodeRover
Presenting our autonomous software engineer from Singapore ! Takes in a Github issue (bug fixing or feature addition), resolves in few minutes, with minimal LLM cost ~$0.5 ! Please RT

GitHub - nus-apr/auto-code-rover: Autonomous program improvement

auto-code-rover/preprint.pdf at main · nus-apr/auto-code-rover

[ 1 / 4]

2/4
Absolutely free for everyone to try out ! And to improve it further!!

3/4
We prefer to run it multiple times - to cater for variations …

4/4
Try it from the following site

https://github.com/nus-apr/auto-code-rover…
https://github.com/nus-apr/auto-code-rover/blob/main/preprint.pdf…
#thursdAI
@ollama

auto-code-rover/preprint.pdf at main · nus-apr/auto-code-rover
GitHub - nus-apr/auto-code-rover: Autonomous program improvement

1/2
AutoCodeRover: Autonomous Software Engineer

Resolves 22% of Github issues in SWE-benchlite in <10 mins at minimal LLM cost ~$0.5
Works on program representation of Abstract Syntax Tree, and exploits program structure in the form of classes/methods/APIs

GitHub - nus-apr/auto-code-rover: A project structure aware autonomous software engineer aiming for autonomous program improvement. Resolved 37.3% tasks (pass@1) in SWE-bench lite and 46.2% tasks (pass@1) in SWE-bench verified with each task costs le

A project structure aware autonomous software engineer aiming for autonomous program improvement. Resolved 37.3% tasks (pass@1) in SWE-bench lite and 46.2% tasks (pass@1) in SWE-bench verified with...

github.com

2/2
Introducing AutoCodeRover
Presenting our autonomous software engineer from Singapore ! Takes in a Github issue (bug fixing or feature addition), resolves in few minutes, with minimal LLM cost ~$0.5 ! Please RT

https://github.com/nus-apr/auto-cod...//code.djangoproject.com/ticket/32347']#32347 of Django. See the demo video for the full process:

https://private-user-images.githubusercontent.com/48704330/320440436-719c7a56-40b8-4f3d-a90e-0069e37baad3.mp4?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MTI2OTM5NTMsIm5iZiI6MTcxMjY5MzY1MywicGF0aCI6Ii80ODcwNDMzMC8zMjA0NDA0MzYtNzE5YzdhNTYtNDBiOC00ZjNkLWE5MGUtMDA2OWUzN2JhYWQzLm1wND9YLUFtei1BbGdvcml0aG09QVdTNC1ITUFDLVNIQTI1NiZYLUFtei1DcmVkZW50aWFsPUFLSUFWQ09EWUxTQTUzUFFLNFpBJTJGMjAyNDA0MDklMkZ1cy1lYXN0LTElMkZzMyUyRmF3czRfcmVxdWVzdCZYLUFtei1EYXRlPTIwMjQwNDA5VDIwMTQxM1omWC1BbXotRXhwaXJlcz0zMDAmWC1BbXotU2lnbmF0dXJlPTExNGZkMTVmOWM4NWNhOGUzYTVlZGFjODJkNjNlN2FiNzUzN2I1M2E1MWM4ZWE0NTQ1ZDRmN2IwNjc4ZGRjMDQmWC1BbXotU2lnbmVkSGVhZGVycz1ob3N0JmFjdG9yX2lkPTAma2V5X2lkPTAmcmVwb19pZD0wIn0.eNRYVyfdSuwh5JPFFeNhansQypb-GimykriAzpOkcXc

Enhancement: leveraging test cases

AutoCodeRover can resolve even more issues, if test cases are available. See an example in the video:

KingDanz · Apr 9, 2024

bnew said:
1/4
The AI world is truly going insane!

Just the other day, we were discussing Princeton University's open-source AI software engineer SWE-agent outperforming Devin, and now a new contender, AutoCodeRover from Singapore, has dethroned SWE-agent in just a matter of days.

This powerhouse can tackle 67 GitHub issues (bug fixes or feature additions) in under ten minutes per issue, while regular developers take an average of over 2.77 days, all at a minuscule LLM cost of ~$0.5! Truly frightening!

2/4
Check it out!

3/4
New LLMs & cost transparency, accomplished!

4/4
@goon_nguyen has just added 4 more AI models to my settings!

All models come with input & output price (per million tokens).

1/4
it's over, and this is current generation LLMs (gpt-4)

> our approach resolved 67 GitHub issues in less than ten minutes each, whereas developers spent more than 2.77 days on average

2/4
ok not completely over for soft eng, but yea that efficiency gain is insane

3/4
this was the results posted from "devin" (they used 25% random subset)

4/4
code is here

1/1
1. Devin (Agentic AI Software Engineer)
2. followed by Devika: GitHub - stitionai/devika: Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective. Devika aims to be a competitive open-source alternative to Devin by Cognition AI. (open source Agentic AI Software Engineer)
3. followed by SWE-agent: GitHub - princeton-nlp/SWE-agent: SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.29% of bugs in the SWE-bench evaluation set and takes just 1.5 minutes to run. (resolved ~12.3% of issues on SWE-bench in comparison to Devin's ~13.8%)
4. and now AutoCodeRover: GitHub - nus-apr/auto-code-rover: Autonomous program improvement. (resolved ~22% of issues on SWE-bench)
5. I also came across this another AI Backend Engineer called GibsonAI(closed source).

All within weeks of each other.

1/4
Introducing AutoCodeRover
Presenting our autonomous software engineer from Singapore ! Takes in a Github issue (bug fixing or feature addition), resolves in few minutes, with minimal LLM cost ~$0.5 ! Please RT

GitHub - nus-apr/auto-code-rover: Autonomous program improvement

auto-code-rover/preprint.pdf at main · nus-apr/auto-code-rover

[ 1 / 4]

2/4
Absolutely free for everyone to try out ! And to improve it further!!

3/4
We prefer to run it multiple times - to cater for variations …

4/4
Try it from the following site

https://github.com/nus-apr/auto-code-rover…
https://github.com/nus-apr/auto-code-rover/blob/main/preprint.pdf…
#thursdAI
@ollama

auto-code-rover/preprint.pdf at main · nus-apr/auto-code-rover
GitHub - nus-apr/auto-code-rover: Autonomous program improvement

1/2
AutoCodeRover: Autonomous Software Engineer

Resolves 22% of Github issues in SWE-benchlite in <10 mins at minimal LLM cost ~$0.5
Works on program representation of Abstract Syntax Tree, and exploits program structure in the form of classes/methods/APIs

GitHub - nus-apr/auto-code-rover: A project structure aware autonomous software engineer aiming for autonomous program improvement. Resolved 37.3% tasks (pass@1) in SWE-bench lite and 46.2% tasks (pass@1) in SWE-bench verified with each task costs le

A project structure aware autonomous software engineer aiming for autonomous program improvement. Resolved 37.3% tasks (pass@1) in SWE-bench lite and 46.2% tasks (pass@1) in SWE-bench verified with...

github.com

2/2
Introducing AutoCodeRover
Presenting our autonomous software engineer from Singapore ! Takes in a Github issue (bug fixing or feature addition), resolves in few minutes, with minimal LLM cost ~$0.5 ! Please RT

GitHub - nus-apr/auto-code-rover: Autonomous program improvement
Example: Django Issue #32347
As an example, AutoCodeRover successfully fixed issue #32347 of Django. See the demo video for the full process:

[URL unfurl="true"]https://private-user-images.githubusercontent.com/48704330/320440436-719c7a56-40b8-4f3d-a90e-0069e37baad3.mp4?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MTI2OTM5NTMsIm5iZiI6MTcxMjY5MzY1MywicGF0aCI6Ii80ODcwNDMzMC8zMjA0NDA0MzYtNzE5YzdhNTYtNDBiOC00ZjNkLWE5MGUtMDA2OWUzN2JhYWQzLm1wND9YLUFtei1BbGdvcml0aG09QVdTNC1ITUFDLVNIQTI1NiZYLUFtei1DcmVkZW50aWFsPUFLSUFWQ09EWUxTQTUzUFFLNFpBJTJGMjAyNDA0MDklMkZ1cy1lYXN0LTElMkZzMyUyRmF3czRfcmVxdWVzdCZYLUFtei1EYXRlPTIwMjQwNDA5VDIwMTQxM1omWC1BbXotRXhwaXJlcz0zMDAmWC1BbXotU2lnbmF0dXJlPTExNGZkMTVmOWM4NWNhOGUzYTVlZGFjODJkNjNlN2FiNzUzN2I1M2E1MWM4ZWE0NTQ1ZDRmN2IwNjc4ZGRjMDQmWC1BbXotU2lnbmVkSGVhZGVycz1ob3N0JmFjdG9yX2lkPTAma2V5X2lkPTAmcmVwb19pZD0wIn0.eNRYVyfdSuwh5JPFFeNhansQypb-GimykriAzpOkcXc

Enhancement: leveraging test cases
AutoCodeRover can resolve even more issues, if test cases are available. See an example in the video:

Treblemaka · Apr 9, 2024

jdubnyce said:
Breh sounding like my RTE at work

Rhyme n Tekniq · Apr 9, 2024

gho3st said:
Nikka i make 6 figures

Uhhh hmm...

I wasnt talking to you nor did ask about your salary, but congrats and welcome to the club, I guess..... the fukk? :what:

gho3st · Apr 9, 2024

Rhyme n Tekniq said:
Uhhh hmm...

I wasnt talking to you nor did ask about your salary, but congrats and welcome to the club, I guess..... the fukk?

Nvm i made a thread and it seem sht got merged with this :francis:

O.T.I.S. · Apr 9, 2024

gho3st said:
IT people that dont do hardware or software suport. Basically nikkas who make 6 figures lol

Huh

You mean… everyone except desktop support or helpdesk :mjlol:

Ghost Utmost · Apr 9, 2024

Read a story once about an AI that diagnosed diseases. This was 10 years ago.

They put the symptoms in

and the AI reads

EVERY medical article published. Ever. In like.. 35 seconds.

Then it tells them the likely disease.

Okay. Great. This is something humans can do. Except the patient had hours to live and the humans would take days to do the research.

The computer was exactly right also.

If the AI can save countless hours of physically typing then that's actually great. Frees the humans up to imagine and create at a faster rate.

bnew · Apr 15, 2024

bnew · Jun 21, 2024

1/6
To start, if you want to see Claude 3.5 Sonnet in action solving a simple pull request, here's a quick demo video we made.

(voiceover by the one and only @sumbhavsethia)

2/6
In our internal pull request eval, Claude 3.5 Sonnet passed 64% of our test cases.

To put this in comparison, Claude 3 Opus only passed 38%.

3/6
3.5 Sonnet performed so well that it almost felt like it was playing with us on some of the test cases.

It would find the bug, fix it, and spend the rest of its output tokens going back and updating the repo documentation and code comments.

4/6
Side note: With Claude's coding skills plus Artifacts, I've already stopped using most simple chart, diagram, and visualization software.

I made the chart above in just 2 messages.

5/6
Back to PRs, Claude 3.5 Sonnet is the first model I've seen change the timelines of some of the best engineers I know.

This is a real quote from one of our engineers after Claude 3.5 Sonnet fixed a bug in an open source library they were using.

6/6
At Anthropic, everyone from non-technical people with no coding experience to tenured SWEs now use Claude to write code that saves them hours of time.

Claude makes you feel like you have superpowers, suddenly no problem is too ambitious.

The future of programming is here folks.

To post tweets in this format, more info here: https://www.thecoli.com/threads/tips-and-tricks-for-posting-the-coli-megathread.984734/post-52211196

bnew · Jun 23, 2024

1/1
Just tried Claude 3.5 sonnet's v hyped codegen. It's great, but you still need to be a senior swe. It took me ~100 messages and 31 iterations to come up with a workout tracker where the dates weren't munged.

Hype is real, blind spots are real too. Great UX tho, love the look.

To post tweets in this format, more info here: https://www.thecoli.com/threads/tips-and-tricks-for-posting-the-coli-megathread.984734/post-52211196

1/3

Just tested Claude 3.5 Sonnet

I love the live preview (aka Artifacts). I've built a tetris game and a resume website. This thing hits output limits rather quickly. You still get some bugs and strange decisions that ruin the app. Imho, it's overhyped and below a JR SWE atm.

2/3
What's the prompt

3/3
1. Create a fully functional Tetris game that runs in the browser. 2. Improve the pieces look by giving them more of an indepth look. Add particles when the rows get removed. I want an effect that breaks the blocks into smaller pieces.

To post tweets in this format, more info here: https://www.thecoli.com/threads/tips-and-tricks-for-posting-the-coli-megathread.984734/post-52211196

King · Jun 23, 2024

Serious said:
There’s also an energy capacity of limit to ai.

As Use of A.I. Soars, So Does the Energy and Water It Requires

Generative artificial intelligence uses massive amounts of energy for computation and data storage and millions of gallons of water to cool the equipment at data centers. Now, legislators and regulators — in the U.S. and the EU — are starting to demand accountability.

e360.yale.edu

Saudi Arabia is planning to use slave labor to turn majority of its desert into energy farms for AI :mjlol:

Is it a wrap for Software Engineers? Devin autonomous AI software engineer...

Veteran

All Star

plata or plomo

Superstar

Amazon’s security chief says he would be ‘astonished’ if cybersecurity professionals are laid off due to AI​

Veteran

Enhancement: leveraging test cases​

Veteran

Example: Django Issue #32347​

Enhancement: leveraging test cases​

President, BYNKRadio.com (Retired)

Superstar

plata or plomo

Veteran

The Soul of the Internet

Veteran

Veteran

Veteran

The black man is always targeted.

Similar threads

Amazon’s security chief says he would be ‘astonished’ if cybersecurity professionals are laid off due to AI

Enhancement: leveraging test cases

Example: Django Issue #32347

Enhancement: leveraging test cases