1/71
@corbtt
Just launched agent.exe, a free, open-source Mac/Windows/Linux app that lets you use Claude 3.5 Sonnet to control your computer!
This was a fun little project to explore the API and see what the model can do. Computer use is really cool—I expect 2025 will be the year of agents.
2/71
@corbtt
Here's agent.exe booking travel on Google Flights.
Claude 3.5 definitely isn't perfect—note that it confidently chooses the wrong dates!
https://video.twimg.com/ext_tw_video/1849126066667077632/pu/vid/avc1/1662x1080/VF2AJDgUzCBXmWv0.mp4
3/71
@corbtt
All the code as well as a (still minimal) README for running the app is available here with an open source Apache 2 license. This is definitely still research-project-quality, but would love to see more development happening on top!
GitHub - corbt/agent.exe
4/71
@corbtt
As a side note, the new Claude 3.5 is incredible for coding as well. This is my first Electron app, and Claude +Cursor could consistently build complex functionality across multiple files in a single shot. First time I've felt more like a manager than an engineer while coding.
5/71
@corbtt
No SoundCloud, but if your company is interested in continuously fine-tuning your LLMs for higher quality based on user feedback you are definitely gonna love @OpenPipeAI
6/71
@chickrnlittle
TBH I’m not going to download an “exe”.
Naming it that is a weird choice.
7/71
@corbtt
I mean this is explicitly an app that lets an LLM run arbitrary code on your computer so pretty valid to have security concerns.
8/71
@keremk
Great stuff! Although you may want to put in a quick warning in your README file so people understand that this is not a sandbox and their actual computer so it is a tad bit too powerful
. Most probably only highly tech-savvy people will use it but you never know...
BTW this may be useful also for UI testing, although a bit expensive.
9/71
@corbtt
I was going to implement a "semi auto" mode where you have to manually approve each action, but in practice it's mega slow to do anything, so you can just hit the "stop" button if it seems like it's turning evil.
10/71
@AllDevThings
Can it work on native apps?
11/71
@corbtt
Yep I had it message my wife on Telegram
12/71
@mlejva
This looks really cool @corbtt, we're about to release desktop support for @e2b_dev sandboxes, would you be open to integrating E2B to your project? I can make a PR ofc
[Quoted tweet]
The Anthropic's "computer use" launch gave us an idea...
13/71
@corbtt
sure if it makes things work better and doesn't make it a pain to set up!
@e2b_dev, we considered using it at @OpenPipeAI for a few flows (still might)
14/71
@king_leocifer
now add voice
15/71
@corbtt
don't tempt me I have an actual company to run
16/71
@MaxDiffusionRL
can you trust it not to hack ur computer
17/71
@corbtt
18/71
@waefrebeorn
Any shot of getting a multimodal llama model doing this? I’m interested in the possibility
19/71
@corbtt
No, it needs to be able to click on specific screen elements to work. Molmo might be able to though.
https://molmo.allenai.org/blog
20/71
@lalopenguin
I just saw agent.exe on my TLDR email newsletter!! Congrats !!!
21/71
@DarbyBaileyXO
insane!! amazing!!
22/71
@Bombachi69
I've got so many ideas on how to play with this. Thanks, great work.
23/71
@kodingnights
You never heard about Skynet huh
24/71
@xucian_
nice
I remember you had like 250 stars few days ago
>2k now. this thing's taking off
25/71
@KevinRossi
Try telling it to make the money for the pizza first.
26/71
@garyzface
Yesteryear "Can I get a refund, my kid ordered that?"
2024 "Can I get a refund, my Agent ordered that?"
27/71
@michabbb
28/71
@vinstar
@amasad @Replit
29/71
@airesearch12
haha, man I was just about to do the same.
even bought a domain to host mine.
computer use - AI-Powered Computer Control
30/71
@iamrobotbear
Nice, where at?
31/71
@Trace_Cohen
@mattshumer_
32/71
@isaiahhall0104
Wow, that was fast! Congrats on the release. Definitely going to try it out over the weekend
33/71
@dkardonsky_
34/71
@julianharris
Why not use
The Open Interpreter Project -/ open source version of a similar thing that was launched 6 months or so ago.
35/71
@connorjewiss
looks awesome! what's your favourite use case for it so far?
36/71
@BobTB12
Let us all download a random .exe and let it run our computer and click what it wants. What can go wrong??
37/71
@PrvnKalavai
How many tokens were used totally?
38/71
@yang_yi_cn
I guess nothing could go wrong from here
39/71
@geertjansloos
Works very well!
40/71
@rob_stemp
"hey claude, format my ...." ...
claude: "sure, formatting your disks".
"no, no no I meant my document"....
41/71
@i_Forget_
[Quoted tweet]
inb4 we start handing out assistant embedded vps as a service envs.
42/71
@SadiaUme1
I love the idea of simplifying computer control with AI.
Can't wait to see where this leads in 2025!
43/71
@menemazarakis
Super cool! Check out what we’re building @AnonPlatform if you want to build agents with integrations to platforms that don’t have public APIs (DoorDash one of them)
44/71
@enricoros
Link:
GitHub - corbt/agent.exe
45/71
@Will_W_Welker
Can it voice-type my X posts?
46/71
@isimmons33
Can he check the "I am not a robot" checkbox?
47/71
@deifosv
this looks pretty nice Kyle, how long did it take you to build this?
48/71
@_rchaves_
amazing name, love it!
49/71
@CherryTruthy
hhahahahahahahahha
That was so fast!
I was going to build this over this coming weekend.
Far too slow it seems.
Looks great, good job.
50/71
@fgblanch
51/71
@tylerclark
Couldn't get it working correctly. Seems like the cursor placement is off. Do I need to be at a certain resolution or something? It was clicking into the wrong areas and typing into the wrong areas on simple things like Chrome and YouTube.
52/71
@tilver
corp IT is going to love things like this. "I downloaded this EXE and let it control my computer. I can even do my banking with it!"
53/71
@X0Radi
I built something similar yesterday, can't wait to check this out. I did run into issues with 3440x1440 screen tho
54/71
@ThaKidDJL
7. Psalm 115:4-8
"But their idols are silver and gold, made by human hands. They have mouths, but cannot speak, eyes, but cannot see. They have ears, but cannot hear, noses, but cannot smell… Those who make them will be like them, and so will all who trust in them.”
55/71
@Blu3_NFT
Very impressive.
56/71
@FeatureCrewPod
the anthropic demo works pretty well:
[Quoted tweet]
New #AI Agent from @AnthropicAI can now...
Delete emails
Manage files
Try to draw
Watch the full video:
https://video.twimg.com/amplify_video/1848941581199400960/vid/avc1/1920x1080/NGOul_PPaOzxRNk9.mp4
57/71
@cometpizzo
really cool but i dont see this being a reality until image inference price comes down over the next few years
58/71
@algorusty
Imagine having a net of Claudes from different computers interacting with each other...
59/71
@ssslomp
kids in the comments "my PC won't boot"
60/71
@joshRnold
thank you for putting this together! was waiting for someone to spin something like this up
61/71
@trinket11101
I'm definitely going to double-click on agent.exe
62/71
@orodeltoro
I have to pay for the API key access to run this on my comp right? Anyone know? I'm a newbie
63/71
@bobz44
nice yes excited to dig in here
64/71
@nexvice
Has anyone tried this with GPT already?
65/71
@Mars53208096
Is it easy to setup
66/71
@AmberSahdev
Hell yeah! I built something similar earlier this year as well - open sourced, multi-platform (mac/windows/linux), and multi-LLM
GitHub - AmberSahdev/Open-Interface: Control Any Computer Using LLMs
67/71
@lordfervi
Waiting for version with Mistral!
68/71
@Elonmusk2bk
If you support President Trump, we are FAMILY! Lets win this...
69/71
@Tyzer34
Hmmm seems that mine is constantly opening the Accessibility settings instead of Firefox...
70/71
@erentrkg
Wow I looove technology
71/71
@ZappaCloud
All I want to know is if it can fish in World of Warcraft @Krillict
To post tweets in this format, more info here: https://www.thecoli.com/threads/tips-and-tricks-for-posting-the-coli-megathread.984734/post-52211196