1/20
@adcock_brett
What a week for AI and Robotics.
As usual, I summarized everything announced by OpenAI, Google DeepMind, Amazon, Microsoft AI, Tencent, ElevenLabs, Meta, xAI, and more.
Here's everything you need to know and how to make sense out of it:
2/20
@adcock_brett
OpenAI announced the immediate launch of their latest o1 and o1 pro reasoning models on day 1 of 'shipmas'
They also introduced a new $200/month Pro tier with unlimited access to their existing model fleet and 01-pro
https://video.twimg.com/ext_tw_video/1864731897861148675/pu/vid/avc1/1920x1080/m8ewXJ3rLoQO9esP.mp4
3/20
@adcock_brett
Google announced Genie2, a large-scale, multimodal foundation world AI model
It can convert single images into interactive, playable 3D environments
Excited about what this can do not just for AI gaming, but also for AI training in simulation
https://video.twimg.com/ext_tw_video/1864351674816495616/pu/vid/avc1/1280x720/Jz3t596U6zObllGO.mp4
4/20
@adcock_brett
Amazon announced Nova, a new family of AI models
The lineup includes four text models of varying capabilities (Micro, Lite, Pro, and Premier), plus Canvas (image) and Reel (video) models
Nice to see Amazon finally making some noise
https://video.twimg.com/amplify_video/1864290507184132096/vid/avc1/720x900/jQNlFahIkLUeONSX.mp4
5/20
@adcock_brett
We announced some big updates to @cover_thz, my AI security company I announced last year:
1. Our first system capable of detecting consealed weapons was completed this summer
2. Design is underway for our 2nd-gen system which will be radically better
[Quoted tweet]
Last year, I started an AI security company called Cover
Our AI hardware detects guns underneath clothes and bags
And I’m currently funding $10M into it
Here’s why this technology is critical and a 12-month update:
6/20
@adcock_brett
Meta dropped Llama 3.3, a 70B open model with similar performance to Llama 3.1 405B, but significantly cheaper
It's text only for now. Available to download at Meta's Llama page and on HuggingFace
7/20
@adcock_brett
Google launched a new model called gemini-exp-1206, and it's currently topping the Chatbot Arena rankings
You can try it for free in Google AI Studio
[Quoted tweet]
What a way to celebrate one year of incredible Gemini progress -- #1
across the board on overall ranking, as well as on hard prompts, coding, math, instruction following, and more, including with style control on.
Thanks to the hard work of everyone in the Gemini team and elsewhere at Google!
8/20
@adcock_brett
Microsoft launched Copilot Vision in the Edge browser
You can talk to Copilot about what’s on your screen, and it knows what you’re referring to and can respond through voice
Seems like Microsoft is differentiating itself by focusing on 'AI companions'
[Quoted tweet]
EXCLUSIVE: Microsoft just launched Copilot Vision in Edge—the first AI that can navigate the internet with you in real time.
I sat down with Mustafa Suleyman (CEO of Microsoft AI) to discuss how it works, infinite memory, AI companions, agents, and more.
Timestamps:
0:00 Intro
0:58 Microsoft Copilot Vision rundown
02:35 Initial reactions and use cases
04:57 How Copilot Vision works
06:02 Teaching AI to remember
08:13 How Microsoft AI is differentiating from OpenAI
09:27 The push for the true AI companion
11:28 Living with a co-intelligence in 10+ years
14:04 How Microsoft handles your data
16:11 When and how can you try Copilot Vision?
18:15 Agentic AI that controls your computer
20:25 What's next: support, gaming, and more
22:34 Preparing the next generation with Copilot
25:11 Mustafa’s advice for students and businesses
https://video.twimg.com/amplify_video/1864696239289204736/vid/avc1/1280x720/5M0O6lm-pFsLQ_zD.mp4
9/20
@adcock_brett
DeepMind unveiled GenCast, an AI weather forecasting system that surpasses the accuracy of the world's leading forecasting model
It can reported produce reliable predictions for 15-day forecasts in minutes, rather than hours
https://video.twimg.com/ext_tw_video/1864340386270830592/pu/vid/avc1/1080x1080/ho_lKGo7mtklf4l3.mp4
10/20
@adcock_brett
Tencent released Hunyuan Video, a high-quality video generation model with 13 billion parameters.
This is on par with commercial competitors like Runway Gen-3 on motion quality and scene consistency
And it's open-sourced
https://video.twimg.com/ext_tw_video/1863811156478988290/pu/vid/avc1/1280x720/VWryVhZgTmigIARL.mp4
11/20
@adcock_brett
A team from Shanghai combined robotics with 3D printing to create a 6-axis 3D printer inspired by spiderwebs.
The early stage prototype expands the horizons of what can be created by a printed
https://video.twimg.com/ext_tw_video/1863507138506575872/pu/vid/avc1/960x540/RnDn6drRbYGjySej.mp4
12/20
@adcock_brett
E11 Bio used AI to lower the cost of brain mapping by 100x
This makes mapping all the neural connections in mouse and human brains a possibility for the first time
This is one of the critical steps towards whole brain simulation
[Quoted tweet]
E11 Bio is excited to share a major step towards brain mapping at 100x lower cost, making whole-brain connectomics at human & mouse scale feasible (
→
→
). Critical for curing brain disorders, building human-like AI systems, and even simulating human brains. 1/N
https://video.twimg.com/ext_tw_video/1863956020701339649/pu/vid/avc1/1280x720/oJjlu6xowkNwCEmx.mp4
13/20
@adcock_brett
ElevenLabs launched a platform to build and deploy conversational AI agents
They integrated a ton of tech for developers, including Speech-to-Text, LLM integrations, Text-to-Speech, turn-taking and interruption handling
https://video.twimg.com/ext_tw_video/1864005887808897024/pu/vid/avc1/1920x1080/haZ3BVn-MjvLi4Zv.mp4
14/20
@adcock_brett
ElevenLabs also announced 'GenFM' the AI Voice startup's version of Google's NotebookLM
Basically, take any URL and create a podcast in seconds, and listen directly on their iOS/Android apps
https://video.twimg.com/ext_tw_video/1865085808107008000/pu/vid/avc1/1920x1080/Ix-H76_DoUKtlwff.mp4
15/20
@adcock_brett
Clone Robotics introduced 'Clone Alpha' with a CGO video, a humanoid featuring synthetic organs and water-powered artificial muscles
279 will be reportedly available for preorder in 2025
https://video.twimg.com/ext_tw_video/1864555673306189826/pu/vid/avc1/1280x720/GANhmAsQaeIwvfCc.mp4
16/20
@adcock_brett
World Labs revealed its first major project
It's an AI system that can transform any image into an explorable, interactive 3D environment
Users can navigate it in real-time through a web browser
https://video.twimg.com/ext_tw_video/1863617332821811200/pu/vid/avc1/1280x720/26_-thUvx9o1SnBO.mp4
17/20
@adcock_brett
OpenAI announced reinforcement finetuning on day 2 of its 12-day 'Shipmas' spree
In order to get access, you need to apply to a limited alpha program
They plan a public rollout in Q1 of 2025
18/20
@adcock_brett
xAI's Grok made some big updates this week too:
1. It's now available to free users on X, with up to 10 messages every 2 hours
2. They launched Aurora, a new AI image generator that is very good at photorealism
[Quoted tweet]
NEWS: xAI’s latest image generator ‘Aurora’ is now live!
19/20
@adcock_brett
We're hiring over 100 engineers @Figure_robot:
→ Systems Integration Engineer
→ Electrical Engineering (many)
→ Manufacturing Roles (many)
→ Controls Eng (many)
→ Embedded SW
See all open roles and apply here:
Careers | Figure
[Quoted tweet]
HIRING UPDATE: Figure must scale over 100 engineering roles
What are we hiring for? AI, Controls, Manufacturing, Fleet Operations, Software, Mechanical, Electrical, System Integration & business roles (legal, recruiting)
Link below + some details
20/20
@adcock_brett
That's it for this week's AI and Robotics breakdown.
I share the latest research every week, so follow me @adcock_brett for more.
If you found this valuable, consider a like/retweet to spread the word.
[Quoted tweet]
What a week for AI and Robotics.
As usual, I summarized everything announced by OpenAI, Google DeepMind, Amazon, Microsoft AI, Tencent, ElevenLabs, Meta, xAI, and more.
Here's everything you need to know and how to make sense out of it:
To post tweets in this format, more info here: https://www.thecoli.com/threads/tips-and-tricks-for-posting-the-coli-megathread.984734/post-52211196