Google DeepMind’s Groundbreaking AI for Protein Structure Can Now Model DNA

bnew

Veteran
Joined
Nov 1, 2015
Messages
57,616
Reputation
8,519
Daps
160,577

Google DeepMind’s Groundbreaking AI for Protein Structure Can Now Model DNA

WILL KNIGHT
BUSINESS


MAY 8, 2024 11:00 AM

Google DeepMind’s Groundbreaking AI for Protein Structure Can Now Model DNA​

Demis Hassabis, Google’s artificial intelligence chief, says the AlphaFold software that revolutionized the study of proteins has received a significant upgrade that will advance drug development.

Abstract sculpture of multicolored spheres and straws on a pink and yellow background molecular structure concept

PHOTOGRAPH: DANIEL GRIZELJ/GETTY IMAGES

Google spent much of the past year hustling to build its Gemini chatbot to counter ChatGPT, pitching it as a multifunctional AI assistant that can help with work tasks or the digital chores of personal life. More quietly, the company has been working to enhance a more specialized artificial intelligence tool that is already a must-have for some scientists.

AlphaFold, software developed by Google’s DeepMind AI unit to predict the 3D structure of proteins, has received a significant upgrade. It can now model other molecules of biological importance, including DNA, and the interactions between antibodies produced by the immune system and the molecules of disease organisms. DeepMind added those new capabilities to AlphaFold 3 in part through borrowing techniques from AI image generators.

“This is a big advance for us,” Demis Hassabis, CEO of Google DeepMind, told WIRED ahead of Wednesday’s publication of a paper on AlphaFold 3 in the science journal Nature. “This is exactly what you need for drug discovery: You need to see how a small molecule is going to bind to a drug, how strongly, and also what else it might bind to.”

AlphaFold 3 can model large molecules such as DNA and RNA, which carry genetic code, but also much smaller entities, including metal ions. It can predict with high accuracy how these different molecules will interact with one another, Google’s research paper claims.

The software was developed by Google DeepMind and Isomorphic labs, a sibling company under parent Alphabet working on AI for biotech that is also led by Hassabis. In January, Isomorphic Labs announced that it would work with Eli Lilly and Novartis on drug development.

AlphaFold 3 will be made available via the cloud for outside researchers to access for free, but DeepMind is not releasing the software as open source the way it did for earlier versions of AlphaFold. John Jumper, who leads the Google DeepMind team working on the software, says it could help provide a deeper understanding of how proteins interact and work with DNA inside the body. “How do proteins respond to DNA damage; how do they find, repair it?” Jumper says. “We can start to answer these questions.”

Understanding protein structures used to require painstaking work using electron microscopes and a technique called x-ray crystallography. Several years ago, academic research groups began testing whether deep learning, the technique at the heart of many recent AI advances, could predict the shape of proteins simply from their constituent amino acids, by learning from structures that had been experimentally verified.

In 2018, Google DeepMind revealed it was working on AI software called AlphaFold to accurately predict the shape of proteins. In 2020, AlphaFold 2 produced results accurate enough to set off a storm of excitement in molecular biology. A year later, the company released an open source version of AlphaFold for anyone to use, along with 350,000 predicted protein structures, including for almost every protein known to exist in the human body. In 2022 the company released more than 2 million protein structures.
 

bnew

Veteran
Joined
Nov 1, 2015
Messages
57,616
Reputation
8,519
Daps
160,577






1/1
Announcing AlphaFold 3: our state-of-the-art AI model for predicting the structure and interactions of all life’s molecules.

Here’s how we built it with
@IsomorphicLabs and what it means for biology. AlphaFold 3 predicts the structure and interactions of all of life’s molecules


To post tweets in this format, more info here: https://www.thecoli.com/threads/tips-and-tricks-for-posting-the-coli-megathread.984734/post-52211196
GNEJPJBXAAEqKaW.jpg

GNEMivPWIAAdG71.jpg
 
Last edited:

bnew

Veteran
Joined
Nov 1, 2015
Messages
57,616
Reputation
8,519
Daps
160,577

AlphaProteo generates novel proteins for biology and health research​


Research

Published: 5 September 2024​

Authors: Protein Design and Wet Lab teams​


[SUB]New AI system designs proteins that successfully bind to target molecules, with potential for advancing drug design, disease understanding and more.[/SUB]

Every biological process in the body, from cell growth to immune responses, depends on interactions between molecules called proteins. Like a key to a lock, one protein can bind to another, helping regulate critical cellular processes. Protein structure prediction tools like https://deepmind.google/impact/meet-the-scientists-using-alphafold/"]AlphaFold have already given us tremendous insight into how proteins interact with each other to perform their functions, but these tools cannot create new proteins to directly manipulate those interactions.

Scientists, however, can create novel proteins that successfully bind to target molecules. These binders can help researchers accelerate progress across a broad spectrum of research, including drug development, cell and tissue imaging, disease understanding and diagnosis – even crop resistance to pests. While https://www.nature.com/articles/s41586-023-06415-8"]recent machine learning approaches to protein design have made great strides, the process is still laborious and requires extensive experimental testing.

Today, we introduce https://storage.googleapis.com/deep...tein_Design_White_Paper_2024.pdf"]AlphaProteo, our first AI system for designing novel, high-strength protein binders to serve as building blocks for biological and health research. This technology has the potential to accelerate our understanding of biological processes and aid the discovery of new drugs, the development of biosensors and more.

AlphaProteo can generate new protein binders for diverse target proteins, including https://www1.rcsb.org/structure/1BJ1"]VEGF-A, which is associated with cancer and complications from diabetes. This is the first time an AI tool has been able to design a successful protein binder for VEGF-A.

AlphaProteo also achieves higher experimental success rates and 3 to 300 times better binding affinities than the best existing methods on seven target proteins we tested.

Learning the intricate ways proteins bind to each other​


Protein binders that can bind tightly to a target protein are hard to design. Traditional methods are time intensive, requiring multiple rounds of extensive lab work. After the binders are created, they undergo additional experimental rounds to optimize binding affinity so they bind tightly enough to be useful.

Trained on vast amounts of protein data from the https://www.rcsb.org/"]Protein Data Bank (PDB) and more than 100 million predicted structures from AlphaFold, AlphaProteo has learned the myriad ways molecules bind to each other. Given the structure of a target molecule and a set of preferred binding locations on that molecule, AlphaProteo generates a candidate protein that binds to the target at those locations.


https://deepmind.google/api/blob/website/media/GDM-ProteinDesignBlog-02-Binder-Final.mp4


[FIGURE]
[CAPTION]Illustration of a predicted protein binder structure interacting with a target protein. Shown in blue is a predicted protein binder structure generated by AlphaProteo, designed for binding to a target protein. Shown in yellow is the target protein, specifically the SARS-CoV-2 spike receptor-binding domain[/CAPTION]
[/FIGURE]

Demonstrating success on important protein binding targets​


To test AlphaProteo, we designed binders for diverse target proteins, including two viral proteins involved in infection (https://www.rcsb.org/structure/2WH6"]BHRF1 and SARS-CoV-2 spike protein receptor-binding domain), SC2RBD), and five proteins involved in cancer, inflammation and autoimmune diseases (IL-7Rɑ, PD-L1, TrkA, IL-17A and VEGF-A).

Our system has highly-competitive binding success rates and best-in-class binding strengths. For seven targets, AlphaProteo generated candidate proteins in-silico that bound strongly to their intended proteins when tested experimentally.

For one particular target, the viral protein https://www.rcsb.org/structure/2WH6"]BHRF1, 88% of our candidate molecules bound successfully when tested in the Google DeepMind Wet Lab. Based on the targets tested, AlphaProteo binders also bind 10 times more strongly, on average, than the best existing design methods.

For another target, https://www.rcsb.org/structure/1WWW"]TrkA, our binders are even stronger than the best prior designed binders to this target that have been through multiple rounds of experimental optimization.

[FIGURE]
[CAPTION]A grid of illustrations of predicted structures of seven target proteins for which AlphaProteo generated successful binders. Shown in blue are examples of binders tested in the wet lab; shown in yellow are protein targets; highlighted in dark yellow are intended binding regions.[/CAPTION]
[/FIGURE]

Validating our results​


Beyond in silico validation and testing AlphaProteo in our wet lab, we engaged Peter Cherepanov’s, Katie Bentley’s and David LV Bauer’s research groups from the Francis Crick Institute to validate our protein binders. Across different experiments, they dived deeper into some of our stronger SC2RBD and VEGF-A binders. The research groups confirmed that the binding interactions of these binders were indeed similar to what AlphaProteo had predicted. Additionally, the groups confirmed that the binders have useful biological function. For example, some of our SC2RBD binders were shown to prevent SARS-CoV-2 and some of its variants from infecting cells.

AlphaProteo’s performance indicates that it could drastically reduce the time needed for initial experiments involving protein binders for a broad range of applications. However, we know that our AI system has limitations as it was unable to design successful binders against an eighth target (https://www.rcsb.org/structure/1TNF"]TNFɑ), a protein associated with autoimmune diseases like rheumatoid arthritis.

Achieving strong binding is usually only the first step in designing proteins that might be useful for practical applications; there are many more bioengineering obstacles to overcome in the research and development process.

Towards responsible development of protein design​


Protein design is a fast-evolving technology that holds lots of potential for advancing science in everything from understanding factors causing disease to accelerating diagnostic test development for virus outbreaks, supporting more sustainable manufacturing processes and even cleaning contaminants from environments.

To account for potential risks in biosecurity, building on our long-standing approach to responsibility and safety,https://storage.googleapis.com/deep..."]we’re working with leading external experts to inform our phased approach to sharing this work and feeding into community efforts to develop best practices including NTI’s new AI Bio Forum.

Going forward, we’ll be working with the scientific community to leverage AlphaProteo on impactful biology problems and understand its limitations. We've also been exploring its drug design applications at Isomorphic Labs and are excited about what the future holds.

At the same time, we’re continuing to improve success rates and affinity of AlphaProteo’s algorithms while expanding its range of design problems it can tackle. We're working with researchers in machine learning, structural biology, biochemistry and other disciplines towards developing a responsible comprehensive protein design offering for the community.

Read our whitepaper here​
 
Last edited:

bnew

Veteran
Joined
Nov 1, 2015
Messages
57,616
Reputation
8,519
Daps
160,577









1/11
@joshim5
We’re excited to introduce @ChaiDiscovery and release Chai-1, a foundation model for molecular structure prediction that performs at the state-of-the-art across a variety of drug discovery tasks

We're releasing inference code, weights & a web interface: Chai Discovery

2/11
@joshim5
We tested Chai-1 across a number of benchmarks, and found that the model achieves a 77% success rate on the PoseBusters benchmark (vs. 76% by AlphaFold3).

3/11
@joshim5
Unlike many existing structure prediction tools which require multiple sequence alignments (MSAs), Chai-1 can be run in single sequence mode without MSAs while preserving most of its performance.

4/11
@joshim5
In addition to its frontier modeling capabilities directly from sequences, Chai-1 can be prompted with new data, e.g. restraints derived from the lab, which boost performance by double-digit percentage points.

5/11
@joshim5
We are releasing Chai-1 via a web interface for free, including for commercial applications such as drug discovery. We are also releasing the code for Chai-1 for non-commercial use as a software library.

Web interface: Chai Discovery
Code: GitHub - chaidiscovery/chai-lab: Chai-1, SOTA model for biomolecular structure prediction

6/11
@joshim5
Prior to founding @chaidiscovery, our team collectively helped advance the development of 20 drug programs.

Many of us were Heads of AI at leading AI Drug Discovery companies and we’ve worked at companies like @OpenAI, @AIatMeta, @Stripe, and @GoogleAI.

7/11
@joshim5
We're well funded and grateful for the strong partnership of @ThriveCapital, @_DimensionCap, @OpenAI, @saranormous, @Neo, @lachygroom, and @Amplify, as well as angel investors @gdb, @ByersBlake, @JuliaHartz, @KevinHartz, @FEhrsam, @gaybrick, @dafrankel, @RMartinChavez.

8/11
@joshim5
Read more about our launch in @Bloomberg: Bloomberg - Are you a robot?

9/11
@apartovi
Let's goooo! I'm so proud of you Josh,
@_JackDent, & @ChaiDiscovery!

It's been seven wonderful years since we started supporting you as @Neo Scholars, and I've loved every step. 💙

Glad to be your investor, and here's to the journey ahead.

10/11
@joshim5
Thanks so much for your support, @apartovi! And what a throwback... that photo on-stage is from the first @Neo reunion back in 2017. I think we can guess what the topic was :smile:

11/11
@jennywang01
Congrats!!🎉


To post tweets in this format, more info here: https://www.thecoli.com/threads/tips-and-tricks-for-posting-the-coli-megathread.984734/post-52211196
GXDFSyqbMAAqgLr.jpg

GXDB1qCa0AAyqwJ.jpg

GXDDCQtagAAroOS.jpg

GXDCoHiaIAAJjQv.jpg

GXDdKS9WEAAq-K3.jpg

GXDdS0ZWAAAuPWZ.jpg

GXDdZ6IWsAAtTKD.jpg

GXDdgCWXsAAPv-i.jpg
 

bnew

Veteran
Joined
Nov 1, 2015
Messages
57,616
Reputation
8,519
Daps
160,577


1/11
@kimmonismus
I think a lot of people who are interested in AI still don't realize what a breakthrough @GoogleDeepMind 's AlphaFold2 is for humanity. It is one of the most important achievements of technology and will directly benefit all people by allowing us to develop new drugs much faster (weeks instead of years!), much more accurate predictions about the drugs through more accurate protein folding and at the same time much more individualized therapy.
I hope to find time soon to write a longer analysis and review of the importance of AlphaFold2. I would really like to do that.

[Quoted tweet]
Interesting that both the Nobel Prize in Physics and the Nobel Prize in Chemistry go to the field of artificial intelligence. Anyway: well deserved to the winners and of course especially to @demishassabis .


2/11
@danielarpm
Alphafold3 will be even more interesting

Eventually whole cell simulation is the goal (and then maybe tissue /organ).



3/11
@kimmonismus
Every new iteration will be even more interesting ;)



4/11
@ooperceiveroo
🤡



5/11
@hive_echo
Much agreed!



6/11
@benyogaking
more impactful than reducing costs for big pharma is that some version of AlphaFold may develop industrially useful protiens for atomically precise manufacturing so that a home desktop DNA resequencer could make anything from anything (given enough time)?



7/11
@BrothrMichaels
It's so hard not to break that 2 year-long pact with myself to lead a Google free life nowadays.



8/11
@cvamarosa
Ya está disponible Alphafold3 y va mucho más allá de la predicción de la estructura de proteínas. Permite estudiar otras moléculas e interacciones entre ellas



9/11
@OmVenture
Yes, and the winners are artificial intelligence and biology
3:0 so far



10/11
@ChristianB32772
Can it be used by anyone, or is access restricted? In other words, could the achievements possibly end up in the hands of large pharmaceutical companies



11/11
@Suprabhat_Ravi
Thanks for this, currently I'm not sure if any of it's applications have been successful, I get that it predicts things accurately but how accurately? Do experiments prove what the model suggests?




To post tweets in this format, more info here: https://www.thecoli.com/threads/tips-and-tricks-for-posting-the-coli-megathread.984734/post-52211196

 

bnew

Veteran
Joined
Nov 1, 2015
Messages
57,616
Reputation
8,519
Daps
160,577

1/1
@kimmonismus
That’s what I love google and especially DeepMind for

„Tx-LLM is a single model that is fine-tuned to predict properties for tasks related to therapeutic development, ranging from early-stage target identification to late-stage clinical trial approval.“

In combination with AlphaFold2 an absolute game changer.

[Quoted tweet]
Development of therapeutic drugs is often difficult and time consuming. A new model, Tx-LLM, is able to predict the properties of many entities of potential interest for therapeutic development with accuracy comparable state-of-the-art specialty models.→ goo.gle/3Zb3AfM


GZeha-MXYAAQNaJ.jpg



To post tweets in this format, more info here: https://www.thecoli.com/threads/tips-and-tricks-for-posting-the-coli-megathread.984734/post-52211196




1/11
@GoogleAI
Development of therapeutic drugs is often difficult and time consuming. A new model, Tx-LLM, is able to predict the properties of many entities of potential interest for therapeutic development with accuracy comparable state-of-the-art specialty models.→ Tx-LLM: Supporting therapeutic development with large language models



GZeha-MXYAAQNaJ.jpg


2/11
@BitValentine
Fascinating breakthrough! But how adaptable is this model across diverse therapeutic areas? Could accelerate drug discovery substantially.



3/11
@gpt_biz
This new model Tx-LLM sounds like a real breakthrough for faster drug development, looking forward to its impact!



4/11
@AlexFridd
Exciting breakthrough! The Tx-LLM model could streamline drug development by predicting therapeutic properties more efficiently.



5/11
@ak_panda
“Tx-LLM, is able to predict the properties of many entities of potential interest for therapeutic development with accuracy”



6/11
@Karenhalmi
Tx-LLM sounds like a breakthrough in drug development! Its ability to predict properties across multiple stages could significantly speed up the process. This kind of AI application has the potential to revolutionize healthcare and medicine. Exciting progress!



7/11
@softech28
Jesus Christ has returned
support @elonmusk
vote @realDonaldTrump
stand with @israel
.



8/11
@diegonov1
Let’s get that chemistry novel



9/11
@DLabz
@jezzichara



10/11
@MedExpertAI
A promising development in therapeutic research. Tx-LLM's ability to predict properties of various biological entities is a significant step forward in streamlining the drug discovery process. Félix en la búsqueda de soluciones para mejorar la salud.



11/11
@Miguel65370735
@DotCSV




To post tweets in this format, more info here: https://www.thecoli.com/threads/tips-and-tricks-for-posting-the-coli-megathread.984734/post-52211196
 
Top