DeepMind’s AlphaFold2 Solves the Long-Stood Protein-Folding Problem

DeepMind’s AlphaFold2 Solves the Long-Stood Protein-Folding Problem

The protein-folding problem is finally solved through DeepMind's Alpha Fold.

Artificial intelligence has changed the way science is done by allowing researchers to analyze the massive amounts of data that modern scientific instruments generate. It can find a needle in a million haystacks of information and, using deep learning, it can also learn from the data itself. AI is accelerating advances in gene hunting, medicine, drug design, and the creation of organic compounds. Deep learning uses algorithms, often neural networks that are trained on large amounts of data, to extract information from new data. It is very different from traditional computing with its step-by-step instructions.

What is Protein Folding

On the other hand, proteins are present in all living organisms. They provide the cells with structure, catalyze reactions, transport small molecules, digest food, and do much more. They are made up of long chains of amino acids like beads on a string. But for a protein to do its job in the cell, it must twist and bend into a complex three-dimensional structure, a process called protein folding. Misfolded proteins can lead to disease. In his chemistry Nobel acceptance speech in 1972, Christiaan Anfinsen postulated that it should be possible to calculate the three-dimensional structure of a protein from the sequence of its building blocks, the amino acids. The order of the amino acids determines the protein's identity and shape, which results in its function.

Because of the inherent flexibility of the amino acid building blocks, a typical protein can adopt an estimated 10 to the power of 300 different forms. This is a massive number, more than the number of atoms in the universe. Yet within a millisecond, every protein in an organism will fold into its very own specific shape – the lowest-energy arrangement of all the chemical bonds that make up the protein. Change just one amino acid in the hundreds of amino acids typically found in a protein and it may misfold and no longer work. For 50 years computer scientists have tried to solve the protein-folding problem – with little success.

The Beginning of The AlphaFold Programme

Then in 2016 DeepMind, an AI subsidiary of Google parent Alphabet initiated its AlphaFold program. It used the protein databank as its training set, which contains the experimentally determined structures of over 150,000 proteins. In less than five years AlphaFold had the protein-folding problem beat – at least the most useful part of it, namely, determining the protein structure from its amino acid sequence. AlphaFold does not explain how the proteins fold so quickly and accurately. It was a major win for AI because it not only accrued huge scientific prestige but also was a major scientific advance that could affect everyone's lives.

In its AlphaFold Protein Structure Database, Deepmind has deposited the 3D structures of nearly all the proteins found in humans, mice, and more than 20 other species. To date, it has solved more than a million structures and plans to add another 100 million structures this year alone. Knowledge of proteins has skyrocketed. The structure of half of all known proteins is likely to be documented by the end of 2022, among them many new unique structures associated with new useful functions.

The system is a wonder of modern deep learning, including a variety of highly sophisticated components. One of these components, the Evoformer, is able to efficiently extract information from a multiple sequence alignment and build an accurate representation of the parts of the protein in close contact. AlphaFold2 was not designed to predict how proteins would interact with one another, yet it has been able to model how individual proteins combine to form large complex units composed of multiple proteins.

More Trending Stories 

Related Stories

No stories found.
logo
Analytics Insight
www.analyticsinsight.net