What are intrinsically disordered proteins (IDPs)
In the last decade the research surrounding intrinsically disordered proteins (IDPs) has dramatically increased. This increased attention is reflected in the rise of articles being published that discuss how attractive IDPs are as drug targets1. But, despite the knowledge we have gained, there’s still a lot of mystery surrounding IDPs. In this article we explain what IDPs are and how they differ from ordered globular proteins, both in terms of structure and their key characteristics.
Figure 1: A graph showing the number of publications covering IDPs over time. Data sourced from: https://www.ncbi.nlm.nih.gov/pubmed/?term=Intrinsically+disordred+proteins
What are intrinsically disordered proteins (IDPs)?
IDPs and proteins with intrinsically disordered protein regions (IDPRs) make up around 30% of the human proteome. The key characteristic of IDPs and IDPRs is that they are unable to fold spontaneously into single, well-defined 3D structures, but instead fluctuate between multiple conformations. IDPs cannot be characterized by the traditional methods of structural biology very easily and so new technologies to investigate their structure and behaviour are being sought after.
The classical structure-function paradigm where two proteins (with highly structured states determined by their respective amino acid sequences) bind together via a “lock and key” model has been taught for more than a century. However, many proteins have been discovered that do not require a unique structure to carry out their function2. These types of proteins, that carry structure independent functions, are classified as IDPs. There are also proteins that exist which contain both ordered domains and intrinsically disordered protein regions (IDRs). One example is p53, which has IDRs in its N- and C-terminus. These IDRs have binding sites for many partner proteins.
Figure 2: A comparison of the key characteristics between structured ordered domains versus intrinsically disordered regions. The inclusion of disordered regions and structured domains is shown to increase the versatility of proteins in eukaryota. Source: Babu et al. 20123 .
How prevalent are IDPs?
IDPs an IDPRs have now been found throughout nature in proteomes of organisms in all kingdoms of life4. Interestingly, it has now been shown through various computational analyses, that the abundance of disorder increases proportionally with organism complexity4,5. It is through these computational analyses that the (presumed) number of sequences with predicted IDPRs (>30 residues) is roughly the same in bacteria/archaea, but is significantly higher in eukaryota6,7. This increase is estimated to be due to the higher number of cell signalling pathways that rely on IDPs/IDPRs.
Almost immediately after the biological world began to recognize IDPs/IDPRs as a new class of biologically active proteins, it became clear that they were not rare exceptions but highly prevalent in nature. The first collection of IDPRs included 67 disordered regions found in 61 PDB proteins8. At present, researchers have shown that there are now approximately 1,150 proteins in the list of validated IDPs9. Similarly, the Swiss Protein Database showed that long IDPRs of at least 40 consecutive residues are predicted to be in over 15,000 proteins10.
The characteristics of IDPs that inform their structure and function
IDPs do not follow the rules established by ordered globular proteins and domains. These structural differences appear to be rooted in the peculiarities of the amino acid sequences in IDPs. So called “order promoting residues” such as Tryptophan, Cysteine, Tyrosine and Leucine are depleted in IDPs. On the other hand, “disorder promoting residues” such as Arginine, Proline, Glycine and Serine are found to be relatively higher in IDPs compared to ordered proteins11.
Just as with ordered proteins, whose biological structures are formed based on their amino acid sequences, the ability of IDPs to not fold and still be functional in the absence of unique structures is also encoded into the amino acid sequences of IDPs/IDPRs. In extended IDPs, these features include multiple charged groups (typically negative) that give the IDP its characteristic high net charge at neutral pH and their extreme isoelectric values12,13,14. It is these features that allow IDP/IDPRs to continue to function in conditions that ordered proteins would find too harsh such as environments with high pH.
These characteristics above, coupled with their low number of hydrophobic amino acid residues13, allow for IDPs to be dis/ordered at one moment in time, but able to change state at a future point in time. Therefore, IDPs are not homogeneous, but represent a very complex mixture of partially foldable, potentially foldable, differently foldable, or completely unfoldable segments15.
In this article we have covered what IDP/IDPRs are and how they differ from ordered proteins in both their structure and their characteristics. There is still lots of mystery surrounding IDP/IDPRs given that they contradict the basic logic of the “lock-and-key” theory of protein functionality and that they break multiple rules devised by the researchers studying structure, folding, and functions of ordered proteins. An accurate understanding of multilevel complexity of IDPs/IDPRs will require the development of new rules and new technologies to study them.
- Dobrev, V.S., Fred, L.M., Gerhart, K.P. and Metallo, S.J. (2018) Characterization of the binding of small molecules to intrinsically disordered proteins. In Methods in enzymology 611, pp. 677.
- Uversky, V.N. (2019) Intrinsically disordered proteins and their ‘mysterious’(meta) physics. Frontiers in Physics, 7, p.10.
- Babu, M.M., Kriwacki, R.W. and Pappu, R.V. (2012) Versatility from protein disorder. Science, 337(6101), p1460.
- Xue, B., Dunker, A.K. and Uversky, V.N. (2012) Orderly order in protein intrinsic disorder distribution: disorder in 3500 proteomes from viruses and the three domains of life. Journal of Biomolecular Structure and Dynamics, 30(2), p.137.
- Uversky, V.N. (2009) The mysterious unfoldome: structureless, underappreciated, yet vital part of any given proteome. BioMed Research International, 2010.
- Peng, Z., Yan, J., Fan, X., Mizianty, M.J., Xue, B., Wang, K., Hu, G., Uversky, V.N. and Kurgan, L. (2015) Exceptionally abundant exceptions: comprehensive characterization of intrinsic disorder in all domains of life. Cellular and Molecular Life Sciences, 72(1), p.137.
- Ward, J.J., Sodhi, J.S., McGuffin, L.J., Buxton, B.F. and Jones, D.T. (2004) Prediction and functional analysis of native disorder in proteins from the three kingdoms of life. Journal of molecular biology, 337(3), p.635.
- Romero, P., Obradovic, Z., Kissinger, C., Villafranca, J.E. and Dunker, A.K. (1997) Identifying disordered regions in proteins from amino acid sequence. In Proceedings of International Conference on Neural Networks 1, p. 90.
- DeForte, S. and Uversky, V.N. (2016) Intrinsically disordered proteins in PubMed: what can the tip of the iceberg tell us about what lies below? RSC Advances, 6(14), p11513.
- Romero, P., Obradovic, Z., Kissinger, C.R., Villafranca, J.E., Garner, E., Guilliot, S.. and Dunker, A.K. (1998) Thousands of proteins likely to have long disordered regions. In Pacific Symposium on Biocomputing, 3, p 437.
- Jorda, J., Xue, B., Uversky, V.N. and Kajava, A.V. (2010) Protein tandem repeats–the more perfect, the less structured. The FEBS journal, 277(12), p2673.
- Weinreb, P.H., Zhen, W., Poon, A.W., Conway, K.A. and Lansbury, P.T. (1996) NACP, a protein implicated in Alzheimer's disease and learning, is natively unfolded. Biochemistry, 35(43), p13709.
- Gast, K., Damaschun, H., Eckert, K., Schulze-Forster, K., Maurer, H.R., Mueller-Frohne, M., Zirwer, D., Czarnecki, J. and Damaschun, G. (1995) Prothymosin. alpha: A biologically active protein with random coil conformation. Biochemistry, 34(40), p13211.
- Hemmings, H.C., A.C. Nairn, D.W. Aswad, and P. Greengard. (1984) DARPP-32, a dopamine-and adenosine 3': 5'-monophosphate-regulated phosphoprotein enriched in dopamine-innervated brain regions. II. Purification and characterization of the phosphoprotein from bovine caudate nucleus. Journal of Neuroscience, 4(1), p99.
- Uversky, V.N. (2016) Paradoxes and wonders of intrinsic disorder: Complexity of simplicity. Intrinsically disordered proteins, 4(1), p1135015.