Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

  • Article
  • Published:

Unifying structural descriptors for biological and bioinspired nanoscale complexes

Abstract

Biomimetic nanoparticles are known to serve as nanoscale adjuvants, enzyme mimics and amyloid fibrillation inhibitors. Their further development requires better understanding of their interactions with proteins. The abundant knowledge about protein–protein interactions can serve as a guide for designing protein–nanoparticle assemblies, but the chemical and biological inputs used in computational packages for protein–protein interactions are not applicable to inorganic nanoparticles. Analysing chemical, geometrical and graph-theoretical descriptors for protein complexes, we found that geometrical and graph-theoretical descriptors are uniformly applicable to biological and inorganic nanostructures and can predict interaction sites in protein pairs with accuracy >80% and classification probability ~90%. We extended the machine-learning algorithms trained on protein–protein interactions to inorganic nanoparticles and found a nearly exact match between experimental and predicted interaction sites with proteins. These findings can be extended to other organic and inorganic nanoparticles to predict their assemblies with biomolecules and other chemical structures forming lock-and-key complexes.

This is a preview of subscription content, access via your institution

Access options

Buy this article

Prices may be subject to local taxes which are calculated during checkout

Fig. 1: The concept of the distance matrix of a protein complex and the introduction of descriptors.
Fig. 2: Analysis of descriptors.
Fig. 3: Construction of distance-based feature matrices for prediction of protein complexes.
Fig. 4: Prediction of protein–protein complexes with the DNN model trained on the GE + GT descriptors.
Fig. 5: The prediction of complexes between protein and nanocarbons.

Similar content being viewed by others

Data availability

Our Code Ocean Capsule66 contains all the associated data for PPI training and PPI/protein–NP interaction testing. The source data for Figs. 15 and Table 1 are provided with this paper.

Code availability

All Python codes associated with this study are deposited in the Code Ocean capsule66 at https://doi.org/10.24433/CO.7800040.v1.

References

  1. Morrison, J. L., Breitling, R., Higham, D. J. & Gilbert, D. R. A lock-and-key model for protein–protein interactions. Bioinformatics 22, 2012–2019 (2006).

    Article  Google Scholar 

  2. Baspinar, A., Cukuroglu, E., Nussinov, R., Keskin, O. & Gursoy, A. PRISM: a web server and repository for prediction of protein–protein interactions and modeling their 3D complexes. Nucleic Acids Res. 42, W285 (2014).

    Article  Google Scholar 

  3. Murakami, Y. & Mizuguchi, K. Applying the naïve Bayes classifier with kernel density estimation to the prediction of protein–protein interaction sites. Bioinformatics 26, 1841–1848 (2010).

    Article  Google Scholar 

  4. Gainza, P. et al. Deciphering interaction fingerprints from protein molecular surfaces using geometric deep learning. Nat. Methods 17, 184–192 (2020).

    Article  Google Scholar 

  5. Montoya, M. A PrePPI way to make predictions. Nat. Struct. Mol. Biol. 19, 1067 (2012).

    Article  Google Scholar 

  6. Northey, T. C., Bareši, A. & Martin, A. C. R. IntPred: a structure-based predictor of protein–protein interaction sites. Bioinformatics 34, 223–229 (2018).

    Article  Google Scholar 

  7. Baranwal, M. et al. Struct2Graph: a graph attention network for structure based predictions of protein–protein interactions. Preprint at bioRxiv https://doi.org/10.1101/2020.09.17.301200 (2020).

  8. Chen, K.-H., Wang, T.-F. & Hu, Y.-J. Protein–protein interaction prediction using a hybrid feature representation and a stacked generalization scheme. BMC Bioinformatics 20, 308 (2019).

    Article  Google Scholar 

  9. Sarkar, D. & Saha, S. Machine-learning techniques for the prediction of protein–protein interactions. J. Biosci. 44, 104 (2019).

    Article  Google Scholar 

  10. Wang, Y. et al. Predicting protein interactions using a deep learning method-stacked sparse autoencoder combined with a probabilistic classification vector machine. Complexity 2018, 4216813 (2018).

    Article  Google Scholar 

  11. Kotov, N. A. Inorganic nanoparticles as protein mimics. Science 330, 188–189 (2010).

    Article  Google Scholar 

  12. Pinals, R. L., Chio, L., Ledesma, F. & Landry, M. P. Engineering at the nano–bio interface: harnessing the protein corona towards nanoparticle design and function. Analyst 145, 5090–5112 (2020).

    Article  Google Scholar 

  13. Govan, J. & Gun’ko, Y. K. Recent progress in chiral inorganic nanostructures. Nanoscience 3, 1–30 (2016).

    Article  Google Scholar 

  14. Weininger, D. SMILES, a chemical language and information system. 1. Introduction to methodology and encoding rules. J. Chem. Inf. Model. 28, 31–36 (1988).

    Article  Google Scholar 

  15. Xu, L. et al. Enantiomer-dependent immunological response to chiral nanoparticles. Nature 601, 366–373 (2022).

    Article  Google Scholar 

  16. Cha, S.-H. et al. Shape-dependent biomimetic inhibition of enzyme by nanoparticles and their antibacterial activity. ACS Nano 9, 9097–9105 (2015).

    Article  Google Scholar 

  17. Ravikumar, K. M., Huang, W. & Yang, S. Coarse-grained simulations of protein–protein association: an energy landscape perspective. Biophys. J. 103, 837–845 (2012).

    Article  Google Scholar 

  18. Kmiecik, S. et al. Coarse-grained protein models and their applications. Chem. Rev. 116, 7898–7936 (2016).

    Article  Google Scholar 

  19. Wang, Y. et al. Anti-biofilm activity of graphene quantum dots via self-assembly with bacterial amyloid proteins. ACS Nano 13, 4278–4289 (2019).

    Article  Google Scholar 

  20. Acosta-Tapia, N., Galindo, J. F. & Baldiris, R. Insights into the effect of Lowe syndrome-causing mutation p.Asn591Lys of OCRL-1 through protein–protein interaction networks and molecular dynamics simulations. J. Chem. Inf. Model. 60, 1019–1027 (2020).

    Article  Google Scholar 

  21. Verma, M. K. & Shakya, S. LRP-1 mediated endocytosis of EFE across the blood–brain barrier; protein–protein interaction and molecular dynamics analysis. Int. J. Pept. Res. Ther. 27, 71–81 (2021).

    Article  Google Scholar 

  22. Li, Z. L. & Buck, M. Modified potential functions result in enhanced predictions of a protein complex by all-atom molecular dynamics simulations, confirming a stepwise association process for native protein–protein interactions. J. Chem. Theory Comput. 15, 4318–4331 (2019).

    Article  Google Scholar 

  23. Liu, Y. et al. A compact biosensor for binding kinetics analysis of protein–protein interaction. IEEE Sens. J. 19, 11955–11960 (2019).

    Article  Google Scholar 

  24. Moscetti, I., Cannistraro, S. & Bizzarri, A. R. Surface plasmon resonance sensing of biorecognition interactions within the tumor suppressor P53 network. Sensors https://doi.org/10.3390/s17112680 (2017).

  25. Verboven, C. et al. Actin-DBP: the perfect structural fit? Acta Crystallogr. D 59, 263–273 (2003).

    Article  Google Scholar 

  26. Dolinsky, T. J. et al. PDB2PQR: expanding and upgrading automated preparation of biomolecular structures for molecular simulations. Nucleic Acids Res. 35, 522–525 (2007).

    Article  Google Scholar 

  27. Kawabata, T. Detection of multiscale pockets on protein surfaces using mathematical morphology. Proteins 78, 1195–1211 (2010).

    Article  Google Scholar 

  28. Osipov, M. A., Pickup, B. T. & Dunmur, D. A. A new twist to molecular chirality: intrinsic chirality indices. Mol. Phys. 84, 1193–1206 (1995).

    Article  Google Scholar 

  29. May, A. et al. Coarse-grained versus atomistic simulations: realistic interaction free energies for real proteins. Bioinformatics 30, 326–334 (2014).

    Article  Google Scholar 

  30. Vishveshwara, S., Brinda, K. V. & Kannan, N. Protein structure: insights from graph theory. J. Theor. Comput. Chem. 1, 187–211 (2002).

    Article  Google Scholar 

  31. Bahar, I., Atilgan, A. R. & Erman, B. Direct evaluation of thermal fluctuations in proteins using a single-parameter harmonic potential. Fold. Des. 2, 173–181 (1997).

    Article  Google Scholar 

  32. Haliloglu, T., Bahar, I. & Erman, B. Gaussian dynamics of folded proteins. Phys. Rev. Lett. 79, 3090–3093 (1997).

    Article  Google Scholar 

  33. Levy, E. D., Pereira-Leal, J. B., Chothia, C. & Teichmann, S. A. 3D complex: a structural classification of protein complexes. PLoS Comput. Biol. 2, 1395–1406 (2006).

    Article  Google Scholar 

  34. Gavin, A. C. et al. Proteome survey reveals modularity of the yeast cell machinery. Nature 440, 631–636 (2006).

    Article  Google Scholar 

  35. Ye, Q., West, A. M. V., Silletti, S. & Corbett, K. D. Architecture and self-assembly of the SARS-CoV-2 nucleocapsid protein. Protein Sci. 29, 1890–1901 (2020).

    Article  Google Scholar 

  36. Romei, M. G., Lin, C., Mathews, I. I. & Boxer, S. G. Electrostatic control of photoisomerization pathways in proteins. Science 367, 76–79 (2020).

    Article  Google Scholar 

  37. Sachpatzidis, A. et al. Crystallographic studies of phosphonate-based α-reaction transition-state analogues complexed to tryptophan synthase. Biochemistry 38, 12665–12674 (1999).

    Article  Google Scholar 

  38. Ju, J., Regmi, S., Fu, A., Lim, S. & Liu, Q. Graphene quantum dot based charge-reversal nanomaterial for nucleus-targeted drug delivery and efficiency controllable photodynamic therapy. J. Biophoton. 12, e201800367 (2019).

    Article  Google Scholar 

  39. Ahmed, K. B. A., Raman, T. & Veerappan, A. Future prospects of antibacterial metal nanoparticles as enzyme inhibitor. Mater. Sci. Eng. C 68, 939–947 (2016).

    Article  Google Scholar 

  40. Unal, M. A. et al. Graphene oxide nanosheets interact and interfere with SARS-CoV-2 surface proteins and cell receptors to inhibit infectivity. Small 17, 2101483 (2021).

    Article  Google Scholar 

  41. Blanco-López, M. C. & Rivas, M. Nanoparticles for bioanalysis. Anal. Bioanal. Chem. 411, 1789–1790 (2019).

    Article  Google Scholar 

  42. Ma, W. et al. Attomolar DNA detection with chiral nanorod assemblies. Nat. Commun. 4, 2689 (2013).

    Article  Google Scholar 

  43. Kagan, V. E. et al. Carbon nanotubes degraded by neutrophil myeloperoxidase induce less pulmonary inflammation. Nat. Nanotechnol. 5, 354–359 (2010).

    Article  Google Scholar 

  44. Pinals, R. L. et al. Quantitative protein corona composition and dynamics on carbon nanotubes in biological environments. Angew. Chem. Int. Ed. 59, 23668–23677 (2020).

    Article  Google Scholar 

  45. Monopoli, M. P., Pitek, A. S., Lynch, I. & Dawson, K. A. Formation and characterization of the nanoparticle–protein corona. Methods Mol. Biol. 1025, 137–155 (2013).

    Article  Google Scholar 

  46. Madathiparambil Visalakshan, R. et al. The influence of nanoparticle shape on protein corona formation. Small https://doi.org/10.1002/smll.202000285 (2020).

  47. Faridi, A. et al. Graphene quantum dots rescue protein dysregulation of pancreatic β-cells exposed to human islet amyloid polypeptide. Nano Res. 12, 2827–2834 (2019).

    Article  Google Scholar 

  48. Wang, M. et al. Graphene quantum dots against human IAPP aggregation and toxicity: in vivo. Nanoscale 10, 19995–20006 (2018).

    Article  Google Scholar 

  49. Lin, W. et al. Control of protein orientation on gold nanoparticles. J. Phys. Chem. C 119, 21035–21043 (2015).

    Article  Google Scholar 

  50. Ma, C. D., Wang, C., Acevedo-Vélez, C., Gellman, S. H. & Abbott, N. L. Modulation of hydrophobic interactions by proximally immobilized ions. Nature 517, 347–350 (2015).

    Article  Google Scholar 

  51. Horovitz, A. Non-additivity in protein–protein interactions. J. Mol. Biol. 196, 733–735 (1987).

    Article  Google Scholar 

  52. Batista, C. A. S. et al. Nonadditivity of nanoparticle interactions. Science 350, https://doi.org/10.1126/science.1242477 (2015).

  53. Qiao, Y., Xiong, Y., Gao, H., Zhu, X. & Chen, P. Protein–protein interface hot spots prediction based on a hybrid feature selection strategy. BMC Bioinformatics 19, 14 (2018).

    Article  Google Scholar 

  54. Kyte, J. & Doolittle, R. F. A simple method for displaying the hydropathic character of a protein. J. Mol. Biol. 157, 105–132 (1982).

    Article  Google Scholar 

  55. Jumper, J. M., Faruk, N. F., Freed, K. F. & Sosnick, T. R. Accurate calculation of side chain packing and free energy with applications to protein molecular dynamics. PLoS Comput. Biol. 14, e1006342 (2018).

    Article  Google Scholar 

  56. Chakrabarty, B., Naganathan, V., Garg, K., Agarwal, Y. & Parekh, N. NAPS update: network analysis of molecular dynamics data and protein–nucleic acid complexes. Nucleic Acids Res. 47, W462–W470 (2019).

    Article  Google Scholar 

  57. Chakraborty, S., Venkatramani, R., Rao, B. J., Asgeirsson, B. & Dandekar, A. M. Protein structure quality assessment based on the distance profiles of consecutive backbone Cα atoms. F1000Res. 2, 1–12 (2013).

    Article  Google Scholar 

  58. Brancolini, G. & Tozzini, V. Multiscale modeling of proteins interaction with functionalized nanoparticles. Curr. Opin. Colloid Interface Sci. 41, 66–73 (2019).

    Article  Google Scholar 

  59. Hazarika, Z. & Jha, A. N. Computational analysis of the silver nanoparticle–human serum albumin complex. ACS Omega 5, 170–178 (2020).

    Article  Google Scholar 

  60. Samal, A. et al. Comparative analysis of two siscretizations of Ricci curvature for complex networks. Sci. Rep. 8, 8650 (2018).

    Article  Google Scholar 

  61. Eidi, M. & Jost, J. Ollivier Ricci curvature of directed hypergraphs. Sci. Rep. 10, 12466 (2020).

    Article  Google Scholar 

  62. Yang, R. & Bogdan, P. Controlling the multifractal generating measures of complex networks. Sci. Rep. 10, 5541 (2020).

    Article  Google Scholar 

  63. Xiao, X., Chen, H. & Bogdan, P. Deciphering the generating rules and functionalities of complex networks. Sci. Rep. 11, 22964 (2021).

    Article  Google Scholar 

  64. Pedregosa, F. et al. Scikit-learn: machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011).

    MathSciNet  MATH  Google Scholar 

  65. Chen, T. & Guestrin, C. XGBoost: a scalable tree boosting system. In Proc. ACM SIGKDD International Conference on Knowledge Discovery and Data Mining 785–794 (ACM, 2016).

  66. Cha, M. et al. Unifying structural descriptors for biological and bioinspired nanoscale complexes [source code]. Code Ocean https://doi.org/10.24433/CO.7800040.V1 (2022).

Download references

Acknowledgements

We thank the University of Michigan College of Engineering for support through the BlueSky Initiative and the University of Michigan for access to the HPC resources of the Great Lakes Cluster. Support from the Vannevar Bush DoD Fellowship to N.A.K. (‘Engineered Chiral Ceramics’ ONR N000141812876, ONR COVID-19 Newton Award ‘Pathways to Complexity with ‘Imperfect’ Nanoparticles’ HQ00342010033 and AFOSR FA9550-20-1-0265 ‘Graph Theory Description of Network Material’) is gratefully acknowledged. X.X. and P.B. gratefully acknowledge the support by the National Science Foundation Career award under grant number CPS/CNS-1453860, the NSF awards under grant numbers CCF-1837131, MCB-1936775, CNS-1932620 and CMMI-1936624, the Okawa Foundation research award, the Defense Advanced Research Projects Agency (DARPA) Young Faculty Award and DARPA Director Award under grant number N66001-17-1-4044, a 2021 USC Stevens Center Technology Advancement Grant (TAG) award, an Intel faculty award and a Northrop Grumman grant. The views, opinions, and/or findings contained in this article are those of the authors and should not be interpreted as representing the official views or policies, either expressed or implied by the Defense Advanced Research Projects Agency, the Department of Defense or the National Science Foundation.

Author information

Authors and Affiliations

Authors

Contributions

N.A.K conceived the project. M.C., E.S.T.E. and N.A.K. designed the descriptor sets and the workflow. E.S.T.E. collected and curated the protein complex dataset. M.C. analysed the protein complex data and computed the CH, GE and GT descriptors. J.-Y.K. contributed to OPD index calculation. X.X. and P.B. contributed to the computation of MFD in GT features. M.C., X.X. and P.B. designed and trained the DNN model and carried out comparative studies of different ML models. M.C. visualized the analysed data. M.C., E.S.T.E. and N.A.K co-wrote the paper. All authors contributed to data analysis, discussion and writing.

Corresponding author

Correspondence to Nicholas A. Kotov.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Computational Science thanks Ning Gu, Häkkinen Hannu and Açelya Yilmazer for their contribution to the peer review of this work. Primary Handling Editor: Ananya Rastogi, in collaboration with the Nature Computational Science team.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Supplementary Figs. 1–28, Tables 1–6, Sections 1–3 and References.

Supplementary Data 1

Source data for Table 1.

Source data

Source Data Fig. 1.

Source data for distance matrix of protein 1ma9, chain A and B. Source data for computed feature values per protein (1ma9, chain A) residues.

Source Data Fig. 2.

Source data for correlation plot (a,b). Source data for distribution plot (c–e).

Source Data Fig. 3.

Source data for ML performance plot. Tenfold validation for true-positive rate data for Fig. 3b are included in the folder Fig3b_DNN_TPR_SOURCE.

Source Data Fig. 4.

Source data for true and predicted interface residue number (A and B indicate each chain of a protein in a protein complex).

Source Data Fig. 5.

Source data for predicted interface residues for protein and NPs.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Cha, M., Emre, E.S.T., Xiao, X. et al. Unifying structural descriptors for biological and bioinspired nanoscale complexes. Nat Comput Sci 2, 243–252 (2022). https://doi.org/10.1038/s43588-022-00229-w

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1038/s43588-022-00229-w

This article is cited by

Search

Quick links

Nature Briefing

Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing