Limited diffusion of scientific knowledge forecasts collapse

Kang, Donghyun; Danziger, Robert S.; Rehman, Jalees; Evans, James A.

doi:10.1038/s41562-024-02041-0

Article
Published: 02 December 2024

Limited diffusion of scientific knowledge forecasts collapse

Nature Human Behaviour volume 9, pages 268–276 (2025)Cite this article

4946 Accesses
42 Altmetric
Metrics details

Subjects

Abstract

Market bubbles emerge when asset prices are driven unsustainably higher than asset values, and shifts in belief burst them. We demonstrate an analogous phenomenon in the case of biomedical knowledge, when promising research receives inflated attention. We introduce a diffusion index that quantifies whether research areas have been amplified within social and scientific bubbles, or have diffused and become evaluated more broadly. We illustrate the utility of our diffusion approach in tracking the trajectories of cardiac stem cell research (a bubble that collapsed) and cancer immunotherapy (which showed sustained growth). We then trace the diffusion of 28,504 subfields in biomedicine comprising nearly 1.9 M papers and more than 80 M citations to demonstrate that limited diffusion of biomedical knowledge anticipates abrupt decreases in popularity. Our analysis emphasizes that restricted diffusion, implying a socio-epistemic bubble, leads to dramatic collapses in relevance and attention accorded to scientific knowledge.

Access through your institution

Buy or subscribe

This is a preview of subscription content, access via your institution

Access options

Access through your institution

Buy this article

Purchase on SpringerLink
Instant access to full article PDF

Buy now

Prices may be subject to local taxes which are calculated during checkout

**Fig. 1: Representation of different diffusion levels and contrasting diffusion trajectories.**

**Fig. 2: Survival probability against bubble bursting as a function of knowledge diffusion in social space.**

**Fig. 3: Comparison of author productivity in collapsed subfields 5 and 10 years post collapse.**

**Fig. 4: The average number of new grants acknowledged in collapsed subfields by years relative to burst.**

Addressing biomedical data challenges and opportunities to inform a large-scale data lifecycle for enhanced data sharing, interoperability, analysis, and collaboration across stakeholders

Article Open access 21 February 2025

Calibrating tumor growth and invasion parameters with spectral spatial analysis of cancer biopsy tissues

Article Open access 02 October 2024

Developing and evaluating non-invasive healthcare technologies for a group of female participants from a socioeconomically disadvantaged area

Article Open access 13 December 2021

Data availability

This work uses the PubMed Knowledge Graph²⁶ (http://er.tacc.utexas.edu/datasets/ped) and the replication data from ref. ³¹ (https://www.openicpsr.org/openicpsr/project/116188/version/V1/view;jsessionid=EA1E1E5A6DAB42737EE54A5F5DD4B069). Source data are provided with this paper.

Code availability

The data and code used for the figures and tables are available in GitHub⁵⁵.

References

Arthur, W. B. Complexity in economic and financial markets. Complexity 1, 20–25 (1995).
Google Scholar
Harras, G. & Sornette, D. How to grow a bubble: a model of myopic adapting agents. J. Econ. Behav. Organ. 80, 137–152 (2011).
Google Scholar
Goldman, A. I. & Shaked, M. An economic model of scientific activity and truth acquisition. Philos. Stud. 63, 31–55 (1991).
Google Scholar
Pedersen, D. B. & Hendricks, V. F. Science bubbles. Philos. Technol. 27, 503–518 (2014).
Google Scholar
Evans, J. P., Meslin, E. M., Marteau, T. M. & Caulfield, T. Genomics. Deflating the genomic bubble. Science 331, 861–862 (2011).
CAS PubMed Google Scholar
Fortunato, S. et al. Science of science. Science 359, eaao0185 (2018).
Partha, D. & David, P. A. Toward a new economics of science. Res. Policy 23, 487–521 (1994).
Google Scholar
Small, H., Boyack, K. W. & Klavans, R. Identifying emerging topics in science and technology. Res. Policy 43, 1450–1467 (2014).
Google Scholar
Funk, R. J. & Owen-Smith, J. A dynamic network measure of technological change. Manage. Sci. 63, 791–817 (2016).
Google Scholar
Klavans, R., Boyack, K. W. & Murdick, D. A. A novel approach to predicting exceptional growth in research. PLoS ONE 15, e0239177 (2020).
CAS PubMed PubMed Central Google Scholar
Weis, J. W. & Jacobson, J. M. Learning on knowledge graph dynamics provides an early warning of impactful research. Nat. Biotechnol. 39, 1300–1307 (2021).
CAS PubMed Google Scholar
Lin, Y., Evans, J. A. & Wu, L. New directions in science emerge from disconnection and discord. J. Informetr. 16, 101234 (2022).
Google Scholar
Petersen, A. M., Pan, R. K., Pammolli, F. & Fortunato, S. Methods to account for citation inflation in research evaluation. Res. Policy 48, 1855–1865 (2019).
Google Scholar
Hutchins, B. I., Yuan, X., Anderson, J. M. & Santangelo, G. M. Relative Citation Ratio (RCR): a new metric that uses citation rates to measure influence at the article level. PLoS Biol. 14, e1002541 (2016).
PubMed PubMed Central Google Scholar
Taylor, M. & Heath, B. Years after Brigham–Harvard scandal, U.S. pours millions into tainted stem-cell field. Reuters (21 June 2022).
Anversa, P., Kajstura, J., Leri, A. & Bolli, R. Life and death of cardiac stem cells: a paradigm shift in cardiac biology. Circulation 113, 1451–1463 (2006).
PubMed Google Scholar
2009 Current Fiscal Year Report: Board of Scientific Counselors, National Institute on Aging. The Federal Advisory Committee Act (FACA) Database (Department of Health and Human Services, 2009); https://www.facadatabase.gov/FACA/apex/FACACommitteeLevelReportAsPDF?id=a10t0000001h2ObAAI
Murry, C. E. et al. Haematopoietic stem cells do not transdifferentiate into cardiac myocytes in myocardial infarcts. Nature 428, 664–668 (2004).
CAS PubMed Google Scholar
Vrotsos, L. W. Harvard Medical School requests retractions for former professor’s research. The Harvard Crimson (16 October 2018).
Oransky, I. & Marcus, A. Harvard and the Brigham call for more than 30 retractions of cardiac stem cell research. STAT News (14 October 2018).
Davis, D. R. Cardiac stem cells in the post-Anversa era. Eur. Heart J. 40, 1039–1041 (2019).
PubMed Google Scholar
Osafune, K. et al. Marked differences in differentiation propensity among human embryonic stem cell lines. Nat. Biotechnol. 26, 313–315 (2008).
CAS PubMed Google Scholar
Harris, R. Rigor Mortis: How Sloppy Science Creates Worthless Cures, Crushes Hope, and Wastes Billions (Basic Books, 2017).
Neimark, J. Line of attack. Science 347, 938–940 (2015).
CAS PubMed Google Scholar
Hughes, P., Marshall, D., Reid, Y., Parkes, H. & Gelber, C. The costs of using unauthenticated, over-passaged cell lines: how much more data do we need? Biotechniques 43, 575–586 (2007).
Google Scholar
Xu, J. et al. Building a PubMed knowledge graph. Sci. Data 7, 205 (2020).
PubMed PubMed Central Google Scholar
Teplitskiy, M., Acuna, D., Elamrani-Raoult, A., Körding, K. & Evans, J. The sociology of scientific validity: how professional networks shape judgement in peer review. Res. Policy 47, 1825–1841 (2018).
Google Scholar
Belikov, A. V., Rzhetsky, A. & Evans, J. Prediction of robust scientific facts from literature. Nat. Mach. Intell. 4, 445–454 (2022).
Google Scholar
Quaini, F. et al. Chimerism of the transplanted heart. N. Engl. J. Med. 346, 5–15 (2002).
PubMed Google Scholar
Freeman, G. J. et al. Engagement of the PD-1 immunoinhibitory receptor by a novel B7 family member leads to negative regulation of lymphocyte activation. J. Exp. Med. 192, 1027–1034 (2000).
CAS PubMed PubMed Central Google Scholar
Azoulay, P., Fons-Rosen, C. & Zivin, J. S. G. Does science advance one funeral at a time? Am. Econ. Rev. 109, 2889–2920 (2019).
PubMed PubMed Central Google Scholar
Le, Q. & Mikolov, T. Distributed representations of sentences and documents. In Proc. 31st International Conference on Machine Learning (eds Xing, E. P. & Jebara, T.) 1188–1196 (PMLR, 2014).
Laflamme, M. A. & Murry, C. E. Regenerating the heart. Nat. Biotechnol. 23, 845–856 (2005).
CAS PubMed Google Scholar
van Berlo, J. H. et al. C-kit+ cells minimally contribute cardiomyocytes to the heart. Nature 509, 337–341 (2014).
PubMed PubMed Central Google Scholar
Chien, K. R. et al. Regenerating the field of cardiovascular cell therapy. Nat. Biotechnol. 37, 232–237 (2019).
CAS PubMed Google Scholar
Mellman, I., Coukos, G. & Dranoff, G. Cancer immunotherapy comes of age. Nature 480, 480–489 (2011).
CAS PubMed PubMed Central Google Scholar
Finck, A., Gill, S. I. & June, C. H. Cancer immunotherapy comes of age and looks for maturity. Nat. Commun. 11, 3325 (2020).
CAS PubMed PubMed Central Google Scholar
Smyth, M. J. & Teng, M. W. 2018 Nobel Prize in physiology or medicine. Clin. Transl. Immunol. 7, e1041 (2018).
Google Scholar
Lin, J. & Wilbur, W. J. PubMed related articles: a probabilistic topic-based model for content similarity. BMC Bioinformatics 8, 423 (2007).
PubMed PubMed Central Google Scholar
Azoulay, P., Bonatti, A. & Krieger, J. L. The career effects of scandal: evidence from scientific retractions. Res. Policy 46, 1552–1569 (2017).
Google Scholar
Myers, K. The elasticity of science. Am. Econ. J. Appl. Econ. 12, 103–134 (2020).
Google Scholar
Reschke, B. P., Azoulay, P. & Stuart, T. E. Status spillovers: the effect of status-conferring prizes on the allocation of attention. Adm. Sci. Q. 63, 819–847 (2018).
Google Scholar
Danchev, V., Rzhetsky, A. & Evans, J. A. Centralized scientific communities are less likely to generate replicable results. eLife 8, e43094 (2019).
PubMed PubMed Central Google Scholar
Bourdieu, P. The specificity of the scientific field and the social conditions of the progress of reason. Soc. Sci. Inf. 14, 19–47 (1975).
Google Scholar
Kim, J., Wang, Z., Shi, H., Ling, H.-K. & Evans, J. Individual misinformation tagging reinforces echo chambers; collective tagging does not. Preprint at https://arxiv.org/abs/2311.11282 (2023).
Mikolov, T., Sutskever, I., Chen, K., Corrado, G. S. & Dean, J. Distributed representations of words and phrases and their compositionality. Adv. Neural Inf. Process. Syst. 26, 3111–3119 (2013).
Google Scholar
Kozlowski, A. C., Taddy, M. & Evans, J. A. The geometry of culture: analyzing the meanings of class through word embeddings. Am. Sociol. Rev. 84, 905–949 (2019).
Google Scholar
Garg, N., Schiebinger, L., Jurafsky, D. & Zou, J. Word embeddings quantify 100 years of gender and ethnic stereotypes. Proc. Natl Acad. Sci. USA 115, E3635–E3644 (2018).
CAS PubMed PubMed Central Google Scholar
Perozzi, B., Al-Rfou, R. & Skiena, S. DeepWalk: online learning of social representations. In Proc. 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining 701–710 (Association for Computing Machinery, 2014).
Grover, A. & Leskovec, J. node2vec: scalable feature learning for networks. KDD 2016, 855–864 (2016).
PubMed PubMed Central Google Scholar
Rehurek, R. & Sojka, P. Software framework for topic modelling with large corpora. In Proc. LREC 2010 Workshop on New Challenges for NLP Frameworks 45–50 (Univ. of Malta, 2010).
Foster, J. G., Rzhetsky, A. & Evans, J. A. Tradition and innovation in scientists’ research strategies. Am. Sociol. Rev. 80, 875–908 (2015).
Google Scholar
Azoulay, P., Furman, J. L. & Murray, F. Retractions. Rev. Econ. Stat. 97, 1118–1136 (2015).
Google Scholar
de Solla Price, D. J. Little Science, Big Science—and Beyond (Columbia Univ. Press, 1963).
Kang, D. Limited diffusion of scientific knowledge forecasts collapse. GitHub https://github.com/Donghyun-Kang-Soc/limited_diffusion (2024).

Download references

Acknowledgements

We acknowledge funding from the Fetzer Franklin Fund in association with the 2019 MetaScience Symposium (R.S.D., J.A.E., D.K.), the Air Force Office of Scientific Research (AFOSR: FA9550-19-1-0354 and FA9550-15-1-0162) (J.A.E., D.K.) and the National Science Foundation (NSF: 1829366 and 1800956) (J.A.E., D.K.). The funders have/had no role in study design, data collection and analysis, decision to publish or preparation of the manuscript. This work was completed in part with resources provided by the University of Chicago’s Research Computing Center. We also appreciate the support from J. Xu and Y. Ding in facilitating access to PubMed Knowledge Graph.

Author information

Authors and Affiliations

Department of Sociology, University of Chicago, Chicago, IL, USA
Donghyun Kang & James A. Evans
Knowledge Lab, University of Chicago, Chicago, IL, USA
Donghyun Kang & James A. Evans
Division of Cardiology, Department of Medicine, University of Illinois College of Medicine, Chicago, IL, USA
Robert S. Danziger & Jalees Rehman
Department of Pharmacology, University of Illinois at Chicago, Chicago, IL, USA
Robert S. Danziger
Department of Physiology and Biophysics, University of Illinois at Chicago, Chicago, IL, USA
Robert S. Danziger
Department of Biochemistry and Molecular Genetics, University of Illinois, College of Medicine, Chicago, IL, USA
Jalees Rehman
University of Illinois Cancer Center, Chicago, IL, USA
Jalees Rehman
Santa Fe Institute, Santa Fe, NM, USA
James A. Evans
Paradigms of Intelligence Team, Google, Mountain View, CA, USA
James A. Evans

Authors

Donghyun Kang
View author publications
Search author on:PubMed Google Scholar
Robert S. Danziger
View author publications
Search author on:PubMed Google Scholar
Jalees Rehman
View author publications
Search author on:PubMed Google Scholar
James A. Evans
View author publications
Search author on:PubMed Google Scholar

Contributions

D.K., R.S.D., J.R. and J.A.E. conceptualized the project. D.K. and J.A.E. developed the methodology. D.K. performed visualization. R.S.D. and J.A.E. acquired funding. D.K. and J.A.E. wrote the original draft. D.K., R.S.D., J.R. and J.A.E. reviewed and edited the manuscript.

Corresponding author

Correspondence to James A. Evans.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Human Behaviour thanks the anonymous reviewers for their contribution to the peer review of this work. Peer reviewer reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data

Extended Data Fig. 1 Complementary 2D heatmaps for the upper panels of Fig. 1.

Values are derived from the kernel density estimations graphed in Fig. 1 for the distribution of diffusion indices in scientific (panels a and b) and social space (panels c and d), respectively.

Source data

Extended Data Fig. 2 Distribution of z_i,t from 28,504 subfields.

We set the cutoff value for bubble bursting to −2.64, the bottom 0.5% percentile. The range of \({z}_{i,t}\) is [−5.2, 5.52].

Source data

Extended Data Fig. 3 Six examples of subfields.

Annual citation counts aggregated at the subfield level, using forward citations to related publications. Top panels (a, b, c): Subfields represented by three PMIDs, illustrating cases without bubble bursting events. Bottom panels (d, e, f): Subfields that experienced bubble bursts, corresponding to cutoffs closest to the 0.5%, 0.25%, and 0.1% thresholds of the \({z}_{i,t}\) value.

Source data

Extended Data Table 1 Pairwise t-test comparing average productivity differences between near-collapse active scientists (≤2 years before collapse) and early entrants

Full size table

Extended Data Table 2 Proportion of subfields with newly acknowledged grants after collapse, and the Mean, 1st Quantile, Median, and 3rd Quartile of the number of new grants post-collapse

Full size table

Supplementary information

Supplementary Information

Supplementary Tables 1.1–1.3, 3.1–3.11 and 4.1–4.4; Figs. 2.1, 2.2, 3.1–3.3 and 4.1–4.4; and Discussion.

Reporting Summary

Peer Review File

Source data

Source Data Fig. 1

Source data.

Source Data Fig. 2

Source data.

Source Data Fig. 3

Source data.

Source Data Fig. 4

Source data.

Source Data Extended Data Fig. 1

Source data.

Source Data Extended Data Fig. 2

Source data.

Source Data Extended Data Fig. 3

Source data.

Source Data Extended Data Table 1

Source data.

Source Data Extended Data Table 2

Source data.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Kang, D., Danziger, R.S., Rehman, J. et al. Limited diffusion of scientific knowledge forecasts collapse. Nat Hum Behav 9, 268–276 (2025). https://doi.org/10.1038/s41562-024-02041-0

Download citation

Received: 15 April 2023
Accepted: 01 October 2024
Published: 02 December 2024
Issue Date: February 2025
DOI: https://doi.org/10.1038/s41562-024-02041-0

This article is cited by

Limited diffusion of scientific knowledge forecasts collapse
- Donghyun Kang
- Robert S. Danziger
- James A. Evans
Nature Human Behaviour (2024)

Subjects

Abstract

Access options

Similar content being viewed by others

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Extended data

Extended Data Fig. 1 Complementary 2D heatmaps for the upper panels of Fig. 1.

Supplementary information

Source data

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Quick links