Fig. 1: Structure of HUPO’s Human Proteome Project.

a The HPP matrix formed by creating two major initiatives (C-HPP and B/D-HPP). The initiatives and their teams are underpinned by 4 Resource Pillars (AB, MS, KB and pathology). b The HPP KB pipeline demonstrates how MS, AB and other biological data are collected, processed, re-analysed and presented annually for FAIR (see below) use by the scientific community. MS datasets are deposited, tagged with a PXD identifier, and stored by PX repositories (PRIDE, PeptideAtlas, MassIVE, Panorama, iProX, JPOST). Data selection, extraction and re-analysis by PeptideAtlas and MassIVE results in processed data that is transmitted to neXtProt. Subsequently, neXtProt annotates and curates other biological data (like Sanger sequencing, protein : protein interaction and other structural/crystallographic data) that is aggregated, integrated and then disseminated to the community. The HUPO HPP KB uses reverse date versions (e.g., the latest 2020 neXtProt HPP reference release 17-01-2020).