Prosit-XL: enhanced cross-linked peptide identification by fragment intensity prediction to study protein interactions and structures

Kalhor, Mostafa; Saylan, Cemil Can; Picciani, Mario; Fischer, Lutz; Schimweg, Falk Boudewijn; Lapin, Joel; Rappsilber, Juri; Wilhelm, Mathias

doi:10.1038/s41467-025-61203-4

Download PDF

Article
Open access
Published: 01 July 2025

Prosit-XL: enhanced cross-linked peptide identification by fragment intensity prediction to study protein interactions and structures

Nature Communications volume 16, Article number: 5429 (2025) Cite this article

Subjects

Abstract

It has been shown that integrating peptide property predictions such as fragment intensity into the scoring process of peptide spectrum match can greatly increase the number of confidently identified peptides compared to using traditional scoring methods. Here, we introduce Prosit-XL, a robust and accurate fragment intensity predictor covering the cleavable (DSSO/DSBU) and non-cleavable cross-linkers (DSS/BS3), achieving high accuracy on various holdout sets with consistent performance on external datasets without fine-tuning. Due to the complex nature of false positives in XL-MS, an approach to data-driven rescoring was developed that benefits from Prosit-XL’s predictions while limiting the overestimation of the false discovery rate (FDR). After validating this approach using two ground truth datasets consisting of synthetic peptides and proteins, we applied Prosit-XL on a proteome-scale dataset, demonstrating an up to ~3.4-fold improvement in PPI discovery compared to classic approaches. Finally, Prosit-XL was used to increase the coverage and depth of a spatially resolved interactome map of intact human cytomegalovirus virions, leading to the discovery of previously unobserved interactions between human and cytomegalovirus proteins.

Mimicked synthetic ribosomal protein complex for benchmarking crosslinking mass spectrometry workflows

Article Open access 08 July 2022

Developing a new cleavable crosslinker reagent for in-cell crosslinking

Article Open access 23 June 2025

Proteome-scale recombinant standards and a robust high-speed search engine to advance cross-linking MS-based interactomics

Article Open access 31 October 2024

Introduction

Crosslinking mass spectrometry (XL-MS) has emerged as a critical technology for analyzing protein complexes and protein-protein interactions (PPIs) by providing distance restraints between protein residues through the identification of cross-linked peptides (XL-peptides)¹. However, several main computational challenges arise from this process. The tandem mass spectrum (MS2) of XL-peptides is more complex than linear peptides due to the presence of two peptides, making sequence assignments challenging. Unequal fragmentation of the two peptides could introduce bias in the total cross-linked peptide spectrum match (CSM) score². Another challenge is the large search space caused by considering all possible peptide pairs (n-square problem), which can increase the chance of false positives before false discovery rate (FDR) estimation. To maintain a fixed FDR, stricter score thresholds are required, leading to higher false negatives after FDR filtering and ultimately reducing the number of true positives. In addition, the FDR estimation using the target-decoy approach for XL-peptides is complicated by the presence of two peptides and remains a challenge in XL-MS research^3,4,5. Numerous crosslinking database search engines (XL-DBSEs) have been developed over the past decade to address these challenges, including xiSEARCH⁶, pLink2⁷, XlinkX⁸, Kojak⁹, Scout¹⁰, etc. However, identifying XL-peptides remains challenging, especially for inter-protein crosslinks (between-links) where the search space size is significantly larger than for intra-protein crosslinks (self-links)¹¹. One approach to improving the scoring process of DBSE for linear peptide identification is to use post-processing rescoring tools such as Percolator¹² and PeptideProphet^13,14. It aims to integrate multiple DBSE features (e.g., DBSE scores, peptide length) into a single score for FDR calculation¹⁵. The XL-MS field is no exception, with XL-DBSEs like XlinkX and Kojak applying Percolator, while pLink 2 uses a built-in SVM classifier as a rescoring tool. However, evaluating these XL-DBSEs on multiple ground truth datasets has revealed that applied rescoring tools often suffer from overfitting, leading to suboptimal accuracy in FDR calculation^10,16,17.

In addition to DBSE features, integrating accurate peptide property predictors, such as Prosit^18,19, a fragment intensity predictor, and DeepLC²⁰, a retention time (RT) predictor, into the rescoring process has been shown to substantially increase the number of confidently identified linear peptides compared to relying only on DBSE features. This improvement is particularly evident in challenging fields such as immunopeptidomics¹⁹, single-cell proteomics²¹, and metaproteomics¹⁸. In the XL-MS field, tools like xiRT²², a retention time predictor for XL-peptides, and pDeepXL²³, a predictor of fragment ion intensity for XL-peptides, have been developed to enhance XL-peptide identifications. However, these tools have limitations, including (1) often requiring fine-tuning for rescoring implementation, (2) yielding modest improvement in CSM and PPI level identification rates after rescoring, and (3) needing a more user-friendly design for rescoring tasks. Additionally, further investigation using ground truth datasets is necessary to fully validate their effectiveness.

In this study, we introduce Prosit-XL, an expanded version of Prosit¹⁹, developed through transfer learning for fragment intensity prediction of XL-peptides. We propose an approach by considering each XL-peptide as two separate peptides, allowing for data augmentation by using each CSM twice during training. Additionally, Prosit-XL inherited its collision energy (CE) awareness from Prosit, circumventing the need for transfer learning on unseen data. We have integrated Prosit-XL into our user-friendly, data-driven rescoring pipeline Oktoberfest²⁴. It has been shown that the lower score of the two peptides in a XL-peptide can serve as a strong parameter for distinguishing correct from incorrect matches^2,9. Here, we adopt a similar approach but introduce a novel aspect by running Percolator on the peptide spectrum match (PSM) level, here referring to the individual peptide of an XL-peptide, rather than CSM level, referring to an entire XL-peptide. We aggregate the intensity-based scores into a single score by using the minimum Percolator-optimized PSM-level score of the two PSMs associated with a CSM as a proxy for its quality, a strategy that more effectively separates correct from incorrect matches, resulting in a substantial boost of identified cross-linked peptides. Prosit-XL and the rescoring pipeline are validated on full and partial ground truth datasets containing synthetic peptides and proteins, respectively. Next, we benchmark Prosit-XL’s performance on a combined large-scale dataset containing E. coli and M. pneumoniae, comparing its performance to xiSEARCH/xiFDR. Ultimately, we apply Prosit-XL and the rescoring pipeline on real-world data from intact human cytomegalovirus to demonstrate its capability in increasing the depth and coverage of XL-peptides required for comprehensive protein structure and PPI discovery and analysis.

Results

Accurate fragment intensity prediction by Prosit-XL

Due to the absence of synthetic data on the scale required for our applications, we had to rely on public data. Multiple public XL-MS datasets from PRIDE²⁵ were downloaded, focusing on cleavable (disuccinimidyl sulfoxide, DSSO^26,27,28; disuccinimidyl dibutyric urea, DSBU²⁹) and non-cleavable (disuccinimidyl suberate, DSS^{26,28,30,31,32}; bissulfosuccinimidyl suberate, BS3³⁰) cross-linkers. Datasets containing MS2 spectra of cleavable (CMS2) and non-cleavable (NMS2) XL-peptides were analyzed by pLink 2, while XlinkX was used for MS3 spectra of cleavable XL-peptides (CMS3). This resulted in 125,727 CMS2, 70,320 NMS2, and 37,938 CMS3 identified high-quality spectra containing ~31,000, ~17,000, and ~9000 unique XL-peptides (peptide pairs), respectively (Fig. 1a). The CMS2 and NMS2 spectra were acquired by higher-energy collisional dissociation (HCD) fragmentation, while CMS3 spectra were obtained using collision-induced dissociation (CID).

**Fig. 1: Data collection and collision energy calibration for refining Prosit to Prosit-XL.**

As highlighted earlier³³, a main factor that can substantially affect fragment ion intensities using HCD fragmentation is the normalized CE (NCE), which can vary even on the same mass spectrometer despite using the same ostensible CE, due to drifts of the effective NCE applied. To enable an NCE-dependent prediction of fragment intensities, acquired data needs to be calibrated. This is particularly difficult for XL-MS datasets due to the lack of replicate peptide spectra across datasets, which could have been used to detect drifts in NCE. Therefore, we proposed to detect shifts in NCE using contaminant linear peptides within the datasets. These identified peptides are then utilized to estimate the NCE at which the predictions match best to the acquired spectra (NCE calibration). This estimation is achieved by comparing the spectra of the top highest scoring PSMs to predictions made by the HCD Prosit 2020 model at different NCEs (Methods). The NCE at which the highest normalized spectral angle (SA) was observed indicates the optimal NCE for prediction (Fig. 1b). The optimal NCEs for NMS2 and CMS2 spectra fall within the ranges of 28 to 40 and 17 to 37, respectively (Fig. 1c, d). Note that NCE calibration is not necessary for CSM3 spectra which were acquired using CID fragmentation.

The collected XL-MS data is substantially smaller than the dataset used for training Prosit, which was trained using the ProteomeTools synthetic peptide library³⁴. This is in part due to the complex nature of analyzing XL-peptides, e.g., challenging fragmentation, identification, and lower abundance in samples. The collected CSMs are equivalent to only ~2 % of the PSMs used for training the HCD Prosit 2020 model¹⁹ (Supplementary Fig. 1a). This is not sufficient to train a model from scratch because that would require a substantial reduction in model size. We propose using transfer learning, the same approach used in developing pDeepXL, where a pre-trained model on a similar task is adapted and further trained for a new, related task. Here, we use Prosit¹⁹ as a starting point to extend it and apply transfer learning for XL-peptides.

In order to make an informed decision on the required adjustments of the Prosit model, two crucial questions need to be answered: Are the fragment ion intensities of peptide A influenced by a cross-linked peptide B, and, if so, are its intensities dependent on the sequence of peptide B? To delve into this, we first compared Prosit’s predictions to fragment ion intensities of XL-peptides. The results (Supplementary Fig. 1b) show that Prosit’s predictions agree somewhat with intensities extracted from CMS3 spectra (e.g., median SA of ~0.51 for Alkene-CMS3), but performs very poorly on CMS2 (e.g., median SA of ~0.35 for DSSO-CMS2) and NMS2 (e.g., median SA of ~0.40 for DSS/BS3-NMS2). This suggests that the presence of a peptide B influences the fragmentation characteristics more than the presence of a crosslinker. However, Prosit’s performance is still better than random, estimated by Prosit’s performance on decoy XL-peptides (median SA of ~0.01, Supplementary Fig. 1c). Second, we assessed the similarity of MS2 fragment intensity patterns of XL-peptides, where the XL-peptides share the same peptide A but differ in peptide B (Fig. 1e), for many different peptides A. Taken together, XL-peptides do indeed influence each other’s fragmentation characteristics and thus implies that peptide B must be considered as a separate input to a model in order to reach good prediction performance. This is not the case for CMS3, since only one of the two peptides is fragmented to acquire a CMS3.

In order to accurately reflect the requirements of the various XL-peptides, various adjustments to the architecture of Prosit are required (Fig. 2a). One additional encoder (Encoder 2) was added to handle the input of peptide B for both Prosit-XL-CMS2 and Prosit-XL-NMS2 models. Moreover, an extra decoder (Decoder 2) was added for Prosit-XL-CMS2 since DSSO and DSBU have two unstable sites, resulting in two distinct types of XL fragments: b-short and y-short, with a shorter segment of the crosslinker, and b-long and y-long, with a longer segment of the crosslinker. In order to maximize the benefits of transfer learning, Prosit-XL is trained to predict only the fragment ion intensities of peptide A, whereas the encoder for peptide B only “modulates” the intensities of peptide A. This further allowed us to utilize each spectrum in the training and test set twice, resulting in a twofold increase in the effective training dataset size. To obtain predictions for peptide B, peptides A and B are swapped. A schematic representation of the best-performing Prosit-XL-CMS2 and Prosit-XL-NMS2 architecture with more details is shown in Supplementary Fig. 2a. Since CMS3 does not require any modifications to the base architecture, the architecture of Prosit-XL-CMS3 is the same as that of CID Prosit 2020 model.

**Fig. 2: Accurate fragment ion intensity prediction of XL-peptides by Prosit-XL.**

After finalizing the architecture of these models, we initialized the Prosit-XL-CMS2 model with the weights from the HCD Prosit 2020 and trained it on the collected and calibrated CMS2 data. Subsequently, the Prosit-XL-NMS2 was initialized with the weights from the Prosit-XL-CMS2 and trained on collected and calibrated NMS2 data (Supplementary Fig. 2b). The Prosit-XL-CMS3 model was initialized using the CID Prosit 2020 model weights and trained on the corresponding CMS3 data. After the training process, the median SA on the holdout set improved by 0.31 − 0.48, depending on the type of spectra (Fig. 2b). It should be emphasized that SAs are measured separately for peptide A and B. Interestingly, the Prosit-XL model’s performance on peptide B(s) is slightly better than peptide A(s), likely due to differences in peptide lengths. On average, peptide B is shorter than peptide A which is generally less challenging for the Prosit-XL to predict (Supplementary Fig. 2c). For benchmarking Prosit-XL’s generalization, Prosit-XL-CMS2 and Prosit-XL-NMS2 models were applied, without any additional transfer learning, to two distinct external datasets using synthetic peptides^16,17 that were cross-linked by DSSO and DSS. Initially, xiSEARCH and pLink were used to identify CSMs. To ensure optimal prediction performance, NCE calibration was performed to find the optimal NCE for each MS file, which was then used as input for the model (Methods). The Prosit-XL-CMS2 achieved a median SA of 0.82 and a Pearson correlation (PCC) of 0.95, showing its remarkably consistent performance compared to the holdout set (Fig. 2c). The Prosit-XL-NMS2 is slightly below that of the holdout set (median SA: 0.77, median PCC: 0.89), which may indicate additional experimental factors not considered in the proposed architecture of Prosit-XL. These results suggest that no additional transfer learning is needed for Prosit-XL to achieve state-of-the-art performance.

Further, pDeepXL was applied to these two datasets to compare the performance of Prosit-XL and pDeepXL. On CSMs supported by both models, Prosit-XL demonstrated higher accuracy, achieving an SA of 0.82 and a PCC of 0.95, compared to pDeepXL, which achieved an SA of 0.74 and a PCC of 0.85, on the dataset that used DSSO as a crosslinker. However, both models showed almost identical performance on the synthetic dataset that used DSS as a crosslinker, where Prosit-XL achieved an SA of 0.75 and a PCC of 0.90, while pDeepXL achieved an SA of 0.76 and a PCC of 0.87 (Supplementary Fig. 2d, Methods). To visually demonstrate the Prosit-XL’s performance, two mirror plots of a cleavable and a non-cleavable XL-peptide are displayed in Fig. 2d, e showing strong agreement between experimental (top spectra) and predicted fragment ion intensities of the annotated b- and y-ions (bottom spectra).

Evaluating Prosit-XL with ground truth benchmark datasets

In line with observations on linear peptides^18,19,35,36, we hypothesized that integrating fragment intensity predictions into CSM scoring would improve the differentiation between true and false positive CSMs in target-target (TT) identifications, leading to substantial improvement in the confident identification of CSMs, at a given FDR threshold, compared to utilizing XL-DBSE scores alone. To test this, we extended our open-source data-driven rescoring pipeline, Oktoberfest²⁴, for CSM rescoring, which consumes predictions from the Prosit-XL models that are served by Koina³⁷, an open-source online prediction service. Briefly, the rescoring process (Fig. 3a) starts with reading MS files and unfiltered search results of supported XL-DBSEs (Method). Using unfiltered search results allows Oktoberfest to reassess all provided matches, including previously rejected (non-confidently identified) but potentially true positive identifications. After NCE calibration, Oktoberfest calculates a plethora of intensity-based features, separately calculated for peptide A and B (Supplementary Table 1), assessing the similarity of spectra predicted at the optimal NCE and the corresponding experimental spectra.

**Fig. 3: Overview of the rescoring pipeline and its results on ground truth datasets.**

However, for FDR estimation in XL-MS, a single score is required that reflects the quality of a CSM in order to effectively separate correct from incorrect matches. While tools such as Percolator are optimized to combine multiple features, as generated by, e.g., Oktoberfest, into a single score, the complex nature of false positives (e.g., target-decoy matches) and often an insufficient number of matches for robust machine learning in XL-MS prevent its direct application.

We propose an approach in which we use Percolator solely to generate an optimized score for each peptide precursor in an XL-peptide separately by running it on PSM level, rather than on CSM level, which is possible because Prosit-XL generates predictions for each peptide separately. When splitting up CSMs into two separate PSMs (one for peptide A and B each), the clear notion of a target and decoy match remains. Further, the overall PSM-level score distribution of matches follows the expected behavior as known for linear peptides (Supplementary Fig. 3a). The result of this is a score that is optimized to separate correct from incorrect PSMs. Because a CSM is incorrect when at least one of the two PSMs is incorrect, we pick the minimum Percolator-optimized PSM-level scores of the two PSMs associated with a CSM as a proxy for the quality of that CSM (Supplementary Note 1 and Supplementary Fig. 4). Finally, the CSMs and their corresponding scores are submitted to xiFDR for FDR estimation (Methods).

We evaluated our rescoring pipeline on two recently published, distinct full and partial ground truth XL-MS datasets to verify if the FDR estimates are well calibrated. One such dataset contains synthetic peptides, which are grouped and cross-linked by DSSO. Each link between synthetic peptides from different groups or unknown peptides is considered as false positive, allowing precise determination of the experimentally validated proportion of discoveries that are accepted but deemed to be incorrect (actual FDR)¹⁷. We first analyzed the data using xiSEARCH followed by xiFDR, resulting in the identification of 1395 CSMs and 789 peptide pairs at an estimated FDR of 1% on CSM- and peptide pair-level, while the actual FDR was 1.18% and 1.65% at the CSM and peptide pair levels, respectively (Fig. 3b). By rescoring pipeline, the number of identified CSMs and peptide pairs modestly improved by 14% (to 1,591) and 12% (to 884) on CSM and peptide pair levels, respectively. However, we also observed an increase in actual FDR to 2.53% at CSM and 3.67% at the peptide pair level. The slightly worse accuracy in FDR estimation may be the result of the small dataset size, which contains only 100 synthetic ground truth peptides. Overall, the results are in line with (and largely below) the reported FDR estimates of other software, including MeroX³⁸, MS Annika³⁹, XlinkX, pLink 2, MaxLynx⁴⁰, and xiSEARCH/xiFDR, whose 1% FDR estimates result in an actual FDR of 5.7%, 2.7%, 4.4%, 4.0%, 2.2%, and 3.2% at the unique residue pair (UXL) level (Methods), respectively. Further, we compared our rescoring pipeline to xiSEARCH by applying an actual FDR of 1%. The results improved after rescoring, with the number of identified CSMs increasing from 1,175 to 1,216 and peptide pairs increasing from 641 to 651, respectively (Supplementary Fig. 3b, Methods).

To further investigate the FDR estimate, we applied rescoring on a larger and more recent dataset¹⁰. Briefly, this dataset contains hundreds of recombinant proteins that were separately mixed and cross-linked by DSSO. Besides comparing CSM and peptide pair FDR estimates, this dataset also enabled us to verify the PPI-level FDR estimates of the rescoring pipeline. Since we did not have access to Scout’s FDR calculator as a standalone tool, we applied xiFDR on both Scout’s unfiltered results (Scout+xiFDR) and rescoring results (Scout+Prosit-XL+xiFDR) in order to ensure a fair comparison. Filtered at 1% FDR, both pipelines produced less than 1% false positives (FP), estimated by the known incorrect interactions, of 0.53%, 0.57%, and 0.67% for rescoring and 0.63%, 0.66%, and 0.96% for Scout+xiFDR on CSM-, peptide pair-, and PPI-level (Fig. 3c), respectively, at an applied FDR of 1% on CSM-, peptide pair-, and PPI-level (Methods). Encouragingly, rescoring was able to increase the number of CSMs, PeptideParis, and PPIs (between-links) by 34.9%, 33.4%, and 42.7%, respectively, compared to Scout+xiFDR. Next, we compared the rescoring results with Scout using its native FDR estimation (without xiFDR). Although Scout identified 24.9% more PPIs compared to the rescoring, the actual FDR for PPIs uniquely identified by Scout was 3.7%, whereas the PPIs identified only by rescoring were 0% (Supplementary Fig. 3c). The FDR for PPIs identified by both methods was 0.76%.

Evaluating Prosit-XL with large-scale datasets and extensive search space

Next, we proceeded to benchmark the rescoring pipeline against an even larger dataset that resembles the quality and complexity of real data more closely. To be able to retain some level of control over the FDR estimation, we re-analyzed two distinct XL-experiments, investigating PPIs in E. coli⁵ and M. pneumoniae⁴¹, in a combined xiSEARCH run (Fig. 4a). Any identified E. coli-M. pneumoniae PPI, E. coli-E. coli PPI supported by spectra from the M. pneumoniae dataset and M. pneumoniae - M. pneumoniae PPI supported by spectra from the E. coli dataset must be considered false positives (mismatch) and thus provide a lower bound estimate of the actual FDR. This approach is similar to entrapment searches, but some number of false positives within-species matches will remain and cannot be accounted for in the estimate of the actual FDR of the analysis.

**Fig. 4: Evaluation of Prosit-XL versus large-scale datasets and extensive search space.**

The improvement after rescoring was substantial, with the number of identified between-links increasing to ~5.6, ~5.7, and ~3.4 fold on CSM-, peptide pair-, and PPI-level, respectively, compared to applying only xiSEARCH+xiFDR (Fig. 4b). Despite this substantial increase in identifications, the number of mismatches remained very low: out of 2431 CSMs, only 13 mismatches (0.53%); out of 1671 peptide pairs, 6 mismatches (0.35%); and out of 517 PPIs, 4 mismatches (0.77%). This is because the features generated by Prosit-XL in combination with the Percolator-based CSM score generation approach can effectively separate true positive target-target matches from false positive target-target matches due to an improved scoring of target-decoy (in part representing TP-FP) matches (Fig. 4c). As visible in the marginal histograms, the score distribution of target-decoy matches aligns much better with the false positive portion of the target-target matches when rescoring is used in comparison to xiSEARCH. This is because the Percolator-based CSM score takes the minimum of the individual PSM scores and thus a single incorrect peptide in an XL-peptide will lead to an overall poor score irrespective of whether the other peptide is in fact a true positive match. This leads to a shift in score cutoff necessary to achieve 1% FDR, resulting in a larger portion of matches surviving the FDR threshold. This experiment demonstrates the capability of rescoring pipeline compared to xiSEARCH in an extremely large search space while maintaining control over the FDR. To the best of our knowledge, this result is the highest increase in identifications observed after CSM rescoring compared to previous studies. Further, we compared our rescoring pipeline to xiSEARCH by applying a lower bound estimate of the actual FDR of 1% (Methods). The results still show substantial improvement after rescoring, with the number of identified CSMs increasing from 1711 to 3389, peptide pairs from 1147 to 2378, and PPIs from 390 to 678 (Supplementary Fig. 5a).

Additionally, we examined the number of expected true positive matches by calculating #TT - (#TD - #DD) after the rescoring process for all analyzed datasets in this study (Supplementary Fig. 5b). The results clearly show that the estimated number of true positives is highest in the high-scoring region and, as expected for Percolator, drops to near zero around a score of 0. At lower scores, the number of estimated true positives remains around zero, indicating that the scoring approach and the use of machine learning did not introduce any unintended bias (i.e., artificially separating targets from decoys). This also suggests that the decoys (TD and DD) provide a reliable estimate for the number of false positive targets.

Prosit-XL-assisted rescoring increases coverage and depth of 3D structure information and PPI mapping

In our final case study, we assessed our rescoring pipeline’s ability to demonstrate its benefits in analyzing protein 3D structures and PPIs by applying it to a dataset that aimed to resolve the interactome map of intact human cytomegalovirus virions⁴². Briefly, we used xiSEARCH followed by the Prosit-XL assisted rescoring as described earlier and compared the results to original XlinkX analysis. The total number of unique interactions was 2427 using XlinkX (1% FDR at UXL level) and increased to 2910 with xiSEARCH+Prosit-XL+xiFDR (Fig. 5a). The biggest increases were observed at the human intra-protein level (1399 and 220 out of 2910), likely a result of the improved sensitivity at which UXLs can be detected by our pipeline. At the UXL-level, rescoring showed an almost 1.5-fold increase (from 4789 to 7396 UXLs) over XlinkX on human intra-protein-protein connection (self-links) (Supplementary Fig. 6a). Similarly, our rescoring pipeline was compared to xiSEARCH+xiFDR results. The most notable increase was observed for the inter-protein-protein interactions with around 3-fold increase (from 424 to 1203 inter-PPIs, Supplementary Fig. 6b). In addition, this increase was also seen with a ~2.3-fold increase at the UXLs level (from 2396 to 5460 UXLs for inter-PPIs, Supplementary Fig. 6b). Both xiSEARCH+xiFDR and xiSEARCH+Prosit-XL+xiFDR applied FDR in a more conservative manner at the CSM-, peptide pair-, and PPI-level.

**Fig. 5: Evaluation of Prosit-XL for analyzing 3D protein structures and protein-protein interactions.**

To further investigate the gains and losses, we assessed the PPIs (inter- and intra-PPI) gained, lost, and shared between xiSEARCH+Prosit-XL+xiFDR and XlinkX for each category (Supplementary Fig. 6c). In general, rescoring added 548 new PPIs that were not detected by XlinkX. However, we also observed a loss of 351 PPIs (Fig. 5b, upper left Venn diagram), which are supported by 403 UXLs that our pipeline did not identify. Further, we investigated the gained, shared, and lost UXLs of PPIs shared between our pipeline and XlinkX. While rescoring led to a gain of 2,235 UXLs, we also observed a loss of 1044 UXLs (Fig. 5b, upper right Venn diagram). Despite the improvements in identified PPIs and UXLs, the number of lost UXLs (total of 1447) is rather high compared to the losses observed in the earlier analysis. To investigate the reason, we checked if the UXLs identified uniquely by XlinkX appeared in the unfiltered search results of xiSEARCH, revealing that ~89% of these UXLs were absent in it (Fig. 5b, bottom Venn diagram), indicating that only 155 were lost because they did not survive our conservative FDR cutoffs at all levels.

The increase in UXLs leads to recovering more PPIs at various cutoffs of minimum UXLs required to call a PPI, commonly applied to remove one-hit-wonders. This increase ranges from 30% for PPIs with at least 1 UXL to 37.5% for PPIs that are supported by at least 10 UXLs (Supplementary Fig. 6d). This is confirmed by the observation that on average a PPI is supported by 1.32 more UXLs (linear regression model y = 1.32x + 0.07) using our rescoring workflow (Fig. 5c). As a result, the number of interaction partners identified for each protein, e.g., UL32, UL25, and UL83, increases (Supplementary Fig. 6e, scatter plot). UL83 and UL25 are major tegument proteins in HCMV and play essential roles in viral assembly. UL83 is crucial for tegument formation, where it helps stabilize the virion structure⁴³. UL25, on the other hand, serves as a hub for assembling other viral proteins into the maturing virion, making it an organizing center during the virion maturation process⁴⁴. It is reported that UL83 facilitates the incorporation of UL25 into mature viral particles⁴⁵. The interaction between UL25 and UL83 is supported by the results, which show that 62 UXLs were identified for this interaction using rescoring, and 44 UXLs using XlinkX (Supplementary Fig. 6e, Table). Additionally, it is suggested that host proteins such as Grb2 and DDX3 show dependency on UL83, being incorporated into virions upon viral infection⁴⁶. However, DDX3X’s direct interaction with UL83 remains unclear. It is also worth mentioning that HCMV infection enhances the expression of Grb2 and DDX3X, facilitating viral replication and spread. As a result, DDX3X has emerged as a target for antiviral therapies due to its critical role in infection^46,47. In our results, we observed an increase in UXLs for the UL83-DDX3X interaction; specifically, we found 34 UXLs using rescoring, while XLinkX shows 15 UXLs (Fig. 5d). Similarly, our results suggest the interaction between UL25 and DDX3X by enhancing the previously reported UXLs from 8 for XlinkX to 21 for rescoring, revealing a larger interaction area (Fig. 5d). These findings suggest that there may be a more complex relationship between UL83-UL25-DDX3X.

From the result of UXLs, the dataset shows a substantial number of self-links. We evaluated these UXLs separately for viral and human self-links. In the original study, the viral UXLs were evaluated by using a specific example for self-links of the protein UL55⁴². As it is given in the dataset paper, the distance between cross-linked residues should be under 40 Å⁴². We further looked into this protein and whether our results show an improvement in detecting UXLs based on this rule of thumb. Specifically, our analysis identified 44 unique UXLs (36 were reported in the XlinkX analysis), with 9 exhibiting distances greater than 40 Å in the post-fusion structure. Notably, all these UXLs adhered to the acceptable threshold in the pre-fusion structure (Fig. 5e). The term “fusion” here refers to the merging of the viral and host cell membranes mediated by glycoproteins, a critical step in herpesvirus infection. When comparing post-fusion to pre-fusion data, our findings align with the original paper (Supplementary Note 2).

To evaluate the human self-link UXLs, we define two metrics, including link distance and average plDDT (predicted local distance difference test). Link distance was calculated as the Euclidean distance between cross-linked K α-carbon using the protein structures retrieved from the EBI-AlphaFold2 (AF2) database⁴⁵. The average plDDT was calculated as the average AF2 plDDT local confidence value for the linked LYS residues as one metric for each UXL. It is known that the plDDT value above 70 in the AF2 is defined as “high confident” prediction, and it performs good for the prediction of protein backbone structure⁴⁵. Thus, we categorized our results into four distinct groups based on specific thresholds for distance (40 Å) and Av. plDDT (70) (Supplementary Fig. 6f). Our analysis demonstrated that the majority (~86%) of the UXLs were observed at a distance of <40 Å. Moreover, the highest density distributions were detected for distance <40 Å and average plDDT > 70, indicating that most detected UXLs resided within the ‘high confidence’ predicted regions, with sufficient distance to establish a linkage. Noteworthy, the number of <40 Å detected interactions with average plDDT <70 indicates that the pIDDT estimate of AF2 may not be as well calibrated as expected and underpins the value of orthogonal information provided by XL-MS for resolving protein structures⁴⁸.

Discussion

In this study, we introduce Prosit-XL, a deep learning model that can predict the fragment ion intensities of XL-peptides with high accuracy. We evaluated it on various experimental holdout sets, demonstrating its strong generalizability to new, unseen data. Further, Prosit-XL was integrated into a data-driven rescoring pipeline, to our best knowledge, the only rescoring pipeline that includes CSM rescoring. For this, an approach was developed that is splitting CSMs into PSM-level to allow the effective combination of our intensity-based features into a single score by percolator. The rescoring pipeline was analyzed using synthetic ground truth datasets, providing insight into its FDR estimation performance. Finally, the pipeline was applied to real-world datasets, resulting in substantial gains on CSM, UXL, and PPI levels, showing its utility and benefits in assisting protein 3D structure elucidation and interactome mapping at organism scale.

There has been notable progress in generating ground truth datasets in the XL-MS field, which is a positive step forward for benchmarking methods. However, small and less complex datasets, such as the analyzed synthetic peptide dataset, can pose challenges for benchmarking machine learning–based tools. These datasets may lack the necessary complexity to effectively benchmark tools for FDR calibration. Future work would benefit from incorporating larger and more complex datasets to enable more reliable benchmarking and further advance the field.

An important consideration when using post-processing tools such as Percolator is the potential for overfitting when the same data is used for both model training and scoring. While earlier versions of Percolator addressed this with cross-validation, this strategy did not fully prevent information leakage. The more recent RESET⁴⁹ approach improved upon this. While this strategy focuses on avoiding data leakage from decoys, all targets are still present during the training and scoring steps, and although our empirical results did not indicate any noticeable bias, the possibility of (subtle) residual biases cannot be entirely excluded. This remains an area where further methodological improvements could enhance the robustness of post-processing workflows, specifically for cross-linking.

Despite Prosit-XL’s high accuracy, several potential improvements, beyond the scope of this current work, could enhance its performance even further. One potential improvement is incorporating separate charge information for each peptide, instead of the current use of one charge for the entire XL-peptide precursor. Current XL-DBSEs (e.g., pLink 2) do not provide accurate individual peptide charges due to the complexity of separating charges in XL-peptides. This is a burden for the model since it has to learn how to correctly estimate the charge state of each peptide, in addition to predicting fragment ion intensities. Additionally, MS2 spectra in crosslinking experiments are often acquired using stepped CE, where multiple CEs are applied in a stepwise manner during fragmentation. This can negatively impact the NCE calibration process, as finding a single optimal CE per MS file for Prosit-XL may not be ideal for capturing the effects of the multiple CEs used in practice. More research is required to estimate the importance of stepped CE in XL-peptide fragmentation and to develop corresponding calibration methods. However, a lack of ground truth systematic data does not allow a comprehensive investigation at this moment.

Although not demonstrated in this work, there is no technical reason why Prosit-XL could not be adapted to other spectra or other peptide properties, such as retention time. Potential improvement to further extend Prosit-XL’s ability to predict spectra may be to accommodate other fragmentation techniques such as MS2-CID and MS2-electron-transfer dissociation⁵⁰ (ETD or EThcD), different fragment ion types (e.g., neutral losses), additional cross-linkers (e.g., DHSO⁵¹, DMTMM⁵¹), and different cross-linked residues¹ (e.g., S, T, Y). Extending the model’s capabilities in these areas could lead to even more comprehensive and accurate predictions.

As highlighted by our analysis, another future improvement in analyzing XL-MS datasets is the application of rescoring on search results from multiple XL-DBSEs, as losses observed when using different search engines may be attributed to differences in processing, rather than differences in confidence estimation. Although XL-DBSEs often identify overlapping sets of XL-peptides and PPIs, there are still cases that are uniquely identified by each XL-DBSE. Because our rescoring approach projects any CSM from any XL-DBSE to the same feature space, alternative explanations from different search engines can be compared, and the best can be picked. Taking advantage of this could lead to further substantial gains, increasing depth and coverage of XL experiments. Likewise, previous work in localizing PTMs has shown that true positive modified peptides may not be present as the highest scoring match produced by a DBSE. Because Prosit-XL takes both peptides into account for prediction, in combination with data-driven rescoring, a better ranking of XL-peptides may yield further gains in the future.

Overall, Prosit-XL represents a valuable advancement in XL-MS, specifically also due to its direct integration into Koina and Oktoberfest, enabling any scientist to benefit from deep-learning-assisted data analysis. Its ability to provide intensity predictions for complex datasets and enhance the identification of XL-peptides and PPIs through rescoring makes it a powerful tool for studying protein structures and protein-protein interactions, particularly for in vivo studies, as exemplified by the exceptional performance on the two-species mix dataset. As the field continues to evolve, and MS focusing on linear peptides already strongly relies on high-quality predictions, we are convinced that Prosit-XL’s capabilities and integration into rescoring will be key for advancing XL-MS and thus our understanding of protein interactions in vivo at proteome-scale.

Methods

Training data

In the process of data collection, we used 11 publicly available datasets: (1) DSBU: PXD012546; (2) DSSO: PXD019926, PXD017711, PXD011861; (3) DSS/BS3: PXD017620, PXD016554, PXD019926, PXD017695, PXD014675, PXD008550. MS2 spectra were acquired by HCD fragmentation, followed by analysis using the Orbitrap with high resolution and mass accuracy, except PXD019926, which also contains CID MS3 spectra. MS2 spectra were searched by pLink 2⁷, a high-speed search engine for proteome-scale identification of XL-peptides, and hence is very suitable for generating large-scale XL benchmark datasets, which are valuable for deep learning models. pLink 2 was used with the following parameters: Carbamidomethylation on cysteine and oxidation on methionine as fixed and variable modifications, respectively; peptide masses ranging from 600 to 6000 Da; precursor and fragment mass search tolerance set to 10 and 20 ppm; maximum allowed missed cleavages set to 3; crosslink specificities at lysine residues and protein N-terminals, FDR set at 0.5% at the CSM level. The extracted spectra were further filtered out under the following conditions: peptide lengths (peptides A and B) less than 6 or greater than 30, precursor charge greater than 6, and the number of matched peaks less than the length of each peptide. Ultimately, the top 10 CSMs for each unique XL-peptide were kept to avoid too much redundancy. All extracted spectra were annotated, where all expected b, y, b-xl, and y-xl for NMS2 spectra and b, y, b-short, y-short, b-long, and y-long, with for CMS2 spectra, charges up to 2 for CMS2 and 3 for NMS2 spectra, are calculated and matched against experimentally acquired fragment peaks. Matching tolerances were 20 ppm for FTMS. Ultimately, the annotated spectra were split into three distinct sets: training (80%), validation (10%), and holdout set (10%). To minimize data leakage, if an XL-peptide pair A-B is included in the training set, neither peptide A nor peptide B can be in the validation or holdout set. Regarding NCE, all MS files were analyzed by MS Amanda⁵² with default parameters to identify linear peptides, which are then used for optimal NCE estimation. To clarify, the top 1000 PSMs were chosen based on MS Amanda’s score and were compared to predicted spectra by HCD Prosit 2020 at different NCEs ranging from 18 to 49. The optimal NCE was determined by identifying the NCE at which the highest SA is observed.

Input and output of Prosit-XL

Inputs to the model are peptide sequence A, peptide sequence B, NCE, and precursor charge. Peptide sequences are encoded as integer vectors of length 30, with each integer representing a specific amino acid, and fed to an embedding layer. For cross-linked lysine residues, a unique integer is assigned to indicate the crosslinker, depending on the crosslinker type. Sequences shorter than 30 amino acids are padded with zeros. The precursor charge is represented using one-hot encoding. The Prosit-XL’s output is annotated spectra, which are transformed to a tensor. Ion intensities are normalized continuous values. A CMS2 and NMS2 spectrum are represented by a 348-dimensional vector (y/b/ys/bs/yl/bl ions, 3 charges, 29 fragment ions) and 174-dimensional vector (y/b/yxl/bxl ions, 3 charges, 29 fragment ions), respectively, and orders as follows: y1 (1+), y1 (2+), y1 (3+), b1 (1+), b1 (2 +), b1 (3+), y2 (1+) and so on. The type of fragment ion can change based on the position of the crosslinker. For example, if the crosslinker is attached to the first amino acid in a non-cleavable XL-peptide, b1 actually represents b-xl 1, indicating a modified b-ion.

Prosit-XL architecture

Encoder 1 and 2: The encoder 1 and 2 contain an embedding layer, followed by two bi-directional recurrent neural networks (BDN) with gated recurrent memory (GRU) units, connected to an attention layer. The recurrent layers use 512 memory cells each. Latent space: The latent space of each encoder is 512 units for each amino acid token. Encoder 3: Precursor charge and NCE encoder is a single dense layer followed by dropout. The latent vectors from Encoders 1 and 2 are first multiplied elementwise. The resulting product is then multiplied with the output of Encoder 3. Decoders 1 and 2: Both decoders consist of a one-layer length 29 BDN with GRUs. It is important to note that Decoder 2 is specifically developed for CMS2 spectra covering y-long and b-long fragments.

Prosit-XL training process

We applied transfer learning using the HCD Prosit 2020 and CID Prosit 2020 weights as starting points and then trained these using the CMS2 and CMS3 training sets to develop Prosit-XL-CMS2 and Prosit-XL-CMS3, respectively. The model weights of Prosit-XL-CMS2 were used as the starting point for the development of Prosit-XL-NMS2 using the NMS2 training set. To control for overfitting, early stopping was employed on the validation set scores, employing a patience of 20 epochs. The holdout set was used after the model was fully trained to evaluate its generalization and potential biases. The loss function was the normalized spectral contrast loss. We used the Adam optimizer with a cyclic learning rate algorithm. During training, the learning rate cycled between a constant lower limit of 0.00001 and an upper limit of 0.0002, which is continuously scaled by a factor of 0.95 with the “triangular” mode. The model was trained with a batch size of 2000³⁶.

Prosit-XL’s performance on synthetic peptide datasets

Synthetic dataset cross-linked by DSSO

All 3 MS files (1, 2, and 3 replicate) were downloaded from the PRIDE repository with the identifier PXD029252. MS files were searched using xiSEARCH with the following parameters: report_top_ranking_only: false, delta_score_filter: false, enzymes: trypsin, missed_cleavages: 2, min_peptide_length: 6, max_peptide_length: 30, isotope_error_ximpa: 2, noncovalent_peptides: true, threads: 20, ms1_tol: 10 ppm, ms2_tol: 10 ppm, top_n_alpha_scores: 10, top_n_alpha_beta_scores: 10, crosslinker: {name: “DSSO”, mass: 158.0038, specificity: K}, conservative_n_multi_loss: 3, denoise_alpha: {top_n: 10, bin_size: 100}, denoise_alpha_beta: {top_n: 20, bin_size: 100}, fragmentation: {nterm_ions: b, cterm_ions: y, add_precursor: true, max_nloss: 4, match_missing_monoisotopic: true}, max_var_protein_mods: 2, max_modified_peps: 20, modification1: {name: cm, specificity: C, type: fixed, composition”: “C2H3N1O1”}, modification2: {name: ox, specificity: M, type: variable, composition: O1. Next, xiFDR was applied with the following parameters: the FDR level for CSM, peptide pair, residua pairs, and protein pairs was set to 1%, without boosting. The identified CSMs were verified against provided groups to remove potential identified false-positive TTs. Finally, the final list of identified CSMs was submitted to Prosit-XL-CMS2. The NCE calibration was performed by Prosit-XL-CMS2. The SAs were calculated separately for peptide A and B (Fig. 2c).

Synthetic dataset cross-linked by DSS

MS file was downloaded from the PRIDE repository with the identifier PXD014337 and analyzed by pLink⁵³ according to the following parameters: Crosslink mass: 138.068, monolink mass: 156.079, crosslinker reactivity: K-K, fixed modification: Carbamidomethyl, variable modification: Oxidation, enzyme: trypsin, max. Missed cleavages: 3, Min peptide mass: 500, Max peptide mass: 6000, Min peptide length: 5, Max peptide length: 60, MS1 tolerance (ppm): 5, MS2 tolerance (ppm): 20, FDR: 1% at PSM level. The identified CSMs were verified against provided groups to remove potential identified false-positive TTs. Subsequently, we applied extra filtering and removed CSMs with scores less than 0.03 and applied Prosit-XL-NMS2. The NCE calibration was performed by Prosit-XL-NMS2. The SAs were calculated separately for peptide A and B (Fig. 2c).

Comparison of Prosit-XL and pDeepXL’s performance on synthetic peptide datasets

All identified CSMs described in the previous section (Fig. 2c) were submitted to pDeepXL for prediction. Some CSMs were removed due to DeepXL’s limitations, such as restrictions on peptide length. For predicting MS/MS spectra of the synthetic peptide dataset linked by DSSO, the following pDeepXL parameters were used: instrument: QEHF, NCE_low: 21, NCE_medium: 27, NCE_high: 33, and crosslinker: DSSO. For the dataset linked by DSS, the parameters were: instrument: QEHFX, NCE_low: 0, NCE_medium: 28, NCE_high: 0, and crosslinker: DSS. It should be emphasized that SAs and PCCs are measured separately for peptide A and peptide B, and only for CSMs that both pDeepXL and Prosit-XL could predict (Fig. S2d).

General rescoring pipeline

The rescoring pipeline (Fig. 3a) requires MS2 spectra files, either in RAW or mzML format, and unfiltered XL-DBSE’s output (xiSEARCH or Scout) as inputs, which contains both target (TTs) and decoy (TDs/DDs) CSMs. Annotation of MS2 spectra is then performed by calculating all potential b- and y-ions for CSM2 spectra (b, y, b-short, y-short, b-long, y-long) and NMS2 (b, y, b-xl, y-xl) with charge up to 2 and 3 for CMS2 and NMS2, respectively. These potential fragments are matched against the experimentally acquired fragment peaks with a 20 ppm mass tolerance for FTMS. Next, optimal NCEs are determined by calibrating Prosit-XL to each provided MS file. Specifically, the 20 highest-scoring CSMs are selected, and then NCE as Prosit-XL’s input is adjusted in a reasonable range (18 to 49). The NCE that leads to the highest SA between predicted and acquired spectra is used as Prosit-XL’s input for that MS file. With the prediction from Prosit-XL at an optimal NCE, Oktoberfest then generates ~150 features per CSM (separately calculated for peptide A and B). The list of features with their corresponding descriptions is provided in Supplementary Table 1. Rescoring is performed on the PSM-level, where each peptide with its corresponding ~75 features is submitted to Percolator (v 3.6.1). Percolator is only used to aggregate the features into a single score, and no q-value or other FDR estimate is taken from it. The final CSM-score is constructed by taking the minimum PSM-level percolator discriminant for each CSM. Additionally, rescoring was also performed using the latest version of Percolator (v3.7.1), with the results presented in Supplementary Fig. 7. It is crucial to highlight that the order of peptides A and B in the xiSEARCH or Scout output does not affect the Prosit-XL’s performance, the features generated by Oktoberfest, and the final Percolator scores. Ultimately, CSM, peptide pair, and PPI level FDR were estimated using xiFDR³, where all FDR levels (PSM, peptide pair, residual pairs, and protein pairs) are set to 1% without boosting. FDR calculations for self and between links are done separately using FDR = (TD-DD) / TT. It is important to note that CSMs provided by xiFDR are unique CSMs, meaning for any given peptide pair, modifications, link sites, and charge state combination only the top scoring one is reported. More information about xiFDR can be found at https://github.com/Rappsilber-Laboratory/xiFDR.

Application of Prosit-XL and rescoring

Rescoring of synthetic peptide dataset cross-linked by DSSO

After running xiSEARCH on 3 MS files, rescoring was applied to each replicate separately, with all features in Oktoberfest config file set to false. The number of identified CSMs and peptide pairs (both self- and between-links), by xiSEARCH+xiFDR and xiSEARCH+Prosit-XL+xiFDR, shown in Fig. 3b, represents the average of CSMs and peptide pairs per replicate. The experimentally validated FDR is calculated using the following formula: FDR = (TTs not within the same XL group) / (TTs total) for each replicate. The experimentally validated FDR in Fig. 3b shows the average actual FDR per replicate.

Applying an actual FDR of 1% to the synthetic peptide dataset cross-linked by DSSO

All CSMs provided by Oktoberfest were submitted to xiFDR with an initial FDR of 100%. Next, for the files generated by xiFDR at the CSM and peptide pair levels, all TDs and DDs were removed. The remaining TTs CSMs, and peptide pairs were then sorted once based on the Percolator score and once based on the xiSEARCH score. Finally, an actual FDR of 1% was applied using the formula: FDR = (TTs not within the same XL group)/(total TTs).

Rescoring of synthetic protein dataset cross-linked by DSSO

MS files were kindly shared with us by the Liu lab and were analyzed using Scout (v.1.4.14) by the following parameters: add contaminants = false; add decoys = true; fragment bin tolerance = 0.02; fragment bin offset = 0; max fragment bin m/z = 1800; min fragment bin m/z = 200, deconvolution for MS searching = false; deconvolution for ion pair = true, crosslinker = DSSO; target n-term = true; reaction residuals = k; enzyme = trypsin; enzyme specificity = FullySpecific; Isotopic possibilities precursor = 1; min peptide length = 6; max peptide length = 60, max variable modification per peptide = 2; min peptide mass = 500, max peptide mass = 6000, max miscleavages = 3, ppm error on MS1 level = 10, ppm error on MS2 level = 20, static modification = Carbamidomethyl, variable modification = Oxidation on Methionine; FDR 100% on all level. The final Scout score (Classification score) was used as input for xiFDR (all levels set to 1% without boosting). After rescoring the unfiltered result of Scout, the Percolator score is used as xiFDR input. The actual FDR is calculated by FDR = (FPs)/(FPs + TPs). For TP identification, proteins need to be in the same group and batch. Notably, only identified between-links are presented in Fig. 3c. For more details, see ref.¹⁰.

Rescoring of large-scale dataset cross-linked by DSSO

Two distinct subsets of large-scale datasets containing E. coli (JPST000845) and M. pneumoniae (PXD017711) were analyzed together using xiSEARCH. The analysis used the same parameters as those used for the synthetic XL-peptide dataset. To control for FDR estimation, a combined database search including both E. coli and M. pneumoniae protein sequences was performed. In this combined search, any proposed interaction between E. coli and M. pneumoniae protein sequences identified by a CMS/UXL was considered a FP identification, labeled as mismatch. Similar to the synthetic XL-peptide dataset, xiFDR is used twice, separately for inputs of the xiSEARCH’ score and the percolator score, after the rescoring process. It is important to note that only identified between-links are shown in Fig. 4b.

Applying a lower bound estimate of the actual FDR of 1% to the large-scale dataset cross-linked by DSSO

All CSMs provided by Oktoberfest were submitted to xiFDR with an initial FDR of 100%. Next, for the files generated by xiFDR at the CSM, peptide pair, and PPI levels, all TDs and DDs were removed. The remaining TTs, CSMs, peptide pairs, and PPIs were then sorted once based on the Percolator score and once based on the xiSEARCH score. Finally, a lower bound estimate of the actual FDR of 1% was applied.

Rescoring of human cytomegalovirus dataset cross-linked by DSSO

The dataset with the identifier PXD031911 was analyzed by xiSEARCH with the same parameters as those used for the synthetic dataset (linked by DSSO). Our evaluation of Prosit-XL’s performance involved comparing UXLs identified at 1% CSM-, peptide pair-, and PPI-level FDR by xiSEARCH+Prosit-XL+xiFDR and those identified at 1% UXL-level FDR by XlinkX. Comparison is done via two levels: PPIs and UXLs. PPIs were defined as a combination of inter-protein-protein interactions where both proteins are different and intra-protein-protein interactions where there is a sequence overlap on the peptide pair. Same definitions and approaches were applied for xiSEARCH+xiFDR search results. For PPI level comparison, the gene name of the UXL is taken and sorted to unify naming coming from both xiSEARCH+Prosit-XL+xiFDR and XlinkX (example PPI; (“UL25”, “UL83”)). For the UXL level, protein link position is included for respective proteins (example UXL; (“UL25”, “84”), (“UL83”, “557”)). To tackle the complexities of ambiguous UXLs, we implemented a standardized selection process. The dataset paper previously provided results from XlinkX along with a selection of UXL for ambiguities⁴². Initially, we utilized the XlinkX results to determine which proteins were involved in ambiguous UXLs. We then aligned the UXLs from xiSEARCH+Prosit-XL+xiFDR with those obtained from XlinkX. Following this alignment, we selected the UXL based on those provided by XlinkX for the corresponding matched UXL. In cases where multiple UXLs were possible but did not align between the two methods, we chose the first candidate based on an alphabetically sorted list for xiSEARCH+Prosit-XL+xiFDR results. An initial comparative analysis was performed at the PPI level. In this context, PPIs were categorized as ‘gain’ if they appeared only in the results from xiSEARCH+Prosit-XL+xiFDR, “shared” if they were found in both xiSEARCH+Prosit-XL+xiFDR and XlinkX results, and “loss” if they were present only in XlinkX results. A subsequent comparative analysis focused exclusively on the “shared” PPIs. We extracted all UXLs from these shared PPIs for comparison. In the final analysis, we combined the UXLs identified for the “loss” PPIs from the first analysis with the UXLs defined for “loss” UXLs (those appearing only in XlinkX) from the second analysis. These combined UXLs were then compared against the unfiltered results from xiSEARCH. Additionally, we explored all combinations of “Human” and “Viral” interaction types (Human-Human, Human-Viral, and Viral-Viral) while utilizing all PPIs and UXLs from xiSEARCH+Prosit-XL+xiFDR and XlinkX. The resulting findings were visualized using a Venn diagram (see Fig. 5b and Supplementary Fig. 6a, b). Regarding AF2-based UXL distance measurement, the human protein structures were extracted from the EBI-AF2 database with reference proteome UP000005640. This database contains 23,391 predicted structures. With these settings, we were able to calculate 97% of the defined self-link for Human proteins. For the Human-Human self-link UXL distance measurement, we utilized the Euclidean distance and atomic coordinates. The UXLs’ atomic coordinates were obtained using BioPython PDB or CIF file parsers. As all AF2 predictions only provide chain A, we measured the distance from only one chain. The distance measurements for UL55 were conducted using post-fusion (PDB: 7KDD) and pre-fusion (PDB 7KDP) conformations. Since the UL55 structure is a trimer, UXL interactions can potentially take place between any of the chains. Consequently, distance measurements were conducted for UXL interactions located in all possible chain combinations. If the measurements for all the combinations show a distance of more than 40 Å, we re-calculated the same pairs using opposite structural conformations to validate the suitability of the UXL. Subsequently, these UXL interactions were manually evaluated using PyMOL.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

The MS files containing CMS2 spectra of training, validation, and holdout set are available via the PRIDE repositories with the identifier PXD012546, PXD017711, PXD019926, and PXD011861. For the NMS2 spectra, the relevant identifiers are PXD017620, PXD016554, PXD019926, PXD017695, PXD014675, and PXD008550. The MS files for CMS3 spectra can be found under the identifier PXD019926. The training, validation, and holdout sets are available on Zenodo. The triton-compatible format of Prosit-XL-CMS2, Prosit-XL-CMS3, and Prosit-XL-NMS2 has been deposited on Zenodo and can be downloaded via the following links: https://zenodo.org/records/10277646, https://zenodo.org/records/10281001, and https://zenodo.org/records/11259344, respectively. The MS files for synthetic peptide datasets using DSSO and DSS cross-linkers can be found under the identifiers PXD029252 and PXD014337, respectively, while the synthetic protein dataset is available with identifier PXD042173. Additionally, the E. coli and M. pneumoniae dataset was sourced from JPOST identifier JPST000845 and PRIDE identifier PXD017711, respectively. The MS files of the human cytomegalovirus dataset were downloaded from PRIDE with the identifier PXD031911. The unfiltered search engine results.fasta files, rescoring results, and RAW files of human cytomegalovirus dataset have been deposited in the PRIDE repository with the identity of PXD057705. Regarding supplementary data files, the name of MS files used for data collection (training, validation, and holdout set) as well as those used in the rescoring process for each dataset are listed in Supplementary Data 1. The output of xiFDR (identified CSMs, peptide pairs, and PPIs) for the synthetic peptide dataset, synthetic protein dataset, E. coli and M. pneumoniae dataset, and human cytomegalovirus dataset is available in Supplementary Data 2–5, respectively. Source data are provided with this paper.

Code availability

Source code and scripts are available on GitHub at https://github.com/wilhelm-lab/koina, and https://github.com/wilhelm-lab/oktoberfest. Oktoberfest repository is released under the MIT License, with all original license and copyright information retained. Attribution to reused components and dependencies is provided within the repository. Custom scripts for data analysis (e.g., cross-link distance calculation) were implemented in python and are available upon request.

References

O’Reilly, F. J. & Rappsilber, J. Cross-linking mass spectrometry: methods and applications in structural, molecular and systems biology. Nat. Struct. Mol. Biol. 25, 1000–1008 (2018).
Article PubMed Google Scholar
Trnka, M. J., Baker, P. R., Robinson, P. J. J., Burlingame, A. L. & Chalkley, R. J. Matching cross-linked peptide spectra: only as good as the worse identification. Mol. Cell. Proteom. MCP 13, 420–434 (2014).
Article CAS PubMed Google Scholar
Fischer, L. & Rappsilber, J. Quirks of error estimation in cross-linking/mass spectrometry. Anal. Chem. 89, 3829–3833 (2017).
Article CAS PubMed PubMed Central Google Scholar
Walzthoeni, T. et al. False discovery rate estimation for cross-linked peptides identified by mass spectrometry. Nat. Methods 9, 901–903 (2012).
Article CAS PubMed Google Scholar
Lenz, S. et al. Reliable identification of protein-protein interactions by crosslinking mass spectrometry. Nat. Commun. 12, 3564 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Mendes, M. L. et al. An integrated workflow for crosslinking mass spectrometry. Mol. Syst. Biol. 15, e8994 (2019).
Article CAS PubMed PubMed Central Google Scholar
Chen, Z.-L. et al. A high-speed search engine pLink 2 with systematic evaluation for proteome-scale identification of cross-linked peptides. Nat. Commun. 10, 3404 (2019).
Article ADS PubMed PubMed Central Google Scholar
Liu, F., Lössl, P., Scheltema, R., Viner, R. & Heck, A. J. R. Optimized fragmentation schemes and data analysis strategies for proteome-wide cross-link identification. Nat. Commun. 8, 15473 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Hoopmann, M. R. et al. Kojak: efficient analysis of chemically cross-linked protein complexes. J. Proteome Res. 14, 2190–2198 (2015).
Article CAS PubMed PubMed Central Google Scholar
Clasen, M. A. et al. Proteome-scale recombinant standards and a robust high-speed search engine to advance cross-linking MS-based interactomics. Nat. Methods 21, 1–9 (2024).
Article Google Scholar
Piersimoni, L., Kastritis, P. L., Arlt, C. & Sinz, A. Cross-linking mass spectrometry for investigating protein conformations and protein–protein interactions─a method for all seasons. Chem. Rev. 122, 7500–7531 (2022).
Article CAS PubMed Google Scholar
Käll, L., Canterbury, J. D., Weston, J., Noble, W. S. & MacCoss, M. J. Semi-supervised learning for peptide identification from shotgun proteomics datasets. Nat. Methods 4, 923–925 (2007).
Article PubMed Google Scholar
Keller, A., Nesvizhskii, A. I., Kolker, E. & Aebersold, R. Empirical statistical model to estimate the accuracy of peptide identifications made by MS/MS and database search. Anal. Chem. 74, 5383–5392 (2002).
Article CAS PubMed Google Scholar
Ma, K., Vitek, O. & Nesvizhskii, A. I. A statistical model-building perspective to identification of MS/MS spectra with PeptideProphet. BMC Bioinforma. 13, S1 (2012).
Article CAS Google Scholar
Kalhor, M., Lapin, J., Picciani, M. & Wilhelm, M. Rescoring peptide spectrum matches: boosting proteomics performance by integrating peptide property predictors into peptide identification. Mol. Cell. Proteom. 23, 100798 (2024).
Article CAS Google Scholar
Beveridge, R., Stadlmann, J., Penninger, J. M. & Mechtler, K. A synthetic peptide library for benchmarking crosslinking-mass spectrometry search engines for proteins and protein complexes. Nat. Commun. 11, 742 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Matzinger, M. et al. Mimicked synthetic ribosomal protein complex for benchmarking crosslinking mass spectrometry workflows. Nat. Commun. 13, 3975 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Gessulat, S. et al. Prosit: proteome-wide prediction of peptide tandem mass spectra by deep learning. Nat. Methods 16, 509–518 (2019).
Article CAS PubMed Google Scholar
Wilhelm, M. et al. Deep learning boosts sensitivity of mass spectrometry-based immunopeptidomics. Nat. Commun. 12, 3346 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Bouwmeester, R., Gabriels, R., Hulstaert, N., Martens, L. & Degroeve, S. DeepLC can predict retention times for peptides that carry as-yet unseen modifications. Nat. Methods 18, 1363–1369 (2021).
Article PubMed Google Scholar
DART-ID increases single-cell proteome coverage | PLOS Computational Biology. https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1007082.
Giese, S. H., Sinn, L. R., Wegner, F. & Rappsilber, J. Retention time prediction using neural networks increases identifications in crosslinking mass spectrometry. Nat. Commun. 12, 3237 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Chen, Z.-L., Mao, P.-Z., Zeng, W.-F., Chi, H. & He, S.-M. pDeepXL: MS/MS spectrum prediction for cross-linked peptide pairs by deep learning. J. Proteome Res. 20, 2570–2582 (2021).
Article CAS PubMed Google Scholar
Picciani, M. et al. Oktoberfest: Open-source spectral library generation and rescoring pipeline based on Prosit. Proteomics e2300112 https://doi.org/10.1002/pmic.202300112 (2023).
Perez-Riverol, Y. et al. The PRIDE database resources in 2022: a hub for mass spectrometry-based proteomics evidences. Nucleic Acids Res. 50, D543–D552 (2022).
Article CAS PubMed Google Scholar
O’Reilly, F. J. et al. In-cell architecture of an actively transcribing-translating expressome. Science 369, 554–557 (2020).
Article ADS PubMed PubMed Central Google Scholar
Stieger, C. E., Doppler, P. & Mechtler, K. Optimized fragmentation improves the identification of peptides cross-linked by MS-cleavable reagents. J. Proteome Res. 18, 1363–1370 (2019).
Article CAS PubMed Google Scholar
Schnirch, L. et al. Expanding the depth and sensitivity of cross-link identification by differential ion mobility using high-field asymmetric waveform ion mobility spectrometry. Anal. Chem. 92, 10495–10503 (2020).
Article CAS PubMed Google Scholar
Götze, M., Iacobucci, C., Ihling, C. H. & Sinz, A. A simple cross-linking/mass spectrometry workflow for studying system-wide protein interactions. Anal. Chem. 91, 10236–10244 (2019).
Article PubMed Google Scholar
Linden, A. et al. A cross-linking mass spectrometry approach defines protein interactions in yeast mitochondria. Moll. Cell. Proteomics 19, 1161–1178 (2020).
Article CAS Google Scholar
Parfentev, I., Schilbach, S., Cramer, P. & Urlaub, H. An experimentally generated peptide database increases the sensitivity of XL-MS with complex samples. J. Proteom. 220, 103754 (2020).
Article CAS Google Scholar
In Situ Structural Restraints from Cross-Linking Mass Spectrometry in Human Mitochondria | Journal of Proteome Research. https://pubs.acs.org/doi/10.1021/acs.jproteome.9b00541.
Zolg, D. P. et al. PROCAL: a set of 40 peptide standards for retention time indexing, column performance monitoring, and collision energy calibration. Proteomics 17, 1700263 (2017).
Article Google Scholar
Zolg, D. P. et al. Building ProteomeTools based on a complete synthetic human proteome. Nat. Methods 14, 259–262 (2017).
Article CAS PubMed PubMed Central Google Scholar
Gabriel, W. et al. Prosit-TMT: deep learning boosts identification of TMT-labeled peptides. Anal. Chem. 94, 7181–7190 (2022).
Article CAS PubMed Google Scholar
Adams, C. et al. Fragment ion intensity prediction improves the identification rate of non-tryptic peptides in timsTOF. Nat. Commun. 15, 3956 (2024).
Article ADS CAS PubMed PubMed Central Google Scholar
Lautenbacher, L. et al. Koina: Democratizing machine learning for proteomics research. 2024.06.01.596953 Preprint at https://doi.org/10.1101/2024.06.01.596953 (2024).
Götze, M. et al. Automated assignment of MS/MS cleavable cross-links in protein 3D-structure analysis. J. Am. Soc. Mass Spectrom. 26, 83–97 (2015).
Article ADS PubMed Google Scholar
Pirklbauer, G. J. et al. MS annika: a new cross-linking search engine. J. Proteome Res. 20, 2560–2569 (2021).
Article CAS PubMed PubMed Central Google Scholar
Accurate and Automated High-Coverage Identification of Chemically Cross-Linked Peptides with MaxLynx | Analytical Chemistry. https://pubs.acs.org/doi/10.1021/acs.analchem.1c03688.
Rescuing error control in crosslinking mass spectrometry | Molecular Systems Biology. https://www.embopress.org/doi/full/10.1038/s44320-024-00057-2.
Bogdanow, B. et al. Spatially resolved protein map of intact human cytomegalovirus virions. Nat. Microbiol. 8, 1732–1747 (2023).
Article CAS PubMed PubMed Central Google Scholar
The Tegument Protein pp65 of Human Cytomegalovirus Acts as an Optional Scaffold Protein That Optimizes Protein Uploading into Viral Particles | Journal of Virology. https://journals.asm.org/doi/10.1128/jvi.01415-14.
Yeast Two-Hybrid Analyses Reveal Novel Binary Interactions between Human Cytomegalovirus-Encoded Virion Proteins | PLoS ONE. https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0017796.
Reyda, S. et al. The tegument protein pp65 of human cytomegalovirus acts as an optional scaffold protein that optimizes protein uploading into viral particles. J. Virol. 88, 9633–9646 (2014).
Article PubMed PubMed Central Google Scholar
The Cellular Proteins Grb2 and DDX3 Are Increased upon Human Cytomegalovirus Infection and Act in a Proviral Fashion - PubMed. https://pubmed.ncbi.nlm.nih.gov/26121620/.
Bei, C. et al. DDX3X is hijacked by snakehead vesiculovirus phosphoprotein to facilitate virus replication via stabilization of the phosphoprotein. J. Virol. 97, e00035–23 (2023).
Article PubMed PubMed Central Google Scholar
Stahl, K. et al. Modelling protein complexes with crosslinking mass spectrometry and deep learning. Nat. Commun. 15, 7866 (2024).
Article CAS PubMed PubMed Central Google Scholar
Freestone, J., Käll, L., Noble, W. S. & Keich, U. How to train a postprocessor for tandem mass spectrometry proteomics database search while maintaining control of the false discovery rate. J. Proteome Res. 24, 2266–2279 (2025).
Article CAS PubMed Google Scholar
Riley, N. M. & Coon, J. J. The role of electron transfer dissociation in modern proteomics. Anal. Chem. 90, 40–64 (2018).
Article CAS PubMed Google Scholar
Gutierrez, C. B. et al. Developing an acidic residue reactive and sulfoxide-containing ms-cleavable homobifunctional cross-linker for probing protein–protein interactions. Anal. Chem. 88, 8315–8322 (2016).
Article CAS PubMed PubMed Central Google Scholar
Dorfer, V., Strobl, M., Winkler, S. & Mechtler, K. MS Amanda 2.0: advancements in the standalone implementation. Rapid Commun. Mass Spectrom. 35, e9088 (2021).
Article CAS PubMed PubMed Central Google Scholar
Yang, B. et al. Identification of cross-linked peptides from complex samples. Nat. Methods 9, 904–906 (2012).
Article CAS PubMed Google Scholar
Combe, C. W., Graham, M., Kolbowski, L., Fischer, L. & Rappsilber, J. xiVIEW: visualisation of crosslinking mass spectrometry data. J. Mol. Biol. 436, 168656 (2024).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

The authors acknowledge all members of the Wilhelmlab, as well as Manuel Matzinger, Micha Johannes Birklbauer, and Tara Bartolec, for their valuable input and discussions. We also thank the Liu Lab, especially Prof. Dr. Fan Liu and Max Ruwolt, for input and sharing their data regarding the human cytomegalovirus dataset. Additionally, we also thank Si-Min He’s lab, particularly Prof. Dr. Si-Min He and Zhenlin Chen, for providing the pLink 2 search results. This work was in part funded by the European Union’s Horizon 2020 research and innovation program under the Marie Skłodowska-Curie grant agreement (Grant No. 956148), an ERC Starting Grant (Grant No. 101077037), and German Federal Ministry of Education and Research (BMBF) (Grant No. 031L0305A).

Funding

Open Access funding enabled and organized by Projekt DEAL.

Author information

Authors and Affiliations

Computational Mass Spectrometry, TUM School of Life Sciences, Technical University of Munich, Freising, Germany
Mostafa Kalhor, Cemil Can Saylan, Mario Picciani, Joel Lapin & Mathias Wilhelm
Bioanalytics, Institute of Biotechnology, Technical University Berlin, Berlin, Germany
Lutz Fischer, Falk Boudewijn Schimweg & Juri Rappsilber
Wellcome Centre for Cell Biology, University of Edinburgh, Edinburgh, UK
Juri Rappsilber
Si-M/“Der Simulierte Mensch”, a Science Framework of Technische Universität Berlin and Charité - Universitätsmedizin Berlin, Berlin, Germany
Juri Rappsilber
Munich Data Science Institute, Technical University of Munich, Garching, Germany
Mathias Wilhelm

Authors

Mostafa Kalhor
View author publications
Search author on:PubMed Google Scholar
Cemil Can Saylan
View author publications
Search author on:PubMed Google Scholar
Mario Picciani
View author publications
Search author on:PubMed Google Scholar
Lutz Fischer
View author publications
Search author on:PubMed Google Scholar
Falk Boudewijn Schimweg
View author publications
Search author on:PubMed Google Scholar
Joel Lapin
View author publications
Search author on:PubMed Google Scholar
Juri Rappsilber
View author publications
Search author on:PubMed Google Scholar
Mathias Wilhelm
View author publications
Search author on:PubMed Google Scholar

Contributions

M.W. and J.R. jointly supervised the research, providing guidance throughout the project. M.K., C.S., L.F., and F.S. analyzed the data. M.K. trained and evaluated Prosit-XL models. M.K. and M.P. extended XL rescoring in Oktoberfest. M.K., C.S., J.L., and M.W. wrote the manuscript. All authors reviewed and approved the manuscript.

Corresponding author

Correspondence to Mathias Wilhelm.

Ethics declarations

Competing interests

M.W. is a founder and shareholder of MSAID GmbH with no operational role and member of the scientific advisory board of Momentum Biotechnologies. The remaining authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Alexander Leitner and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Description of Additional Supplementary Files

Supplementary Data 1

Supplementary Data 2

Supplementary Data 3

Supplementary Data 4

Supplementary Data 5

Reporting Summary

Transparent Peer Review file

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Kalhor, M., Saylan, C.C., Picciani, M. et al. Prosit-XL: enhanced cross-linked peptide identification by fragment intensity prediction to study protein interactions and structures. Nat Commun 16, 5429 (2025). https://doi.org/10.1038/s41467-025-61203-4

Download citation

Received: 02 December 2024
Accepted: 17 June 2025
Published: 01 July 2025
DOI: https://doi.org/10.1038/s41467-025-61203-4