Structure based functional identification of an uncharacterized protein from Coxiella burnetii involved in adipogenesis

Mamun, Tajul Islam; Bourhia, Mohammed; Neoaj, Taufiq; Akash, Shopnil; Azad, Md. A. K.; Hossain, Md. Sarowar; Rahman, Md. Masudur; Bin Jardan, Yousef A.; Ibenmoussa, Samir; Sitotaw, Baye

doi:10.1038/s41598-024-66072-3

Download PDF

Article
Open access
Published: 22 July 2024

Structure based functional identification of an uncharacterized protein from Coxiella burnetii involved in adipogenesis

Tajul Islam Mamun¹,
Mohammed Bourhia²,
Taufiq Neoaj³,
Shopnil Akash⁴,
Md. A. K. Azad⁴,
Md. Sarowar Hossain^4,9,
Md. Masudur Rahman⁵,
Yousef A. Bin Jardan⁶,
Samir Ibenmoussa⁷ &
…
Baye Sitotaw⁸

Scientific Reports volume 14, Article number: 16789 (2024) Cite this article

2455 Accesses
1 Citations
Metrics details

Subjects

An Author Correction to this article was published on 10 September 2024

This article has been updated

Abstract

Coxiella burnetii, the causative agent of Q fever, is an intracellular pathogen posing a significant global public health threat. There is a pressing need for dependable and effective treatments, alongside an urgency for further research into the molecular characterization of its genome. Within the genomic landscape of Coxiella burnetii, numerous hypothetical proteins remain unidentified, underscoring the necessity for in-depth study. In this study, we conducted comprehensive in silico analyses to identify and prioritize potential hypothetical protein of Coxiella burnetii, aiming to elucidate the structure and function of uncharacterized protein. Furthermore, we delved into the physicochemical properties, localization, and molecular dynamics and simulations, and assessed the primary, secondary, and tertiary structures employing a variety of bioinformatics tools. The in-silico analysis revealed that the uncharacterized protein contains a conserved Mth938-like ___domain, suggesting a role in preadipocyte differentiation and adipogenesis. Subcellular localization predictions indicated its presence in the cytoplasm, implicating a significant role in cellular processes. Virtual screening identified ligands with high binding affinities, suggesting the protein’s potential as a drug target against Q fever. Molecular dynamics simulations confirmed the stability of these complexes, indicating their therapeutic relevance. The findings provide a structural and functional overview of an uncharacterized protein from C. burnetii, implicating it in adipogenesis. This study underscores the power of in-silico approaches in uncovering the biological roles of uncharacterized proteins and facilitating the discovery of new therapeutic strategies. The findings provide valuable preliminary data for further investigation into the protein’s role in adipogenesis.

Quantitative proteomics identifies the core proteome of exosomes with syntenin-1 as the highest abundant protein and a putative universal biomarker

Article 09 June 2021

Whole-genome sequencing analysis of the cardiometabolic proteome

Article Open access 10 December 2020

Multiplatform tear proteomic profiling reveals novel non-invasive biomarkers for diabetic retinopathy

Article 09 February 2024

Introduction

Q fever is an influenza-like zoonotic disease caused by Gram-negative obligate intracellular pathogen Coxiella burnetii¹. Domesticated bovines, ovines, and caprices are the most common sources of human infections through inhalation². People who work with animals may have infection through direct or indirect contact^3,4. The potential transmission of C. burnetii to humans has been associated with the consumption of raw milk and other dairy products^5,6. The infection of C. burnetti in humans may result in a wide range of diseases, from acute to chronic. Although the clinical signs vary based on the forms, the acute form may result in coughing, headaches, weight loss, and even death^7,8. Coxiellosis is the name used for Q fever when it affects animals, and the disease is often asymptomatic. Cattle and camels might suffer from infertility, endrometritis, and mastitis, while sheep and goats might have abortion, stillbirth, and premature delivery^9,10. C. burnetti considered the most infectious organism ever studied that is capable of spreading disease with only a single viable organism¹¹. Some early studies suggest that over half of dairy animals shed C. burnetii in their milk and that the bacterium may live in powdered dairy products for 3 years^4,5.

Physicians still struggle with C. burnetii infection because there is no international collaboration to standardize treatment and reduce clinical outcomes¹². To combat C. burnetii and promote public health worldwide, more research is needed to find safe and effective therapies. However, the research on this specific biological agent has not been conducted to the same extent as other agents, resulting in inadequate information on its proteins’ structure and functional characteristics. Limited research has been conducted on the underlying structural processes that inhibit the C. burnetii protein¹³. About 35% of genes in the genomes of different bacterial species are in the “uncharacterized” category¹⁴. Hypothetical proteins (HPs) are uncharacterized protein families or domains with unclear functionality^9,15. Annotating HPs reduces pathogen knowledge gaps by anticipating structures and biological functions¹⁶. Consequently, annotation of potential protein functions has gained popularity, which may help in creating prospective medicines against targeting agents and developing effective pharmacological targets^17,18,19. There has been a lot of development in recent years of technology for computers that minimizes the time and expenses of functional annotation of unknown protein²⁰. Furthermore, the field of bioinformatics not only enhances the reliability of studies but also enhances the efficiency of data processing²¹. Using structure- and sequence-based methodologies, in silico approaches have been successfully used to a variety of microorganisms including Candida dubliniensis²², Clostridium tetani²³, and vibrio cholera¹⁹ to analyse the HPs of these diseases.

Despite its medical importance, a substantial portion of the C. burnetii genome remains uncharacterized, presenting a challenge for understanding the bacterium’s pathogenesis and developing effective therapeutic strategies. Among these unexplored elements are proteins whose functions and roles in cellular processes remain elusive. In this study, we employ a comprehensive in-silico approach to characterize an uncharacterized protein from C. burnetii, hypothesized to be involved in adipogenesis. By integrating advanced computational tools for structure prediction, ___domain analysis, and functional annotation, we aim to unravel the structural features and potential functional roles of this protein. Through this endeavor, we seek to shed light on the molecular basis of adipogenesis modulation by C. burnetii and identify novel avenues for therapeutic intervention against Q fever.

Methods

Sequence extraction from Coxiella burnetii

A hypothetical or uncharacterized protein from Coxiella burnetii (accession no. RQM76478.1) was selected from the NCBI database²⁴. The protein’s 123 amino acid residues were retained in FASTA format for further analysis. Supplementary Table 1 provides a comprehensive overview of all servers utilized to annotate the structural and functional attributes of HP. The workflow is shown in Fig. 1.

Analysis of physicochemical properties

Protein structure and function might be better understood by exploring its physicochemical properties. The uncharacterized protein was analyzed using the ExPASy server²⁵.

Protein subcellular localization

The effective functioning of proteins and precise genetic analysis depend on subcellular localization. Drug and vaccine targets are within cells, providing significant clues for identifying and developing novel pharmaceuticals. The subcellular ___location of Coxiella burnetii was predicted using the CELLO website²⁶. The findings were further verified using the PSORTb, CCTOP, SOSUIGramN, and PSLpred servers^27,28,29,30.

Function prediction by ___domain analysis

Protein functions were predicted from various database servers and tools. To identify a conserved ___domain and motif in the protein, the FASTA sequence was analyzed. Multiple databases and analytic tools were applied to the hypothesized protein sequence, including INTERPRO, MOTIF, Pfam, and the NCBI-hosted conserved ___domain database^31,32,33,34. The MOTIF server was used to analyze protein sequence motifs. Additionally, the I-TASSER site was used to predict and document the hypothetical protein’s molecular, biological, and cellular functions³⁵.

Multiple sequence alignment and phylogeny analysis

The NCBI database of non-redundant protein sequences (nr) was searched using BLASTp with the default settings. This search resulted in the retrieval of numerous protein sequences annotated as having similar functionality. The Clustal Omega tool was used for multiple sequence alignment (MSA) to assess the query sequence against homologous sequences³⁶. Finally, the tools MEGA X and BioEdit were used to conduct phylogenetic analysis and MSA^37,38.

Secondary structure prediction

The secondary structure of the putative protein was predicted to determine the relative placement of its amino acid residues. This analysis identified the presence of alpha-helices, beta-sheets, and random coils. To calculate the parameters of the secondary structure, we used the SOPMA tool with its default settings³⁹. Finally, PSIPRED⁴⁰ was used to further cross-check the findings from SOPMA.

Predicting protein tertiary structure and evaluating its quality

The tertiary structure of the unknown protein was predicted to determine its arrangement and folding. The tertiary model was predicted using the Swiss Model and D-I-TASSER servers^35,41. In the next phase of our investigation, the modeling features provided by the D-I-TASSER server were utilized. Subsequently, energy minimization was performed using the YASARA software⁴². Various web servers were used to assess the quality of the predicted protein structure. The PROCHECK web server was used to build the Ramachandran plot, which served as an indicator to evaluate the quality of the protein structure⁴³. Additionally, VERIFY 3D⁴⁴ and ERRAT⁴⁵ servers were also used to evaluate the predicted tertiary structure of the uncharacterized protein. The Z-scores of the predicted models were computed using the ProSA server⁴⁶.

Active site determination

The protein active site is the precise area that enables the protein to carry out its biological functions. In the current investigation, the CASTp server was used to identify the active sites⁴⁷. This server offers a web-based resource for identifying, delineating, and quantifying curved surface areas on three-dimensional structures of proteins. Active pockets on protein surfaces and within 3D structure interiors were precisely evaluated. Finally, PyMOL software was used to visually represent the CASTp results⁴⁸.

Protein–protein interaction

A computational method was applied for understanding interactions with other proteins and their significance in biological functions. The uncharacterized protein of Coxiella burnetii bacteria was searched in the STRING database for possible functional interaction networks⁴⁹.

Molecular docking analysis and dynamic simulation

Molecular docking studies were conducted between the uncharacterized protein and two distinct ligand molecules in order to further verify the conformation. In the previous literature, the ligand for the potential protein was not disclosed. Ligands were selected based on their known or predicted roles in adipogenesis or lipid-related pathways. This included compounds with reported activities as agonists or antagonists of adipogenic transcription factors, modulators of lipid metabolism enzymes, or regulators of adipocyte differentiation and function. Hence, the Protein Data Bank (PDB) server was implemented to identify potential ligands. Therefore, to investigate the interaction between the protein and potential inhibitors, docking studies were conducted using two distinct ligands: 2-[(2-phenylphenyl)amino]benzoic acid and 2-amino-4-phenylbenzoic acid. These compounds were chosen based on their structural properties, which suggest potential for high affinity binding to the target protein. This approach helps in understanding the ligands’ positions, orientations, and conformations within the binding site of the HP, thus offering valuable information for the design and optimization of more effective therapeutic agents targeting this protein.

The ligands used in the docking investigation were obtained from the PubChem server and then converted to the PDB format using PyMOL software⁵⁰. Docking analysis was performed using AutoDock Vina through PyRx tool to predict how ligands interact with macromolecules⁵¹. The outcomes of the docking process were assessed using PyMOL and Discovery Studio software⁵². The Chimaera tool was used to better understand the hydrophobic and hydrogen-bonding interactions occurring between the atoms of the receptor and ligand at a proximity of 5 Å ⁵³.

The mobility of molecules and atoms in dynamic systems like protein–ligand complexes are studied using molecular dynamic simulation (MDS) to explain crucial physicochemical processes. MDS were conducted on receptor-ligand complexes from molecular docking studies to confirm interaction mechanisms and stability in a dynamic environment. Normal mode analysis (NMA) on the iMODS website was implemented to determine the internal coordinates’ motion by applying MDS to the HP-ligand complexes⁵⁴. This software can compute the deformability, B-factors, eigenvalues, covariance, and elastic network⁵⁵.

Result and discussion

Protein sequence retrieval and similarity analysis

The hypothetical protein (accession no. RQM76478.1) was randomly selected from the NCBI server. It originates from the gram-negative organism Coxiella burnetii and comprises 123 amino acid residues. To facilitate further investigation, the protein sequence was obtained in FASTA format. The BLASTp program was then used to search the non-redundant protein sequences (nr) database for proteins structurally similar to the chosen hypothetical protein (Table 1). According to the BLASTp result, the uncharacterized protein shows similarities with several proteins, including Mth938-like ___domain-containing proteins.

Table 1 Proteins with identical features were obtained from the non-redundant (nr) database.

Full size table

Analysis of physicochemical properties

The hypothetical protein’s physical and chemical characteristics were identified via ExPASy ProtParam server. The amino acid composition was Ala (7.3%), Arg (4.9%), Asn (3.3%), Asp (3.3%), Cys (1.6%), Gln (1.6%), Glu (7.3%), Gly (8.9%), His (3.3%), Ile(6.5%), Leu (11.4%), Lys (3.3%), Met (2.4%), Phe (1.6%), Pro (5.7%), Ser (7.3%), Thr (7.3%), Trp (1.6%), Tyr (4.1%), and Val (7.3%). The protein’s atomic composition includes carbon (601, 31.7%), hydrogen (951, 50.1%), nitrogen (161, 8.5%), oxygen (179, 9.4%), and sulfur (5, 0.3%). The theoretical pI of the protein was calculated to be 5.74, and its molecular weight was 13,456.43 Dalton, and GRAVY, an indicator of its hydrophobicity, was 0.002 (Table 2). It was suggested that the unknown protein was hydrophobic based on the positive GRAVY score⁵⁶. Which indicates a lack of interaction possibilities with water. The protein was anticipated to be unstable with an instability index (II) of 57.75. A protein is considered to possess stability when its instability index value falls below 40, whereas a protein is classified as unstable if its instability index value exceeds 40⁵⁷. The protein’s pI was determined to be 5.74, making it acidic (pH 7) in composition. The predicted protein half-life in mammalian reticulocytes in vitro is 30 h; in yeast and E. coli in vivo, it is > 20 h and > 10 h, respectively. These approaches may adjust half-life and adapt to treatment methods and disease⁵⁸. The extinction coefficient stands in for a proportionality constant in the Beer-Lambert formula. It provides an approximation of the quantity of light at a certain wavelength absorbed by proteins⁵⁹.

Table 2 Physicochemical characteristics of the uncharacterized protein.

Full size table

Estimating the subcellular ___location of protein

Determining subcellular localization is crucial for understanding protein function and genome analysis. Essentially, a protein’s subcellular ___location indicates its position within a cell, which could be the outer membrane, inner membrane, periplasm, extracellular space, or cytoplasm⁶⁰. According to the CELLO server, the selected hypothetical protein is localized in the cytoplasm, with a reliability score of 3.301. This result was confirmed by PSORTb v3.0.3, SOSUIGramN, and PSLpred websites. Additionally, the PSLpred tool predicted a cytoplasmic localization for the protein with a score of 98.1% and a reliability index of 5. However, the CCTOP server predicts that the protein is not a transmembrane protein. Predicting the subcellular localization of an unidentified protein is valuable for genomic identification and drug development⁶¹. Membrane proteins may serve as vaccine targets, while cytoplasmic proteins often function as drug targets⁶². This suggests that the uncharacterized protein could be a promising candidate for drug targeting.

Protein ___domain analysis for predicting biological function

According to NCBI’s CD-search, the selected hypothetical protein contains only a single ___domain. The Mth938-like ___domain-containing protein was identified as the origin of this ___domain (accession No. cd05560). The server generated an e-value of 3.78e-51, indicating that amino acid residues 13–121 constitute the ___domain. This finding was corroborated by Pfam, D-I-TASSER, and INTERPRO servers to ensure its accuracy. While INTERPRO and MOTIF attempted to identify relevant motifs, they did not yield any results. The genome of the microorganism Methanobacterium thermoautotrophicum (Mth) encodes a potential protein known as Mth938. According to various servers, the Mth938 ___domain-containing protein has the potential to be involved in both preadipocyte differentiation and adipocyte formation. Additionally, the Mth938 protein is associated with functions such as 3-oxoacyl-[acyl-carrier-protein] synthase activity, fatty acid biosynthesis, cell wall macromolecule biosynthesis, response to oxygen levels, Actinobacterium-type cell wall biogenesis, mycolic acid metabolism, ketone biosynthesis, and phenylpropanoid biosynthesis. The utilization of the D-I-TASSER server for predicting the structure and function of an uncharacterized protein from Coxiella burnetii has provided valuable insights into its potential role in adipogenesis. D-I-TASSER’s predictions span across molecular function, biological processes, and cellular components, offering a comprehensive view of the protein’s role within the cellular environment (Fig. 2). D-I-TASSER suggests that the protein exhibits binding activities, possibly relating to lipid or receptor molecules, which is consistent with mechanisms known to influence adipocyte differentiation and function. This insight supports the hypothesis that the protein may directly interact with key regulators of adipogenesis. The tool predicts involvement in signaling pathways and cellular processes that are crucial for adipocyte differentiation. Such processes may include the regulation of transcription factors or activation of enzymes involved in lipid metabolism, highlighting a potential mechanism through which the protein influences adipogenesis. The protein is predicted to be associated with cellular locations that are integral to adipocyte function, such as the endoplasmic reticulum or lipid droplets. This localization supports its proposed role in adipogenesis and suggests a direct pathway through which the protein may exert its effects.

In aggressive estrogen receptor-positive breast cancers, the uncharacterized oncogene adipogenesis associated Mth938 ___domain containing (AAMDC) regulates metabolic enzymes involved in the one-carbon folate and methionine cycles, as well as lipid metabolism⁶³. A preliminary study suggests that the post-transcriptional regulation of AAMDC occurs through alternative polyadenylation (APA) and microRNAs⁶⁴. During adipogenesis, AAMDC might activate the caspase-3 pathway to turn on the preadipocytes from proliferation to differentiation and to keep it viable⁶⁵.

Studies of conserved domains show that the C11orf67 gene of human seems to fold very similarly to a predicted protein from the bacterium Methanobacterium thermoautotrophicum (Mth938)⁶⁶. Furthermore, it was determined that C11orf67 demonstrated overexpression in several breast cancer cell lines⁶⁷. No previous research has been conducted on Mth938 ___domain of the Coxiella burnetii bacterium. Nevertheless, there is some structural similarity between human C11orf67 and Mth938 ___domain of Coxiella burnetii. The expression of Mth938 ___domain, which is crucial for several biological processes, including preadipocyte differentiation and adipogenesis, has to be strictly monitored. Drawing on literature that describes the involvement of Mth938 ___domain-containing proteins in cellular processes relevant to adipocyte differentiation and lipid metabolism, we hypothesize potential mechanisms through which our protein might impact adipogenesis. This includes potential interactions with key adipogenic regulators and involvement in signaling pathways that drive adipocyte maturation and function. We also expanded our discussion to consider how the protein’s function might relate to the biology of Coxiella burnetii, especially given its intracellular lifestyle and interaction with host lipid metabolism. This aspect is particularly intriguing as it offers a dual perspective on the protein’s role: its contribution to the pathogen’s survival and replication within host cells, and its potential impact on host cellular processes, such as adipogenesis.

Alignment of sequences and analysis of phylogenetic relationships

BLASTp was performed on the NCBI protein database against a non-redundant protein sequence database, revealing similarities between the target protein and various proteins, particularly those containing Mth938-like domains. For multiple sequence alignment (MSA) and phylogenetic analysis, organisms with the most similar protein sequences, lowest e-values, and highest query coverage were selected (Table 1). Subsequently, MSA was conducted using BioEdit software, yielding consistent results with both MUSCLE and Clustal Omega servers (Fig. 3). The MSA analysis revealed a significant level of conservation among amino acid residues in the target protein.

A phylogenetic tree was generated using the MEGA 11 software, employing 1000 bootstrap replications and the neighbor-joining technique (Supplementary Fig. 1). Phylogenetic analysis determined the evolutionary distance between the target protein and aligned similar proteins, confirming their similarity. The tree root represents the ancestral lineage, while branch points represent descendants or proteins of the ancestor. A phylogeny illustrates common ancestry among lineages, with each lineage having distinct common ancestry. Proteins in the top five branches share a more recent common ancestor than those in the bottom half of the tree. Closer proximity on the phylogenetic tree indicates a more recent common ancestor among proteins.

Prediction of protein secondary structure

According to the SOMPA server, the hypothetical protein’s secondary structure was composed mostly of random coils (37.40%), alpha helices (34.15%), extended strands (23.58%), and beta turns (4.88%) (Supplementary Fig. 2). The data were compared using PSIPRED, and the results were found to be consistent (Fig. 4). Secondary structure analysis is crucial for understanding protein function, as there is a close relationship between protein structure and function. Predicting protein secondary structure can also aid in forecasting tertiary structure, establishing a connection between the primary sequence and the tertiary structure⁶⁸.

Predicting tertiary structures and assessing quality

The Coxiella burnetii protein tertiary structure was anticipated through D-I-TASSER, and Swiss Model websites. The Swiss Model server predicted that the protein’s 3D structure would share 37.40% identity with the top-scoring template (PDB ID: 2 gm2.1. A). Additionally, the oligo-state of the protein was found to be a monomer by additional verification of the 3D structure prediction using SWISS-MODEL interactive workspace⁶⁹. Following the generation of 3D structures from the D-I-TASSER server, the protein’s structure was verified through multiple quality assessment stages and use for further analysis (Supplementary Fig. 3). The PROCHECK software used Ramachandran plot analysis to evaluate the 3D model (Table 3; Fig. 5). According to the results given by the Ramachandran Plot, 80.8% of the residues were found in the most favorable locations. In the most favorable locations, it is commonly accepted that more than 90% of the residues will provide a reliable 3D model⁷⁰. After that, VERIFY 3D and ERRAT were used to double-check the results; and it was found that 52.03% of the residues had an average 3D-1D score > = 0.1. The YASARA energy minimization server subsequently modified the 3D structure. After energy reduction, − 39,228.4 kJ/mol becomes − 61,161.1 kJ/mol. The initial score was − 2.40, and the energy minimization was enhanced to − 0.73. Figure 6 shows that after energy reduction, ERRAT evaluated the model as having a high-quality factor of 84.07% compared to its initial value of 82.52%. A high-quality model is defined as one with a score greater than 50; higher scores indicating a better-quality model⁷¹. The anticipated structure’s quality factors pre and post energy minimization are given in Table 4. Subsequently, the predicted structure was evaluated using the ProSA-web tool, which revealed a Z score value of -6.79 (Supplementary Fig. 4).

Table 3 Ramachandran plot statistics for the predicted 3D model of the uncharacterized protein.

Full size table

Table 4 Comparison of the quality assessment of the protein before and after energy minimization.

Full size table

Determination of the active site

The CASTp server calculated putative protein active sites using a probe radius of 1.4 Å to determine solvent-accessible surface area. The active pockets’ volumes, areas, and mouth openings were also measured. According to the findings, the protein had 24 active pockets, with the optimal active site having a surface area of 375.367 and a volume surface area of 363.460. Analytical calculations were made using both the Lee and Richards’ surface⁷² model for solvent accessibility and Connolly’s surface model for molecule surfaces⁷³. Supplementary Fig. 5 illustrates the active sites of interactions. Accurately localizing the active site on a protein holds significant importance in the fields of molecular docking and the development of novel pharmaceuticals and vaccines.

Protein–protein interaction analysis

The STRING server provides comprehensive coverage of both direct and indirect associations among anticipated protein–protein interactions (PPIs). The protein of interest has connections to other proteins, according to protein–protein interaction analysis, some of which have been experimentally annotated with known functions and others that have not (Fig. 6). The target protein is anticipated to interact with nadE in a significant way. The identified anticipated interaction partners and their scores are nadE (0.773), bioC.1 (0.698), nuoG (0.592), CBU_1304 (0.509), and CBU_0095a (0.504). Furthermore, several proteins have engaged in interactions with some unidentified proteins whose functional characteristics have not yet been characterized. Among these, nadE is the most important because it is a glutamine-dependent NAD(+) synthetase that also accelerates the ATP-dependent amidation of deamido-NAD to produce NAD. L-glutamine is used as a source of nitrogen. BioC.1 and CBU_1304 are involved in synthesizing pimeloyl-ACP via the fatty acid synthetic pathway and the oligoketide cyclase/lipid transport protein. The mechanisms of disease processes may be better understood with the use of protein–protein interaction data⁷⁴.

Molecular docking and dynamic simulation

The molecular docking method exemplifies a structure-based drug design approach that mimics molecular interactions and predicts binding mechanisms between receptors and ligands⁷⁵. Consequently, in order to explore the interactions between the host protein and prospective inhibitors, docking analyses were carried out with two different ligands: 2-[(2-phenylphenyl)amino]benzoic acid and 2-amino-4-phenylbenzoic acid. The docking study incorporating the hypothetical protein and the ligand was carried out with the help of Autodock Vina software. The blind docking procedure was conducted in order to examine the specificity of the substrates. Afterwards, the software PyMOL and Discovery Studio were employed to do further investigation of the interactions subsequent to the docking study. The MD analysis revealed many intermolecular interactions between the protein and probable ligands. 2-[(2-phenylphenyl)amino]benzoic acid had a high binding affinity for the protein, with a docking score of − 7.00 (kcal/mol). A binding affinity of − 6.20 (kcal/mol) was observed for 2-amino-4-phenylbenzoic acid through docking (Fig. 7). The compounds exhibiting the lowest docking score were considered excellent and demonstrated a higher level of affinity⁷⁶. This approach helps in understanding the ligands’ positions, orientations, and conformations within the binding site of the HP, thus offering valuable information for the design and optimization of more effective therapeutic agents targeting this protein.

Blind docking shows that 4 and 5 amino acid residues of the HP interact with the docked complexes of HP-2-[(2-phenylphenyl)amino]benzoic acid and HP-2-Amino-4-phenylbenzoic acid. The study’s active site residue-ligand interaction helps us understand residue-drug binding mechanisms. It obtained substantial discoveries about biological structure and function and hydrogen bonding. The complexes of HP-2-[(2-phenylphenyl)amino]benzoic acid have 2 conventional hydrogen bonds. HP-2-amino-4-phenylbenzoic acid only possessed a single conventional hydrogen bond. The ligands of 2-[(2-phenylphenyl)amino]benzoic acid interact with the residues Glu63, Leu63, Ala62, Ly115, and Tyr38, and the 2-Amino-4-phenylbenzoic acid ligands interact with the residues Leu63, Tyr38, Lys115, Val116, and Pro37. Both ligands had similar interaction residues, including Leu63, Lys115, and Tyr38, in their structures. The molecules with a lower docking value were thought to be the best and had a greater degree of affinity (Table 5).

Table 5 The summary provides insights into the extent and specificity of protein binding interactions involving Coxiella burnetii, shedding light on the affinity levels.

Full size table

Normal mode analysis (NMA) on the iMODS server was used to measure the effectiveness and stability of the complexes. We selected HP-2-[(2-phenylphenyl)amino]benzoic for the molecular dynamic simulation investigation because of its high binding affinity and evaluated the atomic dynamic motions inside the complexes. NMA B-factor values, which are proportional to the RMS value, reflect the docked complex’s mobility. The complex’s deformability relies on the residues’ distortions, embodied in chain hinges. The main-chain deformability is a measure of the capability of a given molecule to deform at each of its residues. The ___location of the chain ‘hinges’ can be derived from high deformability regions (Fig. 8a). The correlation between the experimental B-factor from crystallography or NMR data and the calculated B-factor from Normal Mode Analysis (NMA) is a crucial aspect of structural biology, particularly in understanding protein dynamics. The calculated from NMA is obtained by multiplying the NMA mobility by 8pi^2^L. Actually, the B-factor column gives an averaged RMS (Fig. 8d).

. The correlation between experimental and calculated B-factors serves as a validation of the NMA model. A high correlation suggests that the NMA successfully captures the essential motions contributing to the protein’s dynamic behavior as observed experimentally. Conversely, discrepancies might indicate areas where the model fails to account for certain movements or where the experimental data might be limited or averaged (as in the case of NMR-derived B-factors). The eigenvalue obtained for your docked complex, 3.0429e−05, suggests that the complex has a relatively low stiffness along the corresponding mode of motion (Fig. 8b). In practical terms, this means that the complex is more prone to deformation or conformational changes along this particular mode compared to complexes with higher eigenvalues. Understanding the stiffness properties of protein complexes is crucial in drug design and therapeutics. Targeting protein–protein interactions with small molecules or peptides often involves disrupting specific modes of motion within the complex. For each normal mode, there existed a correlation between eigenvalue and variance⁷⁷. The covariance diagram describes the motion of a particular region of the molecule, with red, blue, and white colors representing movements that are linked, unrelated, and anti-related, accordingly (Fig. 8f). On the iMODS server, this correlation matrix is used to derive normal modes in a simplified manner, facilitating the exploration of possible collective motions under physiological conditions with minimal computational resources. The server utilizes the correlation matrix to identify patterns of concerted atomic fluctuations, indicating regions of the protein that move together in a correlated manner. The relationship between the variance associated with each normal mode and the eigenvalue is inversely related, meaning that higher eigenvalues correspond to lower variances and vice versa (Fig. 8c). This relationship is crucial for understanding the distribution of motions within a protein structure as described by normal mode analysis (NMA). In visual representations of NMA results, colored bars are commonly used to depict the individual and cumulative variances associated with each normal mode. Modes with higher variances contribute more significantly to the overall flexibility of the protein and may represent functionally relevant motions, such as conformational changes associated with ligand binding or enzyme catalysis. The graph displays the relative contribution of variance to equilibrium movements for each normal mode, with green and violate bars representing group and single variance. In addition, a model of an elastic network was constructed, which could identify spring-connected atomic pairs. The elastic network model defines which pairs of atoms are connected by springs. Each dot in the graph represents one spring between the corresponding pair of atoms. Dots are colored according to their stiffness; the darker grays indicate stiffer springs and vice versa. When visualizing the ENM using a graph representation, each dot in the graph represents a spring connecting a pair of atoms. The color of the dots corresponds to the stiffness of the springs they represent. Typically, darker shades of gray indicate stiffer springs, while lighter shades indicate fewer rigid connections (Fig. 8e).

Conclusion

In this study, we conducted a comprehensive in-silico analysis of an uncharacterized protein from Coxiella burnetii, unveiling insights into its structural and functional attributes. Our findings suggest its involvement in vital biological processes, such as preadipocyte differentiation and adipogenesis, inferred from the presence of a conserved Mth938-like ___domain. The protein’s localization in the cytoplasm underscores its significance in cellular function. Through computational analyses, we identified potential ligands with strong binding affinities, highlighting the protein’s potential as a promising drug target for Q fever and related diseases. Molecular dynamics simulations supported the stability of the protein-ligand complexes, reinforcing their therapeutic relevance. Further experimental validation is necessary to confirm the protein’s functions and interactions, providing potential avenues for the development of novel therapies. Overall, this study contributes to bridging the knowledge gap regarding the biological roles of uncharacterized proteins in C. burnetii.

Limitations

We acknowledge several limitations inherent to our study. First, despite the advancements in computational methods, such as D-I-TASSER for protein structure prediction and functional annotation, these in-silico predictions require experimental validation to confirm the hypothesized functions and interactions. The accuracy of these computational models can be constrained by the quality and extent of existing structural and functional databases, which may not fully represent the diversity of protein structures and functions in nature. Furthermore, the specific environmental conditions within a host organism, such as the presence of other interacting proteins, post-translational modifications, and local cellular conditions, can significantly influence protein function. These factors are challenging to fully replicate or account for in computational models. Consequently, our predictions, while informative, may not capture the full complexity of the protein’s role in adipogenesis as it occurs in vivo. Additionally, our focus on Coxiella burnetii and its potential implications in adipogenesis may overlook broader interactions and functions relevant to the organism’s lifecycle or pathogenicity. The role of the identified protein within the context of the bacterium’s intracellular environment and its interactions with host cells might also reveal further dimensions of its function that our current approach cannot elucidate. In conclusion, while our study employs state-of-the-art in-silico techniques to uncover potential functions of an uncharacterized protein, it is crucial to approach these findings as hypotheses that lay the groundwork for future experimental investigations. Validating these predictions through laboratory experiments is essential for confirming their accuracy and for understanding the protein’s role in the context of Coxiella burnetii’s biology and its interaction with host organisms.

Data availability

All data generated or analysed during this study are included in this published article.

Change history

10 September 2024
A Correction to this paper has been published: https://doi.org/10.1038/s41598-024-70969-4

References

Fenollar, F. et al. Risks factors and prevention of Q fever endocarditis. Clin. Infect. Dis. 33(3), 312–316 (2001).
Article CAS PubMed Google Scholar
Kazar, J. Coxiella burnetii infection. Ann. N. Y. Acad. Sci. 1063(1), 105–114 (2005).
Article ADS PubMed Google Scholar
Howe, G. B. et al. Real-time PCR for the early detection and quantification of Coxiella burnetii as an alternative to the murine bioassay. Mol. Cell. Probes 23(3–4), 127–131 (2009).
Article CAS PubMed Google Scholar
Koehler, L. M., Kloppert, B., Hamann, H.-P., El-Sayed, A. & Zschöck, M. Comprehensive literature review of the sources of infection and transmission routes of Coxiella burnetii, with particular regard to the criteria of “evidence-based medicine”. Comp. Immunol. Microbiol. Infect. Dis. 64, 67–72 (2019).
Article PubMed Google Scholar
Barandika, J. F., Alvarez-Alonso, R., Jado, I., Hurtado, A. & García-Pérez, A. L. Viable Coxiella burnetii in hard cheeses made with unpasteurized milk. Int. J. Food Microbiol. 303, 42–45 (2019).
Article CAS PubMed Google Scholar
Li, Q. et al. Protective immunity against Q fever induced with a recombinant P1 antigen fused with HspB of Coxiella burnetii. Ann. N. Y. Acad. Sci. 1063(1), 130–142 (2005).
Article ADS CAS PubMed Google Scholar
Akamine, C. M., Perez, M. L., Lee, J. H. & Ing, M. B. Q fever in southern California: A case series of 20 patients from a VA Medical Center. Am. J. Trop. Med. Hyg. 101(1), 33 (2019).
Article PubMed PubMed Central Google Scholar
Waag, D. M. et al. Comparative efficacy and immunogenicity of Q fever chloroform: Methanol residue (CMR) and phase I cellular (Q-Vax) vaccines in cynomolgus monkeys challenged by aerosol. Vaccine 20(19–20), 2623–2634 (2002).
Article CAS PubMed Google Scholar
Jaroszewski, L. et al. Exploration of uncharted regions of the protein universe. PLoS boil. 7(9), e1000205 (2009).
Article Google Scholar
To, H. et al. Prevalence of Coxiella burnetii infection in dairy cattle with reproductive disorders. J. Vet. Med. Sci. 60(7), 859–861 (1998).
Article CAS PubMed Google Scholar
Jones, R. M., Nicas, M., Hubbard, A. E. & Reingold, A. L. The infectious dose of Coxiella burnetii (Q fever). Appl. Biosaf. 11(1), 32–41 (2006).
Article Google Scholar
Melenotte, C., Million, M. & Raoult, D. New insights in Coxiella burnetii infection: Diagnosis and therapeutic update. Exp. Rev. Anti-infect. Ther. 18(1), 75–86 (2020).
Article CAS Google Scholar
Franklin, M. C. et al. Structural genomics for drug design against the pathogen Coxiella burnetii. Proteins Struct. Funct. Bioinform. 83(12), 2124–2136 (2015).
Article CAS Google Scholar
Hoskeri, J., Krishna, V. & Amruthavalli, C. Functional annotation of conserved hypothetical proteins in Rickettsia massiliae MTU5. J. Comput. Sci. Syst. Biol. 3(2), 50–51 (2010).
Article CAS Google Scholar
Naveed, M., Tehreem, S., Usman, M., Chaudhry, Z. & Abbas, G. Structural and functional annotation of hypothetical proteins of human adenovirus: Prioritizing the novel drug targets. BMC Res. Notes 10(1), 1–6 (2017).
Article Google Scholar
Baker, D. & Sali, A. Protein structure prediction and structural genomics. Science 294(5540), 93–96 (2001).
Article ADS CAS PubMed Google Scholar
de Azevedo Jr, W. F. et al. Molecular model for the binary complex of uropepsin and pepstatin. Biochem. Biophys. Res. Commun. 287(1), 277–281 (2001).
Article PubMed Google Scholar
de Azevedo, W. F. Molecular dynamics simulations of protein targets identified in Mycobacterium tuberculosis. Curr. Med. Chem. 18(9), 1353–1366 (2011).
Article PubMed Google Scholar
Islam, M. S., Shahik, S. M., Sohel, M., Patwary, N. I. & Hasan, M. A. In silico structural and functional annotation of hypothetical proteins of Vibrio cholerae O139. Genom. Inform. 13(2), 53 (2015).
Article Google Scholar
Hawkins, T. & Kihara, D. Function prediction of uncharacterized proteins. J. Bioinform. Comput. Biol. 5(01), 1–30 (2007).
Article CAS PubMed Google Scholar
Forouharmehr, A., Nassiri, M., Ghovvati, S. & Javadmanesh, A. Evaluation of different signal peptides for secretory production of recombinant bovine pancreatic ribonuclease A in gram negative bacterial system: an in silico study. Curr. Proteom. 15(1), 24–33 (2018).
Article CAS Google Scholar
Kumar, K. et al. Functional annotation of putative hypothetical proteins from Candida dubliniensis. Gene 543(1), 93–100 (2014).
Article CAS PubMed Google Scholar
Enany, S. Structural and functional analysis of hypothetical and conserved proteins of Clostridium tetani. J. Infect. Public Health 7(4), 296–307 (2014).
Article PubMed Google Scholar
Sayers, E. W. et al. Database resources of the national center for biotechnology information. Nucl. Acids Res. 50(D1), D20 (2022).
Article CAS PubMed Google Scholar
Gasteiger, E. et al. Protein identification and analysis tools on the ExPASy server (Springer, 2005).
Book Google Scholar
Yu, C. S., Lin, C. J. & Hwang, J. K. Predicting subcellular localization of proteins for Gram-negative bacteria by support vector machines based on n-peptide compositions. Protein Sci. 13(5), 1402–1406 (2004).
Article CAS PubMed PubMed Central Google Scholar
Bhasin, M., Garg, A. & Raghava, G. P. S. PSLpred: Prediction of subcellular localization of bacterial proteins. Bioinformatics 21(10), 2522–2524 (2005).
Article CAS PubMed Google Scholar
Dobson, L., Reményi, I. & Tusnády, G. E. CCTOP: A consensus constrained TOPology prediction web server. Nucl. Acids Res. 43(W1), W408–W412 (2015).
Article CAS PubMed PubMed Central Google Scholar
Imai, K. et al. SOSUI-GramN: High performance prediction for sub-cellular localization of proteins in gram-negative bacteria. Bioinformation 2(9), 417 (2008).
Article PubMed PubMed Central Google Scholar
Yu, N. Y. et al. PSORTb 3.0: Improved protein subcellular localization prediction with refined localization subcategories and predictive capabilities for all prokaryotes. Bioinformatics 26(13), 1608–1615 (2010).
Article CAS PubMed PubMed Central Google Scholar
Blum, M. et al. The InterPro protein families and domains database: 20 years on. Nucl. Acids Res. 49(D1), D344–D354 (2021).
Article CAS PubMed Google Scholar
El-Gebali, S. et al. The Pfam protein families database in 2019. Nucl. Acids Res. 47(D1), D427–D432 (2019).
Article CAS PubMed Google Scholar
Kanehisa, M., Sato, Y., Kawashima, M., Furumichi, M. & Tanabe, M. KEGG as a reference resource for gene and protein annotation. Nucl. Acids Res. 44(D1), D457–D462 (2016).
Article CAS PubMed Google Scholar
Lu, S. et al. CDD/SPARCLE: the conserved ___domain database in 2020. Nucl. Acids Res. 48(D1), D265–D268 (2020).
Article CAS PubMed Google Scholar
Zheng, W. et al. Folding non-homologous proteins by coupling deep-learning contact maps with I-TASSER assembly simulations. Cell Rep. Methods 1(3), 10014 (2021).
Google Scholar
Sievers, F. et al. Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega. Mol. Syst. Biol. 7(1), 539 (2011).
Article PubMed PubMed Central Google Scholar
Hall, T., Biosciences, I. & Carlsbad, C. BioEdit: An important software for molecular biology. GERF Bull. Biosci. 2(1), 60–61 (2011).
Google Scholar
Kumar, S., Stecher, G., Li, M., Knyaz, C. & Tamura, K. MEGA X: Molecular evolutionary genetics analysis across computing platforms. Mol. Biol. Evolut. 35(6), 1547 (2018).
Article CAS Google Scholar
Geourjon, C. & Deleage, G. SOPMA: Significant improvements in protein secondary structure prediction by consensus prediction from multiple alignments. Bioinformatics 11(6), 681–684 (1995).
Article CAS Google Scholar
Buchan, D. W. & Jones, D. T. The PSIPRED protein analysis workbench: 20 years on. Nucl. Acids Res. 47(W1), W402–W407 (2019).
Article CAS PubMed PubMed Central Google Scholar
Waterhouse, A. et al. SWISS-MODEL: Homology modelling of protein structures and complexes. Nucl. Acids Res. 46(W1), W296–W303 (2018).
Article CAS PubMed PubMed Central Google Scholar
Krieger, E. et al. Improving physical realism, stereochemistry, and side-chain accuracy in homology modeling: Four approaches that performed well in CASP8. Proteins Struct. Funct. Bioinform. 77(S9), 114–122 (2009).
Article CAS Google Scholar
Laskowski, R. A., MacArthur, M. W., Moss, D. S. & Thornton, J. M. PROCHECK: A program to check the stereochemical quality of protein structures. J. Appl. Crystallogr. 26(2), 283–291 (1993).
Article ADS CAS Google Scholar
Colovos, C. & Yeates, T. O. Verification of protein structures: Patterns of nonbonded atomic interactions. Protein Sci. 2(9), 1511–1519 (1993).
Article CAS PubMed PubMed Central Google Scholar
Bowie, J. U., Lüthy, R. & Eisenberg, D. A method to identify protein sequences that fold into a known three-dimensional structure. Science 253(5016), 164–170 (1991).
Article ADS CAS PubMed Google Scholar
Wiederstein, M. & Sippl, M. J. ProSA-web: Interactive web service for the recognition of errors in three-dimensional structures of proteins. Nucl. Acids Res. 35(suppl_2), W407-10 (2007).
Article PubMed PubMed Central Google Scholar
Tian, W., Chen, C., Lei, X., Zhao, J. & Liang, J. CASTp 3.0: Computed atlas of surface topography of proteins. Nucl. Acids Res. 46(W1), W363–W367 (2018).
Article CAS PubMed PubMed Central Google Scholar
Rigsby, R. E. & Parker, A. B. Using the P y MOL application to reinforce visual understanding of protein structure. Biochem. Mol. Biol. Educ. 44(5), 433–437 (2016).
Article CAS PubMed Google Scholar
Szklarczyk, D. et al. STRING v11: Protein–protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets. Nucl. Acids Res. 47(D1), D607–D613 (2019).
Article CAS PubMed Google Scholar
Kim, S. et al. PubChem substance and compound databases. Nucl. Acids Res. 44(D1), D1202–D1213 (2016).
Article MathSciNet CAS PubMed Google Scholar
Trott, O. & Olson, A. J. AutoDock Vina: Improving the speed and accuracy of docking with a new scoring function, efficient optimization, and multithreading. J. Comput. Chem. 31(2), 455–461 (2010).
Article CAS PubMed PubMed Central Google Scholar
Patel, H., Grüning, B. A., Günther, S. & Merfort, I. PyWATER: a PyMOL plug-in to find conserved water molecules in proteins by clustering. Bioinformatics 30(20), 2978–2980 (2014).
Article CAS PubMed Google Scholar
Pettersen, E. F. et al. UCSF Chimera—A visualization system for exploratory research and analysis. J. Comput. Chem. 25(13), 1605–1612 (2004).
Article CAS PubMed Google Scholar
López-Blanco, J. R., Aliaga, J. I., Quintana-Ortí, E. S. & Chacón, P. iMODS: Internal coordinates normal mode analysis server. Nucl. Acids Res. 42(W1), W271–W276 (2014).
Article PubMed PubMed Central Google Scholar
Awan, F. M., Obaid, A., Ikram, A. & Janjua, H. A. Mutation-structure-function relationship based integrated strategy reveals the potential impact of deleterious missense mutations in autophagy related proteins on hepatocellular carcinoma (HCC): A comprehensive informatics approach. Int. J. Mol. Sci. 18(1), 139 (2017).
Article PubMed PubMed Central Google Scholar
Kyte, J. & Doolittle, R. F. A simple method for displaying the hydropathic character of a protein. J. Mol. Biol. 157(1), 105–132 (1982).
Article CAS PubMed Google Scholar
Guruprasad, K., Reddy, B. B. & Pandit, M. W. Correlation between stability of a protein and its dipeptide composition: A novel approach for predicting in vivo stability of a protein from its primary sequence. Protein Eng. Des. Sel. 4(2), 155–161 (1990).
Article CAS Google Scholar
Kontermann, R. E. Half-life extended biotherapeutics. Expert opinion on biological therapy 16(7), 903–915 (2016).
Article CAS PubMed Google Scholar
Herzog, B., Schultheiss, A. & Giesinger, J. On the validity of Beer–Lambert law and its significance for sunscreens. Photochem. Photobiol. 94(2), 384–389 (2018).
Article CAS PubMed Google Scholar
Huang, G.-H. & Xu, W.-B. Recent advance in new types of human adenovirus. Bing du xue bao = Chin. J. Virol. 29(3), 342–348 (2013).
CAS Google Scholar
Mahmud, S., Ahmed, M. S., Mia, M., Islam, S. & Rahman, T. Homology modelling, bioinformatics analysis and Insilico functional annotation of an antitoxin protein from Streptomyces coelicolor A3 (2). J. Proteom. Comput. Biol. 2(1), 1–7 (2015).
Article Google Scholar
Vahisalu, T. et al. SLAC1 is required for plant guard cell S-type anion channel function in stomatal signalling. Nature 452(7186), 487–491 (2008).
Article ADS CAS PubMed PubMed Central Google Scholar
Golden, E. et al. The oncogene AAMDC links PI3K-AKT-mTOR signaling with metabolic reprograming in estrogen receptor-positive breast cancer. Nat. Commun. 12(1), 1920 (2021).
Article ADS PubMed PubMed Central Google Scholar
Xiao, R. et al. Adipogenesis associated Mth938 ___domain containing (AAMDC) protein expression is regulated by alternative polyadenylation and microRNAs. FEBS Lett. 593(14), 1724–1734 (2019).
Article CAS PubMed Google Scholar
Ma, X. et al. LOC66273 isoform 2, a novel protein highly expressed in white adipose tissue, induces adipogenesis in 3T3-L1 cells. J. Nutr. 142(3), 448–455 (2012).
Article CAS PubMed Google Scholar
Das, K. et al. Structure note X-ray crystal structure of MTH938 from methanobacterium thermoautotrophicum at 2.2 Å resolution reveals a novel tertiary protein fold. Proteins Struct. Funct. Genet. 45, 486–8 (2001).
Article CAS PubMed Google Scholar
Kwek, S. S. et al. Co-amplified genes at 8p12 and 11q13 in breast tumors cooperate with two major pathways in oncogenesis. Oncogene 28(17), 1892–1903 (2009).
Article CAS PubMed PubMed Central Google Scholar
Zhang, B., Li, J. & Lü, Q. Prediction of 8-state protein secondary structures by a novel deep learning architecture. BMC Bioinform. 19(1), 1–13 (2018).
Article Google Scholar
Bertoni, M., Kiefer, F., Biasini, M., Bordoli, L. & Schwede, T. Modeling protein quaternary structure of homo-and hetero-oligomers beyond binary interactions by homology. Sci. Rep. 7(1), 10480 (2017).
Article ADS PubMed PubMed Central Google Scholar
Hooda, V., babu Gundala, P. & Chinthala, P. Sequence analysis and homology modeling of peroxidase from Medicago sativa. Bioinformation 8(20), 974 (2012).
Article PubMed PubMed Central Google Scholar
Messaoudi, A., Belguith, H. & Hamida, J. B. Three-dimensional structure of Arabidopsis thaliana lipase predicted by homology modeling method. Evol. Bioinform. 7, EBO. S7122 (2011).
Article Google Scholar
Lee, B. & Richards, F. M. The interpretation of protein structures: Estimation of static accessibility. J. Mol. Biol. 55(3), 379 (1971).
Article CAS PubMed Google Scholar
Connolly, M. L. Solvent-accessible surfaces of proteins and nucleic acids. Science 221(4612), 709–713 (1983).
Article ADS CAS PubMed Google Scholar
Peng, X., Wang, J., Peng, W., Wu, F.-X. & Pan, Y. Protein–protein interactions: Detection, reliability assessment and applications. Brief. Bioinform. 18(5), 798–819 (2017).
CAS PubMed Google Scholar
Morris, G. M. & Lim-Wilby, M. Molecular docking, molecular modeling of proteins 365–382 (Humana Press, 2008).
Book Google Scholar
Simon, L., Imane, A., Srinivasan, K., Pathak, L. & Daoud, I. In silico drug-designing studies on flavanoids as anticolon cancer agents: Pharmacophore mapping, molecular docking, and Monte Carlo method-based QSAR modeling. Interdiscip. Sci. Comput. Life Sci. 9, 445–458 (2017).
Article CAS Google Scholar
Kovacs, J. A., Chacón, P. & Abagyan, R. Predictions of protein flexibility: First-order measures. Proteins Struct. Funct. Bioinform. 56(4), 661–668 (2004).
Article CAS Google Scholar

Download references

Acknowledgements

The authors would like to extend their sincere appreciation to the Researchers Supporting Project, King Saud University, Riyadh, Saudi Arabia for funding this work through the project number (RSP2025R457).

Funding

This work is financially supported by the Researchers Supporting Project number (RSP2025R457), King Saud University, Riyadh, Saudi Arabia.

Author information

Authors and Affiliations

Department of Epidemiology and Public Health, Sylhet Agricultural University, Sylhet, 3100, Bangladesh
Tajul Islam Mamun
Laboratory of Biotechnology and Natural Resources Valorization, Faculty of Sciences, Ibn Zohr University, 80060, Agadir, Morocco
Mohammed Bourhia
Department of Pharmacology and Toxicology, Sylhet Agricultural University, Sylhet, 3100, Bangladesh
Taufiq Neoaj
Department of Pharmacy, Faculty of Allied Health Sciences, Daffodil International University, Birulia, Ashulia, Dhaka, 1216, Bangladesh
Shopnil Akash, Md. A. K. Azad & Md. Sarowar Hossain
Department of Pathology, Sylhet Agricultural University, Sylhet, 3100, Bangladesh
Md. Masudur Rahman
Department of Pharmaceutics, College of Pharmacy, King Saud University, P.O. Box 11451, Riyadh, Saudi Arabia
Yousef A. Bin Jardan
Laboratory of Therapeutic and Organic Chemistry, Faculty of Pharmacy, University of Montpellier, 34000, Montpellier, France
Samir Ibenmoussa
Department of Biology, Bahir Dar University, P.O. Box 79, Bahir Dar, Ethiopia
Baye Sitotaw
Faculty of Pharmaceutical Science, Assam Down Town University, Guwahati, Assam, India
Md. Sarowar Hossain

Authors

Tajul Islam Mamun
View author publications
Search author on:PubMed Google Scholar
Mohammed Bourhia
View author publications
Search author on:PubMed Google Scholar
Taufiq Neoaj
View author publications
Search author on:PubMed Google Scholar
Shopnil Akash
View author publications
Search author on:PubMed Google Scholar
Md. A. K. Azad
View author publications
Search author on:PubMed Google Scholar
Md. Sarowar Hossain
View author publications
Search author on:PubMed Google Scholar
Md. Masudur Rahman
View author publications
Search author on:PubMed Google Scholar
Yousef A. Bin Jardan
View author publications
Search author on:PubMed Google Scholar
Samir Ibenmoussa
View author publications
Search author on:PubMed Google Scholar
Baye Sitotaw
View author publications
Search author on:PubMed Google Scholar

Contributions

Conceptualization: T.I.M., S.A. and T.N., Data curation: T.I.N. and A.K., Formal analysis: M.S.H. and M.R., Investigation: T.I.N., S.A. and A.K., Methodology: T.I.N., S.A., M.B. and A.K., Project administration: M.S.H., and M.R., Resources: S.A. and A.K., Software: S.A., and M.R., Supervision: S.A., Y.A.B.J. and M.R., Validation: T.N., A.K. and M.S.H., Visualization: T.I.N. and S.A., Writing—original draft—Preparation: T.I.N., S.A. and T.N., Writing—review and editing: M.S.H., S.I., B.S. and M.R., All authors have read and approved the final version of the manuscript.

Corresponding authors

Correspondence to Mohammed Bourhia or Baye Sitotaw.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

The original online version of this Article was revised: The original version of this Article contained an error in the grant number listed the Acknowledgements and in the Funding section. Full information regarding the corrections made can be found in the correction for this Article.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Mamun, T.I., Bourhia, M., Neoaj, T. et al. Structure based functional identification of an uncharacterized protein from Coxiella burnetii involved in adipogenesis. Sci Rep 14, 16789 (2024). https://doi.org/10.1038/s41598-024-66072-3

Download citation

Received: 10 November 2023
Accepted: 26 June 2024
Published: 22 July 2024
DOI: https://doi.org/10.1038/s41598-024-66072-3

Subjects

Abstract

Similar content being viewed by others

Quantitative proteomics identifies the core proteome of exosomes with syntenin-1 as the highest abundant protein and a putative universal biomarker

Whole-genome sequencing analysis of the cardiometabolic proteome

Multiplatform tear proteomic profiling reveals novel non-invasive biomarkers for diabetic retinopathy

Introduction

Methods

Sequence extraction from Coxiella burnetii

Analysis of physicochemical properties

Protein subcellular localization

Function prediction by ___domain analysis

Multiple sequence alignment and phylogeny analysis

Secondary structure prediction

Predicting protein tertiary structure and evaluating its quality

Active site determination

Protein–protein interaction

Molecular docking analysis and dynamic simulation

Result and discussion

Protein sequence retrieval and similarity analysis

Analysis of physicochemical properties

Estimating the subcellular ___location of protein

Protein ___domain analysis for predicting biological function

Alignment of sequences and analysis of phylogenetic relationships

Prediction of protein secondary structure

Predicting tertiary structures and assessing quality

Determination of the active site

Protein–protein interaction analysis

Molecular docking and dynamic simulation

Conclusion

Limitations

Data availability

Change history

10 September 2024

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Information.

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Quick links