Introduction

AP2/ERF (APETALA2/ETHYLENE RESPONSIVE FACTOR) is a widely present superfamily of transcription factors in plants. A significant structural feature of this family is the presence of an AP2 ___domain, which is a DNA binding ___domain composed of 60–70 amino acids1. Cytokinin Response Factors (CRFs) genes belong to the ERF-type members of the AP2/ERF transcription factor superfamily2. The CRF gene sequence consists of a single AP2/ERF transcription factor DNA-binding ___domain, a highly conserved CRF ___domain, and a variable C-terminal region. The latter is involved in protein-protein interactions, distinguishing CRFs from other AP2/ERF members3,4. The CRF genes are a branch of the two-component signaling system. In this system, cytokinins bind to self-phosphorylating histidine kinase receptors, which then transfer the phosphate group to histidine phosphotransfer proteins (HPT). The HPTs subsequently transfer the phosphate group to transcription factors, specifically the type-B response regulators (RRB)5. Currently, CRF genes have been found in mosses, lycophytes, ferns, and most flowering plants4.

CRF genes are widely involved in plant growth and development, hormone signaling, and responses to biotic and abiotic stresses6,7. In Arabidopsis, approximately 25% of the F2 progeny with knockouts of the CRF5 and CRF6 genes exhibit impaired embryo development8. Research on cork oak has found that CRF3 plays a role in embryo development, suggesting that CRF genes may be involved in seed embryo development in flowering plants9. Both AtCRF3 and AtCRF4 can enhance Arabidopsis tolerance to low-temperature stress10. Both AtCRF1 and AtCRF2 are associated with salt stress. Mutants of Arabidopsis lacking the functions of crf1 and crf2 exhibit higher photosynthetic efficiency than the wild type after salt treatment11. In Solanum lycopersicum, four genes, SlCRF1, SlCRF2, SlCRF3, and SlCRF5, respond to various abiotic stresses, such as low temperature, drought, and high temperature. Additionally, SlCRF5 is involved in the development of tomato roots, leaves, and flowers12,13,14. In Arabidopsis, plants overexpressing the CRF2 gene exhibit significantly increased levels of gene expression associated with disease resistance and demonstrate stronger resistance to pathogens compared to wild-type Arabidopsis15. In addition, the CRF gene interacts with auxin-transporting PIN proteins and plays a more direct role in regulating auxin distribution throughout the plant8[8]. And CRF4 in Arabidopsis was shown to be the earliest nitrogen-responsive transcription factor16.

Rice is one of the major staple food crops, with half of the world’s population consuming it. Previous studies have identified and analyzed the AP2/ERF transcription factor family in rice, but research on CRF genes has not been conducted17. And the CRF gene response to low temperature in rice is not clear. Moreover, rice has widely different genetic backgrounds depending on the classification of subspecies, so revealing the distribution and variation of the CRF genes in different rice subspecies is important for understanding its functional evolution. Therefore, three cultivated rice varieties (Oryza sativa Japonica Group (Os), Oryza sativa Indica Group (Oi), Oryza sativa circum-Aus1 var. N22 (Osn)) and one wild rice variety (Oryza rufipogon, Or) were selected in this study for CRF gene family Bioinformatics analysis of the CRF gene family was performed in Oryza sativa (circum-Aus1 var. N22) and a wild rice variety (Oryza rufipogon) with the aim of exploring the characteristics of the CRF gene in rice. The main components of the analysis included the physicochemical properties of the proteins, the construction of phylogenetic trees, the conserved patterns of the genes, the gene structures, the chromosomal localisation of the genes, the duplication types of the genes, the cis-acting elements and the expression patterns. Finally, we further investigated the expression levels of OsCRF members in response to low temperature, as well as the effects of melatonin and cytokinin on gene expression levels under low temperature stress. The results of this study will help to gain insights into the characterisation of CRF genes in rice and their response to low temperature stress, and provide a reference for understanding their functions.

Results

Identification and physicochemical analysis of CRF family members in four rice species

We identified 157, 162, 157, and 171 AP2/ERF members in the genomes of Os, Oi, Osn, and Or rice species, respectively. Subsequently, we constructed neighbor-joining phylogenetic trees using the protein sequences of AP2/ERF members from these four rice species and CRF member protein sequences from 14 other species including A. thaliana, G. max, P. tomentosa, T. hispida, M. caespitosa, M. mohrii, O. punctata, P. dulcis, T. cacao, S. asiatica, A. rufa, H. trionum, V. angularis, and A. annua. This enabled the identification of potential CRF genes in the four rice species (Fig. S1). The results showed that 9, 30, 13, and 13 members potentially belong to CRF genes in the AP2/ERF members of Os, Oi, Osn, and Or rice species, respectively. Further, we conducted multiple alignments of the AP2/ERF members obtained from the four rice species with the protein sequences of Arabidopsis thaliana CRF members, identifying members belonging to the CRF-type transcription factors (Fig. S2). The results indicated that 7, 6, 6, and 7 AP2/ERF members were determined to be CRF genes in Os, Oi, Osn, and Or rice species, respectively. These members were renamed based on their chromosomal positions in the respective genomes, as OsCRF1 to OsCRF6 (a pair of homologous genes renamed as OsCRF2a/2b), OiCRF1 to OiCRF6, OsnCRF1 to OsnCRF6, and OrCRF1 to OrCRF7 (Table S1). Further analysis of the physicochemical properties of the CRF members in the four rice species revealed that the protein lengths ranged from 241 amino acids (aa) (OsCRF4) to 382 aa (OsnCRF2), and the protein molecular weights ranged from 25487.02 Dalton (Da) (OsCRF4) to 40869.68 Da (OsnCRF2). The isoelectric points of 20 members were less than 7, while 6 members had isoelectric points greater than 7. The instability index of the proteins ranged from 43.99 (OiCRF1) to 90.17 (OsCRF5), and the aliphatic index ranged from 56.62 (OiCRF3) to 75.65 (OsnCRF6), indicating hydrophilic proteins. Subcellular localization analysis revealed that 21 members were localized in the nucleus, 3 members in the chloroplast, and 1 member in the cytoplasm (Table S2). Due to the protein sequence of OsCRF1 not starting with the amino acid M (non-start codon), the subcellular distribution of the protein could not be predicted.

Phylogenetic analysis

We constructed Neighbor-Joining phylogenetic trees utilizing the protein sequences of CRF members from five species: Os, Oi, Osn, Or, and Arabidopsis thaliana (At). The phylogenetic analysis revealed four distinct groups, labeled as Group I through Group IV (Fig. 1). In Group I, three CRF members were identified from each of the Os, Oi, Osn, and Or species. Group II was exclusively composed of eight AtCRF members. Group III included 2, 2, and 3 CRF members from Os, Oi, and Or, respectively. Finally, Group IV comprised 2, 3, and 2 CRF members from Os, Osn, and Or, respectively, along with 4 AtCRF members. These findings suggest a high degree of conservation among CRF members across the different rice varieties.

Fig. 1
figure 1

Phylogenetic tree of CRF members from Os, Oi, Osn, Or, and Arabidopsis thaliana. Different colored regions represent different groups, and solid circles in species colors represent the corresponding species names.

Protein conserved motif and gene structure analysis

Based on the protein sequences of CRF members from Os, Oi, Osn, and Or rice varieties, we constructed a Neighbor-joining phylogenetic tree. The clustering results for each member were consistent with the clustering results in Fig. 1, and they were grouped accordingly (Fig. 2A). Using the MEME tool, we annotated 10 motifs in the protein sequences of CRF members from the four rice varieties. Each motif’s conserved sequence was identified using the NCBI-CDD database. It was found that Motif 1 and Motif 2 corresponded to the AP2 ___domain, while Motif 3 to Motif 10 were not identified as any ___domain (Fig. S3). Except for the protein OiCRF2, all other members contained Motif 2 (Fig. 2B). The CRF member proteins in Group I were divided into two categories based on motif type and ___location. OiCRF2, OsCRF2a, OsCRF2b, OrCRF2, and OsnCRF2 proteins corresponded to nine types of motifs, namely Motif 10, Motif 3, Motif 6, Motif 2, Motif 1, Motif 4, Motif 9, Motif 7, and Motif 5. OiCRF3, OsnCRF3, OrCRF3, OsCRF3, OsnCRF4, OiCRF5, and OrCRF6 proteins mainly corresponded to five types of motifs, namely Motif 3, Motif 2, Motif 1, Motif 4, and Motif 8. The seven CRF member proteins in Group II mainly contained three types of motifs, namely Motif 3, Motif 2, and Motif 1. The seven CRF member proteins in Group IV mainly contained five types of motifs, namely Motif 3, Motif 2, Motif 1, Motif 5, and Motif 7.

The gene structure of CRF members exhibits a high degree of conservation (Fig. 2C). In Group I, except for the genes OiCRF2, OiCRF3, and OsCRF3, which contain 2 exons and 1 intron, the remaining 9 genes have only 1 exon and 0 introns. In Group II, the genes OiCRF1 and OiCRF6 have 2 exons and 1 intron, while the other 5 genes have only 1 exon and 0 introns. In Group III, all seven genes have only 1 exon and 0 introns. In summary, the conservation of protein motifs and gene structure suggests a high degree of conservation of CRF genes among different rice varieties during the evolutionary process.

Fig. 2
figure 2

Conservation motifs and gene structure of Os, Oi, Osn, Or, and CRF members. A: Neighbor-joining phylogenetic tree of Os, Oi, Osn, Or, and CRF members. B: Conserved protein motifs. C: Gene structure.

Gene chromosome ___location, collinearity, and selection pressure analysis

OsCRF members are distributed on chromosomes 1, 3, 7, and 9, with each chromosome containing 4, 1, 1, and 1 gene, respectively (Fig. 3A). OiCRF members are distributed on chromosomes 1, 5, and 6, with each chromosome containing 3, 1, and 2 genes, respectively (Fig. 3B). OsnCRF members are distributed on chromosomes 1, 6, 8, and 9, with each chromosome containing 3, 1, 1, and 1 gene, respectively (Fig. 3C). OrCRF members are distributed on chromosomes 1, 3, 5, 6, and 9, with each chromosome containing 3, 1, 1, 1, and 1 gene, respectively (Fig. 3D). The CRF members of the four rice species are all distributed in regions of relatively high gene density on the chromosomes. It is worth noting that all four rice species have a relatively high distribution on chromosome 1, each with three genes (OsCRF2a/2b is considered one gene), and these three genes are also relatively close in chromosomal distribution. Further analysis of chromosomal collinearity within the four rice species identified only one pair of segmental duplicated genes, OsCRF4/OsCRF5, in Oryza sativa.

We selected Os as the main reference species and identified homologous genes between Os and CRF members of Arabidopsis thaliana, Oi, Osn, and Or (Fig. 4). The results showed that Os shares one pair of homologous genes with Arabidopsis thaliana, which is OsCRF3/AtCRF6. Additionally, Os shares 2 (OsCRF2a/OiCRF2, OsCRF3/OiCRF3), 4 (OsCRF3/OsnCRF3, OsCRF1/OsnCRF1, OsCRF2a/OsnCRF2, and OsCRF6/OsnCRF6), and 6 (OsCRF1/OrCRF1, OsCRF2a/OrCRF2, OsCRF3/OrCRF3, OsCRF4/OrCRF4, OsCRF5/OrCRF4, and OsCRF6/OrCRF7) pairs of homologous genes with Oi, Osn, and Or, respectively. It is noteworthy that Os shows high homology and conservation in chromosomal distribution with the other three rice species, indicating low separation during evolution. The results with Arabidopsis thaliana suggest a greater degree of separation between dicots and monocots in terms of CRF genes, consistent with the results shown in Fig. 1.

We conducted Ka, Ks, and Ka/Ks value calculations for segment duplication and homologous gene pairs to explore the evolutionary pressures affecting CRF genes (Table 1). The results showed that for six pairs of homologous genes, including OsCRF3/OiCRF3, OsCRF1/OsnCRF1, OsCRF2a/OsnCRF2, OsCRF1/OrCRF1, OsCRF3/OrCRF3, and OsCRF4/OrCRF4, both Ka and Ks were 0, indicating no synonymous or non-synonymous mutations occurred, resulting in a Ka/Ks value of 0. For the pair OsCRF2a/OiCRF2, the Ka/Ks value was greater than 1, while for the remaining six pairs of homologous genes, the Ka/Ks value was less than 1. The combination of homologous gene pairs further validates the high conservation of CRF genes among rice varieties during the evolutionary process.

Fig. 3
figure 3

Chromosomal distribution of CRF members in Os, Oi, Osn, and Or. A: Oryza sativa Japonica Group. B: Oryza sativa Indica Group. C: Oryza sativa (circum-Aus1 var. N22). D: Oryza rufipogon. The left scale represents chromosome lengths, with “Chr” indicating chromosomes. Genetic distances were set at 200 kb to calculate the gene density of each chromosome, represented by a gradient color from blue (low gene density) to red (high gene density). Blank regions indicate genetic regions lacking gene distribution information. Red lines indicate segmental duplication gene pairs.

Fig. 4
figure 4

Distribution of CRF orthologous gene pairs among species. Each horizontal line represents a chromosome, with numbers indicating chromosome numbers. Green lines represent CRF orthologous gene pairs.

Table 1 CRF genes replication and homologous gene Ka/Ks values.

Os, Oi, Osn, and Or rice CRF members’ upstream 2000 bp promoter regions were collectively annotated with 86 cis-acting elements, of which 34 had confirmed functionalities (Table S3). The annotation revealed a plethora of basic elements such as CAAT-box and TATA-box, alongside various light-responsive elements including ACE, G-box, Box 4, and ATCT-motif. We focused on three functional cis-acting elements: Plant growth and development, Abiotic and biotic stresses, and Phytohormone responsive, for positional distribution display (Fig. 5, Table S4). Among them, the Plant growth and development-related elements comprised seven cis-acting elements: CAT-box, O2-site, NON-box, circadian, GCN4_motif, MBSI, and HD-Zip 1. Abiotic and biotic stresses-related elements included five cis-acting elements: ARE, MBS, GC-motif, LTR, and WUN-motif. Phytohormone responsive-related elements encompassed eight cis-acting elements: TGACG-motif, ABRE, CGTCA-motif, TCA-element, GARE-motif, TGA-element, P-box, and AuxRR-core. Notably, the conservation of cis-acting element types and distribution positions among different rice CRF members on the same branch of the phylogenetic tree was observed, such as OsnCRF4, OiCRF5, and OrCRF6 genes, OrCRF7, and OsnCRF7. In summary, the results of promoter cis-acting elements indicate the extensive involvement of CRF members in rice growth and development processes.

Fig. 5
figure 5

Annotation of cis-acting elements in the upstream 2000 bp promoter regions of Os, Oi, Osn, and Or CRF members. A: Neighbor-joining phylogenetic tree of CRF members from Os, Oi, Osn, and Or. B: Distribution of cis-acting elements. C: Names of cis-acting elements.

Analysis of the expression patterns of OsCRF members

We utilized heatmaps to illustrate the FPKM values obtained from root, stem, leaf, and leaf samples under low-temperature stress (Table S5). The results (Fig. 6A) showed that the FPKM values corresponding to OsCRF2b in roots, stems, and leaves were all less than 1, suggesting that this gene may not be expressed in rice tissues. OsCRF2a exhibited the highest FPKM value in leaves, indicating potentially high expression activity in leaves. OsCRF3 showed FPKM values greater than 1 in stems and leaves, while OsCRF6 showed FPKM values greater than 1 in roots and stems, and OsCRF5 displayed FPKM values greater than 1 in all three tissues, suggesting the involvement of these three genes in the growth and development processes of different rice tissues. It is noteworthy that both OsCRF1 and OsCRF4 exhibited high FPKM levels in all three tissues, with particularly high expression levels in leaves, suggesting that these two genes may mainly participate in leaf growth, development, and related functions. According to the results obtained under low-temperature stress (Fig. 6B), OsCRF1, OsCRF2a, OsCRF4, OsCRF5, and OsCRF6 all responded to low temperatures, showing a significant increase in FPKM. However, OsCRF2b and OsCRF3 showed no response to low-temperature stress. This result suggests that CRF genes may be involved in rice’s response to low-temperature stress.

Fig. 6
figure 6

Heatmaps illustrating FPKM values of OsCRF members. A: FPKM values of OsCRF members in roots, stems, and leaves. B: FPKM values of OsCRF members in leaves under normal temperature (CK) and low-temperature stress (4 °C). Green to yellow indicates an increase in FPKM values.

Fluorescence quantification analysis

To investigate whether OsCRF members occur in response to low-temperature stress (4 °C), we selected leaves of Longdao 18 variety of three-leaf stage plants for quantitative fluorescence analysis (Fig. 7). The results showed that the expression levels of two genes, OsCRF2b and OsCRF3, were less than 1 in all six time-point samples. five genes, OsCRF1, OsCRF2a, OsCRF4, OsCRF5, and OsCRF6, all showed significantly higher expression levels with the lapse of time of the low-temperature stress treatments, which was characterized by a significant temporal gradient of expression. In addition to this, we further explored whether melatonin and cytokinin treatments would affect the expression levels of OsCRF members in rice leaves after low-temperature stress treatment. The results showed that the expression levels of two genes, OsCRF2b and OsCRF3, varied little in the six time point samples and were less than 1. The expression levels of three genes, OsCRF4, OsCRF5 and OsCRF6, increased slightly in the six time point samples. the expression levels of OsCRF2a were significantly increased in all of the 48 h samples and increased slightly in all of the other time point samples. It is noteworthy that the expression level of OsCRF1 gene was greatly elevated after melatonin treatment corresponding to the four time points of 6, 12, 24 and 48 h. In summary, the expression level of CRF genes in rice under low temperature environment occurs a strong response and will be induced by melatonin and cytokines.

Fig. 7
figure 7

Time gradient fluorescence quantification of expression levels of OsCRF members. Red color indicates normal low temperature stress (4 °C) treated samples. Yellow indicates melatonin-treated post low temperature stress (4 °C) treated samples. Blue color indicates cytokinin-treated post low-temperature stress (4 °C) treated samples.

Target Gene Prediction Analysis

We obtained the binding motif of the CRF4 transcription factor from the JASPAR database and identified 23,301 target genes by searching the 2000 bp upstream promoter sequences of Os genes for CRF4 binding sites. These target genes include thirteen matching sequences: ACGCCGCC, CAGCCGCC, CCGCCGCC, CCGCCGTC, GAGCCGCC, GCGCCGAC, GCGCCGCA, GCGCCGCC, GCGCCGCG, GCGCCGGC, GCGCCGTC, GGGCCGCC, and CGCCGCC. Most of the target genes contain multiple matching sequences in their promoter sequences. (Fig. S4, Table S6). The results indicate that 16,284 target genes have clear structural domains, 7,936 target genes have annotated Gene Ontology (GO) functions, and 8,632 target genes are annotated to KEGG pathways (Fig. 8, Table S7). According to the GO enrichment analysis, in biological processes, most target genes are mainly enriched in various functions such as cellular process (GO:0009987), metabolic process (GO:0008152), and developmental process (GO:0032502). In molecular function, most target genes are primarily enriched in catalytic activity (GO:0003824), binding (GO:0005488), and signal transducer activity (GO:0004871). In cellular components, most target genes are mainly enriched in functions such as cell (GO:0005623), organelle (GO:0043226), and membrane (GO:0016020). The KEGG annotation results show that most target genes are mainly enriched in metabolism-related pathways. Target genes are enriched in various pathways related to growth, development, and stress responses, such as Carbon metabolism (259 target genes, ko01200), MAPK signaling pathway (179 target genes, ko04010), and Starch and sucrose metabolism (152 target genes, ko00500), among others.

Fig. 8
figure 8

Annotation of Target Genes with GO and KEGG. A: Annotation results of target genes with GO functions. B: Annotation results of target genes with KEGG pathways.

Subcellular localization analysis

The subcellular localization of rice CRF proteins was predicted (Table S4), multiple CRF proteins localised to the nucleus, chloroplasts and cytoplasm. Using tobacco leaves, OsCRF4-GFP and OsCRF5-GFP were selected for transient expression assays. Plasmids of OsCRF4-GFP and OsCRF5-GFP were transiently expressed together with the nuclear markers. As shown in Fig. 9, the fluorescence signals of OsCRF4-GFP and OsCRF5-GFP co-localised extensively with the nuclear markers, indicating that OsCRF4-GFP and OsCRF5-GFP were localised at the plasma membrane and endoplasmic reticulum.

Fig. 9
figure 9

Transient expression of OsCRF4-GFP and OsCRF5-GFP with nuclearbiomarker fusion proteins in tobacco epidermal cells.

Discussion

The AP2/ERF family, as one of the largest transcription factor families, has achieved significant research progress in various aspects such as growth and development, abiotic stress, and signal transduction. The CRF genes constitute a small fraction of the AP2/ERF transcription factor superfamily. Currently, CRF genes have been extensively studied in Arabidopsis thaliana and to a lesser extent in a few other plants such as Solanum lycopersicum14, Brassica rapa18, Glycine max19, Tamarix hispida20, and Quercus suber9. In this study, we conducted a preliminary screening of AP2/ERF members in four rice varieties, Os, Oi, Osn, and Or, and identified genes belonging to the CRF type transcription factors through multiple alignments with Arabidopsis CRF members.

We identified 7, 6, 6, and 7 CRF genes in the Os, Oi, Osn, and Or rice varieties, respectively. The number of CRF members varies across different plant species and does not correlate directly with genome size. For instance, Arabidopsis thaliana has 12 CRF genes, tomato has 11, and soybean has 26. Notably, there’s significant diversity in the number of CRF genes between diploid and polyploid varieties of the same species; for example, B. rapa has 21 CRF genes, while the allotetraploid B. napus has 44 21. The physicochemical properties of CRF proteins are similar among the four rice varieties, with most CRF proteins localized in the nucleus, consistent with the subcellular localization of CRF members in Solanum lycopersicum, Brassica rapa, and Brassica napus18. Phylogenetic analysis revealed distinct clustering of CRF members from the four rice varieties with significant divergence from Arabidopsis CRF members, unlike the significant conservation observed between B. napus and the 12 CRF genes of Arabidopsis. This suggests that there is a high degree of divergence in the CRF gene during the evolution of rice and Arabidopsis.

Gene function is closely associated with conserved motifs within protein sequences22. Based on the phylogenetic tree, it’s evident that CRF members from Group III and Group IV in Os, Oi, Osn, and Or have higher conservation of motifs. However, for the five CRF members in Group I (OiCRF2, OsCRF2a, OsCRF2b, OrCRF2, and OsnCRF2), corresponding to nine conserved motifs, this result aligns with the findings for CRF3 and CRF4 members in the phylogenetic tree of Brassica napus, where members of these two groups corresponded to nine motifs, while the other seven CRF members corresponded to CRF6 group members. Based on this, we infer that the degree of sequence variation in CRF genes in rice may not be as high as in dicotyledonous plants. Gene structure information can provide clues for the evolution of gene family members. The gene structure of CRF members in the four rice varieties is highly conserved, with 21 members having one exon and five members having two exons, consistent with the gene structure of corresponding CRF members in Brassica rapa and Brassica napus21.

The chromosomal ___location of a gene may influence its functional expression23. Chromosomal localization results indicate that CRF1, CRF2, and CRF3 genes in Os, Oi, Osn, and Or rice varieties are all distributed on chromosome 1, with relatively close positions, while the remaining genes are distributed on other chromosomes. Gene duplication is a major factor in the expansion and evolution of gene members, contributing to species adaptation to environmental changes and maintaining normal life processes24. We identified only one pair of segmental duplicate genes, OsCRF4/OsCRF5, in Os, indicating that gene duplication is not the main cause of CRF member expansion in rice. Inter-species collinearity results show that Os has 2, 3, and 6 pairs of homologous genes with Oi, Osn, and Or, respectively, with 2, 3, and 3 pairs of homologous genes on chromosome 1, indicating the high conservation of CRF members on chromosome 1 during the evolution of rice. It is noteworthy that Os and Or share 6 pairs of CRF homologous genes, indicating a close phylogenetic relationship between Or, originating from Asia, and Os. Similarly, the Ka and Ks values of the 6 pairs of homologous genes are all 0, indicating that some CRF member sequences among rice varieties are highly conserved, and the Ka/Ks values suggest that CRFs in rice have undergone strong purifying selection pressure during evolution, indicating similar functions.

Cis-acting elements in the promoter region regulate gene expression25. Annotation of promoter sequences reveals that CRF genes in Os, Oi, Osn, and Or rice varieties are involved in various functions such as growth, stress response, and hormone regulation. Investigating gene expression during tissue development and under adverse environmental conditions is important for understanding the molecular mechanisms of biological development26. Soybean CRF members are expressed in multiple tissues and respond to cold stress. For example, GmCRF15 and GmCRF25 are significantly expressed in seeds, GmCRF6 and GmCRF8 are expressed significantly in root hairs, and GmCRF3 and GmCRF15 are significantly expressed under cold stress19. CRF members in B. rapa show significant expression in nutritional assemblies and reproductive tissues. Some CRF genes in B. napus can enhance its tolerance to low phosphorus stress, such as the BnaCRF7. Similarly, FPKM values obtained from transcriptome data show that OsCRF members are expressed in roots, stems, and leaves, and also participate in resisting low-temperature stress. Fluorescence quantification showed that two genes, OsCRF2b and OsCRF3, might not be expressed in rice leaves. In contrast, five genes, OsCRF1, OsCRF2a, OsCRF4, OsCRF5, and OsCRF6, increased their expression levels over time under low-temperature stress in rice and were affected by melatonin and cytokinin to increase their expression levels. Taken together, CRF may have a function in rice to resist low-temperature stress.

Transcription factors regulate gene expression by binding to specific sequences in the promoter regions of target genes27. Numerous studies have identified target genes corresponding to AP2-type transcription factors. For instance, AgDREB1 and AgDREB2 may act as transcriptional activators by binding to corresponding DRE elements to enhance celery’s stress resistance28. Sm128 and Sm152 can interact with ERF to regulate the biosynthesis of tanshinones and phenolic acids29. AP2/ERF can interact with WRKY, bHLH, bZIP, MYB, NAC, and C2H2 to enhance plant resistance to cold stress30. However, there are no reports on the regulation of target genes by CRF-type transcription factors. In this study, we obtained the binding motif of the CRF4 transcription factor through the JASPAR database and identified a total of 23,301 target genes in Os, including 13 matching sequences. This indicates that CRF can bind to multiple sequences, demonstrating a broader range of target gene binding functions. Enrichment analysis of Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways revealed that target genes possess various protein functions related to stress resistance and are distributed across pathways associated with stress defense. This suggests that CRF genes regulate multiple pathways through the modulation of downstream genes, highlighting the importance of identifying and exploring these CRF-corresponding target genes in the rice genome.

Conclusions

Seven, six, six, and seven CRF genes were identified in the genomes of the four rice subspecies, respectively. These CRF members shared consistent features. Phylogenetic analysis classified these members into three distinct groups, revealing a high degree of conservation in protein motifs and gene structures within each group. Additionally, the amplification of CRF genes was not predominantly driven by segmental or tandem repeats, and there was significant divergence between CRF members in monocotyledons and dicotyledons. Promoter element analysis indicated that CRF genes contain cis-acting elements involved in growth, development, hormone regulation, and stress response in rice. OsCRF members were expressed in roots, stems, and leaves, responding to low-temperature stress with increased expression levels induced by melatonin and cytokinin. The nuclear localization of OsCRF4/5 was confirmed as predicted. These findings provide valuable insights into the characterization of rice CRF genes and their response mechanisms to low-temperature stress.

Materials and methods

Identification of AP2/ERF members and CRFs in four rice

Genome data for Oryza sativa Japonica Group (GCA_001433935.1, referred to as Os), Oryza sativa Indica Group (GCA_000004655.2, referred to as Oi), Oryza sativa (circum-Aus1 var. N22) (GCA_001952365.2, referred to as Osn), and Oryza rufipogon (GCA_000817225.1, referred to as Or) were obtained from the Ensembl Plants database (http://plants.ensembl.org/species.html) for the identification and analysis of CRF family members31. Using the Pfam database (http://pfam.xfam.org/), the hidden Markov model (HMM) of the AP2 ___domain (PF00847) was downloaded. The HMMER tool was then employed to search and align the protein sequences of the four rice genomes against the AP2 ___domain hidden Markov model, retaining protein sequences with an E-value of ≤-5 32,33. As the primary focus here is to study CRF-type members, no redundancy removal of sequences was conducted, and only screening of AP2/ERF members was performed.

We obtained the protein sequences of 12 Arabidopsis CRF members from the UniProt database. To ensure that the selected genes from the AP2/ERF members of the four rice species belong to the CRF-type transcription factors, we further retrieved protein sequences of CRF members from thirteen additional species, including Glycine max, Populus tomentosa, Tamarix hispida, Marshallia caespitosa, Marshallia mohrii, Oryza punctata, Prunus dulcis, Theobroma cacao, Striga asiatica, Actinidia rufa, Hibiscus trionum, Vigna angularis, and Artemisia annua, from the NCBI database (Table S8). Using the MEGA tool, multiple sequence alignment was performed with the MUSCLE algorithm for the AP2/ERF family members of the four rice species and the CRF member protein sequences of fourteen other species. Subsequently, the neighbor-joining method was employed to construct phylogenetic trees with 1,000 bootstrap replications, using the Poisson correction model and pairwise deletion. Next, potential CRF-type transcription factors selected from the AP2/ERF members of the four rice species were subjected to MUSCLE multiple sequence alignment analysis. Based on the structural ___domain features of CRF-type transcription factors, non-CRF-type members were removed. Finally, the physicochemical properties of the CRF members from the four rice species were analyzed using ExPASy (http://web.expasy.org/protparam/), and protein subcellular localization analysis was conducted using WoLF PSORT II (https://www.genscript.com/wolf-psort.html?src = leftbar).34,35.

Construction of phylogenetic trees

Using the MEGA X tool, multiple sequence alignment was performed with the MUSCLE algorithm for the CRF members of the four rice species and the AtCRF protein sequences of Arabidopsis. Subsequently, phylogenetic trees were constructed using the neighbor-joining method with 1,000 bootstrap replications, employing the Poisson correction model and pairwise deletion36. The generated phylogenetic trees were then visualized and refined using the Evolview tool37.

Conserved protein motifs and gene structures

The protein sequences of the CRF family members from the four rice species were annotated for conserved motifs using the MEME online tool (http://meme-suite.org/tools/meme). The number of motifs was set to 10, with other parameters kept at default values38. The exon and intron positions and quantities of the CRF family members were determined using the General Feature Format (GFF) annotation information from the four rice genomes. Finally, the evolutionary trees, motifs, and gene structures of the CRF family members from the four rice species were clustered and visualized using the TBtools software39.

Gene chromosomal localization, collinearity, and identification of homologous gene pairs

Using the TBtools software, the chromosomal distribution positions of CRF members in the four rice species were plotted. The MCScanX tool was employed to identify segmental duplications and tandem duplicate gene pairs among the CRF family members of the four rice species40. For interspecies collinearity analysis, Oryza sativa (Os) was selected as the reference, and homologous CRF genes were analyzed with Arabidopsis, Oryza indica (Oi), Oryza sativa N22 (Osn), and Oryza rufipogon (Or). Finally, the Ka (nonsynonymous substitution rate), Ks (synonymous substitution rate), and Ka/Ks values were calculated for all types of duplicated genes using the KaKs Calculator tool41.

Annotation of Cis-acting elements

Using the TBtools software, the upstream 2000 bp promoter sequences of CRF members from the four rice species were extracted. These sequences were then annotated for cis-acting elements using the PlantCARE online tool (http://bioinformatics.psb.ugent.be/webtools/plantcare/html/)42. Functional categories such as abiotic and biotic stresses, phytohormone responsiveness, and plant growth and development were selected based on previous classifications. Subsequently, the distribution of cis-acting elements in the promoter regions of CRF members from the four rice species was plotted and analyzed using the TBtools software43.

Expression patterns of OsCRF members

We analyzed the differential tissue expression patterns of OsCRF members using root and leaf transcriptome data obtained from our sequencing efforts44,45, as well as stem transcriptome data from the NCBI SRA database (SRR26372771)46. Additionally, transcriptome data of leaves subjected to low-temperature stress (SRR22266757 and SRR22266758 at 4 °C, and SRR22266759 and SRR22266760 at normothermia (CK)) were retrieved from the NCBI SRA database47. These data were utilized to obtain FPKM (Fragments Per Kilobase of exon model per Million mapped fragments) values, allowing for the analysis of OsCRF member expression patterns under low-temperature stress.

Fluorescence quantification

We selected three-leaf stage plants of the Japonica rice variety Longdao 18 and implemented three treatment conditions: (1) exposure to 4 °C low-temperature stress for 0, 2, 6, 12, 24, and 48 h; and (2) separate treatments involving spraying 100µM/L melatonin and 50µM/L cytokinin solutions, followed by the same low-temperature stress intervals. Each treatment was replicated three times biologically, and the leaves were ultimately harvested and stored at -80 °C for subsequent analysis.

Three independent biological replicates, each consisting of three independent plants, were utilized for qRT-PCR detection. The qRT-PCR primers for the selected CRF genes were designed using Primer Premier 5 (Table S9). The fluorescence quantification procedure followed the previously described method48. The obtained cycle threshold (CT) values were quantitatively analysed by the 2-Ct method49.

Prediction of target genes

We obtained the transcription factor binding motif (MA0976.1) of CRF4 from the JASPAR Plantae database (https://jaspar.elixir.no/search?q=&collection=CORE&tax_group=plants)50. Subsequently, using TBtools, we extracted the 2000 bp promoter sequences of all Os genes and identified the genes bound by the CRF4 transcription factor motif using the Motif FIMO tool (https://meme-suite.org/meme/)36. Finally, target gene ___domain prediction was conducted based on the PFAM database, and KEGG (Kyoto Encyclopedia of Genes and Genomes) and GO (Gene Ontology) enrichment analyses of the target genes were performed using OmicShare Tools (https://www.omicshare.com/tools).

Subcellular localization of OsCRF4/5

To determine the subcellular localisation of CRF proteins, the full-length CDS of OsCRF4/5 was ligated to CaMV35S::GFP. and the CaMV35S::OsCRF4/5-GFP vector and nuclear marker were transformed into Agrobacterium EHA105 and co-transformed into N. benthamiana leaf blades. The fluorescence signals were observed by laser scanning confocal microscopy after 24 h of induction at 27 °C under dark conditions.