Abstract:
In order to find out soybean PPR genes and figure out the chromosomal distribution, the conserved domains, the subgrouping, the phylogenetic relationship and the expression characteristics of these genes, this study found 631 PPR genes through screening the soybean reference genome with the PPR model in the Pfam database. Clustering analysis was further performed to divide them into different subfamily with the softwares of MEME, ExPASy, TBtools and FigTree. The phylogenetic relationship and conserved domains of these subfamilies were also analyzed. We selected the representative genes from these subfamilies, and further evaluated their isoelectric points, UTR, CDS, and gene expression patterns. The results showed that, soybean PPR gene family could be divided into five subfamilies, DYW-, P-, PLS-, E/E+ and an unknown subfamily. Of them, DYW-subfamily was the biggest one, accounting for 57.2% of total soybean PPR genes. These subfamilies distributed unevenly in chromosomes, and their intron numbers varied greatly. DYW-subfamily contained fewer introns but contained more conserved motifs with two characteristical motifs of Motif7 and Motif4 locating in the C terminal. The representative genes of these subfamilies had a similar expression pattern with a high expression level in leaf but a low expression level in flower, root and stem. The UTR was missing in some members of these subfamilies. Glyma.19 G095500, Glyma.11 G086900, Glyma.02 G175900 and Glyma.01 G158100 were subgrouped into a new subgroup containing a unique motif of Motif8.