Complete Chloroplast Genomes of Chlorophytum comosum and Chlorophytum gallabatense: Genome Structures, Comparative and Phylogenetic Analysis

Paul Musili

The genus Chlorophytum includes many economically important species well-known for medicinal, ornamental, and horticultural values. However, to date, few molecular genomic resources have been reported for this genus. Therefore, there is limited knowledge of phylogenetic studies, and the available chloroplast (cp) genome of Chlorophytum (C. rhizopendulum) does not provide enough information on this genus. In this study, we present genomic resources for C. comosum and C. gallabatense, which had lengths of 154,248 and 154,154 base pairs (bp), respectively. They had a pair of inverted repeats (IRa and IRb) of 26,114 and 26,254 bp each in size, separating the large single-copy (LSC) region of 84,004 and 83,686 bp from the small single-copy (SSC) region of 18,016 and 17,960 bp in C. comosum and C. gallabatense, respectively. There were 112 distinct genes in each cp genome, which were comprised of 78 protein-coding genes, 30 tRNA genes, and four rRNA genes. The comparative analysis with fi...

plants Article Complete Chloroplast Genomes of Chlorophytum comosum and Chlorophytum gallabatense: Genome Structures, Comparative and Phylogenetic Analysis Jacinta N. Munyao 1,2,3,† , Xiang Dong 1,2,3,† , Jia-Xin Yang 1,2 , Elijah M. Mbandi 1,2,3 , Vincent O. Wanga 1,2,3 , Millicent A. Oulo 1,2,3 , Josphat K. Saina 1,2,3 , Paul M. Musili 4 and Guang-Wan Hu 1,2,3, * 1 2 3 4 * † CAS key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan 430074, China; jacintandunge.jn@gmail.com (J.N.M.); directx0831@163.com (X.D.); yangjxgz@163.com (J.-X.Y.); mkala@wbgcas.cn (E.M.M.); vincentokelo@gmail.com (V.O.W.); millicentoulo@gmail.com (M.A.O.); jksaina@wbgcas.cn (J.K.S.) Sino-Africa Joint Research Center, Chinese Academy of Sciences, Wuhan 430074, China University of Chinese Academy of Sciences, Beijing 100049, China East Africa Herbarium, National Museums of Kenya, P.O. Box 45166 00100 Nairobi, Kenya; pmutuku@museums.or.ke Correspondence: guangwanhu@wbgcas.cn or guangwanhu@sohu.com These authors contributed equally to this work. Received: 3 January 2020; Accepted: 13 February 2020; Published: 1 March 2020 Abstract: The genus Chlorophytum includes many economically important species well-known for medicinal, ornamental, and horticultural values. However, to date, few molecular genomic resources have been reported for this genus. Therefore, there is limited knowledge of phylogenetic studies, and the available chloroplast (cp) genome of Chlorophytum (C. rhizopendulum) does not provide enough information on this genus. In this study, we present genomic resources for C. comosum and C. gallabatense, which had lengths of 154,248 and 154,154 base pairs (bp), respectively. They had a pair of inverted repeats (IRa and IRb) of 26,114 and 26,254 bp each in size, separating the large single-copy (LSC) region of 84,004 and 83,686 bp from the small single-copy (SSC) region of 18,016 and 17,960 bp in C. comosum and C. gallabatense, respectively. There were 112 distinct genes in each cp genome, which were comprised of 78 protein-coding genes, 30 tRNA genes, and four rRNA genes. The comparative analysis with five other selected species displayed a generally high level of sequence resemblance in structural organization, gene content, and arrangement. Additionally, the phylogenetic analysis confirmed the previous phylogeny and produced a phylogenetic tree with similar topology. It showed that the Chlorophytum species (C. comosum, C. gallabatense and C. rhizopendulum) were clustered together in the same clade with a closer relationship than other plants to the Anthericum ramosum. This research, therefore, presents valuable records for further molecular evolutionary and phylogenetic studies which help to fill the gap in genomic resources and resolve the taxonomic complexes of the genus. Keywords: C. comosum; C. gallabatense; chloroplast genome; phylogenetic analysis 1. Introduction Chlorophytum Ker-Gawl. is a large and taxonomically complex genus [1], distributed in the Old World tropics, particularly, in Africa (possibly the center of origin), Australia, Asia, Madagascar, and India [2,3], that comprises about 215 species, six subspecies, and 14 varieties [4,5] in the subfamily Agavoideae in the Asparagaceae family [6]. They are perennial rhizomatous herbs, Plants 2020, 9, 296; doi:10.3390/plants9030296 www.mdpi.com/journal/plants Plants 2020, 9, 296 2 of 18 commonly known as spider plants, based on the mosaic form of C. comosum leaves [7]. The genus is well-known for its medicinal characteristics [8]. Therefore, it has attracted the attention of scientists due to pharmacologically relevant steroidal and triterpenoidal saponin contents in their roots [9]. Other pharmacological activities such as aphrodisiac, anticancer, larvicidal, and immunomodulatory were also mentioned by Deore et al., 2008, and Khanam et al., 2013, [10,11]. C. comosum is used to treat burns, bronchitis, fractures [12], and also has antitumor activity [13]. There are several well-documented studies on phytochemistry and the medicinal properties of different species in this genus [14]. The leaves of C. borivilianum and C. breviscapum are used as a vegetable [15]. Additionally, some of the species are cultivated for their ornamental values such as C. comosum and C. filipendulum subsp. amaniense, and are also effective in controlling soil erosion [1]. The increased economic values of the Chlorophytum species have contributed to unregulated harvesting and habitat degradation. In addition, poor germination and regeneration rates have resulted in the scarcity of these plants. This has culminated in some species being listed as vulnerable, threatened, or critically endangered. C. rhizopendulum is one of the critically endangered species according to the International Union for Conservation of Nature (IUCN) Red List (www.iucnredlist.org). Moreover, this genus is experiencing difficulties in classification, and different taxonomists have misidentified the species when using morphological features [16]. C. comosum and C. gallabatense were reported to have these delimitation problems [17]. This problem of species delimitation has not been solved, and it is highly desirable to find other methods to authenticate the species of Chlorophytum. Phylogenetic analysis based on the shared protein-coding genes or complete (cp) genomes could solve these problems; however, the Chlorophytum’s complete (cp) genomic data is very insufficient. To date, Chlorophytum genomic data is underrepresented in literature, only one complete chloroplast genome (C. rhizopendulum) has been published in this genus [18], regardless of the chloroplast being illustrated as important for plant genetic molecular studies [19]. Consequently, this limits further phylogenetic analysis of this genus. Chloroplasts are important organelles in plants, where carbon fixation, photosynthesis, and pigment synthesis are carried out [20,21], and therefore they are the main plastid for green plants [22]. The cp is an independent genome (chloroplast DNA) with a preserved, circular structure and a low molecular weight [23]. It has a typical quadripartite structure, consisting of a pair of inverted repeat regions (IRs) of equal size, separating a large single copy (LSC) region from a small single copy (SSC) region [24]. The plastid size ranges from 107 kb to 218 kb [25,26]. It contains approximately 110 to 130 genes encoding ribosomal RNA (rRNA) genes, protein-coding genes (PCGs), and transfer RNAs genes (tRNAs) [27]. Chloroplast genomes are easier and efficient to use for evolutionary studies because of their self-replication and slow evolutionary process that is relatively independent [28,29]. Recently, complete (cp) genome sequencing and assembly have improved, becoming less costly and simpler [20,30], as the next-generation sequence technology has evolved rapidly [31]. Studying the structure and sequence information of the cp genome gives researchers very important information for species conservation, construction of phylogeny [28,32], understanding genomic evolution, and solving relationship issues at various taxonomic levels [28,33]. In addition, it provides information on the genetic structure, present genes, gene order, and nucleotide alignment according to Gao et al., 2009 [30]. This is possible because the chloroplast genome is basic, stable, and conservative [34]. Currently, there is little data available on the genetic structures of C. comosum and C. gallabatense, especially on their detailed phylogenetic placement or chloroplast genomes. Only one nucleotide sequence of Chlorophytum (C. rhizopendulum) has been published (GenBank no. KX931454). In this study, we sequenced the complete cp genome sequences of C. comosum and C. gallabatense for the first time. We aimed to elucidate the structural features in the cp genomes, codon usage, RNA editing, simple sequence repeat (SSR), highly divergent regions, and phylogenetic analyses of C. comosum and C. gallabatense, as well as compare them with five Asparagales species (four Asparagaceae and one Amaryllidaceae species). Our findings provide a basic genetic tool for species identification, evolution, genetic engineering, population genetics, and phylogenetic studies of the species in genus Plants 2020, 9, 296 3 of 18 Chlorophytum. Moreover, our results help to fill the gap in genomic resources and resolve the taxonomic complexes of the genus. 2. Results and Discussion 2.1. C. comosum and C. gallabatense Chloroplast Genome Features The cp genomes of C. comosum and C. gallabatense are 154,248 bp and 154,154 bp in size, respectively. The two sequences contained four regions; the LSC region of 84,004 bp and 83,686 bp length, the SSC regions of 18,016 bp and 17,960 bp length, and a pair of inverted repeat regions (IRa and IRb) of 26,114 bp and 26,254 bp each in length, in C. comosum and C. gallabatense, respectively (Figure 1 and Table 1). The overall guanine-cytosine (GC) ratio was 37.3% in the two cp genomes (Table 1). Both C. comosum and C. gallabatense cp genome features were similar to other reported species in the Asparagaceae family with respect to gene content, order, and orientation [35]. We detected a total of 112 unique genes in C. comosum and C. gallabatense cp genomes, which included 78 PCGs, four rRNA, and 30 tRNA genes (Table 1). Amongst them, 19 genes are replicated in the IR regions, comprising seven PCGs (ndhB, ycf1, ycf2, rpl2, rps7, rpl23, and rps12), four rRNA genes (rrn23, rrn4.5, rrn5, and rrn16), and eight tRNA genes (trnA-UGC, trnH-GUG, trnI-GAU, trnI-CAU, trnL-CAA, trnN-GUU, trnV-GAC, and trnR-ACG) (Table 2). There are fifteen intron-containing genes, of which thirteen have one intron (ndhA, atpF, ndhB, rpoC1, rpl2, rps16, rps12, trnV-UAC, trnl-GAU, trnL-UAA, trnK-UUU, trnG-UCC, and trnA-UGC), whereas two genes (ycf3 and clpP) contained two introns. In total, nine of these genes are PCGs and six are tRNA genes (Table 2). This result is similar with previous report by Sheng et al., 2017 [35]. The rps12 gene is unequally distributed (trans-splicing gene), with the 5′ exon positioned in the LCS part, and the 3′ exons are positioned in the IRa/IRb regions. A similar observation in cp genomes of other species has been previously documented [36,37]. The presence of introns in the plant plastomes can lead to this expression of exogenous genes at precise locations and times [38]. Moreover, two pseudogenes (rps2 and infA) were positioned in the LCS region. They are known as pseudogenes due to the reverse repetitive nature of the IR regions, and therefore not wholly duplicated, losing the capacity to encode whole proteins [39]. There were no significant rearrangements in gene order or inversions observed in comparisons of the two cp genomes of Chlorophytum with those of other selected genome sequences. Gene loss has been reported in some of the Asparagales species [40,41]. McKain et al., 2016 [18], confirmed that rps19 was either pseudogenized in various positions or absent. Comparative analysis of seven species of Asparagales showed that the rps19 gene was pseudogenized in all the species apart from the species of Chlorophytum, where rps19 was missing as reported by McKain et al., 2016 [18]. 2.2. Codon Usage Analysis Codons are useful for the transmission of genetic information used in the evolution of genomes because they connect proteins and nucleic acids [42]. Codon usage is the use of similar codons with which an organism encodes the amino acids in the genes of their proteins [43]. This codon usage, however, has been shown to vary across different species [44] due to different factors such as codon hydrophilicity, tRNA abundance, gene length, gene expression rate, and base group composition [45]. Codon usage bias signifies that each gene of a species has its own preferred amino acid codon [46]. In the two species, the similarity was found in the codon usage and amino acid frequencies. The PCGs contained 20,479 and 20,453 codons in C. comosum and C. gallabatense cp genomes, respectively (Table S1). A comparison was made with other selected species of Asparagales, and codon encoded the genes ranging from 19,986 (Asparagus officinalis) to 20,532 (A. ramosum), as shown in (Table S1). In addition, leucine with a percentage of 10.41% (2,132) and 10.43% (2,133), was the most frequent amino acid in the cp genomes, followed by isoleucine with 8.77% (1797) and 8.75% (1789) in C. comosum and C. gallabatense, respectively. Cysteine, however, encoded the least frequently used amino acids Plants 2020, 9, 296 4 of 18 with only 1.11% (228) and 1.12% (230) in C. comosum and C. gallabatense, respectively (Figure 2 and Table S1). Cysteine are found less frequently due to their high sensitivity to changes in physiological and environmental conditions [47]. This was also similar to the rest of the Asparagaceae family (Table S1). If the relative value of synonymous codon usage (RSCU) is equal to one, the codon usage is not preferred (not biased), but highly preferred if the value is greater than one [36], indicating that the codon is used more often than expected [48] and less preferred with values of less than one. Almost all of the amino acid codons in C. comosum and C. gallabatense, cp genomes have preferences, which is due to the amino acid activity preventing error all through the transcription process, other than tryptophan (UGG) and methionine (AUG), which did not have a bias (RSCU values equal to one) (Table S1). The results, moreover, showed that all preferred synonymous codons (RSCU > 1) have an A or U position at the third codon, except for UUG (Table S1). In the cp genome, natural selection and direction of variation play a very significant role in influencing codon usage bias [49]. This analysis provides the required amino acids for protein biosynthesis in the cp genomes of C. comosum and C. gallabatense. These results are similar to cp genome reports of other terrestrial plants [50,51]. Figure 1. Gene map of the C. comosum and C. gallabatense complete cp genomes. Genes drawn within the circle are transcribed in the clockwise direction, and genes drawn out are transcribed in the counterclockwise direction. Genes are color-filled, basing on different functions. Inverted repeat (IR), small single-copy (SSC), and large single-copy (LSC) regions are indicated. Plants 2020, 9, 296 5 of 18 Table 1. Features of the chloroplast genomes of C. comosum and C. gallabatense and the related Asparagaceae species. Genome Features Chlorophytum comosum Chlorophytum gallabatense Allium victorialis Asparagus officinalis Anemarrhena asphodeloides Anthericum ramosum Chlorophytum rhizopendulum Size (bp) LSC (bp) IR (bp) SSC (bp) Total genes PCGs tRNA genes rRNA genes Adenine (A) Thymine (T) Guanine (G) Cytosine (C) GC content 154,248 84,004 52,228 18,016 112 78 30 4 31.0% 31.7% 18.3% 19.0% 37.3% 154,154 83,686 52,508 17,960 112 78 30 4 31.0% 31.7% 18.3% 19.0% 37.3% 154,074 83,169 53,052 17,853 116 82 30 4 31.2% 31.8% 18.2% 18.8% 37.0% 156,699 84,999 53,062 18,638 112 78 30 4 30.9% 31.5% 18.5% 19.1% 37.6% 156,917 85,027 53,320 18,570 112 78 30 4 30.8% 31.4% 18.6% 19.3% 37.8% 155,812 84,817 52,756 18,239 114 80 30 4 31.0% 31.7% 18.3% 19.0% 37.3% 153,504 93,446 42,102 17,956 112 78 30 4 31.0% 31.7% 18.3% 19.0% 37.3% Table 2. Genes present in C. comosum and C. gallabatense chloroplast genomes. Category Group of Genes Name of Genes Self-replication Large subunit ribosomal proteins rpl2 a,b , rpl14, rpl16, rpl20, rpl22, rpl23 a , rpl32, rpl33, rpl36 Small subunit ribosomal proteins ψrps2, rps3, rps4, rps7 a , rps8, rps11, rps12 a,b , rps14, rps15, rps16 b , rps18 DNA-dependent RNA polymerase rpoA, rpoB, rpoC1 b , rpoC2 rRNA genes rrn4.5, rrn5, rrn16, rrn23 tRNA genes trnl-CAU a , trnL-CAA a , trnV-GAC a , trnl-GAU a,b , trnA-UGC a,b , trnR-ACG a , trnN-GUU a , trnL-UAG, trnP-UGG, trnW-CCA, trnQ-UUG, trnS-GCU, trnR-UCU, trnC-GCA, trnD-GUC, trnY-GUA, trnE-UUC, trnT-GGU, trnS-UGA, trnG-UCC b , trnG-GCC, trnfM-CAU, trnS-GGA, trnH-GUG a , trnT-UGU, trnL-UAA b , trnF-GAA, trnV-UAC b , trnM-CAU, trnK-UUU b photosynthesis Other genes unknown Photosystem I psaA, psaB, psaC, psal, psaJ Photosystem II psbA, psbB, psbC, psbD, psbE, psbF, psbH, psbl, psbJ, psbK, psbL, psbM, psbN, psbT, psbZ Cytochrome b/f complex petA, petB, petD, petG, petL, petN ATP synthase atpA, atpB, atpE, atpF b , atpH, atpl RuBisCO rbcL Subunits of NADH-dehydrogenase ndhA b , ndhB a,b , ndhC, ndhD, ndhE, ndhF, ndhG, ndhH, ndhl, ndhJ, ndhK Maturase MatK Proteolysis clpP c Translation initiation factor ψinfA Carbon metabolism cemA Fatty acid synthesis accD Cytochrome c synthesis gene ccsA Ycf1 a , ycf2 a , ycf3 c , ycf4 Conserved open reading frames a, two gene copies in IRs; b , genes with one intron; c , genes with two introns. Plants 2020, 9, 296 6 of 18 Figure 2. Amino acid proportion in C. comosum and C. gallabatense protein-coding sequences. 2.3. RNA Editing Sites RNA editing is a common molecular process in the cp genomes of plants [52]. It has different functions that can change a transcribed RNA’s protein-coding sequence by altering, deleting, or inserting nucleotides during transcription [53]. The predictive plant RNA editor (PREP) software [54], which uses 35 genes as references for possible RNA editing sites by comparing the predicted genes to homologous protein genes from other plants, identified editing sites in 26 PCGs studied (Table S2). As a result, in C. comosum and C. gallabatense a total of 64 and 66 potential RNA editing sites were predicted, respectively. The ndhB gene had the highest number (12 sites) in the two cp genomes of RNA editing sites followed by ndhF, rpoB, and matK, which had more than five editing sites. The gene rpoC2, atpF, rpoC1, rps14, ycf3, accD, psbF, rpl20, clpP, petB, rpoA, rps8, rpl2, ccsA, ndhG, and ndhA had editing sites more than one. The remaining five genes (rps16, atpA, atpI, atpB, and petG) did not have RNA predicting sites. Additionally, variations in amino acid as a result of RNA editing occurred most frequently in serine to leucine which reflects a conversion into a hydrophobic amino acid from a hydrophilic, while arginine to cysteine occurred less frequently (Table S2). These general features of advanced RNA editing in the plants cp genomes have also been documented by previous studies [55,56] which found that important RNA editing sites direct amino acid to alter from polar to non-polar in order to form a fundamental organization of protein to enhance protein hydrophobicity [57]. 2.4. Analysis of SSRs Microsatellites or simple sequence repeats (SSRs) are tandemly repetitive DNA sequences, comprising of one to six (mono-, di-, tri-, tetra-, penta-, and hexa-) repeat nucleotide units. Also known as microsatellites, SSRs are good molecular markers often used in plant species phylogenetics, identification and population genetic studies [58], as they are highly reliable, reproductive, and highly polymorphic [37,44]. They are highly spread in the PCGs, introns, and intergenic regions. We detected a total of 111 SSRs in C. comosum and 90 SSRs in C. gallabatense (Figure 3 and Table 3). Likened to other Asparagales species, A. ramosum had the largest number of repeats with 73 SSRs, C. rhizopendulum with 71, Allium victorialis with 64 SSRs, Asparagus officianalis with 55 SSRs, and the least number of repeats in Anemarrhena asphodeloides with 49 SSRs. C. comosum was comprised of 86 mononucleotide repeats, 14 dinucleotide repeats, two trinucleotide repeats, and nine tetranucleotide repeats, and C. gallabatense Plants 2020, 9, 296 7 of 18 included 68 repeats for mononucleotides, 12 repeats for dinucleotides, one repeat for trinucleotides, eight repeats for tetranucleotides, and one repeat for pentanucleotides. Ten hexanucleotides were only present in A. ramosum cp genome (Figure 4 and Table S3). The mononucleotide SSRs had the richest content in all the species, followed by dinucleotides and tetranucleotide. In the cp genomes, short polyadenine (poly A/T) repeats are the most frequent SSRs as compared with guanine (G) or cytosine (C) repeats [59]. In this research, the A/T mononucleotide repeats were the most abundant type in all the cp genomes. This result supports another study where the A/T repeats were abundant [60]. Genetic diversity is reflected in the regions with high mutation rates [61]. The identified loci, in this study, could be good molecular markers, useful in the study of population genetics of C. comosum and C. gallabatense and phylogenetic study in the future. Mono Di Tri Tetra Penta Hexa Figure 3. Number of different types of SSRs. Table 3. Numbers of simple sequence repeats (SSRs) in the C. comosum and C. gallabatense chloroplast genomes. SSR Type Mono Di Tri Tetra Penta Repeat Unit A/T AG/CT AT/AT AAT/ATT AAAG/CTTT AAAT/ATTT AATG/ATTC ACAT/ATGT AAGAT/ATCTT Amount Ratio (%) C. comosum C. gallabatense C. comosum C. gallabatense 86 3 11 2 1 6 1 1 0 68 3 9 1 1 6 0 1 1 100% 21.4% 78.6% 100% 11.1% 66.7% 11.1% 11.1% 0% 100% 25% 75% 100% 12.5% 75% 0% 12.5% 100% Plants 2020, 9, 296 8 of 18 Figure 4. Number of different repeat units of SSRs. 2.5. IR Expansion and Contraction Variations in the size of the cp genome are primarily due to the contraction and expansion of the border regions [62]. Such modifications have long-term effects on the size of cp genomes. According to Zhang et al., 2017 [63], IR regions are possible distinctive features in most of the angiosperms. The arrangement of genes in the junctions of cp genomes differs from one species to another [64]. Most of the cp genomes tend to be rearranged due to the loss of the IR regions [65], although, Chumley et al., 2006 [25], discovered that the pelargonium x hortorum genome was rearranged even without the loss of the IR regions. A comparison was made among Asparagales species on the border regions and the adjacent genes and some remarkable variations were discovered (Figure 5). Similar to other typical cp genomes, these genomes showed clear variances at the junctions, but the general gene structures, contents, and orientations were the same. The rps19 gene was located at IRb, 37 bp, 49 bp, 53 bp, and 82 bp away from the junction of LSC/IRb in A. ramosum, A. asphodeloides, A. officinalis, and A. victorialis, respectively; however, this gene was missing in all three Chlorophytum species proving that the common ancestry lost this gene independently. The rpl2 gene was positioned at the IRb in all chloroplast genomes at variable extensions from the junction. Similarly, the psbA gene occurred in the LSC region in all the genome sequences 1 bp to 101 bp away from IRa/LSC boundary. The distance from the IRa/SSC junction to ndhF gene differed by 5 bp in A. asphodeloides to 70 bp in A. ramosum. Still, there was an overlap between ycf1/ndhF in A. victorialis, A. officinalis, C. comosum, C. gallabatense, and C. rhizopendulum. Interestingly, the ndhF gene overlapped the SSC and IRb junction with 2 bp in C. rhizopendulum and 31 bp in A. victorialis. The border region in the middle of IRa and SSC was positioned within the ycf1 gene, and the ψycf1 pseudogene situated at the IRa region ranging from 802 bp to 1174 bp in size. The length of the SSC regions ranged from 17,853 bp in A. victorialis to 18, 638 bp in A. officinalis. The ψycf1 gene was located at the junction between SSC and IRb. This pattern of IR expansion and contraction of partial copies of non-coding genes is a common occurrence in Plants 2020, 9, 296 9 of 18 most terrestrial species [66]. These outcomes can provide insight into the evolutionary processes of chloroplast genomes, as well as being a source of DNA barcodes. Figure 5. Comparison of the junctions of LSC, SSC, and IR regions among seven chloroplast genomes. 2.6. Analysis of Nucleotide Diversity To examine the divergence of sequence in the cp genomes, the nucleotide variability (Pi) value was calculated using the DnaSP v5.10 program. A comparison was made with the other randomly selected five Asparagales species on the two cp genomes of Chlorophytum. The Pi values of the seven Asparagales species ranged from 0 to 0.15897 (Figure 6). This indicates that the species in the order Asparagales could be undergoing rapid nucleotide substitution [36]. The IR regions, as predicted, had a much lower nucleotide variability than the SSC and LSC regions. Other nucleotide diversity analyses have also shown that the IR region has a lower value of nucleotide diversity than the SSC and LSC regions [67,68]. This shows that IRa/IRb regions are much more conserved as compared with the LSC and the SSC regions as a result of gene replication [69]. Six divergent regions revealed higher nucleotide diversity values (Pi values > 0.1). All these variable regions were found in the intergenic spacer. This indicated that the intergenic regions were more divergent than the coding regions. Four divergent regions are located in LSC (psbK/psbI, trnS/trnG, psbM/trnD, and rps12/clpP), one in SSC Plants 2020, 9, 296 10 of 18 (rps15/ycf1), and one in IR (trnV/rps12). Some of these regions have been previously reported in other species cp genomes [37,67]. Such highly variable regions can provide potential molecular markers for the authentication of plants and assist in phylogenetic analysis studies in this genus [70]. Figure 6. Nucleotide diversity of different regions of Asparagales chloroplast genomes. 2.7. Comparative Genome Analysis Comparative genomic analysis and the available DNA sequences make it possible to have a comprehensive view of a genus [71]. The cp genome sequences of C. comosum and C. gallabatense were compared to those of C. rhizopendulum, A. victorialis, A. asphodeloides, A. officinalis, and A. ramosum using the mVISTA program (Figure 7) and MAUVE program (Figure 8). Among the cp genomes compared, there were no significant rearrangements observed, except for the slight variations in size and gene positioning (Figure 8). A. asphodeloides (156,917 bp) was the largest, followed by A. officinalis (156,699 bp), then A. ramosum (155,812 bp), then C. comosum (154,248 bp), then C. gallabatense (154,154 bp), then A. victorialis (154,074 bp), and finally the shortest was C. rhizopendulum (153,504 bp). The results revealed that the genomes with some level of divergence were greatly conserved. The non-coding regions were less conserved as compared with the coding regions. The LSC and the SSC regions were equally less conserved than the IR regions. In addition, it was also detected that the intergenic spacer regions had the highest divergent regions. Some of the high divergent regions included psbI-trnG-UCC, matK-rps16, psbZ-rps14, rps16-psbI, atpH-atpI, trnC-GCA-petN, and trnS-GGA- trnL-UAA (Figure 7). The comparative study showed the conservation of gene order, signifying the evolutionary conservation of these species [71]. These regions can be used as unique barcodes for DNA and also give phylogenetic information. Plants 2020, 9, 296 11 of 18 Figure 7. Comparison of seven cp genomes using mVISTA alignment program. Genome regions are color-coded as protein-coding, rRNA coding, tRNA coding, or conserved non-coding sequences. The vertical scale shows the percentage of identity, varying from 50% to 100%. Figure 8. Comparison of the genome structure of seven Asparagales species using Mauve program. The DNA sequences above the line are presented in a clockwise direction, and those below the line in a counterclockwise direction. 2.8. Phylogenetic Analysis Due to the rapid development of the sequencing technologies, cp genomes, which are valuable genomic resources, have been used to a greater extent in the reconstruction of plant phylogenies and evolutionary relationships [72]. The phylogeny of Asparagales was reconstructed using whole Plants 2020, 9, 296 12 of 18 chloroplast genome sequences of 48 species belonging to the family Asparagaceae (thirty-eight species), the family Amaryllidaceae (eight species), and Iridaceae (two species) as outgroup. All the 46 cp genomes were downloaded from GenBank (Table S4), and the 48 cp genomes were aligned by MAFFT v7.308 [73]. Then, the phylogeny was reconstructed using the maximum likelihood (ML) approach to reveal the evolutionary position of C. comosum and C. gallabatense and the relationship with other Asparagales species. Almost all of the nodes had high bootstrap support values (BP). The result indicated that genus Chlorophytum is closely related to genus Anthericum. The Chlorophytum species (C. comosum, C. gallabatense, and C. rhizopendulum) were clustered together in the same clade with a bootstrap value of 100%. Bjora et al., 2008 [17], in their study, stated that C. gallabatense clusters together with C. comosum, C. filipendulum, and C. macrophyllum. These species had a closer relationship to the Anthericum ramosum which formed an independent clade (Figure 9). This shows the affinity relationship among the species. During the early classification system, genus Chlorophytum was classified to be under the family Anthericaceae together with Anthericum. After several revisionary works, the two genera were separated using morphological characters (stamen filament ornamentation) and some of the species of the Anthericum were transferred into genus Chlorophytum [74], hence, the close relationship in the phylogenetic tree. These findings show that information from the whole cp genome sequences can give more reliable phylogenetic outcomes, therefore, more species of Chlorophytum can be sequenced in order to have a more comprehensive phylogenetic tree in this genus and give insight into the Asparagaceae relationship. Our results confirm the position of the selected Asparagales species supporting the previous whole genome and multigene-based analyses [35,75,76] (Figure 9). Therefore, this study has provided an in-depth molecular analysis within the family Asparagaceae and can be used in the taxonomy of the family especially using the molecular markers upon further analysis. Figure 9. Phylogenetic trees of Asparagales species based on whole chloroplast genome sequences, with two species from family Iridaceae used as outgroup. Plants 2020, 9, 296 13 of 18 3. Materials and Methods 3.1. DNA Isolation and Chloroplast Genome Sequencing Young, fresh, and healthy leaf samples of C. comosum and C. gallabatense were collected from Kenya (C. comosum SAJIT-004032 and C. gallabatense SAJIT-006319) and dried in silica gel to preserve the DNA [77]. All the samples were taken to the Herbarium of Wuhan Botanical Garden (HIB) and stored at −80 ◦ C until DNA extraction. The genomic DNA was isolated from 100 micrograms of leaves using a modified cetyltrimethylammonium bromide (CTAB) technique [78] and sequenced using the Illumina platform at Novogene Company (Beijing, China). 3.2. Genome Assembly and Annotation The clean data obtained after filtering the low-quality data and adaptors were assembled using GetOrganelle version 1.6.2 software [79], and then, manually corrected. The assembled chloroplasts were used for gene annotation by the GeSeq online tool (https://chlorobox.mpimp-golm.mpg.de/geseq. Html) with default settings [80]. Manual verification was done using Sequin Viewer to relate the cp genomes of Chlorophytum species with those of Asparagaceae. Annotations of tRNAs were confirmed using the tRNAscan-SE [81]. The boundaries of the introns and the start and stop codons were manually corrected. Gene maps of complete cp genome sequences were drawn using OrganellarGenome DRAW software [82] (Figure 1). The annotated cp genomes of the C. comosum and C. gallabatense were submitted to the GenBank (GenBank numbers: C. comosum: MT076065 and C. gallabatense: MT036265). 3.3. Genome Structure and Comparison The genome features of C. comosum and C. gallabatense were determined using MEGA7 [83] by comparing the five other available cp genomes of Asparagales which were downloaded from NCBI (Table S4). The PCGs from the Chlorophytum species and the four Asparagales species were extracted manually and the Predictive RNA Editor for plants (PREP) suite online software was used to identify the RNA editing sites [54] where the cutoff value was set to 0.8. PREP server. The default settings used 35 genes to compare and predict the potential RNA editing sites. For comparative analysis, the cp genomes of C. comosum and C. gallabatense with the five randomly selected Asparagales species was constructed using the mVISTA program [84]. Furthermore, in the rearrangement analysis, the cp genomes were aligned and constructed in MAUVE program [85]. The expansion and contraction of IRs were analyzed using the IRscope online program (https: //irscope.shinyapps.io/irapp/), then manually modified. To detect the divergence hotspots, the complete chloroplast genomes of the selected seven Asparagales species were aligned by MAFFT v7.308 [73]. DnaSP v5.10 was used to calculate the nucleotide divergence values of the chloroplast genome sequence alignment of seven analyzed species using the sliding window method [86] with a window length of 600 bp and a 200 bp step size. 3.4. Codon Usage The codon usage frequency in each of the seven Asparagales species was analyzed for all the PCGs using MEGA7 [83]. Synonymous codon usage and the relative synonymous codon usage (RSCU) was conducted to determine if the plastid genes were under selection. 3.5. Simple Sequence Repeats Simple sequence repeats (SSRs) present in cp genomes of C. comosum and C. gallabatense were analyzed using the software MISA (MicroSAtellite) [87] with the settings of ten repeat units for mononucleotides, five for dinucleotides and trinucleotides, and three for tetranucleotides, pentanucleotides, and hexanucleotide repeats. Plants 2020, 9, 296 14 of 18 3.6. Phylogenetic Analysis To confirm the phylogenetic positions of C. comosum and C. gallabatense within the family Asparagaceae, we downloaded forty-six cp genomes representing three families of Asparagales (Amaryllidaceae, Asparagaceae, and Iridaceae as outgroup) from the NCBI database (Table S4) for analysis. Multiple sequence alignment of the forty-eight complete cp genome sequences was done using MAFFT v7.308 [73] with default parameters and the maximum likelihood (ML) tree with default settings and 1000 bootstrap replicates was reconstructed using the program IQ-TREE v6.10 with the best fit model TVM + I + G4. TreeDyn [88,89] was used to visualize and refine the tree. 4. Conclusions Chlorophytum is one of the essential genera in the family Asparagaceae as it is used for its medicinal properties, however, its cp genomes are not well studied. In this study, we analyzed the complete cp genomes of C. comosum and C. gallabatense of the family Asparagaceae for the first time. These genomes provided a basic genetic tool for species identification within the genus. We compared the cp genomes of C. comosum and C. gallabatense with five randomly selected Asparagales species. The results showed that the gene size, content, and order were all similar. The loss of rps19 gene in the cp genomes of Chlorophytum, displays a distinctive feature of evolution within precise characteristics. The codon usage, RNA editing, and microsatellites were determined. The ndhB and ndhF genes had the highest number of editing sites. These results provide useful insights regarding editing sites that were gained or lost during the evolution of the Angiosperm genomes. Additionally, such findings provide genetic information for the genus Chlorophytum for the creation of molecular markers and for research into genetic diversity. The contraction and expansion of the boundary regions showed the genome size variation in the selected Asparagales species. Nucleotide analysis revealed the highest nucleotide diversity was in rps12/clpP and rps15/ycf1 regions, which are highly variable regions that can be used as potential markers in family Asparagaceae for species identification and phylogeny. Additionally, the phylogenetic analysis confirmed the previous phylogeny and produced a phylogenetic tree with similar topology. It showed that the Chlorophytum species (C. comosum, C. gallabatense, and C. rhizopendulum) were clustered together in the same clade with a closer relationship than other plants to the A. ramosum. However, more complete cp genomes on Chlorophytum species should be sequenced which would help solve the relationship within Asparagaceae. Generally, this study provides valuable genetic information of Chlorophytum which can aid in further phylogenetic studies, species identification, and evolutionary relationships between Chlorophytum and Anthericum. Supplementary Materials: The following are available online at http://www.mdpi.com/2223-7747/9/3/296/s1, Table S1: Codon usage, Table S2: RNA editing, Table S3: Simple sequence repeats, and Table S4: NCBI accession number. Author Contributions: Formal analysis was performed by J.N.M. and X.D.; drafting the manuscript and preparation of figures and tables was done by J.N.M.; revision and manuscript editing was done by J.-X.Y., E.M.M., V.O.W., M.A.O., and J.K.S.; conceptualization and supervision was done by P.M.M. and G.-W.H.; resources and funding acquisition was provided by G.-W.H. All authors have read and agreed to the published version of the manuscript. Funding: This work was supported by grants from the Backbone Talents Project of Wuhan Botanical Garden, CAS (Y655301M01), Sino-Africa Joint Research Center, CAS (SAJC201614) and the National Natural Science Foundation of China (31970211). Acknowledgments: Sincere thanks to John Nzei and Virginia Mwanzia for providing helpful suggestions and support related to the analysis performed. We also thank the Managing Editor Harry Fu and the anonymous reviewers for their useful comments and precious remarks on the manuscript. Conflicts of Interest: The authors affirm no conflict of interest. Plants 2020, 9, 296 15 of 18 References 1. 2. 3. 4. 5. 6. 7. 8. 9. 10. 11. 12. 13. 14. 15. 16. 17. 18. 19. 20. 21. 22. 23. 24. Basu, S.; Jha, T.B. Cytogenetic studies in four species of Chlorophytum Ker-Gawl. (Liliaceae). The Nucleus 2011, 54, 123–132. [CrossRef] Govaerts, R.; Zonneveld, B.; Zona, S. World Checklist of Asparagaceae; Facilitated by the Royal Botanic Gardens: Kew, UK, 2012; Available online: http://apps.kew.org/wcsp/ (accessed on 26 February 2020). Malpure, N.V.; Yadav, S. Chlorophytum gothanense, a new species of Anthericaceae from the Western Ghats of India. Kew Bull. 2009, 64, 739–741. [CrossRef] Govaerts, R. World Checklist of Selected Plant Families Asparagaceae; Royal Botanic Gardens: Kew, UK, 2015. Lekhak, M.; Adsul, A.; Yadav, S. Cytotaxonomical investigations into the genus Chlorophytum from India. Kew Bull. 2012, 67, 285–292. [CrossRef] Group, A.P. An update of the Angiosperm Phylogeny Group classification for the orders and families of flowering plants: APG III. Bot. J. Linn. Soc. 2009, 161, 105–121. Adsul, A.A. Taxonomic Revision of Genus Chlorophytum Ker Gawl for India. Ph.D. Thesis, Shivaji University, Kolhapur India, 2015. Kumar, M.; Meena, P.; Verma, S.; Kumar, M.; Kumar, A. Anti-tumour, anti-mutagenic and chemomodulatory potential of Chlorophytum borivilianum. Asian Pac. J. Cancer Prev. 2010, 11, 327–334. Kalra, S.; Kumar, S.; Lakhanpal, N.; Kaur, J.; Singh, K. Characterization of squalene synthase gene from Chlorophytum borivilianum (Sant. and Fernand.). Mol. Biotechnol. 2013, 54, 944–953. [CrossRef] Deore, S.; Khadabadi, S. Standardisation and pharmaceutical evaluation of Chlorophytum borivilianum mucilage. Rasayan J. Chem. 2008, 1, 887–892. Khanam, Z.; Singh, O.; Singh, R.; Bhat, I.U.H. Safed musli (Chlorophytum borivilianum): A review of its botany, ethnopharmacology and phytochemistry. J. Ethnopharmacol. 2013, 150, 421–441. [CrossRef] Kaushik, N. Saponins of Chlorophytum species. Phytochem. Rev. 2005, 4, 191–196. [CrossRef] Haque, R.; Saha, S.; Bera, T. A peer reviewed literature on medicinal activity of Chlorophytum borivilianum commercial medicinal plant. Int. J. Drug Dev. Res. 2011, 3, 1116, 1130. O’Donnell, G.; Bucar, F.; Gibbons, S. Phytochemistry and antimycobacterial activity of Chlorophytum inornatum. Phytochemistry 2006, 67, 178–182. [CrossRef] [PubMed] Maiti, S.; Geetha, K. Characterization, genetic improvement and cultivation of Chlorophytum borivilianum—an important medicinal plant of India. Plant Genet. Resour. 2005, 3, 264–272. [CrossRef] Adsul, A.A.; Lekhak, M.M.; Yadav, S.R. Chlorophytum sharmae (Asparagaceae): A new species from Kerala, India. Kew Bull. 2014, 69. [CrossRef] Bjorå, C.S.; Hemp, A.; Hoell, G.; Nordal, I. A taxonomic and ecological analysis of two forest Chlorophytum taxa (Anthericaceae) on Mount Kilimanjaro, Tanzania. Plant Syst. Evol. 2008, 274, 243–253. [CrossRef] McKain, M.R.; McNeal, J.R.; Kellar, P.R.; Eguiarte, L.E.; Pires, J.C.; Leebens-Mack, J. Timing of rapid diversification and convergent origins of active pollination within Agavoideae (Asparagaceae). Am. J. Bot. 2016, 103, 1717–1729. [CrossRef] Deguilloux, M.-F.; Pemonge, M.-H.l.N.; Petit, R.M.J. Use of chloroplast microsatellites to differentiate oak populations. Ann. For. Sci. 2004, 61, 825–830. [CrossRef] Daniell, H.; Lin, C.-S.; Yu, M.; Chang, W.-J. Chloroplast genomes: Diversity, evolution, and applications in genetic engineering. Genome Biol. 2016, 17, 134. [CrossRef] Neuhaus, H.; Emes, M. Nonphotosynthetic metabolism in plastids. Annu. Rev. Plant Biol. 2000, 51, 111–140. [CrossRef] Zhou, T.; Wang, J.; Li, W.; Zhang, X.; Xu, Y.; Xu, F.; Zhu, H.; Wang, X. The complete chloroplast genome of Euphrasia regelii, pseudogenization of ndh genes and the phylogenetic relationships within Orobanchaceae. Front. Genet. 2019, 10, 444. [CrossRef] Wicke, S.; Schneeweiss, G.M.; Depamphilis, C.W.; Müller, K.F.; Quandt, D. The evolution of the plastid chromosome in land plants: Gene content, gene order, gene function. Plant Mol. Biol. 2011, 76, 273–297. [CrossRef] Bendich, A.J. Circular chloroplast chromosomes: The grand illusion. Plant Cell 2004, 16, 1661–1666. [CrossRef] [PubMed] Plants 2020, 9, 296 25. 26. 27. 28. 29. 30. 31. 32. 33. 34. 35. 36. 37. 38. 39. 40. 41. 42. 43. 44. 45. 46. 47. 16 of 18 Chumley, T.W.; Palmer, J.D.; Mower, J.P.; Fourcade, H.M.; Calie, P.J.; Boore, J.L.; Jansen, R.K. The complete chloroplast genome sequence of Pelargonium× hortorum: Organization and evolution of the largest and most highly rearranged chloroplast genome of land plants. Mol. Biol. Evol. 2006, 23, 2175–2190. [CrossRef] [PubMed] Lin, C.-P.; Huang, J.-P.; Wu, C.-S.; Hsu, C.-Y.; Chaw, S.-M. Comparative chloroplast genomics reveals the evolution of Pinaceae genera and subfamilies. Genome Biol. Evol. 2010, 2, 504–517. [CrossRef] [PubMed] Wyman, S.K.; Jansen, R.K.; Boore, J.L. Automatic annotation of organellar genomes with DOGMA. Bioinformatics 2004, 20, 3252–3255. [CrossRef] Moore, M.J.; Soltis, P.S.; Bell, C.D.; Burleigh, J.G.; Soltis, D.E. Phylogenetic analysis of 83 plastid genes further resolves the early diversification of eudicots. Proc. Natl. Acad. Sci. USA 2010, 107, 4623–4628. [CrossRef] Parks, M.; Cronn, R.; Liston, A. Increasing phylogenetic resolution at low taxonomic levels using massively parallel sequencing of chloroplast genomes. BMC Biol. 2009, 7, 84. [CrossRef] Gao, L.; Yi, X.; Yang, Y.-X.; Su, Y.-J.; Wang, T. Complete chloroplast genome sequence of a tree fern Alsophila spinulosa: Insights into evolutionary changes in fern chloroplast genomes. BMC Evol. Biol. 2009, 9, 130. [CrossRef] Mardis, E.R. Next-generation sequencing platforms. Annu. Rev. Anal. Chem. 2013, 6, 287–303. [CrossRef] Li, B.; Li, Y.; Cai, Q.; Lin, F.; Huang, P.; Zheng, Y. Development of chloroplast genomic resources for Akebia quinata (Lardizabalaceae). Conserv. Genet. Resour. 2016, 8, 447–449. [CrossRef] Sun, M.; Li, J.; Li, D.; Shi, L. Complete chloroplast genome sequence of the medical fern Drynaria roosii and its phylogenetic analysis. Mitochondrial Dna Part B 2017, 2, 7–8. [CrossRef] Wolfe, K.H.; Li, W.-H.; Sharp, P.M. Rates of nucleotide substitution vary greatly among plant mitochondrial, chloroplast, and nuclear DNAs. Proc. Natl. Acad. Sci. USA 1987, 84, 9054–9058. [CrossRef] Sheng, W.; Chai, X.; Rao, Y.; Tu, X.; Du, S. The Complete Chloroplast Genome Sequence of Asparagus (Asparagus officinalis L.) and its phylogenetic positon within Asparagales. Int. J. Plant Biol. Res. 2017, 5, 121–128. Cui, Y.; Nie, L.; Sun, W.; Xu, Z.; Wang, Y.; Yu, J.; Song, J.; Yao, H. Comparative and Phylogenetic Analyses of Ginger (Zingiber officinale) in the Family Zingiberaceae Based on the Complete Chloroplast Genome. Plants 2019, 8, 283. [CrossRef] [PubMed] Iram, S.; Hayat, M.Q.; Tahir, M.; Gul, A.; Ahmed, I. Chloroplast Genome Sequence of Artemisia scoparia: Comparative Analyses and Screening of Mutational Hotspots. Plants 2019, 8, 476. [CrossRef] [PubMed] Xu, J.; Feng, D.; Song, G.; Wei, X.; Chen, L.; Wu, X.; Li, X.; Zhu, Z. The first intron of rice EPSP synthase enhances expression of foreign gene. Sci. China Ser. C: Life Sci. 2003, 46, 561. [CrossRef] Huang, Y.; Yang, Z.; Huang, S.; An, W.; Li, J.; Zheng, X. Comprehensive Analysis of Rhodomyrtus tomentosa Chloroplast Genome. Plants 2019, 8, 89. [CrossRef] Meerow, A.W. Convergence or reticulation? Mosaic evolution in the canalized American Amaryllidaceae. In Diversity, Phylogeny and Evolution in the Monocotyledons; Seberg, O., Petersen, G., Barfod, A.S., Davis, J.I., Eds.; Aarhus University Press: Aarhus, Denmark, 2010; pp. 145–168. Steele, P.R.; Hertweck, K.L.; Mayfield, D.; McKain, M.R.; Leebens-Mack, J.; Pires, J.C. Quality and quantity of data recovered from massively parallel sequencing: Examples in Asparagales and Poaceae. Am. J. Bot. 2012, 99, 330–348. [CrossRef] Liu, Q.; Dou, S.; Ji, Z.; Xue, Q. Synonymous codon usage and gene function are strongly related in Oryza sativa. Biosystems 2005, 80, 123–131. [CrossRef] Campbell, W.H.; Gowri, G. Codon usage in higher plants, green algae, and cyanobacteria. Plant Physiol. 1990, 92, 1–11. [CrossRef] Srivastava, D.; Shanker, A. Identification of simple sequence repeats in chloroplast genomes of Magnoliids through bioinformatics approach. Interdiscip. Sci. Comput. Life Sci. 2016, 8, 327–336. [CrossRef] Li, Y.; Kuang, X.J.; Zhu, X.X.; Zhu, Y.; Sun, C. Codon usage bias of Catharanthus roseus. Zhongguo Zhong Yao Za Zhi = Zhongguo Zhongyao Zazhi = China J. Chin. Mater. Med. 2016, 41, 4165–4168. Wu, X.-M.; Wu, S.-F.; Ren, D.-M.; Zhu, Y.-P.; He, F.-C. The analysis method and progress in the study of codon bias. Yi Chuan = Hered. 2007, 29, 420–426. [CrossRef] [PubMed] Marino, S.M.; Gladyshev, V.N. Analysis and functional prediction of reactive cysteine residues. J. Biol. Chem. 2012, 287, 4419–4425. [CrossRef] [PubMed] Plants 2020, 9, 296 48. 49. 50. 51. 52. 53. 54. 55. 56. 57. 58. 59. 60. 61. 62. 63. 64. 65. 66. 67. 17 of 18 Sharp, P.M.; Li, W.-H. The codon adaptation index-a measure of directional synonymous codon usage bias, and its potential applications. Nucleic Acids Res. 1987, 15, 1281–1295. [CrossRef] [PubMed] Liu, Q.; Xue, Q. Comparative studies on codon usage pattern of chloroplasts and their host nuclear genes in four plant species. J. Genet. 2005, 84, 55–62. [CrossRef] Park, I.; Kim, W.; Yeo, S.-M.; Choi, G.; Kang, Y.-M.; Piao, R.; Moon, B. The complete chloroplast genome sequences of Fritillaria ussuriensis Maxim. and Fritillaria cirrhosa D. Don, and comparative analysis with other Fritillaria species. Molecules 2017, 22, 982. [CrossRef] Zuo, L.-H.; Shang, A.-Q.; Zhang, S.; Yu, X.-Y.; Ren, Y.-C.; Yang, M.-S.; Wang, J.-M. The first complete chloroplast genome sequences of Ulmus species by de novo sequencing: Genome comparative and taxonomic position analysis. PLoS ONE 2017, 12, e0171264. [CrossRef] Freyer, R.; Hoch, B.; Neckermann, K.; Maier, R.M.; Kössel, H. RNA editing in maize chloroplasts is a processing step independent of splicing and cleavage to monocistronic mRNAs. Plant J. 1993, 4, 621–629. [CrossRef] Tang, W.; Luo, C. Molecular and functional diversity of RNA editing in plant mitochondria. Mol. Biotechnol. 2018, 60, 935–945. [CrossRef] Mower, J.P. The PREP suite: Predictive RNA editors for plant mitochondrial genes, chloroplast genes and user-defined alignments. Nucleic Acids Res. 2009, 37, W253–W259. [CrossRef] Corneille, S.; Lutz, K.; Maliga, P. Conservation of RNA editing between rice and maize plastids: Are most editing events dispensable? Mol. Gen. Genet. 2000, 264, 419–424. [CrossRef] Jain, B.P.; Chauhan, P.; Tanti, G.K.; Singarapu, N.; Ghaskadbi, S.; Goswami, S.K. Tissue specific expression of SG2NA is regulated by differential splicing, RNA editing and differential polyadenylation. Gene 2015, 556, 119–126. [CrossRef] [PubMed] Taylor, F.; Coates, D. The code within the codons. Biosystems 1989, 22, 177–187. [CrossRef] Qi, W.; Lin, F.; Liu, Y.; Huang, B.; Cheng, J.; Zhang, W.; Zhao, H. High-throughput development of simple sequence repeat markers for genetic diversity research in Crambe abyssinica. BMC Plant Biol. 2016, 16, 139. [CrossRef] [PubMed] Kuang, D.-Y.; Wu, H.; Wang, Y.-L.; Gao, L.-M.; Zhang, S.-Z.; Lu, L. Complete chloroplast genome sequence of Magnolia kwangsiensis (Magnoliaceae): Implication for DNA barcoding and population genetics. Genome 2011, 54, 663–673. [CrossRef] [PubMed] Zhou, J.; Chen, X.; Cui, Y.; Sun, W.; Li, Y.; Wang, Y.; Song, J.; Yao, H. Molecular structure and phylogenetic analyses of complete chloroplast genomes of two Aristolochia medicinal species. Int. J. Mol. Sci. 2017, 18, 1839. [CrossRef] Qi, W.-H.; Jiang, X.-M.; Yan, C.-C.; Zhang, W.-Q.; Xiao, G.-S.; Yue, B.-S.; Zhou, C.-Q. Distribution patterns and variation analysis of simple sequence repeats in different genomic regions of bovid genomes. Sci. Rep. 2018, 8, 14407. [CrossRef] Wang, W.; Messing, J. High-throughput sequencing of three Lemnoideae (duckweeds) chloroplast genomes from total DNA. PLoS ONE 2011, 6, e24670. [CrossRef] Zhang, H.; Hall, N.; McElroy, J.S.; Lowe, E.K.; Goertzen, L.R. Complete plastid genome sequence of goosegrass (Eleusine indica) and comparison with other Poaceae. Gene 2017, 600, 36–43. [CrossRef] Yang, J.-B.; Tang, M.; Li, H.-T.; Zhang, Z.-R.; Li, D.-Z. Complete chloroplast genome of the genus Cymbidium: Lights into the species identification, phylogenetic implications and population genetic analyses. BMC Evol. Biol. 2013, 13, 84. [CrossRef] Strauss, S.H.; Palmer, J.D.; Howe, G.T.; Doerksen, A.H. Chloroplast genomes of two conifers lack a large inverted repeat and are extensively rearranged. Proc. Natl. Acad. Sci. USA 1988, 85, 3898–3902. [CrossRef] [PubMed] Wang, M.; Cui, L.; Feng, K.; Deng, P.; Du, X.; Wan, F.; Weining, S.; Nie, X. Comparative analysis of Asteraceae chloroplast genomes: Structural organization, RNA editing and evolution. Plant Mol. Biol. Report. 2015, 33, 1526–1538. [CrossRef] Yu, X.; Tan, W.; Zhang, H.; Gao, H.; Wang, W.; Tian, X. Complete Chloroplast Genomes of Ampelopsis humulifolia and Ampelopsis japonica: Molecular Structure, Comparative Analysis, and Phylogenetic Analysis. Plants 2019, 8, 410. [CrossRef] [PubMed] Plants 2020, 9, 296 68. 69. 70. 71. 72. 73. 74. 75. 76. 77. 78. 79. 80. 81. 82. 83. 84. 85. 86. 87. 88. 89. 18 of 18 De Souza, U.J.B.; Nunes, R.; Targueta, C.P.; Diniz-Filho, J.A.F.; de Campos Telles, M.P. The complete chloroplast genome of Stryphnodendron adstringens (Leguminosae-Caesalpinioideae): Comparative analysis with related Mimosoid species. Sci. Rep. 2019, 9, 1–12. [CrossRef] [PubMed] Khakhlova, O.; Bock, R. Elimination of deleterious mutations in plastid genomes by gene conversion. Plant J. 2006, 46, 85–94. [CrossRef] Ma, P.-F.; Zhang, Y.-X.; Zeng, C.-X.; Guo, Z.-H.; Li, D.-Z. Chloroplast phylogenomic analyses resolve deep-level relationships of an intractable bamboo tribe Arundinarieae (Poaceae). Syst. Biol. 2014, 63, 933–950. [CrossRef] Lin, H.; Yu, M.; Wang, X.; Zhang, X.-H. Comparative genomic analysis reveals the evolution and environmental adaptation strategies of vibrios. BMC Genom. 2018, 19, 135. [CrossRef] Tonti-Filippini, J.; Nevill, P.G.; Dixon, K.; Small, I. What can we do with 1000 plastid genomes? Plant J. 2017, 90, 808–818. [CrossRef] Katoh, K.; Standley, D.M. MAFFT multiple sequence alignment software version 7: Improvements in performance and usability. Mol. Biol. Evol. 2013, 30, 772–780. [CrossRef] Kunth, C. Melanthaceae [sic]. Enumeratio Plant. 1843, 4, 1. Wang, Y.; Sun, J.; Yuan, Q.; Guo, L. Complete chloroplast genome sequence of Dracaena cochinchinensis: Genome structure and genomic resources. Mitochondrial DNA Part B 2019, 4, 1750–1751. [CrossRef] Pfanzelt, S.; Albach, D.C.; von Hagen, K.B. Extremely low levels of chloroplast genome sequence variability in Astelia pumila (Asteliaceae, Asparagales). PeerJ 2019, 7, e6244. [CrossRef] Chase, M.W.; Hills, H.H. Silica gel: An ideal material for field preservation of leaf samples for DNA studies. Taxon 1991, 40, 215–220. [CrossRef] Doyle, J. DNA protocols for plants. In Molecular Techniques in Taxonomy; Hewitt, G.M., Johnston, A.W.B., Young, J.P.W., Eds.; Springer: Berlin/Heidelberg, Germany, 1991; pp. 283–293. Jin, J.-J.; Yu, W.-B.; Yang, J.-B.; Song, Y.; Yi, T.-S.; Li, D.-Z. GetOrganelle: A simple and fast pipeline for de novo assembly of a complete circular chloroplast genome using genome skimming data. BioRxiv 2018, 256479. [CrossRef] Tillich, M.; Lehwark, P.; Pellizzer, T.; Ulbricht-Jones, E.S.; Fischer, A.; Bock, R.; Greiner, S. GeSeq–versatile and accurate annotation of organelle genomes. Nucleic Acids Res. 2017, 45, W6–W11. [CrossRef] [PubMed] Lowe, T.M.; Chan, P.P. tRNAscan-SE On-line: Integrating search and context for analysis of transfer RNA genes. Nucleic Acids Res. 2016, 44, W54–W57. [CrossRef] Lohse, M.; Drechsel, O.; Kahlau, S.; Bock, R. OrganellarGenomeDRAW—a suite of tools for generating physical maps of plastid and mitochondrial genomes and visualizing expression data sets. Nucleic Acids Res. 2013, 41, W575–W581. [CrossRef] Kumar, S.; Stecher, G.; Tamura, K. MEGA7: Molecular evolutionary genetics analysis version 7.0 for bigger datasets. Mol. Biol. Evol. 2016, 33, 1870–1874. [CrossRef] Frazer, K.A.; Pachter, L.; Poliakov, A.; Rubin, E.M.; Dubchak, I. VISTA: Computational tools for comparative genomics. Nucleic Acids Res. 2004, 32, W273–W279. [CrossRef] Darling, A.C.; Mau, B.; Blattner, F.R.; Perna, N.T. Mauve: Multiple alignment of conserved genomic sequence with rearrangements. Genome Res. 2004, 14, 1394–1403. [CrossRef] Librado, P.; Rozas, J. DnaSP v5: A software for comprehensive analysis of DNA polymorphism data. Bioinformatics 2009, 25, 1451–1452. [CrossRef] Thiel, T.; Michalek, W.; Varshney, R.; Graner, A. Exploiting EST databases for the development and characterization of gene-derived SSR-markers in barley (Hordeum vulgare L.). Theor. Appl. Genet. 2003, 106, 411–422. [CrossRef] [PubMed] Dereeper, A.; Guignon, V.; Blanc, G.; Audic, S.; Buffet, S.; Chevenet, F.; Dufayard, J.-F.; Guindon, S.; Lefort, V.; Lescot, M. Phylogeny. fr: Robust phylogenetic analysis for the non-specialist. Nucleic Acids Res. 2008, 36, W465–W469. [CrossRef] [PubMed] Chevenet, F.; Brun, C.; Bañuls, A.-L.; Jacq, B.; Christen, R. TreeDyn: Towards dynamic graphics and annotations for analyses of trees. BMC Bioinform. 2006, 7, 439. [CrossRef] [PubMed] © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

RELATED PAPERS

RELATED TOPICS

Log In

Complete Chloroplast Genomes of Chlorophytum comosum and Chlorophytum gallabatense: Genome Structures, Comparative and Phylogenetic Analysis

Complete Chloroplast Genomes of Chlorophytum comosum and Chlorophytum gallabatense: Genome Structures, Comparative and Phylogenetic Analysis

Related Papers

RELATED PAPERS

RELATED TOPICS