International Journal of Genomics

International Journal of Genomics / 2020 / Article

Research Article | Open Access

Volume 2020 |Article ID 6162802 |

Shaofeng Peng, Jia Lu, Zhen Zhang, Li Ma, Caixia Liu, Yongzhong Chen, "Global Transcriptome and Correlation Analysis Reveal Cultivar-Specific Molecular Signatures Associated with Fruit Development and Fatty Acid Determination in Camellia oleifera Abel", International Journal of Genomics, vol. 2020, Article ID 6162802, 16 pages, 2020.

Global Transcriptome and Correlation Analysis Reveal Cultivar-Specific Molecular Signatures Associated with Fruit Development and Fatty Acid Determination in Camellia oleifera Abel

Academic Editor: Marco Gerdol
Received07 Apr 2020
Revised02 Jul 2020
Accepted15 Jul 2020
Published01 Sep 2020


Background. Oil-tea Camellia is a very important edible oil plant widely distributed in southern China. Tea oil extracted from the oil-tea Camellia seeds is beneficial to health and is considered as a health edible oil. We attempt to identify genes related to fatty acid biosynthesis in an oil-tea Camellia seed kernel, generated a comprehensive transcriptome analysis of the seed kernel at different developmental stages, and explore optimal picking time of fruit. Material and Methods. A gas chromatography-mass spectrometer was used to detect the content of various fatty acids in samples. Transcriptome analysis was performed to detect gene dynamics and corresponding functions. Results. Multiple phenotypic data were counted in detail, including the oil content, oleic acid content, linoleic acid content, linolenic acid content, fruit weight, fruit height, fruit diameter, single seed weight, seed length, and seed width in different developmental stages, which indicate that a majority of indicators increased with the development of oil-tea Camellia. The transcriptomics was conducted to perform a comprehensive and system-level view on dynamic gene expression networks for different developmental stages. Short Time-series Expression Miner (STEM) analysis of XL106 (the 6 time points) and XL210 (8 time points) was performed to screen related fatty acid (FA) gene set, from which 1041 candidate genes related to FA were selected in XL106 and 202 related genes were screened in XL210 based on GO and KEGG enrichment. Then, candidate genes and trait dataset were combined to conduct correlation analysis, and 10 genes were found to be strongly connected with several key traits. Conclusions. The multiple phenotypic data revealed the dynamic law of changes during the picking stage. Transcriptomic analysis identified a large number of potential key regulatory factors that can control the oil content of dried kernels, oleic acid, linoleic acid, linolenic acid, fresh seed rate, and kernel-to-seed ratio, thereby providing a new insight into the molecular networks underlying the picking stage of oil-tea Camellia, which provides a theoretical basis for the optimal fruit picking point.

1. Introduction

Oil-tea Camellia (Camellia oleifera Abel.) is an ancient oilseed plant that has been cultivated for more than two thousand years and is mainly planted in tropical and subtropical regions [1]. Oil-tea Camellia is widely distributed in 11 provinces (municipality) in southern China, such as Hunan, Jiangxi, and Guangxi. Oil-tea Camellia has the characteristics of strong adaptability, high product value, and good comprehensive benefits. Owing to the high production of oils, oil-tea Camellia is considered one of the four woody oil plants, and the other three oil plants are oil palm, oil olive, and oil coconut. Tea oil is extracted from the oil-tea Camellia seeds and is widely used as an edible oil and a traditional medicine in our country. In particular, the Camellia oil is beneficial to health and is considered as a dietary oil [2]. Research indicated that tea oil contains abundant unsaturated FAs (fatty acids), including oleic acid, monounsaturated fatty acid, and linoleic acid [3]. Some reports systematically investigated the oil contents and fatty acid (FA) compositions of 11 wild oil-tea Camellia cultivars, revealing that oil contents in Camellia seeds varied from 41.92% to 53.30%. The average ratio and compositions of single FA were similar, and all involved palmitic acid (C16:0, PA), palmitoleic acid (C16:1), stearic acid (C18:0, SA), oleic acid (C18:1, OA), linoleic acid (C18:2, LA), linolenic acid (C18:3), eicosenoic acid (C20:1), and tetracosenoic acid (C24:1) [4].

Optimal picking time of fruit plays a pivotal role in the quality of the fruit. If the picking time is too early, the nutrients such as protein, sugar, and lipid in the fruit have not yet reached the optimal value. On the contrary, late harvest may lead to fruit aging and loss of nutrients. Hence, the optimal picking time of fruit of oil-tea Camellia can ensure the highest contents of FA. Usually, fruits of most cultivars are picked in October. However, the optimal picking time is still unknown. Therefore, 6 time points and 8 time points were separately selected in two common cultivars (XL106 and XL210) to systemically explore the indicator value of characters, including weight of single fruit, fruit width, seed number of fruit, seed length, seed thick, kernel weight, linolenic acid, oleic acid, and palmitic acid. Furthermore, changes in gene expression were analyzed by using transcriptomics.

On the one hand, we aimed to explore whether the oil content increases or decreases when fruits were picked in the early or late stages. If the difference is not significant, we can pick the fruit at any time from September to November, which is conducive to guiding the production. On the other hand, we attempt to identify genes related to the FA biosynthesis in oil-tea Camellia seeds, generated a comprehensive transcriptome of the seed kernel at different developmental stages, and explored the molecular regulatory mechanisms.

2. Materials and Methods

2.1. Plant Materials

The experimental materials of XL210 and XL106 were obtained from the Germplasm Resources Preservation Base of Hunan Academy of Forestry (National Oil-Tea Engineering Technology Research Center), which is located in Mingyue Village, Lukou Town, Changsha, Hunan Province, China (N28° 24.29, E113° 15.19). XL106 is an excellent clone by the Hunan Academy of Forestry Sciences in 1994. The flowering stage is from early October to mid-late December. The fruit maturity stage is in early October, 25-60 fruits per 500 grams. The yield of fresh seed is 47.7%, and seed kernel oil content is 42.95%. XL210 is a good strain of seeds in 2003 by the Hunan Academy of Forestry Sciences. It has a white flowering period from late October to late December. The fruit ripening stage is in late October, 15-30 fruits per 500 grams. The fresh seed rate is 44.8%, and the seed kernel oil rate is 44.32%. XL210 and XL106 are planted and preserved in the germplasm resource garden of oil-tea Camellia in the Hunan Academy of Forestry Sciences.

The sampling time of XL210 is from September 15, 2018, to November 5, 2018, sampling every other week, a total of 8 samples. From September 15, 2018, to October 22, 2018, the XL106 was sampled every other week for a total of 6 samples. At each sample, 10 fruits were collected from the middle and upper parts of the outer canopy of the tree. After picking, it was immediately put into self-sealing bags and put into dry iceboxes. When they were taken back to the laboratory, the peel and seed coat were stripped off, and the seed kernel was taken out and stored at -80°C for later use. In total, 8 samples in XL210 and 6 samples in XL106 without biological duplication were obtained for subsequent transcriptome. Details are as follows: we find a beaker with a diameter slightly larger than the fruit of Camellia oil and fill it with clear water. We tie the fruit with a string and let the fruit fully immerse in water. Then, the overflow was caught with a measuring cylinder to measure the volume that is the volume of the fruit.

2.2. Analysis of Phenotypic Data

Ten fruits were randomly selected at each time, and the fruit weight, fruit volume, fruit height, fruit diameter, pericarp thickness, and ventricular number per seed were determined. Fruit height, fruit diameter, and peel thickness were determined by a vernier caliper. The fruit volume was measured by the drainage method.

About one g seed kernel of oil-tea Camellia was randomly collected in each time period. Firstly, the wall was broken mechanically, and then, the FA was extracted with the Lipid Extraction Kit. Then, the fatty acid in the seed kernel was determined in accordance with GB 5009.168-2016 food safety national standard for methyl ester treatment.

2.3. Total RNA Isolation, Library Construction, and Transcriptome Sequencing

As mentioned earlier, RNA isolation and purification as well as cDNA library construction and sequencing were conducted [5]. In brief, total RNA was extracted from the seed kernel from both oil-tea Camellia cultivars using the TRIzol Reagent (Invitrogen, Beijing, China). The NanoDrop ND1000 spectrophotometer (NanoDrop Technologies, Wilmington, DE, USA) and the Agilent Bioanalyzer 2100 system (Agilent Technologies, Palo Alto, CA, USA) were used to determine the quantity and quality of RNA, respectively, and 1% agarose gel electrophoresis was used to detect RNA integrity. The mRNA was isolated from total RNA using magnetic beads with oligo (dT). The cDNA synthesis kit (TaKaRa) was selected to synthesize cDNA and linked the sequencing adapter at both ends of cDNA [6]. All RNA-Seq libraries were sequenced on a NovaSeq, and the whole set of the transcriptome raw data have been deposited in the Sequence Read Archive (SRA) database of the National Center for Biotechnology Information (NCBI) (accession number SRP241856).

2.4. De Novo Transcriptome Assembly and Analysis of Differentially Expressed Genes (DEGs)

For further analysis, the adaptor reads and low-quality reads were removed to obtain the clean reads, and then, the trinity platform ( was used for transcriptome assembly without a reference genome. The unigene sequences obtained were integrated for annotation.

For gene expression analysis, the counts were mapped to the reading of each gene by HTSeq v0.5.4p3 and then normalized to fragments per kilobase of transcript per million mapped reads (FPKM) according to the report [7]. DEGs were screened following and corrected . GOseq (1.10.0) [8] and KOBAS software [9] were utilized to conduct Gene Ontology (GO) enrichment and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment for all DEGs. The time series analysis software Mfuzz was used to perform the clustering algorithm [10]. Mfuzz is a software package that can classify genes into multiple clusters based on similar expression profiles, which helps find genes with similar functions. At the same time, genes with the same expression trend may participate in the same biological process, which helps to find gene regulation.

2.5. Real-Time PCR Analysis

Real-time PCR was performed according to a previous report [11]. Briefly, the PrimeScript RT Kit (TaKaRa, Dalian, China) was used to reverse-transcribe RNA into cDNA according to the manufacturer’s instructions. A 20 μL scale was selected to conduct the PCR reaction using the SYBR Premix ExTaq™ (TaKaRa, Dalian, China) on an ABI (Applied Biosystems) StepOne Plus System. The relative expression of target genes was calculated by the 2ΔΔCt method, and 12 correlated genes were selected for RT-qPCR using specific primers designed by Primer Premier 5 software. Each measurement was carried out in triplicate, and data is expressed as .

3. Results

3.1. Indicator Value of Characters of Two Oil-Tea Camellia Cultivars in the Developmental Stages

From the appearance of the fruit, it has no significant difference between the two cultivars at different stages of development (Figure 1). Indicator values in different periods of XL106 and XL210 are detailedly described in Tables 1 and 2. Results indicated that a majority of indicators keep increasing with growth of fruit. In particular, the oil content of dried kernels is significantly increased during the developmental stages, XL106 increased from 37.70% to 64.51%, and XL210 increased from 32.09% to 59.92%. Many characters have a larger difference between the two cultivars. XL106 matures early, with small flowers, small leaves, and small fruits, but the oil content is relatively high. However, XL210 matures later, with larger flowers, leaves, and fruits and low oil content. Throughout the development process, the palmitic acid content of XL106 was higher than that of XL210. The average content of oleic acid of XL106 was 75.69%, which was higher than XL210. Simultaneously, other related traits of fruit were systematically measured including fruit weight, fruit height, fruit diameter, single seed weight, seed length, and seed width, which is relatively higher in XL210.

Trait indicatorsSample collection date

Palmitic acid (%)10.97611.78911.66411.32711.16511.217
Palmitoleic acid (%)0.14800.099000
Stearic acid (%)0.891.131.6321.5231.6672.529
Oleic acid (%)66.51676.83976.39477.15479.05878.149
Linoleic acid (%)21.27210.0349.7099.4728.1098.105
Linolenic acid (%)0.1980.2080.5020.51400
Oil content of dried kernels (%)37.7049.9355.2557.1662.2864.51
Single fruit weight (g)8.82249.741211.46327.78628.29611.6192
Fruit volume (cm3)
Fruit height (mm)29.38830.30831.32428.0628.3230.422
Fruit diameter (mm)23.83424.46826.40422.88622.93626.384
Peel thickness (mm)2.8382.5982.862.862.7563.308
Seeds per fruit (pcs)
Fresh seed weight (g)4.13624.51785.65663.59423.67045.7838
Fresh seed rate (%)46.8846.3849.3546.1644.2449.78
Single seed weight (g)2.64742.75243.17923.05163.10582.6524
Seed volume (cm3)2.1222.381.962.322.14
Seed length (mm)17.68818.41219.87216.85419.09218.152
Seed width (mm)18.59818.9820.98219.32617.73618.428
Seed thickness (mm)15.43613.30812.3615.90415.8812.366
Seed coat thickness (mm)0.8260.7120.7460.7340.770.756
Kernel-seed ratio0.660.690.680.710.700.72

Trait indicatorsDifferent sampling periods

Palmitic acid (%)16.2215.3516.4016.6514.7814.3416.7213.39
Palmitoleic acid (%)0.1850.1680.2320.1820.1780.1600
Stearic acid (%)1.0751.020.911.2282.3791.0630.7581.233
Oleic acid (%)65.0771.5063.1466.8871.0570.4571.9076.52
Linoleic acid (%)17.0211.7719.1714.8511.2413.6510.478.75
Linolenic acid (%)0.430.
Oil content of dried kernels (%)32.0959.3443.5746.5255.3363.6461.1359.92
Single fruit weight (g)32.0435.9432.6231.0031.5840.0231.4032.62
Fruit volume (cm3)30.7535.753230.2530.253931.1332.5
Fruit height (mm)33.4334.7136.4135.1835.4237.6135.1034.48
Fruit diameter (mm)40.2739.9139.2338.0537.2141.3639.2438.82
Peel thickness (mm)
Seeds per fruit (pcs)66.2554.255.57.2544.5
Fresh seed weight (g)14.2313.9413.7312.2811.4415.9712.709.46
Fresh seed rate (%)44.4238.7842.0839.6136.2339.8940.4528.99
Single seed weight (g)2.19782.6373.11663.22582.33962.42463.09322.633
Seed volume (cm3)1.761.962.
Seed length (mm)17.17216.47220.36818.09215.45218.44816.2817.37
Seed width (mm)19.0218.09619.94620.97620.61817.88421.55219.604
Seed thickness (mm)12.99413.1714.51616.61611.72813.18217.35214.16
Seed coat thickness (mm)0.7580.8320.7880.6880.6120.60.7120.738
Kernel-seed ratio0.62100.65380.67810.67630.67310.67870.69110.6848
DGAT (U/g)0.25310.17940.21280.25430.18650.17200.18150.1874
AI (U/g)4768.008757.177772.196450.675235.866746.177419.235391.82
G6PD (U/g)79.384265.178977.787286.528961.312464.086259.883578.2915
CWI (U/g)1180.621361.162244.741675.512038.712329.71291.061471.60
ACCase (U/g)0.43170.39940.42040.40600.38370.37380.29500.2523

DGAT: diacylglycerol acyltransferase; AI: invertase; G6PD: glucose-6-phosphatase dehydrogenase; CWI: cell wall-bound invertase; ACCase: acetyl coenzyme A carboxylase.
3.2. RNA-Seq Analysis

In order to explore the molecular basis of the above-described morphological differences in seed development, RNA-Seq analyses were conducted to generate transcriptome profiles. A total of 11.52 million clean reads from the samples were screened following a series of data filtering. The proportion of the GC accounts for 49% (Supplementary Table 1), and the percentage of reads with an average quality score of more than 30 was more than 94% (Supplementary Table 1), indicating that the sequencing data was accurate, quality, and sufficient for further analysis. 174310 unigenes were assembled by using trinity, with an average length of 591.16 bp (Supplementary Table 2). The median length of contig (N50) was 816 bp, and the range of 69097 unigenes is from 200 to 300 bp (Supplementary Table 3).

3.3. Differential Gene Expression during Seed Kernel Development in Two Cultivars

The transcriptome analysis of different stages of seed kernel development can provide crucial system-level insights into the molecular mechanisms underlying seed development. By convention, the picking time of XL106 and XL210 is selected on October 7 and October 22, respectively, where the oil content is usually the highest. Therefore, this time point with other groups was compared, and the corresponding results are exhibited in Figure 2. In the XL106 group, compared with XL106_1007, the number of DEG in the initial stage (XL106_0915 with 1518 DEG) and the mature stage (XL106_1022 with 1507 DEG) is higher than other developmental stages (including XL106_0922 with 626, XL106_0930 with 503DEG, and XL106_1014 with 824 DEG), in which the number of downregulated genes is more than that of the number of upregulated genes (Figure 2(a)).

Among the eight stages of XL210, 1617 DEGs and 1253 DEGs were detected in the early stage (XL210_0915 and XL210_0922), in which the number of upregulated genes is lower than that of downregulated genes. 1396 and 854 DEGs separately were screened in the middle developmental stage (XL210_1007 and XL210_1014), in which the number of upregulated genes was more than that of downregulated genes. At seed maturity (XL210_1029 and XL210_1105), 1173 and 951 DEGs are identified, and the number of upregulated genes is more than that of the downregulated genes (Figure 2(b)).

3.4. KEGG Analysis of Differential Genes

KEGG pathway analysis was conducted to explore the function of DEGs at different stages of two cultivars. The results of XL106 showed that the enriched pathway of KEGG in the initial period (XL106_1007 vs. XL106_0915) and mature period (XL106_1007 vs. XL106_1022) was roughly similar and mainly enriched in the pathway, including energy metabolism, amino acid metabolism, FA metabolism, carbohydrate metabolism, and biosynthesis of other secondary metabolites. These pathways are positively related to the FA metabolism of oil-tea Camellia (Figure 3(a)). However, compared with the initial period, the pathway of membrane transport was significantly enriched in the mature period. For the XL210, although the initial and mature stages (XL210_1022 vs. XL210_0915 and XL210_1022 vs. XL210_1105) were enriched in pathways similar to XL106, the number of genes in the pathways varied greatly, respectively, reducing nearly half of the gene (Figure 3(b)).

3.5. Dynamic Changes of DEGs in Different Developmental Processes of Two Cultivars

Based on STEM time series analysis, trend analysis was performed on XL106 (the 6 time points) (Figure 4(a)) and XL210 (8 time points) (Figure 4(b)) to screen related trend gene sets. All of the genes were clustered into 26 profiles, of which nine trend profiles were significantly enriched () (colored block). Through continuously observing the fatty acid content, we more focus on the gene set: continuously increased gene sets and gene set of “down-up-down.” In XL106, the trend gene set of No.9 and No.16 was screened, and the analysis of KEGG enrichment was conducted. The results suggested that many genes annotated to translation (62 genes), overview (58 genes), and carbohydrate metabolism (55 genes) in gene set of No.9. The No.16 gene set indicated that 73 genes annotated carbohydrate metabolism, 54 genes for energy metabolism, and 53 genes for translation. Similarly, we more focus on cluster 0 in the XL210, indicating that carbohydrate metabolism, translation, amino acid metabolism, and fatty acid metabolism were the mainly annotated pathways.

3.6. Correlation Analysis between Fatty Acid-Related Genes and Phenotype Indicates the Differential Regulatory Network of Two Cultivars

Firstly, the FA-related genes were screened from the top 20 data of GO and KEGG pathway (cluster 9 and cluster 16 in XL106 and cluster 0 in XL210), including 1041 candidate genes related to fatty acid was selected in the XL106 group (Supplementary Table 4) and 202 related genes were screened in the XL210 group (Supplementary Table 5). Subsequently, candidate genes and trait dataset were combined to conduct correlation analysis (Figure 5). We further obtained a pair of strong correlations between genes and traits and finally screened the top 10 genes with positive correlations for each trait. The connection between genes and traits represents a strong correlation, and the genes connecting multiple traits were the key genes.

In the trait-gene correlation network diagram of XL106, a number of genes with strong correlations with different physiological indicators were found including G37003_c1_g1 (DnaJ protein), G33655_c2_g1 (glycerol-3-phosphate acyltransferase), G36811_c3_g1 (lipid transfer protein), G22894_c1_g1 (chlorophyll a-b binding protein), G37717_c0_g3 (respiratory burst oxidase homolog protein), G35286_c1_g4 (mannitol dehydrogenase), G_3241_g3 (uncharacterized protein), and G3241_g2 (uncharacterized protein), which have a strong connection with several key traits such as the oil content of dried kernels, oleic acid, linoleic acid, linolenic acid, fresh seed rate, and kernel-to-seed ratio.

Similarly, in XL210, many genes were found, including G29262_c0_g1, G34899_c4_g3 (scarecrow-like protein), G26200_c0_g1 (uncharacterized protein), G7298_c0_g1 (polygalacturonase At1g48100), G34002_c3_g6 (sucrose synthase), and G34160_c3_g3 (uncharacterized protein). And the genes have a strong connection with dried kernel oil content, oleic acid, linoleic acid, linolenic acid, fresh seed rate, and kernel-seed ratio. Specific related genes and molecular functions are also listed in Table 3.

Gene IDNR database IDNR database notesKO notesMetabolic pathway

G37003_c1_g1OVA02738.1DnaJ domain (Macleaya cordata)
G33655_c2_g1XP_017231148.1Predicted: probable glycerol-3-phosphate acyltransferase 8 (Daucus carota)Glycerol-3-phosphate acyltransferaseGlycerophospholipid metabolism
G36811_c3_g1CAN67019.1Hypothetical protein VITISV_027707 (Vitis vinifera)
G22894_c1_g1XP_002284493.1Predicted: chlorophyll a-b binding protein 13, chloroplastic (Vitis vinifera)Light-harvesting complex II chlorophyll a/b binding protein 3Photosynthesis—antenna proteins
G37717_c0_g3AII25876.1NADPH oxidase A (Camellia sinensis)Respiratory burst oxidaseMAPK signaling pathway
G35286_c1_g4XP_010095885.1Putative mannitol dehydrogenase (Morus notabilis)Cinnamyl-alcohol dehydrogenasePhenylpropanoid biosynthesis
G32414_c2_g5XP_007010484.1Predicted: dnaJ protein homolog (Theobroma cacao)DnaJ homolog subfamily A member 2Protein processing in endoplasmic reticulum
G35432_c0_g2NP_001291336.111S globulin seed storage protein 2 precursor (Sesamum indicum)
G29262_c0_g1XP_011100902.1Alkane hydroxylase MAH1-like (Sesamum indicum)
G34899_c4_g3XP_015062026.1Predicted: scarecrow-like protein 22 (Solanum pennellii)
G26200_c0_g1CDO98741.1Unnamed protein product (Coffea canephora)
G7298_c0_g1XP_002266600.1Predicted: polygalacturonase At1g48100 (Vitis vinifera)
G34002_c3_g6AHL29281.1Sucrose synthase 1 (Camellia sinensis)Sucrose synthaseStarch and sucrose metabolism
G34160_c3_g3XP_011075467.2Kirola-like (Sesamum indicum)

3.7. Specific Genes Involved in the Production of Particular Fatty Acids and the Regulation and Synthesis of Neutral Lipid or Triacylglycerol

Linoleic acid and linolenic acid are essential fatty acids for the human body and must be supplied by food. Thus, we focused on the genes related to synthesis. As shown in Figure 6, we screened the genes related to linoleic acid including G35432_c0_g3 (globulin seed storage protein), G35764_c0_g1 (protein SCAR3), G36627_c1_g1 (polyadenylate-binding protein), g33751_c3_g1 (actin-related protein), G27835_c5_g2 (heterogeneous nuclear ribonucleoprotein), G12584_c0_g1 (calmodulin-like protein), G35432_c9_g1 (globulin seed storage protein), G35432_c5_g1 (globulin seed storage protein), G35432_c0_g2 (globulin seed storage protein), and G19961_c0_g2 (uncharacterized protein). G37599_c0_g1 (pleiotropic drug resistance protein), G36542_c0_g6 (ribonucleotide reductase large subunit B family protein), G22894_c1_g1 (chlorophyll a-b binding protein 13), G34392_c4_g1 (hypothetical protein), and G20726_c0_g1 (histone H2A-like) were found to be related to linolenic acid.

3.8. RT-qPCR Validation of the Transcriptomic Data

Finally, 12 genes related to FA (8 genes in XL106 and 4 genes in XL210) were screened, and the expression levels in different developmental stages were analyzed using RT-qPCR. The results indicated that the majority of 8 genes in XL106 had similar expression patterns between the RNA-Seq results and RT-qPCR (Figure 6). Most of the genes showed a continuous increase over time and maintained a relatively stable state.

In XL210, the expression line graph showed that the expression trend of 4 genes at eight time points was almost identical, indicating that the expression quantity increased gradually with time and fluctuates at the last two time points. The results validated the good consistency of the RNA-Seq data and RT-qPCR gene expression.

4. Discussion

Understanding the molecular regulation mechanisms of FA is of great importance to instruct the production and FA yield in oil-tea Camellia. In this study, transcriptomic comparison was performed between several developmental stages of XL106 and XL210 seed kernel. The results provided comprehensive information about genes related to the determination of FA.

According to the changes of FA and combined current meteorological conditions, two cultivars were selected for systematic analysis. Usually, XL106 was picked around October 7, and XL210 was picked on October 22; thus, a few key points before and after picking were chosen. We have studied the phenotypic traits and studied the different developmental stages of two cultivars in detail, demonstrating that most indicators maintain growth over time. However, many indicators, including palmitic acid, stearic acid, oleic acid, linoleic acid, and linolenic acid, showed a trend of “down-up-down.” With the development of plants and the maturation of fruit, fatty acid, protein, fat, and other nutrients continue to increase. The phenomenon is commonplace and has been reported in many plants, especially in oil crops including peanut [12] and rapeseed [13]. However, related FAs have not continued to increase during the stages of the time period of picking (September to October). In particular, compared with the conventional picking point, the content of FA including palmitic acid, stearic acid, oleic acid, linoleic acid, and linolenic acid was gradually decreased in the later period, speculating that fruits are starting to senescence after the optimal picking period and fatty acids start to decompose and consume [14].

Therefore, transcriptome analysis was performed and the molecular regulatory mechanisms were explored. As shown in Supplementary Table 1, the average data volume of each sample reached 2.3 G, and the proportion of base quality reaching Q30 is 94%, which is better than other researches, indicating that quality of sequencing is normal and satisfied the following analysis [15, 16]. DEG analysis showed that the number and variation of upregulated and downregulated genes of the two cultivars at different stages have great changes. The differential regulation of gene members in different cultivars may lead to different regulatory networks, which can determine the fruit development and FA content of specific varieties.

KEGG analysis showed that the results of XL106_1007 vs. XL106_0915 (early stage) and XL106_1007 vs. XL106_1022 (later stage) were similar and were mainly enriched in the pathway including energy metabolism, amino acid metabolism, fatty acid metabolism, carbohydrate metabolism, and biosynthesis of other secondary metabolites. These massively concentrated pathways have a great relationship with the synthesis of oil-tea Camellia [17, 18]. In the two stages, the number of KEGG-enriched genes was 385 and 346, respectively, which indicated that a large number of the genes and pathways related to FA remained active in the pre- and poststage of XL106. A large number of genes and pathways remain active. Compared with XL106_1007 vs. XL106_0915, it is worth mentioning that the enrichment results of XL106_1007 vs. XL106_1022 found that there was a distinct and significant enrichment in the metabolic pathways related to membrane transport. The results may mean that more pathways are needed to perform more transmembrane transport that adapted to accommodate the large accumulation of FA at the later stages [19, 20]. Aiming at XL210, the enrichment pathways were found to be similar to XL106, but the initial and later stage numbers of genes on the pathways have greatly changed, 292 and 152 genes, respectively, which means that XL210 was delayed by 2 periods at the time point. It was speculated that the long-term accumulation of fatty acid in the later period could negatively inhibit the activity of relevant metabolic pathways, thereby maintaining the normal growth and metabolic balance of the organism [21].

STEM time series analysis was carried out, and the trend gene set related to the targeted expectation was found according to 6 time points of XL106 and 8 time points of XL210. It was also found that several genes were strongly correlated with different physiological indicators at the same time. In particular, many genes with a strong correlation with the character were screened, including dry kernel oil content, oleic acid, linoleic acid, linolenic acid, fresh seed ratio, and kernel-to-seed ratio.

G33655_c2_g1 (glycerol-3-phosphate acyltransferase) is a kind of acyltransferase, which can play a key role in the metabolic pathway of glycerophosphofatty acid [22, 23]. Previous studies have found that glycerol-3-phosphate acyltransferase played essential roles in the production of triacylglycerol in Phaeodactylum tricornutum [24, 25]. G37717_c0_g3 gene is NADPH oxidase, which can activate or respond to the MAPK signal transduction pathway [26], thus responding to endogenous or exogenous stimuli including growth factors [27], cytokines, radiation, and osmotic pressure [28]. It is speculated that it can respond to the flow of metabolic products, such as oleic acid. G35286_c1_g4 (mannitol dehydrogenase) and G34002_c3_g6 (sucrose synthase) participate in the regulation of nutrients or energy metabolism and fatty acid metabolism by affecting ammonium phenylpropionate metabolism, endoplasmic reticulum protein processing, and starch and sugar metabolism pathways [29].

In the synthesis process of oil-tea Camellia fatty acid metabolites and other metabolites, it will be affected by the regulation of a lot of genes, including signal transduction, energy metabolism, the product of base material synthesis, and key synthetase gene functions [3032]. These different levels of gene interaction formed a complex biomolecular regulatory network, which in turn affects the synthesis, accumulation, and metabolism of fatty acid metabolites [33, 34]. Overall, on the one hand, we revealed the dynamic change of various FAs. On the other hand, the analyses of the comprehensive transcriptome dataset in this study provided a useful genomic resource for fruit picking and provided molecular insights into the related FA genes.

Data Availability

The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.

Conflicts of Interest

All authors declare that they have no financial or other conflicts of interest in relation to this research and its publication.


This study is grateful to Yanming Xu, Xiangnan Wang, Rui Wang, Zhilong He, and Yinghe Peng for their help in data analysis. This research was supported by the Science and Technology Major Program of Hunan Province, China (Grant No. 2018NK1030).

Supplementary Materials

Supplementary Table 1: transcriptome base statistics after filtration. Supplementary Table 2: the length range of transcript and unigene. Supplementary Table 3: the length of transcript, unigene, and N50. Supplementary Table 4: candidate genes related to fatty acid in XL106. Supplementary Table 5: candidate genes related to fatty acid in 202. (Supplementary Materials)


  1. J.-L. Feng, Z. J. Yang, S. P. Chen, Y. A. el-Kassaby, and H. Chen, “High throughput sequencing of small RNAs reveals dynamic microRNAs expression of lipid metabolism during Camellia oleifera and C. meiocarpa seed natural drying,” BMC Genomics, vol. 18, no. 1, p. 546, 2017. View at: Publisher Site | Google Scholar
  2. Y. T. Cheng, C. C. Lu, and G. C. Yen, “Beneficial effects of Camellia oil (Camellia oleifera Abel.) on hepatoprotective and gastroprotective activities,” Journal of Nutritional Science and Vitaminology, vol. 61, pp. S100–S102, 2015. View at: Publisher Site | Google Scholar
  3. P. Lin, K. Wang, C. Zhou, Y. Xie, X. Yao, and H. Yin, “Seed transcriptomics analysis in Camellia oleifera uncovers genes associated with oil content and fatty acid composition,” International Journal of Molecular Sciences, vol. 19, no. 1, 2018. View at: Publisher Site | Google Scholar
  4. C. Yang, X. Liu, Z. Chen, Y. Lin, and S. Wang, “Comparison of oil content and fatty acid profile of ten new Camellia oleifera cultivars,” Journal of Lipids, vol. 2016, Article ID 3982486, 6 pages, 2016. View at: Publisher Site | Google Scholar
  5. H. Liu, J. Gu, Q. Lu et al., “Transcriptomic analysis reveals the high-oleic acid feedback regulating the homologous gene expression of stearoyl-ACP desaturase 2 (SAD2) in peanuts,” International Journal of Molecular Sciences, vol. 20, no. 12, p. 3091, 2019. View at: Publisher Site | Google Scholar
  6. F. Li, K. E. Kaczor-Urbanowicz, J. Sun et al., “Characterization of human salivary extracellular RNA by next-generation sequencing,” Clinical Chemistry, vol. 64, no. 7, pp. 1085–1095, 2018. View at: Publisher Site | Google Scholar
  7. X. Mao, T. Cai, J. G. Olyarchuk, and L. Wei, “Automated genome annotation and pathway identification using the KEGG Orthology (KO) as a controlled vocabulary,” Bioinformatics, vol. 21, no. 19, pp. 3787–3793, 2005. View at: Publisher Site | Google Scholar
  8. S. Gotz, J. M. Garcia-Gomez, J. Terol et al., “High-throughput functional annotation and data mining with the Blast2GO suite,” Nucleic Acids Research, vol. 36, no. 10, pp. 3420–3435, 2008. View at: Publisher Site | Google Scholar
  9. A. Mortazavi, B. A. Williams, K. McCue, L. Schaeffer, and B. Wold, “Mapping and quantifying mammalian transcriptomes by RNA-Seq,” Nature Methods, vol. 5, no. 7, pp. 621–628, 2008. View at: Publisher Site | Google Scholar
  10. L. Kumar and M. E. Futschik, “Mfuzz: a software package for soft clustering of microarray data,” Bioinformation, vol. 2, no. 1, pp. 5–7, 2007. View at: Publisher Site | Google Scholar
  11. H. Liu, S. Dong, F. Gu et al., “NBS-LRR protein Pik-H4 interacts with OsBIHD1 to balance rice blast resistance and growth by coordinating ethylene-brassinosteroid pathway,” Frontiers in Plant Science, vol. 8, p. 127, 2017. View at: Publisher Site | Google Scholar
  12. H. Liu, Y. Hong, Q. Lu et al., “Integrated analysis of comparative lipidomics and proteomics reveals the dynamic changes of lipid molecular species in high-oleic acid peanut seed,” Journal of Agricultural and Food Chemistry, vol. 68, no. 1, pp. 426–438, 2020. View at: Publisher Site | Google Scholar
  13. P. Liao, H. K. Woodfield, J. L. Harwood, M. L. Chye, and S. Scofield, “Comparative transcriptomics analysis of Brassica napus L. during seed maturation reveals dynamic changes in gene expression between embryos and seed coats and distinct expression profiles of acyl-CoA-binding proteins for lipid accumulation,” Plant & Cell Physiology, vol. 60, no. 12, pp. 2812–2825, 2019. View at: Publisher Site | Google Scholar
  14. N. Iqbal, N. A. Khan, A. Ferrante, A. Trivellini, A. Francini, and M. I. R. Khan, “Ethylene role in plant growth, development and senescence: interaction with other phytohormones,” Frontiers in Plant Science, vol. 8, p. 475, 2017. View at: Publisher Site | Google Scholar
  15. W. Xie, J. Zhang, X. Zhao, Z. Zhang, and Y. Wang, “Transcriptome profiling of Elymus sibiricus, an important forage grass in Qinghai-Tibet Plateau, reveals novel insights into candidate genes that potentially connected to seed shattering,” BMC Plant Biology, vol. 17, no. 1, p. 78, 2017. View at: Publisher Site | Google Scholar
  16. J. Hu, D. Yang, H. Wang, C. Li, Y. Zeng, and W. Chen, “CpG oligodeoxynucleotides induce differential cytokine and chemokine gene expression profiles in Dapulian and landrace pigs,” Frontiers in Microbiology, vol. 7, article 1992, 2016. View at: Publisher Site | Google Scholar
  17. B. Dong, B. Wu, W. Hong et al., “Transcriptome analysis of the tea oil camellia (Camellia oleifera) reveals candidate drought stress genes,” PLoS One, vol. 12, no. 7, article e0181835, 2017. View at: Publisher Site | Google Scholar
  18. B. Wu and C. Ruan, “Comparative transcriptomic analysis of high- and low-oil Camellia oleifera reveals a coordinated mechanism for the regulation of upstream and downstream multigenes for high oleic acid accumulation,” 3 Biotech, vol. 9, no. 7, p. 257, 2019. View at: Publisher Site | Google Scholar
  19. G. Z. Jiang, M. Zhou, D. D. Zhang, X. F. Li, and W. B. Liu, “The mechanism of action of a fat regulator: glycyrrhetinic acid (GA) stimulating fatty acid transmembrane and intracellular transport in blunt snout bream (Megalobrama amblycephala),” Comparative Biochemistry and Physiology Part A, Molecular & Integrative Physiology, vol. 226, pp. 83–90, 2018. View at: Publisher Site | Google Scholar
  20. N. Guan, B. Du, J. Li et al., “Comparative genomics and transcriptomics analysis-guided metabolic engineering of Propionibacterium acidipropionici for improved propionic acid production,” Biotechnology and Bioengineering, vol. 115, no. 2, pp. 483–494, 2018. View at: Publisher Site | Google Scholar
  21. N. Meng, Z. Y. Ren, X. F. Yang, and Q. H. Pan, “Effects of simple rain-shelter cultivation on fatty acid and amino acid accumulation in 'Chardonnay' grape berries,” Journal of the Science of Food and Agriculture, vol. 98, no. 3, pp. 1222–1231, 2018. View at: Publisher Site | Google Scholar
  22. X. Men, J. Shi, W. Liang et al., “Glycerol-3-phosphate acyltransferase 3 (OsGPAT3) is required for anther development and male fertility in rice,” Journal of Experimental Botany, vol. 68, no. 3, pp. 513–526, 2017. View at: Publisher Site | Google Scholar
  23. N. Sui, S. Tian, W. Wang, M. Wang, and H. Fan, “Overexpression of glycerol-3-phosphate acyltransferase from Suaeda salsa improves salt tolerance in Arabidopsis,” Frontiers in Plant Science, vol. 8, p. 1337, 2017. View at: Publisher Site | Google Scholar
  24. M. B. Garcia-Fabiani, M. A. Montanaro, P. Stringa et al., “Glycerol-3-phosphate acyltransferase 2 is essential for normal spermatogenesis,” The Biochemical Journal, vol. 474, no. 18, pp. 3093–3107, 2017. View at: Publisher Site | Google Scholar
  25. Y. F. Niu, X. Wang, D. X. Hu et al., “Molecular characterization of a glycerol-3-phosphate acyltransferase reveals key features essential for triacylglycerol production in Phaeodactylum tricornutum,” Biotechnology for Biofuels, vol. 9, no. 1, p. 60, 2016. View at: Publisher Site | Google Scholar
  26. Y. Kadota, K. Shirasu, and C. Zipfel, “Regulation of the NADPH oxidase RBOHD during plant immunity,” Plant & Cell Physiology, vol. 56, no. 8, pp. 1472–1480, 2015. View at: Publisher Site | Google Scholar
  27. Y. Qu, M. Yan, and Q. Zhang, “Functional regulation of plant NADPH oxidase and its role in signaling,” Plant Signaling & Behavior, vol. 12, no. 8, article e1356970, 2017. View at: Publisher Site | Google Scholar
  28. W. Wang, D. Chen, X. Zhang, D. Liu, Y. Cheng, and F. Shen, “Role of plant respiratory burst oxidase homologs in stress responses,” Free Radical Research, vol. 52, no. 8, pp. 826–839, 2018. View at: Publisher Site | Google Scholar
  29. C. Liu, X. Chen, P. Ma et al., “Ethylene responsive factor MeERF72 negatively regulates sucrose synthase 1 gene in cassava,” International Journal of Molecular Sciences, vol. 19, no. 5, article 1281, 2018. View at: Publisher Site | Google Scholar
  30. Q. Xu, L. Cheng, Y. Mei et al., “Alternative splicing of key genes in LOX pathway involves biosynthesis of volatile fatty acid derivatives in tea plant (Camellia sinensis),” Journal of Agricultural and Food Chemistry, vol. 67, no. 47, pp. 13021–13032, 2019. View at: Publisher Site | Google Scholar
  31. Q. Li, S. Lei, K. Du et al., “RNA-seq based transcriptomic analysis uncovers α-linolenic acid and jasmonic acid biosynthesis pathways respond to cold acclimation in Camellia japonica,” Scientific Reports, vol. 6, no. 1, article 36463, 2016. View at: Publisher Site | Google Scholar
  32. Y. Xie and X. Wang, “Comparative transcriptomic analysis identifies genes responsible for fruit count and oil yield in the oil tea plant Camellia chekiangoleosa,” Scientific Reports, vol. 8, no. 1, article 6637, 2018. View at: Publisher Site | Google Scholar
  33. J. A. Aznar-Moreno and T. P. Durrett, “Simultaneous targeting of multiple gene homeologs to alter seed oil production in Camelina sativa,” Plant & Cell Physiology, vol. 58, no. 7, pp. 1260–1267, 2017. View at: Publisher Site | Google Scholar
  34. Q. Y. Yao, H. Huang, Y. Tong, E. H. Xia, and L. Z. Gao, “Transcriptome analysis identifies candidate genes related to triacylglycerol and pigment biosynthesis and photoperiodic flowering in the ornamental and oil-producing plant, Camellia reticulata (Theaceae),” Frontiers in Plant Science, vol. 7, p. 163, 2016. View at: Publisher Site | Google Scholar

Copyright © 2020 Shaofeng Peng et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

More related articles

 PDF Download Citation Citation
 Download other formatsMore
 Order printed copiesOrder

Related articles

Article of the Year Award: Outstanding research contributions of 2020, as selected by our Chief Editors. Read the winning articles.