What is 18S /ITS Amplicon Sequencing Analysis?
16S/18S/Internal Transcribed Spacer (ITS) amplicon sequencing is widely used in microbial research because of its low cost, short cycle time, and low sample quality requirements. Among them, 18S and ITS sequencing is commonly used to identify fungi and other eukaryotes in environmental samples. Both methods use PCR to amplify DNA with specific primers, and the results are obtained after sequence processing, sequence analysis, and comparison. ITS sequencing is more focused on studying the intraspecific genetic diversity of fungi due to greater ITS variability, while 18S rRNA focuses on phylogenetic taxonomic studies of fungi. In recent years, scientists are developing full-length 18S/ITS amplicon sequencing methods based on PacBio and Oxford Nanopore sequencing technologies to identify a wider range of species and distinguish strains within species.
Fig. 1. Taxonomic classification of eukaryotic microbes using 18S amplicon sequencing. (Popovic A, et al, 2018)
Application of Full-length 18S / ITS Amplicon Sequencing Analysis
Our full-length 18S/ITS amplicon sequencing analysis can be used for but not limited to the following research:
Environmental microbiology: our full-length 18S/ITS amplicon sequencing analysis can help you characterize microbial communities in the environment and reveal their role in nutrient cycling, bioremediation, and overall ecosystem function.
Human microbiome research: our full-length 18S/ITS amplicon sequencing analysis can characterize microbial communities in different body parts, such as the gut, mouth, skin and reproductive tract. This analysis helps reveal the complex relationships between the human microbiome and various health conditions.
Agriculture and food safety: our full-length 18S/ITS amplicon sequencing analysis can be used to study microbial communities associated with crop health, soil fertility and plant-microbe interactions. It also helps to detect and monitor foodborne pathogens and spoilage microbes, ensuring food safety and quality throughout the production and distribution chain.
Biotechnology and pharmaceutical research: our full-length 18S/ITS amplicon sequencing analysis allows monitoring and characterizing these microbial communities, helping you to identify potential contaminants, optimize production conditions, and develop effective microbial control strategies.
Full-length 18S/ITS Sequencing Process
Fig 2 The Process of Full-length 18S/ITS Sequencing
Sample Submission Guidelines of Sequencing
CD Genomics Data Analysis Pipeline:
CD Genomics is a leading company in genomics and sequencing services, providing comprehensive and advanced bioinformatics analysis of full-length 18S/ITS amplicon sequencing data. Utilizing cutting-edge technology and expertise in the field, CD Genomics assists researchers and scientists in unraveling the complexity of microbial communities and their functions.
(1) Data pre-processing
Quality control and filtering of raw sequencing data obtained by sequencing full-length 18S/ITS amplicons to remove low-quality reads, adapter sequences and artifacts. CD Genomics employs stringent quality control measures to ensure reliable downstream analysis.
(2) Sequence alignment and clustering
CD Genomics performs sequence alignment using specialized algorithms and databases to map sequencing reads to reference databases, such as SILVA or UNITE for the 18S and ITS regions, respectively.
(3) Taxonomic analysis and diversity analysis
CD Genomics uses advanced bioinformatics tools to assign taxonomic identities to OTUs and generate taxonomic analysis histograms and abundance heat maps.
(4) Functional analysis:
CD Genomics can predict the abundance of functional pathways based on 16S copy number.
(5) Statistical analysis and visualization
CD Genomics uses various statistical methods and visualization techniques to reveal significant differences and relationships in the data. Multivariate statistical analyses are used to identify changes in microbial communities under different conditions or treatments. In addition, visualization methods such as principal component analysis (PCA) and ternary plots facilitate visualization and interpretation of complex microbial data sets.
Full-length 18S / ITS Amplicon Sequencing Analysis Content:
Species Annotation | Tag assembly OTU clustering Taxonomic analysis Abundance heat map generation Phylogenetic tree construction |
Diversity Analysis | Alpha diversity Beta diversity Meta-analysis Multivariate statistical analysis |
What Are the Advantages of Our Services?
Stringent Quality Control
Our protocols are meticulously designed to ensure high-quality RNA extraction and effective ribosomal RNA removal, which are crucial for obtaining accurate sequencing results.
Reliable Data
We maintain data integrity by implementing rigorous quality assessment steps, including adapter trimming and the removal of low-quality sequences.
State-of-the-Art Sequencing Platforms
By utilizing cutting-edge sequencing technology, we achieve high-throughput and precise 18S/ITS amplicon sequencing.
Innovative Techniques
Our advanced methodologies significantly enhance the detection of target RNA molecules, thereby improving the sensitivity and specificity of our sequencing processes.
Sophisticated Bioinformatics Tools
Our comprehensive data analysis pipeline includes quality control, sequence alignment, and functional annotation, ensuring detailed insights into microbial communities.
Functional and Pathway Analysis
We conduct in-depth analysis of differentially expressed genes and their enrichment in biological pathways, elucidating the functional roles of microbial species.
Example Data Analysis Report
To demonstrate the quality and detail of a CD Genomics report for Full-length 18S Amplicon data analysis, we have a sample report available. Contact us to request our Full-length 18S Amplicon data report. You can also refer to a client-published article, " Blocking IL-17A prevents oxycodone-induced depression-like effects and elevation of IL-6 levels in the ventral tegmental area and reduces oxycodone-derived physical dependence in rats." which includes some of the data we provided.
How It Works
CD Genomics is a high-tech company specializing in multiomic data analysis. We provide services such as project design, data analysis, and database construction. With a focus on developing breakthrough products and services, we are a pioneer in the biotechnology industry, serving researchers and partners worldwide.
CD Genomics provides valuable support and insight in areas such as these applications with its extensive experience and expertise in bioinformatics analysis of full-length 18S / ITS amplicon sequencing. We can help you gain a deeper understanding of the diversity, composition and function of your microbial community. From species annotation and diversity analysis to functional prediction and enterotype analysis, CD Genomics offers a comprehensive solution for unlocking the mysteries of the microbial world. If you are interested in our services, please contact us for more detailed information.
Reference
- Popovic A, Parkinson J. Characterization of eukaryotic microbiome using 18S amplicon sequencing[J]. Microbiome analysis: methods and protocols, 2018: 29-48.
What Does Analysis of Full-length 18S / ITS Amplicon Sequenceing Show?
Data Processing and Statistics
Table 1 Data statistics of the quality control (Top 10).
Sample Name | Raw_Reads(nt) | Clean Tags (nt) | Avglen (nt) | GC Conter (%) | Q20 (%) | Q30 (%) |
C1_1_M | 228272 | 98032 | 252 | 50.7 | 99.23 | 97.11 |
C1_3_M | 210990 | 91908 | 252 | 50.64 | 99.26 | 97.18 |
C1_2_M | 196918 | 85044 | 252 | 50.49 | 99.3 | 97.32 |
C2_2_F | 182560 | 79198 | 252 | 51.21 | 99.24 | 97.11 |
C4_1_M | 174606 | 75278 | 252 | 54.84 | 99.24 | 97.11 |
C4_3_F | 168318 | 70309 | 252 | 55.61 | 99.22 | 97.03 |
C1_2_F | 165110 | 70902 | 252 | 50.27 | 99.21 | 97.04 |
C16_2_M | 159294 | 69376 | 252 | 49.1 | 99.18 | 96.96 |
C8_2_F | 153876 | 64221 | 252 | 55.16 | 99.26 | 97.17 |
C3_4_M | 148624 | 61667 | 252 | 56.79 | 99.27 | 97.12 |
Fig 3 Effective tags length distribution of sample ID C1_1_M.
Feature table construction
Fig 4 The frequency of each sample
Species Annotation and Taxonomic Analysis
Taxonomy Distribution Histogram of All Samples
Fig 5 The taxonomy distribution of all samples in the Phylum classification level.
Species Abundance Heatmap
Fig 6 Species abundance Heatmap. Phylum.
Alpha Diversity Analysis
Statistical Data of Alpha Diversity
Table 2. Statistics of Alpha diversity indices (Top 10).
Sample | Observed species | ace | Chao 1 | Simpson | Shannon |
S145_1 | 910 | 910 | 910 | 0.654985035 | 3.914181389 |
S145_2 | 1891 | 1891 | 1891 | 0.830694297 | 5.83462709 |
S146_1 | 158 | 158 | 158 | 0.961684233 | 5.833334444 |
S146_2 | 87 | 87 | 87 | 0.55398131 | 1.616963214 |
S147_1 | 100 | 100 | 100 | 0.529850969 | 1.550806285 |
S147_2 | 802 | 802 | 802 | 0.594395889 | 2.948466552 |
S148_1 | 1141 | 1141 | 1141 | 0.789908199 | 4.70196164 |
S148_2 | 233 | 233 | 233 | 0.46312489 | 2.44111006 |
S149_1 | 3312 | 3312 | 3312 | 0.998912511 | 10.74202729 |
S149_2 | 199 | 199 | 199 | 0.538326826 | 1.745920168 |
Rarefaction Curve
Fig 7 Rarefaction curve of the sequenced reads for all samples (The above figure) and The depth of the sequencing samples (The below figure).
Beta Diversity Analysis
Boxplot Analysis
Fig 8 Boxplot analysis based on Bray Curtis.
PCoA Analysis
Fig 9 PCoA analysis based on Bray Curtis.
UPGMA Analysis
Fig 10 UPGMA clustering tree based on unweighted unifrac. The different colors represent different groups.
Significant Difference Analysis
ANCOM Analysis
Table 3. ANCOM analysis partial result in phylum-level in GROUP.
Percentile | 0 | 25 | 50 | 75 | 100 | 0 | 25 | 50 | 75 | 100 |
Group | AHCC | AHCC | AHCC | AHCC | AHCC | Antibiotic | Antibiotic | Antibiotic | Antibiotic | Antibiotic |
d__Bacteria;p__Bacteroidota | 1007 | 2247 | 5634 | 9116 | 14790 | 1 | 16 | 4664 | 16765 | 24293 |
d__Bacteria;p__Proteobacteria | 1 | 1 | 1 | 18 | 174 | 16 | 6645 | 7417 | 10599 | 25746 |
d__Bacteria;p__Firmicutes | 3488 | 7413 | 9790 | 10722 | 18251 | 3285 | 5717 | 7584 | 13450 | 19714 |
d__Bacteria;p__Deferribacterota | 19 | 32 | 91 | 190 | 591 | 1 | 1 | 1 | 1 | 27 |
Fig 11 Mean proportion of treated and control group.
Title: PacBio next-generation sequencing uncovers Apicomplexa diversity in different habitats
Publication: Scientific Reports
Main Methods: full-length 18S rRNA amplicon sequencing
Abstract:
The phylum Apicomplexa includes intracellular protozoan parasites causing diseases like toxoplasmosis and cryptosporidiosis. This study used next-generation 18S rRNA amplicon sequencing with PacBio technology to explore Apicomplexa diversity in riverine environments. PCA, PCoA, PERMANOVA, ANOSIM, Cluster analysis, and Venn diagrams indicated habitat heterogeneity. Dominant genera in inlet samples were Gregarina, Cryptosporidium, and Leidyana, while outlet samples were dominated by Babesia, Cryptosporidium, and Theileria. Surface water had 16% and 8.33% relative abundance of Toxoplasma and Cryptosporidium, respectively. This is the first comprehensive study of these parasites in Egypt, aiding water quality standards.
Research Results:
The study identified 52,214 microeukaryotic OTUs from 136,127 high-quality reads. PCA showed clustering of physicochemical parameters by habitat, notably between inlet and surface water samples. Lower-quality outlet samples resembled inlet samples, while higher-quality outlet samples were similar to surface water samples. Significant differences in environmental parameters were confirmed by PERMANOVA and ANOSIM. The difference between inlet and surface water parameters was most pronounced compared to inlet vs. outlet and outlet vs. surface water.
Fig 12 Principal component analysis (PCA) plot for spatial variation of physicochemical parameters between different habitats (principal components (PC1 and PC2) explained 69.6% of the total variation).
Table 4. PERMANOVA and ANOSIM test the significance of the differences in physicochemical parameters between the habitats.
Group | PERMANOV | ANOSIM | ||
t | P | R | P | |
Inlet vs. outlet | 2.10 | 0.004 | 0.38 | 0.003 |
Inlet vs. surface water | 5.19 | <0.001 | 0.94 | <0.001 |
Outlet vs. surface water | 3.27 | <0.001 | 0.67 | <0.001 |
The PCoA and Cluster Analysis revealed significant differences in the Apicomplexa community across habitats, with inlet samples distinct from outlet and surface water samples. Apicomplexan species were less than 0.01% of microeukaryotic communities. Comparable patterns were observed in studies from China and the USA. The Venn diagram showed no shared apicomplexan species across habitats, with 89% unique species in inlet samples, and fewer unique species in outlet (4) and surface water samples (7). The study suggests that physicochemical parameters influence apicomplexan community distribution, necessitating further research on specific environmental factors.
Fig 13 Principal coordinate analysis (PCoA) based on Bray Curtis similarity index showing the β-diversity patterns of the Apicomplexa community
Fig 14 Cluster analysis of Apicomplexa OTUs in different environments
Fig 15 Venn shape shows the unique and shared Apicomplexa OTUs in different habitats.
The heatmap revealed more Apicomplexan genera in inlet samples compared to outlet and surface water samples, indicating effective parasite removal during wastewater treatment. This decrease in parasite abundance, from raw sewage to Nile River water, aligns with Freudenthal et al.'s findings on the efficacy of wastewater treatment technology in reducing health risks. Effective pathogen removal is crucial for safe wastewater reuse. Network analysis showed associations between parasitic protists (e.g., Dientamoeba, Entamoeba, Giardia) and potential predators (ciliates, rotifers), suggesting predation as a factor in parasite reduction.
Fig 16 Heatmap showing the distribution of Apicomplexa taxa in different environments.
The study found that the dominant apicomplexan parasites in inlet samples were Gregarina (38.54%), Cryptosporidium (32.29%), and Leidyana (11.90%), while outlet samples were dominated by Babesia (33.33%), Cryptosporidium (25%), and Theileria (16.67%). Apicomplexa were the most abundant protozoa in both inlet (57.2%) and outlet (46.9%) samples, similar to findings in New Zealand wastewater. The relative abundance of Cryptosporidium decreased from inlet to outlet, consistent with previous research showing wastewater as a hotspot for parasites.
Fig 17 Taxonomic composition of the Apicomplexa community. Others refer to unclassified and less contributed taxa.
Conclusions:
Utilizing next-generation 18S rRNA amplicon PacBio sequencing, this study investigated Apicomplexa in Egypt's aquatic habitats, detecting a wide range of parasites. Findings revealed the presence of Cryptosporidium in treated sewage and Nile River samples, indicating potential health risks due to insufficient wastewater treatment. The study suggests the need for national legislation on drinking water and wastewater reuse standards based on WHO guidelines. Further research is needed to explore microeukaryotic parasite diversity in other environments, such as drinking water and swimming pools, using advanced sequencing technologies.
1. What is 18S rRNA sequencing for?
The genes encoding 18S rRNA are known as 18S rRNA genes. These sequences are extensively utilized in molecular analyses to reconstruct the evolutionary history of organisms, particularly in vertebrates. Due to their slow rate of evolution, 18S rRNA genes are ideal for studying ancient divergences.
2. What is the size of 18S rRNA amplicon?
The 18S rRNA gene in eukaryotic microbes, ranging in length from 1500 to 2000 base pairs, contains both conserved and variable regions.
3. What is the difference between ITS and 18S sequencing?
ITS amplicon sequencing is more suitable for studying fungal diversity, offering high accuracy in species identification. In contrast, 18S amplicon sequencing targets the diversity of eukaryotic microbes, providing a broader range of species annotations but with relatively lower accuracy.
4. How do 16S rRNA and 18S rRNA differ from each other?
Although both 16S and 18S rRNA are crucial for ribosome function and protein synthesis, they primarily differ in the organisms they are found in and their uses in molecular biology. Specifically, 16S rRNA is used for identifying and classifying prokaryotic organisms, while 18S rRNA serves the same purpose for eukaryotic organisms.
5. How were the Full-length 18S/ITS Amplicon sequencing data analyzed?
The process begins with merging raw data, performing quality checks, filtering, and removing chimeras to create ASV clusters. This is followed by alpha diversity analysis (including rarefaction, Chao1, and Shannon curves), species annotation (using KRONA and phylogenetic trees), and beta diversity analysis (such as UniFrac distances and PCA). Diversity statistics are then calculated (using NMDS and DCA). Advanced analyses include Metastats, LEfSe, and RDA/CCA.